linux-scsi.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: problems with scsi_transport_fc and qla2xxx
       [not found] ` <978150825.20060227210552@netvision.net.il>
@ 2006-03-01  6:14   ` Andrew Morton
  2006-03-01 20:25     ` Stefan Kaltenbrunner
  0 siblings, 1 reply; 17+ messages in thread
From: Andrew Morton @ 2006-03-01  6:14 UTC (permalink / raw)
  To: Maxim Kozover; +Cc: linux-kernel, linux-scsi, Andrew Vasquez

Maxim Kozover <maximkoz@netvision.net.il> wrote:
>
> Hi!

(cc's added)

> Most of the problem seems to be a QLogic driver problem.
> HBAs are connected to target via FC switch.
> 
> 1. If I have several LUNs on each HBA, with QLogic only 1 directory
> per adapter (for LUN 0) is created in /sys/class/fc_remote_ports,
> while with Emulex a directory for every LUN is created.
> 
> 2. The situation I described occurs with QLogic only if the cable
> connecting between HBA and switch is pulled out/in. If I
> connect/disconnect the cable between switch and target, disks come
> back.
> 
> 3. With Emulex in both cases disks come back.
> 
> However, both with Emulex and QLogic stale directories in
> /sys/classfc_remote_ports are left.
> For example, with Emulex if I had in the beginning
> rport-6:0-0  rport-6:0-1  rport-6:0-2  rport-7:0-0  rport-7:0-1  rport-7:0-2
> then disconnected adapter 7, got
> rport-6:0-0  rport-6:0-1  rport-6:0-2  rport-7:0-0  rport-7:0-2
> (7-0-0 and 7-0-2 didn't disappear while 7-0-1 did)
> connected 7 back
> rport-6:0-0  rport-6:0-1  rport-6:0-2  rport-7:0-2  rport-7:0-4
> rport-7:0-5 rport-7:0-6
> (7-0-0 disappeared, but 7-0-2 is still here).
> 
> Thanks,
> 
> Maxim.
> 
> MK> Hi!
> MK> I'm checking 2.6.16-rc5 with 2 QLogic 2312 adapters using qla2xxx
> MK> driver from 2.6.16-rc5.
> MK> As with earlier kernels, I think > 2.6.12 (since scsi_transport_fc
> MK> gained functionality) I have the following problem.
> MK> 2 scsi hosts available, 4 and 5 (for QLogic).
> MK> I disconnect the cable from one of QLogic cards. After timeout I have
> MK> the message
> MK> rport-4:0-0: blocked FC remote port time out: removing target and saving binding
> MK> and appropriate SCSI devices that came from adapter 4 disappear from
> MK> /proc/scsi/scsi.
> MK> So far, so good.
> MK> I reconnect the cable, the directory
> MK> /sys/class/fc_remote_ports/rport-4:0-1 appears along with the old
> MK> ones rport-4:0-0 and rport-5:0-0, so currently I have 3.
> MK> However, no automatic rescan appears on adapter 4.
> MK> What's worse, if I try echo "0 1 0" > /sys/class/scsi_host/host4/scan
> MK> the process is stuck.
> MK> Please advise.
> 
> MK> Thanks,
> 
> MK> Maxim.
> 
> 
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: problems with scsi_transport_fc and qla2xxx
  2006-03-01  6:14   ` problems with scsi_transport_fc and qla2xxx Andrew Morton
@ 2006-03-01 20:25     ` Stefan Kaltenbrunner
  2006-03-01 21:08       ` Andrew Vasquez
  0 siblings, 1 reply; 17+ messages in thread
From: Stefan Kaltenbrunner @ 2006-03-01 20:25 UTC (permalink / raw)
  To: Andrew Morton; +Cc: Maxim Kozover, linux-kernel, linux-scsi, Andrew Vasquez

Andrew Morton wrote:
> Maxim Kozover <maximkoz@netvision.net.il> wrote:
> 
>>Hi!
> 
> 
> (cc's added)
> 
> 
>>Most of the problem seems to be a QLogic driver problem.
>>HBAs are connected to target via FC switch.
>>
>>1. If I have several LUNs on each HBA, with QLogic only 1 directory
>>per adapter (for LUN 0) is created in /sys/class/fc_remote_ports,
>>while with Emulex a directory for every LUN is created.
>>
>>2. The situation I described occurs with QLogic only if the cable
>>connecting between HBA and switch is pulled out/in. If I
>>connect/disconnect the cable between switch and target, disks come
>>back.

I can confirm that very problem (pulling the cable between HBA and
switch results in only LUN 0 or nothing coming back afterward) on
2.6.15.4 here too.


Stefan

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: problems with scsi_transport_fc and qla2xxx
  2006-03-01 20:25     ` Stefan Kaltenbrunner
@ 2006-03-01 21:08       ` Andrew Vasquez
  2006-03-02 16:32         ` Maxim Kozover
  0 siblings, 1 reply; 17+ messages in thread
From: Andrew Vasquez @ 2006-03-01 21:08 UTC (permalink / raw)
  To: Stefan Kaltenbrunner
  Cc: Andrew Morton, Maxim Kozover, linux-kernel, linux-scsi

On Wed, 01 Mar 2006, Stefan Kaltenbrunner wrote:

> Andrew Morton wrote:
> > Maxim Kozover <maximkoz@netvision.net.il> wrote:
> > 
> >>Hi!
> > 
> > 
> > (cc's added)
> > 
> > 
> >>Most of the problem seems to be a QLogic driver problem.
> >>HBAs are connected to target via FC switch.
> >>
> >>1. If I have several LUNs on each HBA, with QLogic only 1 directory
> >>per adapter (for LUN 0) is created in /sys/class/fc_remote_ports,
> >>while with Emulex a directory for every LUN is created.
> >>
> >>2. The situation I described occurs with QLogic only if the cable
> >>connecting between HBA and switch is pulled out/in. If I
> >>connect/disconnect the cable between switch and target, disks come
> >>back.
> 
> I can confirm that very problem (pulling the cable between HBA and
> switch results in only LUN 0 or nothing coming back afterward) on
> 2.6.15.4 here too.

Please try recent 2.6.16-rcX kernels as there have been a number of
patches submitted since 2.6.15 which (attempt to) address most of
these holes -- I'm still trying to get additional details on Maxim's.

387f96b4d9391bf3ce6928fb9cd90c9c7df37291 [PATCH] qla2xxx: Close window on race between rport removal and fcport transition.
77427f514f88143bfef41ba8c1e624bc45f42297 [SCSI] qla2xxx: Drop legacy 'bypass lun scan for tape device' code.
052c40c83b4ca37be226112049b60097cb9961e1 [SCSI] qla2xxx: Correct issue where the rport's upcall was not being made after relogin.
d97994dc1fddcbb8212b745d9c9c9ce96262155c [SCSI] qla2xxx: Correct synchronization issues during rport addition/deletion.
79f89a4296ff22f09baf538d4ff2a6d0c3097a73 [SCSI] qla2xxx: Disable port-type RSCN handling via driver state-machine.


Regards,
Andrew Vasquez

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re:Re: problems with scsi_transport_fc and qla2xxx
  2006-03-01 21:08       ` Andrew Vasquez
@ 2006-03-02 16:32         ` Maxim Kozover
  2006-03-02 17:38           ` Andrew Vasquez
  2006-03-06 20:00           ` Mike Snitzer
  0 siblings, 2 replies; 17+ messages in thread
From: Maxim Kozover @ 2006-03-02 16:32 UTC (permalink / raw)
  To: Andrew Vasquez
  Cc: Stefan Kaltenbrunner, Andrew Morton, linux-kernel, linux-scsi

Hi Andrew!
Today I tested disconnecting QLogic port.
Adapter 4 is connected via switch to a storage and 3 LUNs are seen via
the adapter.
Only 1 rport is created (for FCP Target) while in Emulex case there
were 3: (Fabric Port, Directory Server and FCP Target, FCP Initiator).
# ls /sys/class/fc_remote_ports/
rport-4:0-0
# cat /sys/class/fc_remote_ports/*/roles
FCP Target

Default dev_loss_tmo is 6 (1+5) while in Emulex case the default was 35.

After disconnecting the cable between the HBA and the switch
qla2xxx 0000:03:01.0: LOOP DOWN detected (2).
 rport-4:0-0: blocked FC remote port time out: removing target and saving binding

# ls /sys/class/fc_remote_ports/
rport-4:0-0
# cat /sys/class/fc_remote_ports/*/roles
unknown

Relevant scsi devices are removed from /proc/scsi/scsi.

After reconnecting the cable
qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).

# ls /sys/class/fc_remote_ports/
rport-4:0-0
# cat /sys/class/fc_remote_ports/*/roles
FCP Target

However, scsi devices don't reappear in /proc/scsi/scsi.
When I issue rescan, the command is stuck
echo - - - > /sys/class/scsi_host/host4/scan

Please advise.

Thanks,

Maxim.

I'm including /var/log/messages output:
# egrep -v 'Vendor|cron' /var/log/messages
Mar  2 19:13:49 multipath kernel: QLogic Fibre Channel HBA Driver
Mar  2 19:13:49 multipath kernel: GSI 24 sharing vector 0xE9 and IRQ 24
Mar  2 19:13:49 multipath kernel: ACPI: PCI Interrupt 0000:03:01.0[A] -> GSI 48 (level, low) -> IRQ 233
Mar  2 19:13:49 multipath kernel: qla2xxx 0000:03:01.0: Found an ISP2312, irq 233, iobase 0xffffc20000004000
Mar  2 19:13:49 multipath kernel: qla2xxx 0000:03:01.0: Configuring PCI space...
Mar  2 19:13:49 multipath kernel: qla2xxx 0000:03:01.0: Configure NVRAM parameters...
Mar  2 19:13:49 multipath kernel: qla2xxx 0000:03:01.0: Verifying loaded RISC code...
Mar  2 19:13:49 multipath kernel: qla2xxx 0000:03:01.0: Waiting for LIP to complete...
Mar  2 19:13:51 multipath kernel: qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
Mar  2 19:13:51 multipath kernel: qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).
Mar  2 19:13:51 multipath kernel: qla2xxx 0000:03:01.0: Topology - (F_Port), Host Loop address 0xffff
Mar  2 19:13:52 multipath kernel: scsi4 : qla2xxx
Mar  2 19:13:52 multipath kernel: qla2xxx 0000:03:01.0:
Mar  2 19:13:52 multipath kernel:  QLogic Fibre Channel HBA Driver: 8.01.04-k
Mar  2 19:13:52 multipath kernel:   QLogic QLA2340 -
Mar  2 19:13:52 multipath kernel:   ISP2312: PCI-X (133 MHz) @ 0000:03:01.0 hdma-, host#=4, fw=3.03.18 IPX
Mar  2 19:13:52 multipath kernel: GSI 25 sharing vector 0x32 and IRQ 25
Mar  2 19:13:52 multipath kernel: ACPI: PCI Interrupt 0000:06:01.0[A] -> GSI 72 (level, low) -> IRQ 50
Mar  2 19:13:52 multipath kernel: qla2xxx 0000:06:01.0: Configuring PCI space...
Mar  2 19:13:52 multipath kernel: qla2xxx 0000:06:01.0: Configure NVRAM parameters...
Mar  2 19:13:52 multipath kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Mar  2 19:13:52 multipath kernel: SCSI device sdb: 33554432 512-byte hdwr sectors (17180 MB)
Mar  2 19:13:52 multipath kernel: sdb: Write Protect is off
Mar  2 19:13:52 multipath kernel: SCSI device sdb: drive cache: write through
Mar  2 19:13:52 multipath kernel: SCSI device sdb: 33554432 512-byte hdwr sectors (17180 MB)
Mar  2 19:13:52 multipath kernel: sdb: Write Protect is off
Mar  2 19:13:52 multipath kernel: SCSI device sdb: drive cache: write through
Mar  2 19:13:52 multipath kernel:  sdb: unknown partition table
Mar  2 19:13:52 multipath kernel: sd 4:0:0:0: Attached scsi disk sdb
Mar  2 19:13:52 multipath kernel: qla2xxx 0000:06:01.0: Verifying loaded RISC code...
Mar  2 19:13:52 multipath kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Mar  2 19:13:52 multipath scsi.agent[4288]: disk at /devices/pci0000:00/0000:00:02.0/0000:01:00.2/0000:03:01.0/host4/rport-4:0-0/target4:0:0/4:0:0:0
Mar  2 19:13:52 multipath kernel: SCSI device sdc: 67108864 512-byte hdwr sectors (34360 MB)
Mar  2 19:13:52 multipath kernel: sdc: Write Protect is off
Mar  2 19:13:52 multipath kernel: SCSI device sdc: drive cache: write through
Mar  2 19:13:52 multipath kernel: SCSI device sdc: 67108864 512-byte hdwr sectors (34360 MB)
Mar  2 19:13:52 multipath kernel: sdc: Write Protect is off
Mar  2 19:13:52 multipath kernel: SCSI device sdc: drive cache: write through
Mar  2 19:13:52 multipath kernel:  sdc: unknown partition table
Mar  2 19:13:52 multipath kernel: sd 4:0:0:1: Attached scsi disk sdc
Mar  2 19:13:52 multipath kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Mar  2 19:13:52 multipath kernel: sdd : very big device. try to use READ CAPACITY(16).
Mar  2 19:13:52 multipath kernel: SCSI device sdd: 9797894144 512-byte hdwr sectors (5016522 MB)
Mar  2 19:13:52 multipath kernel: sdd: Write Protect is off
Mar  2 19:13:52 multipath kernel: SCSI device sdd: drive cache: write through
Mar  2 19:13:52 multipath kernel: sdd : very big device. try to use READ CAPACITY(16).
Mar  2 19:13:52 multipath kernel: SCSI device sdd: 9797894144 512-byte hdwr sectors (5016522 MB)
Mar  2 19:13:52 multipath kernel: sdd: Write Protect is off
Mar  2 19:13:52 multipath kernel: SCSI device sdd: drive cache: write through
Mar  2 19:13:52 multipath scsi.agent[4313]: disk at /devices/pci0000:00/0000:00:02.0/0000:01:00.2/0000:03:01.0/host4/rport-4:0-0/target4:0:0/4:0:0:1
Mar  2 19:13:52 multipath kernel:  sdd: unknown partition table
Mar  2 19:13:52 multipath kernel: sd 4:0:0:2: Attached scsi disk sdd
Mar  2 19:13:52 multipath scsi.agent[4331]: disk at /devices/pci0000:00/0000:00:02.0/0000:01:00.2/0000:03:01.0/host4/rport-4:0-0/target4:0:0/4:0:0:2
Mar  2 19:13:52 multipath kernel: qla2xxx 0000:06:01.0: Waiting for LIP to complete...
Mar  2 19:14:12 multipath kernel: qla2xxx 0000:06:01.0: Cable is unplugged...
Mar  2 19:14:12 multipath kernel: scsi5 : qla2xxx
Mar  2 19:14:12 multipath kernel: qla2xxx 0000:06:01.0:
Mar  2 19:14:12 multipath kernel:  QLogic Fibre Channel HBA Driver: 8.01.04-k
Mar  2 19:14:12 multipath kernel:   QLogic QLA2340 -
Mar  2 19:14:12 multipath kernel:   ISP2312: PCI-X (133 MHz) @ 0000:06:01.0 hdma-, host#=5, fw=3.03.18 IPX
Mar  2 19:14:17 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  2 19:14:18 multipath last message repeated 19 times
Mar  2 19:22:02 multipath kernel: qla2xxx 0000:03:01.0: LOOP DOWN detected (2).
Mar  2 19:22:08 multipath kernel:  rport-4:0-0: blocked FC remote port time out: removing target and saving binding
Mar  2 19:24:04 multipath kernel: qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
Mar  2 19:24:04 multipath kernel: qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).


^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: Re:Re: problems with scsi_transport_fc and qla2xxx
  2006-03-02 16:32         ` Maxim Kozover
@ 2006-03-02 17:38           ` Andrew Vasquez
  2006-03-02 22:15             ` Maxim Kozover
  2006-03-06 20:00           ` Mike Snitzer
  1 sibling, 1 reply; 17+ messages in thread
From: Andrew Vasquez @ 2006-03-02 17:38 UTC (permalink / raw)
  To: Maxim Kozover
  Cc: Stefan Kaltenbrunner, Andrew Morton, linux-kernel, linux-scsi

On Thu, 02 Mar 2006, Maxim Kozover wrote:

> Today I tested disconnecting QLogic port.
> Adapter 4 is connected via switch to a storage and 3 LUNs are seen via
> the adapter.
> Only 1 rport is created (for FCP Target) while in Emulex case there
> were 3: (Fabric Port, Directory Server and FCP Target, FCP Initiator).

That's correct, we currently don't make an upcall for the SNS server
port nor the switch fabric port.

> # ls /sys/class/fc_remote_ports/
> rport-4:0-0
> # cat /sys/class/fc_remote_ports/*/roles
> FCP Target
> 
> Default dev_loss_tmo is 6 (1+5) while in Emulex case the default was 35.
> 
> After disconnecting the cable between the HBA and the switch
> qla2xxx 0000:03:01.0: LOOP DOWN detected (2).
>  rport-4:0-0: blocked FC remote port time out: removing target and saving binding
> 
> # ls /sys/class/fc_remote_ports/
> rport-4:0-0
> # cat /sys/class/fc_remote_ports/*/roles
> unknown
> 
> Relevant scsi devices are removed from /proc/scsi/scsi.
> 
> After reconnecting the cable
> qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
> qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).
> 
> # ls /sys/class/fc_remote_ports/
> rport-4:0-0
> # cat /sys/class/fc_remote_ports/*/roles
> FCP Target
> 
> However, scsi devices don't reappear in /proc/scsi/scsi.
> When I issue rescan, the command is stuck
> echo - - - > /sys/class/scsi_host/host4/scan
> 
> Please advise.

Could you add the enable-debug patch I sent you earlier and retry the
test?  Again forward the relevent snippets from var/log/messages.

Here's the patch again.

--
av

---

diff --git a/drivers/scsi/qla2xxx/qla_dbg.h b/drivers/scsi/qla2xxx/qla_dbg.h
index 935a59a..632f653 100644
--- a/drivers/scsi/qla2xxx/qla_dbg.h
+++ b/drivers/scsi/qla2xxx/qla_dbg.h
@@ -9,6 +9,7 @@
  */
 /* #define QL_DEBUG_LEVEL_1  */ /* Output register accesses to COM1 */
 /* #define QL_DEBUG_LEVEL_2  */ /* Output error msgs to COM1 */
+#define QL_DEBUG_LEVEL_2   /* Output error msgs to COM1 */
 /* #define QL_DEBUG_LEVEL_3  */ /* Output function trace msgs to COM1 */
 /* #define QL_DEBUG_LEVEL_4  */ /* Output NVRAM trace msgs to COM1 */
 /* #define QL_DEBUG_LEVEL_5  */ /* Output ring trace msgs to COM1 */
diff --git a/drivers/scsi/qla2xxx/qla_settings.h b/drivers/scsi/qla2xxx/qla_settings.h
index 363205c..b2e22b0 100644
--- a/drivers/scsi/qla2xxx/qla_settings.h
+++ b/drivers/scsi/qla2xxx/qla_settings.h
@@ -8,7 +8,7 @@
  * Compile time Options:
  *     0 - Disable and 1 - Enable
  */
-#define DEBUG_QLA2100		0	/* For Debug of qla2x00 */
+#define DEBUG_QLA2100		1	/* For Debug of qla2x00 */
 
 #define USE_ABORT_TGT		1	/* Use Abort Target mbx cmd */
 



^ permalink raw reply related	[flat|nested] 17+ messages in thread

* Re: Re: problems with scsi_transport_fc and qla2xxx
  2006-03-02 17:38           ` Andrew Vasquez
@ 2006-03-02 22:15             ` Maxim Kozover
  2006-03-06 19:06               ` Andrew Vasquez
  0 siblings, 1 reply; 17+ messages in thread
From: Maxim Kozover @ 2006-03-02 22:15 UTC (permalink / raw)
  To: Andrew Vasquez
  Cc: Stefan Kaltenbrunner, Andrew Morton, linux-kernel, linux-scsi

Hi Andrew!
Please see the log with debug-patch.
The module is loaded with option qlport_down_retry=1.
Adapter 4 is connected to switch, adapter 5 doesn't have cable attached.
After reconnecting the cable the disks don't reappear and rescan is stuck.
Before applying your patches ghost rport was staying, now it's OK.

Thanks,

Maxim.

# egrep -v 'Vendor|cron' /var/log/messages
Mar  3 01:05:17 multipath kernel: QLogic Fibre Channel HBA Driver
Mar  3 01:05:17 multipath kernel: GSI 24 sharing vector 0xE9 and IRQ 24
Mar  3 01:05:17 multipath kernel: ACPI: PCI Interrupt 0000:03:01.0[A] -> GSI 48 (level, low) -> IRQ 233
Mar  3 01:05:17 multipath kernel: qla2xxx 0000:03:01.0: Found an ISP2312, irq 233, iobase 0xffffc20000004000
Mar  3 01:05:17 multipath kernel: qla2xxx 0000:03:01.0: Configuring PCI space...
Mar  3 01:05:17 multipath kernel: qla2xxx 0000:03:01.0: Configure NVRAM parameters...
Mar  3 01:05:17 multipath kernel: qla2xxx 0000:03:01.0: Verifying loaded RISC code...
Mar  3 01:05:17 multipath kernel: scsi(4): **** Load RISC code ****
Mar  3 01:05:17 multipath kernel: scsi(4): Verifying Checksum of loaded RISC code.
Mar  3 01:05:17 multipath kernel: scsi(4): Checksum OK, start firmware.
Mar  3 01:05:17 multipath kernel: scsi(4): Issue init firmware.
Mar  3 01:05:17 multipath kernel: qla2xxx 0000:03:01.0: Waiting for LIP to complete...
Mar  3 01:05:19 multipath kernel: scsi(4): Asynchronous LIP RESET (f7f7).
Mar  3 01:05:19 multipath kernel: qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
Mar  3 01:05:19 multipath kernel: scsi(4): Asynchronous P2P MODE received.
Mar  3 01:05:20 multipath kernel: scsi(4): Asynchronous LOOP UP (2 Gbps).
Mar  3 01:05:20 multipath kernel: qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).
Mar  3 01:05:20 multipath kernel: scsi(4): Asynchronous PORT UPDATE.
Mar  3 01:05:20 multipath kernel: scsi(4): Port database changed ffff 0006 0000.
Mar  3 01:05:20 multipath kernel: scsi(4): F/W Ready - OK
Mar  3 01:05:20 multipath kernel: scsi(4): fw_state=3 curr time=ffff3dac.
Mar  3 01:05:20 multipath kernel: qla2xxx 0000:03:01.0: Topology - (F_Port), Host Loop address 0xffff
Mar  3 01:05:20 multipath kernel: scsi(4): Configure loop -- dpc flags =0x4080040
Mar  3 01:05:20 multipath kernel: scsi(4): RSCN queue entry[0] = [00/000000].
Mar  3 01:05:20 multipath kernel: scsi(4): device_resync: rscn overflow.
Mar  3 01:05:20 multipath kernel: scsi(4): RFT_ID exiting normally.
Mar  3 01:05:20 multipath kernel: scsi(4): RFF_ID exiting normally.
Mar  3 01:05:20 multipath kernel: scsi(4): RNN_ID exiting normally.
Mar  3 01:05:20 multipath kernel: scsi(4): RSNN_NN exiting normally.
Mar  3 01:05:20 multipath kernel: scsi(4): GID_PT entry - nn 200000e08b079a69 pn 210000e08b079a69 portid=010700.
Mar  3 01:05:20 multipath kernel: scsi(4): GID_PT entry - nn 2000001738279c00 pn 1000001738279c11 portid=010200.
Mar  3 01:05:20 multipath kernel: scsi(4): device wrap (010200)
Mar  3 01:05:20 multipath kernel: scsi(4): Trying Fabric Login w/loop id 0x0081 for port 010200.
Mar  3 01:05:20 multipath kernel: scsi(4): LOOP READY
Mar  3 01:05:20 multipath kernel: DEBUG: detect hba 4 at address = ffff81006f5f8548
Mar  3 01:05:20 multipath kernel: scsi4 : qla2xxx
Mar  3 01:05:20 multipath kernel: qla2xxx 0000:03:01.0:
Mar  3 01:05:20 multipath kernel:  QLogic Fibre Channel HBA Driver: 8.01.04-k-debug
Mar  3 01:05:20 multipath kernel:   QLogic QLA2340 -
Mar  3 01:05:20 multipath kernel:   ISP2312: PCI-X (133 MHz) @ 0000:03:01.0 hdma-, host#=4, fw=3.03.18 IPX
Mar  3 01:05:20 multipath kernel: GSI 25 sharing vector 0x32 and IRQ 25
Mar  3 01:05:20 multipath kernel: ACPI: PCI Interrupt 0000:06:01.0[A] -> GSI 72 (level, low) -> IRQ 50
Mar  3 01:05:20 multipath kernel: qla2xxx 0000:06:01.0: Found an ISP2312, irq 50, iobase 0xffffc20000006000
Mar  3 01:05:20 multipath kernel: qla2xxx 0000:06:01.0: Configuring PCI space...
Mar  3 01:05:20 multipath kernel: qla2xxx 0000:06:01.0: Configure NVRAM parameters...
Mar  3 01:05:20 multipath kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Mar  3 01:05:20 multipath kernel: SCSI device sdb: 33554432 512-byte hdwr sectors (17180 MB)
Mar  3 01:05:20 multipath kernel: sdb: Write Protect is off
Mar  3 01:05:20 multipath kernel: SCSI device sdb: drive cache: write through
Mar  3 01:05:20 multipath kernel: SCSI device sdb: 33554432 512-byte hdwr sectors (17180 MB)
Mar  3 01:05:20 multipath kernel: sdb: Write Protect is off
Mar  3 01:05:20 multipath kernel: SCSI device sdb: drive cache: write through
Mar  3 01:05:20 multipath kernel:  sdb: unknown partition table
Mar  3 01:05:20 multipath kernel: sd 4:0:0:0: Attached scsi disk sdb
Mar  3 01:05:20 multipath kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Mar  3 01:05:20 multipath kernel: SCSI device sdc: 67108864 512-byte hdwr sectors (34360 MB)
Mar  3 01:05:20 multipath kernel: sdc: Write Protect is off
Mar  3 01:05:20 multipath kernel: qla2xxx 0000:06:01.0: Verifying loaded RISC code...
Mar  3 01:05:20 multipath kernel: scsi(5): **** Load RISC code ****
Mar  3 01:05:20 multipath kernel: SCSI device sdc: drive cache: write through
Mar  3 01:05:20 multipath kernel: SCSI device sdc: 67108864 512-byte hdwr sectors (34360 MB)
Mar  3 01:05:20 multipath kernel: sdc: Write Protect is off
Mar  3 01:05:20 multipath kernel: SCSI device sdc: drive cache: write through
Mar  3 01:05:20 multipath kernel:  sdc: unknown partition table
Mar  3 01:05:20 multipath kernel: sd 4:0:0:1: Attached scsi disk sdc
Mar  3 01:05:20 multipath scsi.agent[4295]: disk at /devices/pci0000:00/0000:00:02.0/0000:01:00.2/0000:03:01.0/host4/rport-4:0-0/target4:0:0/4:0:0:0
Mar  3 01:05:20 multipath kernel:   Type:   Direct-Access                      ANSI SCSI revision: 05
Mar  3 01:05:20 multipath kernel: scsi(5): Verifying Checksum of loaded RISC code.
Mar  3 01:05:20 multipath kernel: sdd : very big device. try to use READ CAPACITY(16).
Mar  3 01:05:20 multipath kernel: SCSI device sdd: 9797894144 512-byte hdwr sectors (5016522 MB)
Mar  3 01:05:20 multipath kernel: sdd: Write Protect is off
Mar  3 01:05:20 multipath scsi.agent[4330]: disk at /devices/pci0000:00/0000:00:02.0/0000:01:00.2/0000:03:01.0/host4/rport-4:0-0/target4:0:0/4:0:0:1
Mar  3 01:05:20 multipath kernel: SCSI device sdd: drive cache: write through
Mar  3 01:05:20 multipath scsi.agent[4342]: disk at /devices/pci0000:00/0000:00:02.0/0000:01:00.2/0000:03:01.0/host4/rport-4:0-0/target4:0:0/4:0:0:2
Mar  3 01:05:20 multipath kernel: sdd : very big device. try to use READ CAPACITY(16).
Mar  3 01:05:20 multipath kernel: SCSI device sdd: 9797894144 512-byte hdwr sectors (5016522 MB)
Mar  3 01:05:20 multipath kernel: scsi(5): Checksum OK, start firmware.
Mar  3 01:05:20 multipath kernel: sdd: Write Protect is off
Mar  3 01:05:20 multipath kernel: SCSI device sdd: drive cache: write through
Mar  3 01:05:20 multipath kernel:  sdd: unknown partition table
Mar  3 01:05:20 multipath kernel: sd 4:0:0:2: Attached scsi disk sdd
Mar  3 01:05:20 multipath kernel: scsi(5): Issue init firmware.
Mar  3 01:05:20 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0000/0006/7ee5.
Mar  3 01:05:20 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0000/0007/7ee5.
Mar  3 01:05:20 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0000/0004/7ee5.
Mar  3 01:05:20 multipath kernel: qla2xxx 0000:06:01.0: Waiting for LIP to complete...
Mar  3 01:05:21 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0081/0006/7ee5.
Mar  3 01:05:41 multipath kernel: qla2xxx 0000:06:01.0: Cable is unplugged...
Mar  3 01:05:41 multipath kernel: scsi(5): fw_state=4 curr time=ffff522d.
Mar  3 01:05:41 multipath kernel: scsi(5): Firmware ready **** FAILED ****.
Mar  3 01:05:41 multipath kernel: DEBUG: detect hba 5 at address = ffff81006e8f0548
Mar  3 01:05:41 multipath kernel: scsi5 : qla2xxx
Mar  3 01:05:41 multipath kernel: qla2xxx 0000:06:01.0:
Mar  3 01:05:41 multipath kernel:  QLogic Fibre Channel HBA Driver: 8.01.04-k-debug
Mar  3 01:05:41 multipath kernel:   QLogic QLA2340 -
Mar  3 01:05:41 multipath kernel:   ISP2312: PCI-X (133 MHz) @ 0000:06:01.0 hdma-, host#=5, fw=3.03.18 IPX
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:46 multipath kernel: qla2x00_mailbox_command(5): **** FAILED. mbx0=4005, mbx1=0, mbx2=6e92, cmd=6b ****
Mar  3 01:05:46 multipath kernel: qla2x00_get_link_status(5): failed=102.
Mar  3 01:05:46 multipath kernel: qla2xxx 0000:06:01.0: Unable to retrieve host statistics (65535).
Mar  3 01:05:50 multipath kernel: scsi(5): Loop Down - aborting the queues before time expire
-------------------------------
Disconnecting the cable
Mar  3 01:06:34 multipath kernel: scsi(4): Asynchronous LOOP DOWN (2).
Mar  3 01:06:34 multipath kernel: qla2xxx 0000:03:01.0: LOOP DOWN detected (2).
Mar  3 01:06:34 multipath kernel: scsi(4): fcport-0 - port retry count: 0 remaining
Mar  3 01:06:40 multipath kernel:  rport-4:0-0: blocked FC remote port time out: removing target and saving binding
Mar  3 01:06:42 multipath kernel: scsi(4): Loop Down - aborting the queues before time expire
Mar  3 01:07:22 multipath kernel: scsi(4): Asynchronous LIP RESET (f7f7).
Mar  3 01:07:22 multipath kernel: qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
Mar  3 01:07:22 multipath kernel: scsi(4): qla2x00_reset_marker()
Mar  3 01:07:22 multipath kernel: scsi(4): Asynchronous P2P MODE received.
Mar  3 01:07:22 multipath kernel: scsi(4): qla2x00_reset_marker()
Mar  3 01:07:22 multipath kernel: scsi(4): Asynchronous LOOP UP (2 Gbps).
Mar  3 01:07:22 multipath kernel: qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).
Mar  3 01:07:22 multipath kernel: scsi(4): Asynchronous PORT UPDATE.
Mar  3 01:07:22 multipath kernel: scsi(4): Port database changed ffff 0006 7ee5.
Mar  3 01:07:22 multipath kernel: scsi(4): qla2x00_loop_resync()
Mar  3 01:07:22 multipath kernel: scsi(4): F/W Ready - OK
Mar  3 01:07:22 multipath kernel: scsi(4): fw_state=3 curr time=ffffb533.
Mar  3 01:07:22 multipath kernel: scsi(4): Configure loop -- dpc flags =0x4090060
Mar  3 01:07:22 multipath kernel: scsi(4): RSCN queue entry[0] = [00/000000].
Mar  3 01:07:22 multipath kernel: scsi(4): device_resync: rscn overflow.
Mar  3 01:07:22 multipath kernel: scsi(4): RFT_ID exiting normally.
Mar  3 01:07:22 multipath kernel: scsi(4): RFF_ID exiting normally.
Mar  3 01:07:22 multipath kernel: scsi(4): RNN_ID exiting normally.
Mar  3 01:07:22 multipath kernel: scsi(4): RSNN_NN exiting normally.
Mar  3 01:07:22 multipath kernel: scsi(4): GID_PT entry - nn 200000e08b079a69 pn 210000e08b079a69 portid=010700.
Mar  3 01:07:22 multipath kernel: scsi(4): GID_PT entry - nn 2000001738279c00 pn 1000001738279c11 portid=010200.
Mar  3 01:07:22 multipath kernel: scsi(4): device wrap (010200)
Mar  3 01:07:22 multipath kernel: scsi(4): Trying Fabric Login w/loop id 0x0081 for port 010200.
Mar  3 01:07:22 multipath kernel: scsi(4): LOOP READY
Mar  3 01:07:22 multipath kernel: scsi(4): qla2x00_loop_resync - end
Mar  3 01:07:22 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0081/0007/7ee5.
Mar  3 01:07:22 multipath kernel: scsi(4:0:0): status_entry: Port Down pid=43, compl status=0x29, port state=0x4
Mar  3 01:07:22 multipath kernel: scsi(4): Port login retry: 1000001738279c11, id = 0x0081 retry cnt=8
Mar  3 01:07:23 multipath kernel: scsi(4): fcport-0 - port retry count: 0 remaining
Mar  3 01:07:23 multipath kernel: scsi(4): qla2x00_port_login()
Mar  3 01:07:23 multipath kernel: scsi(4): Trying Fabric Login w/loop id 0x0081 for port 010200.
Mar  3 01:07:23 multipath kernel: scsi(4): port login OK: logged in ID 0x81
Mar  3 01:07:23 multipath kernel: scsi(4): qla2x00_port_login - end
Mar  3 01:07:23 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0000/0006/0001.
Mar  3 01:07:23 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0000/0007/0001.
Mar  3 01:07:23 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0000/0004/0001.
Mar  3 01:07:24 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0081/0006/0001.
Mar  3 01:09:56 multipath kernel: scsi(5): Loop down exceed 4 mins - restarting queues.



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: problems with scsi_transport_fc and qla2xxx
  2006-03-02 22:15             ` Maxim Kozover
@ 2006-03-06 19:06               ` Andrew Vasquez
  0 siblings, 0 replies; 17+ messages in thread
From: Andrew Vasquez @ 2006-03-06 19:06 UTC (permalink / raw)
  To: Maxim Kozover
  Cc: Stefan Kaltenbrunner, Andrew Morton, linux-kernel, linux-scsi

On Fri, 03 Mar 2006, Maxim Kozover wrote:

> Please see the log with debug-patch.
> The module is loaded with option qlport_down_retry=1.
> Adapter 4 is connected to switch, adapter 5 doesn't have cable attached.
> After reconnecting the cable the disks don't reappear and rescan is stuck.
> Before applying your patches ghost rport was staying, now it's OK.

Before you try the patch I sent earlier, could you send be the output
from the following:

	# echo t > /proc/sysrq-trigger

Thanks,
Andrew Vasquez

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: Re: problems with scsi_transport_fc and qla2xxx
  2006-03-02 16:32         ` Maxim Kozover
  2006-03-02 17:38           ` Andrew Vasquez
@ 2006-03-06 20:00           ` Mike Snitzer
  2006-03-06 20:09             ` Maxim Kozover
  1 sibling, 1 reply; 17+ messages in thread
From: Mike Snitzer @ 2006-03-06 20:00 UTC (permalink / raw)
  To: Maxim Kozover
  Cc: Andrew Vasquez, Stefan Kaltenbrunner, Andrew Morton,
	linux-kernel, linux-scsi

On 3/2/06, Maxim Kozover <maximkoz@netvision.net.il> wrote:
> Hi Andrew!
> Today I tested disconnecting QLogic port.
> Adapter 4 is connected via switch to a storage and 3 LUNs are seen via
> the adapter.
> Only 1 rport is created (for FCP Target) while in Emulex case there
> were 3: (Fabric Port, Directory Server and FCP Target, FCP Initiator).
> # ls /sys/class/fc_remote_ports/
> rport-4:0-0
> # cat /sys/class/fc_remote_ports/*/roles
> FCP Target
>
> Default dev_loss_tmo is 6 (1+5) while in Emulex case the default was 35.
>
> After disconnecting the cable between the HBA and the switch
> qla2xxx 0000:03:01.0: LOOP DOWN detected (2).
>  rport-4:0-0: blocked FC remote port time out: removing target and saving binding
>
> # ls /sys/class/fc_remote_ports/
> rport-4:0-0
> # cat /sys/class/fc_remote_ports/*/roles
> unknown
>
> Relevant scsi devices are removed from /proc/scsi/scsi.
>
> After reconnecting the cable
> qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
> qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).
>
> # ls /sys/class/fc_remote_ports/
> rport-4:0-0
> # cat /sys/class/fc_remote_ports/*/roles
> FCP Target
>
> However, scsi devices don't reappear in /proc/scsi/scsi.
> When I issue rescan, the command is stuck
> echo - - - > /sys/class/scsi_host/host4/scan

Historically the qlogic driver rescan is a 2-phase process:
1) schedule the rescan, e.g.: echo scsi-qlascan > /proc/scsi/qla2xxx/4
2) rescan, e.g.: echo - - - > /sys/class/scsi_host/host4/scan

BUT, I've just used scsi-qlascan to discover _new_ devices... not
existing devices that experienced FC connection loss.  I assume the
qla driver _should_ just bring those lost devices back?  But does the
historic 2-phase rescan for new devices speak to why the qlogic driver
doesn't automagically bring the old devices back?  Or has the latest
qlogic driver in mainline advanced past this 2-phase requirement in
general?

regards,
Mike

Mike

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: Re: problems with scsi_transport_fc and qla2xxx
  2006-03-06 20:00           ` Mike Snitzer
@ 2006-03-06 20:09             ` Maxim Kozover
  2006-03-06 21:28               ` Andrew Vasquez
  0 siblings, 1 reply; 17+ messages in thread
From: Maxim Kozover @ 2006-03-06 20:09 UTC (permalink / raw)
  To: Mike Snitzer; +Cc: Andrew Vasquez, linux-kernel, linux-scsi

Hi Mike!
Unfortunately I don't have the directory /proc/scsi/qla2xxx.
However the target sees PRLI from the host again after reconnecting
the cable between the initiator and the switch.
Does it mean the rediscovering new devices on initiator side is
already done?

Thanks,

Maxim.

MS> Historically the qlogic driver rescan is a 2-phase process:
MS> 1) schedule the rescan, e.g.: echo scsi-qlascan > /proc/scsi/qla2xxx/4
MS> 2) rescan, e.g.: echo - - - > /sys/class/scsi_host/host4/scan

MS> BUT, I've just used scsi-qlascan to discover _new_ devices... not
MS> existing devices that experienced FC connection loss.  I assume the
MS> qla driver _should_ just bring those lost devices back?  But does the
MS> historic 2-phase rescan for new devices speak to why the qlogic driver
MS> doesn't automagically bring the old devices back?  Or has the latest
MS> qlogic driver in mainline advanced past this 2-phase requirement in
MS> general?

MS> regards,
MS> Mike

MS> Mike



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: Re: problems with scsi_transport_fc and qla2xxx
  2006-03-06 21:28               ` Andrew Vasquez
@ 2006-03-06 20:57                 ` Maxim Kozover
  2006-03-06 21:09                 ` Maxim Kozover
  1 sibling, 0 replies; 17+ messages in thread
From: Maxim Kozover @ 2006-03-06 20:57 UTC (permalink / raw)
  To: Andrew Vasquez; +Cc: Mike Snitzer, linux-kernel, linux-scsi

Hi Andrew!
AV> I also noticed that scsi_transport_fc.c::fc_user_scan() is not called
AV> with the host_lock held... hmm..  could you try out the patch I sent
AV> earlier and provide the results.

AV> Also, could you send the "echo t > /proc/..." output after the cable
AV> has been reinserted, but, before the 'echo "- - -" > /sys/class' scan
AV> is initiated.

AV> thanks,
AV> av

Here's sysrq output after reconnecting cable without manual disk
rescan. Before applying a patch.
The same lock exists:
#001:             [ffff81006ee20080] {scsi_host_alloc}
.. held by:         scsi_wq_4: 4255 [ffff81006f9147b0, 110]
... acquired at:               scsi_scan_target+0x51/0x87 [scsi_mod]

Thanks,

Maxim.

SysRq : Show State

                                                       sibling
  task                 PC          pid father child younger older
init          S ffff81007fdd3d78     0     1      0     2               (NOTLB)
ffff81007fdd3d78 0000000000000000 0000000000000282 0000000000000000
       ffff81007f6126d8 0000000000000000 0000000000000000 000000000000cc00
       ffff81007fdc4770 0000000000000ee6
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff8018189c>{do_select+944}
       <ffffffff80181431>{__pollwait+0} <ffffffff80181b7a>{sys_select+651}
       <ffffffff8010a70a>{system_call+126}
migration/0   S ffff81007fdd7ec8     0     2      1             3       (L-TLB)
ffff81007fdd7ec8 000000017ead3830 ffff810003018b00 ffff81000301f800
       0000000000000002 0000000000000000 0000000000000000 0000000000000096
       ffff81007fdc4040 0000000000000d38
Call Trace: <ffffffff801255b3>{__wake_up_common+67}
       <ffffffff8012687c>{migration_thread+360} <ffffffff80126714>{migration_thread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
ksoftirqd/0   S ffff81007fddbf08     0     3      1             4     2 (L-TLB)
ffff81007fddbf08 ffff81007ebd1080 000000008031dafb 0000000000000000
       0000000000000000 0000000000000000 0000000000000000 000000000000c6b5
       ffff81007fdc67b0 00000000000000b9
Call Trace: <ffffffff801309a5>{ksoftirqd+0} <ffffffff801309dd>{ksoftirqd+56}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
watchdog/0    S ffff81007fdddea8     0     4      1             5     3 (L-TLB)
ffff81007fdddea8 ffffffff8038f000 0000000000000000 ffff81007fddde68
       0000000000000200 0000000000000000 0000000000000000 000000007fdddf08
       ffff81007fdc6080 0000000000000789
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8014cb3d>{watchdog+0}
       <ffffffff8031e54a>{schedule_timeout+138} <ffffffff8013435d>{process_timeout+0}
       <ffffffff80134645>{msleep_interruptible+46} <ffffffff8014cb8c>{watchdog+79}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
migration/1   S ffff81007fddfec8     0     5      1             6     4 (L-TLB)
ffff81007fddfec8 000000007a26c080 ffff810003020b00 ffff810003017800
       0000000000000002 0000000000000000 0000000000000001 0000000100000096
       ffff81007fdc77f0 00000000000010bc
Call Trace: <ffffffff801255b3>{__wake_up_common+67}
       <ffffffff8012687c>{migration_thread+360} <ffffffff80126714>{migration_thread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
ksoftirqd/1   S ffff810003363f08     0     6      1             7     5 (L-TLB)
ffff810003363f08 ffff81007ed8e7f0 000000008031dafb 0000000000000000
       0000000000000000 0000000100000000 0000000000000001 00000001000014e5
       ffff81007fdc70c0 0000000000000075
Call Trace: <ffffffff801309a5>{ksoftirqd+0} <ffffffff801309dd>{ksoftirqd+56}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
watchdog/1    S ffff81007fd85ea8     0     7      1             8     6 (L-TLB)
ffff81007fd85ea8 ffff810003364100 0000000000000000 ffff81007fd85e68
       0000000000000200 0000000000000000 0000000000000001 000000017fd85f08
       ffff810003364830 000000000000008d
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8014cb3d>{watchdog+0}
       <ffffffff8031e54a>{schedule_timeout+138} <ffffffff8013435d>{process_timeout+0}
       <ffffffff80134645>{msleep_interruptible+46} <ffffffff8014cb8c>{watchdog+79}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
events/0      S ffff810037f3de98     0     8      1             9     7 (L-TLB)
ffff810037f3de98 0000000000000286 ffffffff801339a1 0000000000000286
       0000000000000286 0000000300000000 0000000000000000 0000000000000000
       ffff810037fef870 0000000000002478
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8013ad6e>{worker_thread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff80125562>{default_wake_function+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
events/1      S ffff810037e57e98     0     9      1            10     8 (L-TLB)
ffff810037e57e98 0000000000000246 0000000000000246 ffff81007e992800
       000000007e9929e8 0000000000000000 ffff810037e46520 0000000100000000
       ffff810037fef140 0000000000000134
Call Trace: <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ad6e>{worker_thread+0}
       <ffffffff8013ae41>{worker_thread+211} <ffffffff80125562>{default_wake_function+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013ddeb>{kthread+200}
       <ffffffff8010b846>{child_rip+8} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
khelper       S ffff810037f39e98     0    10      1            11     9 (L-TLB)
ffff810037f39e98 ffff81006ee95b80 0000000000000001 ffff81006ee95b98
       0000000000000000 0000000000000000 ffff810037e46420 0000000000000000
       ffff810037fee770 00000000000001c2
Call Trace: <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ad6e>{worker_thread+0}
       <ffffffff8013ae41>{worker_thread+211} <ffffffff80125562>{default_wake_function+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013ddeb>{kthread+200}
       <ffffffff8010b846>{child_rip+8} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
kthread       S ffff810037c2de98     0    11      1    15     181    10 (L-TLB)
ffff810037c2de98 ffff81006fbe5b68 0000000000000001 ffff81006fbe5b80
       0000000000000000 0000000300000000 0000000000000001 0000000100000000
       ffff810037fee040 0000000000000151
Call Trace: <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ad6e>{worker_thread+0}
       <ffffffff8013ae41>{worker_thread+211} <ffffffff80125562>{default_wake_function+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013ddeb>{kthread+200}
       <ffffffff8010b846>{child_rip+8} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
kblockd/0     S ffff810037dc3e98     0    15     11            16       (L-TLB)
ffff810037dc3e98 ffff81007f99c000 ffff81007f618ce8 ffff81007f618ce8
       ffffffff801dd758 0000000300000000 0000000000000000 0000000000000000
       ffff81007f580830 0000000000001488
Call Trace: <ffffffff801dd758>{elv_next_request+62}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
kblockd/1     S ffff810037dc7e98     0    16     11            17    15 (L-TLB)
ffff810037dc7e98 ffff81007f99c000 ffff81007f618ce8 ffff81007f618ce8
       ffffffff801dd758 0000000300000000 0000000000000001 0000000100000000
       ffff81007f580100 000000000000142c
Call Trace: <ffffffff801dd758>{elv_next_request+62}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
kacpid        S ffff810037de7e98     0    17     11           115    16 (L-TLB)
ffff810037de7e98 0000000000000000 0000000000000006 00000000000006cc
       0000000000000000 0000000000000000 0000000000000001 0000000137de7e28
       ffff810003369870 00000000000001cc
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031f92f>{_spin_unlock_irq+7}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
khubd         S ffff810037dede28     0   115     11           179    17 (L-TLB)
ffff810037dede28 0000000000000001 800001807f5107b0 ffff81007e4af800
       0000000000100100 0000000000000000 00000000ffffffff 00000001ffffffff
       ffff81007f5107b0 00000000000006a4
Call Trace: <ffffffff80287ff6>{hub_port_status+83} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8028921a>{hub_thread+0} <ffffffff80289d06>{hub_thread+2796}
       <ffffffff8031dafb>{thread_return+100} <ffffffff8013e39e>{autoremove_wake_function+0}
       <ffffffff801255b3>{__wake_up_common+67} <ffffffff8013e39e>{autoremove_wake_function+0}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8028921a>{hub_thread+0}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013ddeb>{kthread+200}
       <ffffffff8010b846>{child_rip+8} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
pdflush       S ffff81007f773ec8     0   179     11           180   115 (L-TLB)
ffff81007f773ec8 ffff81007f773e28 ffff810037fee040 0000000000000000
       000000007f505870 0000000000000000 0000000000000292 000000010301f800
       ffff81007f505870 00000000000000d2
Call Trace: <ffffffff80125835>{set_user_nice+261} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80153f7f>{pdflush+220} <ffffffff80153ea3>{pdflush+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
pdflush       S ffff81007f76fec8     0   180     11           182   179 (L-TLB)
ffff81007f76fec8 ffffffff8039aaa0 ffffffff80133a6a 0000000000000286
       00000000000035c8 0000000000000000 ffff81007fdd3dc8 00000001fffffffc
       ffff81007f517140 0000000000000652
Call Trace: <ffffffff80133a6a>{__mod_timer+168} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80153f7f>{pdflush+220} <ffffffff80153ea3>{pdflush+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
aio/0         S ffff81007f659e98     0   182     11           183   180 (L-TLB)
ffff81007f659e98 ffff81007f51a9c0 0000000000000009 00000000000009ba
       0000000000000002 ffff81007fdd4088 0000000000000000 000000007f659e28
       ffff81007f7240c0 0000000000000239
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031f92f>{_spin_unlock_irq+7}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
kswapd0       S ffff81007f4fdeb8     0   181      1           431    11 (L-TLB)
ffff81007f4fdeb8 ffff81007f4fddf8 0000000000000001 0000000000000001
       0000000000000000 0000000000000000 0000000000000296 000000010301f800
       ffff81007f540080 00000000000002fa
Call Trace: <ffffffff801363fb>{do_notify_parent+404}
       <ffffffff8015769b>{kswapd+247} <ffffffff8013e39e>{autoremove_wake_function+0}
       <ffffffff8012e510>{do_exit+2116} <ffffffff8013e39e>{autoremove_wake_function+0}
       <ffffffff8010b846>{child_rip+8} <ffffffff801575a4>{kswapd+0}
       <ffffffff8010b83e>{child_rip+0}
aio/1         S ffff81007f6f9e98     0   183     11           258   182 (L-TLB)
ffff81007f6f9e98 ffff81007f6cd660 0000000000000009 00000000000009ba
       0000000000000002 ffff81007fdd4088 0000000000000001 000000017f6f9e28
       ffff81007f687100 000000000000015c
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031f92f>{_spin_unlock_irq+7}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
kseriod       S ffff81007f6dbe88     0   258     11           316   183 (L-TLB)
ffff81007f6dbe88 ffff81007f78cda0 ffffffff8031ed71 0000000000000000
       0000000000000000 0000000000008080 0000000000000000 00000000803ce7f0
       ffff81007f682770 000000000000062d
Call Trace: <ffffffff8031ed71>{__mutex_unlock_slowpath+476}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8025133b>{serio_thread+664}
       <ffffffff8013e39e>{autoremove_wake_function+0} <ffffffff8013e39e>{autoremove_wake_function+0}
       <ffffffff802510a3>{serio_thread+0} <ffffffff8013ddeb>{kthread+200}
       <ffffffff8010b846>{child_rip+8} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
kpsmoused     S ffff81007f7e3e98     0   316     11           337   258 (L-TLB)
ffff81007f7e3e98 ffff81007ff59ba0 0000000000000009 00000000000009ba
       0000000000000002 ffff81007fdd4088 0000000000000000 000000007f7e3e28
       ffff81007f564040 000000000000024c
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031f92f>{_spin_unlock_irq+7}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
ata/0         S ffff81007ec65e98     0   337     11           338   316 (L-TLB)
ffff81007ec65e98 0000000000000000 0000000000000006 00000000000006cc
       0000000000000000 0000000000000000 0000000000000000 000000007ec65e28
       ffff81007f64e7b0 00000000000002ed
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031f92f>{_spin_unlock_irq+7}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
ata/1         S ffff81007ec67e98     0   338     11           344   337 (L-TLB)
ffff81007ec67e98 ffffffff8031f917 0000000000000010 0000000000000246
       000000007ec67df8 0000000000000000 ffff81007f6b7ae0 0000000100000000
       ffff81007f682040 00000000000051f3
Call Trace: <ffffffff8031f917>{_spin_unlock_irqrestore+11}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
scsi_eh_0     S ffff81007ec73e98     0   344     11           345   338 (L-TLB)
ffff81007ec73e98 0000000000000000 0000000000000006 00000000000006cc
       0000000000000000 0000000000000000 0000000000000001 000000017ec73e28
       ffff81007f64e080 00000000000000c6
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031dafb>{thread_return+100}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0}
       <ffffffff88005e10>{:scsi_mod:scsi_error_handler+100}
       <ffffffff801255b3>{__wake_up_common+67} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
scsi_eh_1     S ffff81007ec13e98     0   345     11           382   344 (L-TLB)
ffff81007ec13e98 0000000100000001 0000000000000006 00000000000006cc
       0000000000000001 0000000000000001 0000000000000001 000000017ec13e28
       ffff81007f60e7b0 00000000000000ba
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0} <ffffffff88005e10>{:scsi_mod:scsi_error_handler+100}
       <ffffffff801255b3>{__wake_up_common+67} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
scsi_eh_3     S ffff81007ec1fe98     0   382     11           385   345 (L-TLB)
ffff81007ec1fe98 0000000000000000 0000000000000009 00000000000009ba
       00000000732f0000 0000000000000000 0000000000000001 000000017ec1fe28
       ffff81007f7247f0 00000000000000a3
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031dafb>{thread_return+100}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0}
       <ffffffff88005e10>{:scsi_mod:scsi_error_handler+100}
       <ffffffff801255b3>{__wake_up_common+67} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
scsi_wq_3     S ffff81007f58fe98     0   385     11           417   382 (L-TLB)
ffff81007f58fe98 2222222222222222 2222222222222222 2222222222222222
       2222222222222222 0000000300000000 0000000000000001 0000000100000000
       ffff81007f7dc870 0000000000000843
Call Trace: <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
kmirrord      S ffff81007ef15e98     0   417     11          1935   385 (L-TLB)
ffff81007ef15e98 0000000000000011 ffff81007ef15eb8 ffff81007ef0e320
       ffff81007edc7870 0000000000000282 0000000000000001 0000000100000100
       ffff81007edc7870 00000000000004d0
Call Trace: <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
kjournald     S ffff81007ede1e88     0   431      1          1358   181 (L-TLB)
ffff81007ede1e88 ffff81007edc95f8 ffffffff88087585 00000fdc00000000
       000000006ebc6024 0000000000000000 ffff81007f62a000 000000017278ddf0
       ffff81007ed8e7f0 00000000000001b6
Call Trace: <ffffffff88087585>{:jbd:journal_commit_transaction+4225}
       <ffffffff88089467>{:jbd:kjournald+466} <ffffffff8013e39e>{autoremove_wake_function+0}
       <ffffffff8013e39e>{autoremove_wake_function+0} <ffffffff8012e510>{do_exit+2116}
       <ffffffff88089290>{:jbd:commit_timeout+0} <ffffffff8010b846>{child_rip+8}
       <ffffffff88089295>{:jbd:kjournald+0} <ffffffff8010b83e>{child_rip+0}
udevd         S ffff81007da91d78     0  1358      1          2104   431 (NOTLB)
ffff81007da91d78 0000000000000206 0000000000000206 0000000000000001
       0000004400000001 ffff81000000cc00 0000000000000001 0000000100000286
       ffff81007edc8100 0000000000001f20
Call Trace: <ffffffff80151fd1>{__alloc_pages+102} <ffffffff8031e4de>{schedule_timeout+30}
       <ffffffff8013e1d6>{add_wait_queue+18} <ffffffff802bc43c>{datagram_poll+42}
       <ffffffff802bc412>{datagram_poll+0} <ffffffff8018189c>{do_select+944}
       <ffffffff801a5313>{proc_delete_inode+0} <ffffffff80181431>{__pollwait+0}
       <ffffffff8031f883>{_spin_lock_irqsave+11} <ffffffff80181b7a>{sys_select+651}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8016fddd>{sys_read+69}
       <ffffffff8010a70a>{system_call+126}
kauditd       S ffff81007d5f7ea8     0  1935     11          2043   417 (L-TLB)
ffff81007d5f7ea8 00000000000009ba 0000000000000002 ffff81007f70e908
       0000000000000001 ffff81007d5f7e28 0000000000000000 0000000000000000
       ffff81007e562140 0000000000000140
Call Trace: <ffffffff80149c98>{kauditd_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80149c98>{kauditd_thread+0} <ffffffff80149dd7>{kauditd_thread+319}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
kmpathd/0     S ffff81007d031e98     0  2043     11          2044  1935 (L-TLB)
ffff81007d031e98 2d18130000000006 0000000000000008 00000000000008c0
       00000000000000db 0000000000000000 0000000000000001 000000007d031e28
       ffff81007e58d7b0 000000000000011c
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031dafb>{thread_return+100}
       <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8013ad6e>{worker_thread+0}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013ad6e>{worker_thread+0}
       <ffffffff8013ae41>{worker_thread+211} <ffffffff80125562>{default_wake_function+0}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff80125562>{default_wake_function+0}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013ddeb>{kthread+200}
       <ffffffff8010b846>{child_rip+8} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
kmpathd/1     S ffff81007d027e98     0  2044     11          4224  2043 (L-TLB)
ffff81007d027e98 0000000000000000 0000000000000008 00000000000008c0
       000000007da3a890 0000000000000000 0000000000000001 000000017d027e28
       ffff81007ead57f0 00000000000001b8
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031dafb>{thread_return+100}
       <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8013ad6e>{worker_thread+0}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013ad6e>{worker_thread+0}
       <ffffffff8013ae41>{worker_thread+211} <ffffffff80125562>{default_wake_function+0}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff80125562>{default_wake_function+0}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013ddeb>{kthread+200}
       <ffffffff8010b846>{child_rip+8} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013dd23>{kthread+0} <ffffffff8010b83e>{child_rip+0}
kjournald     S ffff81007a0a9e88     0  2104      1          2716  1358 (L-TLB)
ffff81007a0a9e88 0000003000000010 ffff81007a0a9e98 ffff81007a0a9dd8
       0000000079fcfb28 0000000000000000 0000000000000005 0000000100000000
       ffff81007e9de080 0000000000000922
Call Trace: <ffffffff88089467>{:jbd:kjournald+466} <ffffffff8013e39e>{autoremove_wake_function+0}
       <ffffffff8013e39e>{autoremove_wake_function+0} <ffffffff8031f92f>{_spin_unlock_irq+7}
       <ffffffff88089290>{:jbd:commit_timeout+0} <ffffffff8010b846>{child_rip+8}
       <ffffffff88089295>{:jbd:kjournald+0} <ffffffff8010b83e>{child_rip+0}
dhclient      S ffff810079229d78     0  2716      1          2760  2104 (NOTLB)
ffff810079229d78 0000000000000000 0000004400000001 ffff81000000cc00
       000200d00000015c 0000000000000282 0000000000000000 00000000000000d0
       ffff81007a0d9040 0000000000001125
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff802bc412>{datagram_poll+0}
       <ffffffff8018189c>{do_select+944} <ffffffff80181431>{__pollwait+0}
       <ffffffff80181b7a>{sys_select+651} <ffffffff8010a70a>{system_call+126}
syslogd       R  running task       0  2760      1          2764  2716 (NOTLB)
klogd         S ffff81007906fbd8     0  2764      1          2775  2760 (NOTLB)
ffff81007906fbd8 ffff81007edc8830 ffff810037fdac80 ffff810037c1e690
       0000000078715000 000000005b169863 0000000000000000 000000007906fbf8
       ffff81007ebd1080 00000000000004c0
Call Trace: <ffffffff80123eda>{try_to_wake_up+1035}
       <ffffffff8031e4de>{schedule_timeout+30} <ffffffff8031f883>{_spin_lock_irqsave+11}
       <ffffffff8013e2f6>{prepare_to_wait_exclusive+21} <ffffffff803177cf>{unix_wait_for_peer+163}
       <ffffffff8013e39e>{autoremove_wake_function+0} <ffffffff80125609>{__wake_up+54}
       <ffffffff8013e39e>{autoremove_wake_function+0} <ffffffff803182c8>{unix_dgram_sendmsg+950}
       <ffffffff802b5561>{do_sock_write+196} <ffffffff802b56b1>{sock_aio_write+79}
       <ffffffff8031dafb>{thread_return+100} <ffffffff8016fbe9>{do_sync_write+201}
       <ffffffff8013e39e>{autoremove_wake_function+0} <ffffffff8013e39e>{autoremove_wake_function+0}
       <ffffffff8016fd0d>{vfs_write+231} <ffffffff8016fe4b>{sys_write+69}
       <ffffffff8010a70a>{system_call+126}
irqbalance    S ffff8100791f1e98     0  2775      1          2787  2764 (NOTLB)
ffff8100791f1e98 00000002540be400 ffffffff8013031b 00000000440caf80
       000000000005b900 0000000000019a0c 0000000000000001 00000001bbf35151
       ffff81007e484040 0000000000011881
Call Trace: <ffffffff8013031b>{getnstimeofday+16} <ffffffff8014059e>{enqueue_hrtimer+93}
       <ffffffff801406ac>{hrtimer_start+195} <ffffffff8031f1fd>{schedule_hrtimer+36}
       <ffffffff80140933>{hrtimer_nanosleep+91} <ffffffff80137877>{do_sigaction+568}
       <ffffffff80136d74>{sigprocmask+216} <ffffffff80140a2d>{sys_nanosleep+85}
       <ffffffff8010a70a>{system_call+126}
portmap       S ffff810079063e78     0  2787      1          2807  2775 (NOTLB)
ffff810079063e78 0000000000000001 0000004400000001 ffff81000000cc00
       00000000802b4b33 0000000000000000 ffff81000000cc00 00000001000000d0
       ffff81007eece0c0 000000000000098b
Call Trace: <ffffffff8031f883>{_spin_lock_irqsave+11}
       <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff80181f2c>{do_sys_poll+610}
       <ffffffff80181431>{__pollwait+0} <ffffffff80182049>{sys_poll+74}
       <ffffffff8010a70a>{system_call+126}
rpc.statd     S ffff81007958bd78     0  2807      1          2846  2787 (NOTLB)
ffff81007958bd78 0000000000000001 0000004400000001 ffff81000000cc00
       000200d0802b516e 0000000000000286 0000000000000001 00000001000000d0
       ffff81007e4f0100 00000000000019d8
Call Trace: <ffffffff8031e4de>{schedule_timeout+30}
       <ffffffff8013e1d6>{add_wait_queue+18} <ffffffff802e4789>{tcp_poll+42}
       <ffffffff8018189c>{do_select+944} <ffffffff80181431>{__pollwait+0}
       <ffffffff8013e451>{bit_waitqueue+53} <ffffffff80181b7a>{sys_select+651}
       <ffffffff8010a70a>{system_call+126}
rpc.idmapd    S ffff810078d43e88     0  2846      1          2937  2807 (NOTLB)
ffff810078d43e88 ffff81007e5300c0 ffffffff80160d90 ffff810002e3bf90
       000000008015b7f2 0000000000000000 0000000000000001 0000000100000246
       ffff81007ec83870 0000000000000816
Call Trace: <ffffffff80160d90>{page_add_file_rmap+36}
       <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff801953e9>{sys_epoll_wait+400}
       <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff80125562>{default_wake_function+0}
       <ffffffff8010a70a>{system_call+126}
acpid         S ffff810078849d78     0  2937      1          2968  2846 (NOTLB)
ffff810078849d78 ffff810078849e88 ffffffff8014fef4 0000000000000000
       0000000000000286 0000000000000001 0000000000000001 0000000100000001
       ffff81007ec71100 0000000000005e59
Call Trace: <ffffffff8014fef4>{__generic_file_aio_write_nolock+873}
       <ffffffff80151fd1>{__alloc_pages+102} <ffffffff8031e4de>{schedule_timeout+30}
       <ffffffff8031f883>{_spin_lock_irqsave+11} <ffffffff8013e1d6>{add_wait_queue+18}
       <ffffffff8018189c>{do_select+944} <ffffffff80181431>{__pollwait+0}
       <ffffffff80181b7a>{sys_select+651} <ffffffff8016fe4b>{sys_write+69}
       <ffffffff8010a70a>{system_call+126}
cupsd         S ffff810078a9bd78     0  2968      1          3073  2937 (NOTLB)
ffff810078a9bd78 0000000000000000 0000000000000282 0000000000000014
       ffffffff802b4ff1 0000000000000000 0000000000000000 000000000000cc00
       ffff81007e58d080 0000000000000865
Call Trace: <ffffffff802b4ff1>{sock_sendmsg+265} <ffffffff801339a1>{lock_timer_base+27}
       <ffffffff8031e54a>{schedule_timeout+138} <ffffffff8013435d>{process_timeout+0}
       <ffffffff8018189c>{do_select+944} <ffffffff80181431>{__pollwait+0}
       <ffffffff80181b7a>{sys_select+651} <ffffffff8010a70a>{system_call+126}
sshd          S ffff8100761bbd78     0  3073      1          3094  2968 (NOTLB)
ffff8100761bbd78 0000000000000000 0000000000000286 0000000000000005
       ffff81007da0f480 0000000000000001 0000000000000001 000000010000cc00
       ffff81007edc7140 0000000000003c68
Call Trace: <ffffffff8031e4de>{schedule_timeout+30}
       <ffffffff8013e1d6>{add_wait_queue+18} <ffffffff802e4789>{tcp_poll+42}
       <ffffffff8018189c>{do_select+944} <ffffffff80181431>{__pollwait+0}
       <ffffffff80181b7a>{sys_select+651} <ffffffff8010a70a>{system_call+126}
xinetd        S ffff81007626dd78     0  3094      1          3113  3073 (NOTLB)
ffff81007626dd78 0000000000000000 0000000000000282 0000000000000046
       00000000802b4ff1 0000000000000000 0000004400000001 000000000000cc00
       ffff81007ed1b7b0 00000000000007c9
Call Trace: <ffffffff80151fd1>{__alloc_pages+102} <ffffffff8031e4de>{schedule_timeout+30}
       <ffffffff8013e1d6>{add_wait_queue+18} <ffffffff802e4789>{tcp_poll+42}
       <ffffffff8018189c>{do_select+944} <ffffffff80181431>{__pollwait+0}
       <ffffffff80181b7a>{sys_select+651} <ffffffff8031f92f>{_spin_unlock_irq+7}
       <ffffffff8010a70a>{system_call+126}
sendmail      S ffff8100769b5d78     0  3113      1          3121  3094 (NOTLB)
ffff8100769b5d78 0000000000000000 0000000000000286 0000000000000033
       000000006ed12014 0000000000000000 0000004400000001 000000010000cc00
       ffff81007e5300c0 0000000000002cfd
Call Trace: <ffffffff8031f883>{_spin_lock_irqsave+11}
       <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff8018189c>{do_select+944}
       <ffffffff80181431>{__pollwait+0} <ffffffff80181b7a>{sys_select+651}
       <ffffffff8010a70a>{system_call+126}
sendmail      S ffff810076c6ff68     0  3121      1          3170  3113 (NOTLB)
ffff810076c6ff68 0000000000000000 0000000000030002 0000000000000000
       00000000440caeea 0000000000000000 0000000000000001 00000001798e6698
       ffff81007e614870 000000000000fcc1
Call Trace: <ffffffff80137caa>{sys_pause+23} <ffffffff8010a70a>{system_call+126}
gpm           S ffff8100771b1d78     0  3170      1          3229  3121 (NOTLB)
ffff8100771b1d78 ffff81007232c830 ffff81007f763568 0000000000000000
       0000000000000282 0000000000000000 0000000000000000 0000000000000000
       ffff81007f53c040 0000000000000149
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff8018189c>{do_select+944}
       <ffffffff80181431>{__pollwait+0} <ffffffff80181b7a>{sys_select+651}
       <ffffffff802b4d59>{sock_map_fd+305} <ffffffff8010a70a>{system_call+126}
htt           S ffff810076f63ea8     0  3229      1  3230    3242  3170 (NOTLB)
ffff810076f63ea8 0000000000000000 ffff81000337b0c0 0000000001200011
       0000000000000000 0000000000000000 ffffffff8013b8c1 0000000076aae010
       ffff81000337b0c0 00000000000024e9
Call Trace: <ffffffff8013b8c1>{attach_pid+28} <ffffffff801ed478>{__up_read+16}
       <ffffffff8032163c>{do_page_fault+1000} <ffffffff8012f1ef>{do_wait+2585}
       <ffffffff80125562>{default_wake_function+0} <ffffffff80125562>{default_wake_function+0}
       <ffffffff8010a70a>{system_call+126}
htt_server    S ffff810077227e78     0  3230   3229                     (NOTLB)
ffff810077227e78 0000000000000000 0000000000000000 0000000000000000
       0000000000000282 0000000000000000 ffffffff8015b7f2 0000000000000000
       ffff81007e310140 0000000000000f32
Call Trace: <ffffffff8015b7f2>{__handle_mm_fault+1372}
       <ffffffff8031f883>{_spin_lock_irqsave+11} <ffffffff801339a1>{lock_timer_base+27}
       <ffffffff8031e54a>{schedule_timeout+138} <ffffffff8013435d>{process_timeout+0}
       <ffffffff80181f2c>{do_sys_poll+610} <ffffffff80181431>{__pollwait+0}
       <ffffffff80182049>{sys_poll+74} <ffffffff8010a70a>{system_call+126}
cannaserver   S ffff810076733d78     0  3242      1          3254  3229 (NOTLB)
ffff810076733d78 ffffffff8038f000 0000000000000000 0000000000000000
       0000000000000282 ffff810076733d88 0000000000000000 0000000000000000
       ffff81007e739100 0000000000000633
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff8018189c>{do_select+944}
       <ffffffff80181431>{__pollwait+0} <ffffffff80181b7a>{sys_select+651}
       <ffffffff8010a70a>{system_call+126}
crond         S ffff810073dbfe98     0  3254      1          3295  3242 (NOTLB)
ffff810073dbfe98 0000000df8475800 ffffffff8013031b 00000000440caf79
       00000000000d47d4 0000000000018d58 0000000000000000 00000000bbf35151
       ffff81007e6997f0 000000000000130f
Call Trace: <ffffffff8013031b>{getnstimeofday+16} <ffffffff8014059e>{enqueue_hrtimer+93}
       <ffffffff801406ac>{hrtimer_start+195} <ffffffff8031f1fd>{schedule_hrtimer+36}
       <ffffffff80140933>{hrtimer_nanosleep+91} <ffffffff80137877>{do_sigaction+568}
       <ffffffff80136d74>{sigprocmask+216} <ffffffff80140a2d>{sys_nanosleep+85}
       <ffffffff8010a70a>{system_call+126}
xfs           S ffff810073641d78     0  3295      1          3314  3254 (NOTLB)
ffff810073641d78 ffff810003364100 0000000000000001 0000000000000000
       0000000000000282 0000000000000000 0000000000000001 0000000100000000
       ffff81007e5247f0 00000000000003d1
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff8018189c>{do_select+944}
       <ffffffff80181431>{__pollwait+0} <ffffffff80181b7a>{sys_select+651}
       <ffffffff8010a70a>{system_call+126}
atd           S ffff8100737f1e98     0  3314      1          3333  3295 (NOTLB)
ffff8100737f1e98 00000045d964b800 ffffffff8013031b 00000000440caeed
       0000000000077199 0000000000007b32 0000000000000000 00000000bbf35151
       ffff81007f60e080 00000000000036c3
Call Trace: <ffffffff8013031b>{getnstimeofday+16} <ffffffff8014059e>{enqueue_hrtimer+93}
       <ffffffff801406ac>{hrtimer_start+195} <ffffffff8031f1fd>{schedule_hrtimer+36}
       <ffffffff80140933>{hrtimer_nanosleep+91} <ffffffff80137877>{do_sigaction+568}
       <ffffffff80136d74>{sigprocmask+216} <ffffffff80140a2d>{sys_nanosleep+85}
       <ffffffff8010a70a>{system_call+126}
dbus-daemon-1 S ffff810072a7de78     0  3333      1          3343  3314 (NOTLB)
ffff810072a7de78 ffff810000000000 ffff810072a7ddd8 0000000000000000
       0000000000000286 0000000000000000 0000000000000001 0000000100000001
       ffff810079e48040 00000000000013d2
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff80181f2c>{do_sys_poll+610}
       <ffffffff80181431>{__pollwait+0} <ffffffff80182049>{sys_poll+74}
       <ffffffff8010a70a>{system_call+126}
cups-config-d S ffff810072b33e78     0  3343      1          3363  3333 (NOTLB)
ffff810072b33e78 0000000000000000 0000000000000000 0000000000000000
       0000000000000282 ffff810002cbf750 0000000000000000 0000000000000000
       ffff81007e549140 00000000000045c9
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff80181f2c>{do_sys_poll+610}
       <ffffffff80181431>{__pollwait+0} <ffffffff80182049>{sys_poll+74}
       <ffffffff8010a70a>{system_call+126}
login         S ffff810072bc5ea8     0  3363      1  3967    3364  3343 (NOTLB)
ffff810072bc5ea8 0000000000000000 ffff81007a26c7b0 0000000001200011
       0000000000000000 ffff810072bc5e18 0000000000000000 000000007ede2ff8
       ffff81007a26c7b0 0000000000000b53
Call Trace: <ffffffff801ed478>{__up_read+16} <ffffffff8032163c>{do_page_fault+1000}
       <ffffffff8012f1ef>{do_wait+2585} <ffffffff80125562>{default_wake_function+0}
       <ffffffff80137b93>{sys_rt_sigaction+97} <ffffffff80125562>{default_wake_function+0}
       <ffffffff8010a70a>{system_call+126}
mingetty      S ffff8100726bfda8     0  3364      1          3365  3363 (NOTLB)
ffff8100726bfda8 0000000000000020 0000000000000004 0000000000000410
       0000000000000246 0000000000000000 0000000000000001 00000000726bfd38
       ffff81007e549870 000000000001a64c
Call Trace: <ffffffff80123eda>{try_to_wake_up+1035}
       <ffffffff8031e4de>{schedule_timeout+30} <ffffffff8031f883>{_spin_lock_irqsave+11}
       <ffffffff8013e1d6>{add_wait_queue+18} <ffffffff80236612>{read_chan+990}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8031f47f>{__down_failed+53}
       <ffffffff80125562>{default_wake_function+0} <ffffffff80231317>{tty_read+136}
       <ffffffff8016fa82>{vfs_read+209} <ffffffff8016fddd>{sys_read+69}
       <ffffffff8010a70a>{system_call+126}
mingetty      S ffff810072babda8     0  3365      1          3368  3364 (NOTLB)
ffff810072babda8 0000000000000020 ffff81007e2e1800 0000000000000246
       0000000000000246 0000000000000000 ffffffff8012bce6 0000000100000246
       ffff81007e0eb770 00000000000054ec
Call Trace: <ffffffff8012bce6>{release_console_sem+377}
       <ffffffff8031e4de>{schedule_timeout+30} <ffffffff8031f883>{_spin_lock_irqsave+11}
       <ffffffff8013e1d6>{add_wait_queue+18} <ffffffff80236612>{read_chan+990}
       <ffffffff80125562>{default_wake_function+0} <ffffffff80125562>{default_wake_function+0}
       <ffffffff80230fce>{tty_ldisc_deref+101} <ffffffff80231317>{tty_read+136}
       <ffffffff8016fa82>{vfs_read+209} <ffffffff8016fddd>{sys_read+69}
       <ffffffff8010a70a>{system_call+126}
mingetty      S ffff810072badda8     0  3368      1          3369  3365 (NOTLB)
ffff810072badda8 0000000000000020 ffff81007e2e1000 0000000000000246
       0000000000000246 0000000000000000 ffffffff8012bce6 0000000100000246
       ffff810079df1770 0000000000008895
Call Trace: <ffffffff8012bce6>{release_console_sem+377}
       <ffffffff8031e4de>{schedule_timeout+30} <ffffffff8031f883>{_spin_lock_irqsave+11}
       <ffffffff8013e1d6>{add_wait_queue+18} <ffffffff80236612>{read_chan+990}
       <ffffffff80125562>{default_wake_function+0} <ffffffff80125562>{default_wake_function+0}
       <ffffffff80230fce>{tty_ldisc_deref+101} <ffffffff80231317>{tty_read+136}
       <ffffffff8016fa82>{vfs_read+209} <ffffffff8016fddd>{sys_read+69}
       <ffffffff8010a70a>{system_call+126}
mingetty      S ffff810072bbbda8     0  3369      1          3371  3368 (NOTLB)
ffff810072bbbda8 0000000000000020 ffff81007e288c00 0000000000000246
       0000000000000246 0000000000000000 ffffffff8012bce6 0000000100000246
       ffff810079c90100 0000000000008342
Call Trace: <ffffffff8012bce6>{release_console_sem+377}
       <ffffffff8031e4de>{schedule_timeout+30} <ffffffff8031f883>{_spin_lock_irqsave+11}
       <ffffffff8013e1d6>{add_wait_queue+18} <ffffffff80236612>{read_chan+990}
       <ffffffff80125562>{default_wake_function+0} <ffffffff80125562>{default_wake_function+0}
       <ffffffff80230fce>{tty_ldisc_deref+101} <ffffffff80231317>{tty_read+136}
       <ffffffff8016fa82>{vfs_read+209} <ffffffff8016fddd>{sys_read+69}
       <ffffffff8010a70a>{system_call+126}
mingetty      S ffff810072bc7da8     0  3371      1          3581  3369 (NOTLB)
ffff810072bc7da8 0000000000000020 ffff81007e288000 0000000000000246
       0000000000000246 0000000000004e7d 0000000000000000 0000000000000246
       ffff81007eece7f0 0000000000004fa0
Call Trace: <ffffffff8031e4de>{schedule_timeout+30}
       <ffffffff8031f883>{_spin_lock_irqsave+11} <ffffffff8013e1d6>{add_wait_queue+18}
       <ffffffff80236612>{read_chan+990} <ffffffff80125562>{default_wake_function+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff80230fce>{tty_ldisc_deref+101}
       <ffffffff80231317>{tty_read+136} <ffffffff8016fa82>{vfs_read+209}
       <ffffffff8016fddd>{sys_read+69} <ffffffff8010a70a>{system_call+126}
mingetty      S ffff8100721e7da8     0  3581      1          3584  3371 (NOTLB)
ffff8100721e7da8 0000000000000020 ffff81007e49fa00 0000000000000246
       0000000000000246 0000000000004e7d 0000000000000000 0000000000000246
       ffff81007ead3100 000000000000be06
Call Trace: <ffffffff8031e4de>{schedule_timeout+30}
       <ffffffff8031f883>{_spin_lock_irqsave+11} <ffffffff8013e1d6>{add_wait_queue+18}
       <ffffffff80236612>{read_chan+990} <ffffffff80125562>{default_wake_function+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff80230fce>{tty_ldisc_deref+101}
       <ffffffff80231317>{tty_read+136} <ffffffff8016fa82>{vfs_read+209}
       <ffffffff8016fddd>{sys_read+69} <ffffffff8010a70a>{system_call+126}
gdm-binary    S ffff810072337e78     0  3584      1  3845    4253  3581 (NOTLB)
ffff810072337e78 0000000000000000 0000000000000282 0000000000000001
       0000000003017800 0000000000000000 0000004400000001 000000000000cc00
       ffff8100721bb830 0000000000000518
Call Trace: <ffffffff8031f883>{_spin_lock_irqsave+11}
       <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff80181f2c>{do_sys_poll+610}
       <ffffffff80181431>{__pollwait+0} <ffffffff80182049>{sys_poll+74}
       <ffffffff8010a70a>{system_call+126}
gdm-binary    S ffff81007215dde8     0  3845   3584  3846               (NOTLB)
ffff81007215dde8 000000003ad45f4e 0000000000000020 0000000000000000
       0000000000000009 000000000000099c 0000000000000001 0000000171bcc0f8
       ffff81007ede3040 0000000000000519
Call Trace: <ffffffff8017ad22>{pipe_wait+118} <ffffffff8013e39e>{autoremove_wake_function+0}
       <ffffffff8013e39e>{autoremove_wake_function+0} <ffffffff8017b086>{pipe_readv+658}
       <ffffffff8017b130>{pipe_read+26} <ffffffff8016fa82>{vfs_read+209}
       <ffffffff8016fddd>{sys_read+69} <ffffffff8010a70a>{system_call+126}
X             S ffff81007206bd78     0  3846   3845          3966       (NOTLB)
ffff81007206bd78 ffffffff8038f000 ffff81007206bdc8 0000000000000000
       0000000000000282 0000000000000000 0000000000000000 000000007ea25800
       ffff81007232c830 000000000000a27d
Call Trace: <ffffffff8031f883>{_spin_lock_irqsave+11}
       <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff8018189c>{do_select+944}
       <ffffffff80181431>{__pollwait+0} <ffffffff80181b7a>{sys_select+651}
       <ffffffff8010a70a>{system_call+126}
gdmgreeter    S ffff810070303e78     0  3966   3845                3846 (NOTLB)
ffff810070303e78 ffff81007232c830 ffff810070303dd8 0000000000000000
       0000000000000282 0000000000000000 0000000000000000 0000000000000000
       ffff810079894870 000000000000015a
Call Trace: <ffffffff801339a1>{lock_timer_base+27} <ffffffff8031e54a>{schedule_timeout+138}
       <ffffffff8013435d>{process_timeout+0} <ffffffff80181f2c>{do_sys_poll+610}
       <ffffffff80181431>{__pollwait+0} <ffffffff80182049>{sys_poll+74}
       <ffffffff8010a70a>{system_call+126}
bash          R  running task       0  3967   3363                     (NOTLB)
scsi_eh_4     S ffff81006ee2be98     0  4224     11          4255  2044 (L-TLB)
ffff81006ee2be98 0000000000000000 0000000000000008 00000000000008c0
       0000000000000000 0000000000000000 0000000000000000 000000006ee2be28
       ffff81007f5407b0 0000000000000121
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0} <ffffffff88005e10>{:scsi_mod:scsi_error_handler+100}
       <ffffffff801255b3>{__wake_up_common+67} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
qla2xxx_4_dpc S ffff81007f4ede18     0  4253      1          4408  3584 (L-TLB)
ffff81007f4ede18 0000000000004000 ffff81007f985a00 0000000000000000
       0000000000200200 00000000ffffdb57 0000000000000000 000000006ee234e8
       ffff81006fbf9830 00000000000a624e
Call Trace: <ffffffff8031f7b7>{__down_interruptible+203}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013ab66>{__queue_work+78}
       <ffffffff8031f4b9>{__down_failed_interruptible+53} <ffffffff881b952c>{:qla2xxx:.text.lock.qla_os+15}
       <ffffffff8010b846>{child_rip+8} <ffffffff881b8cb8>{:qla2xxx:qla2x00_do_dpc+0}
       <ffffffff8010b83e>{child_rip+0}
scsi_wq_4     D ffff81006ee95a88     0  4255     11          4264  4224 (L-TLB)
ffff81006ee95a88 0001122000011220 0000000000011220 ffff81007628a780
       ffff81007f985a00 ffff81007f9d8580 0000000000000000 0000000000000246
       ffff81006f9147b0 0000000000001045
Call Trace: <ffffffff8031dc2a>{wait_for_completion+165}
       <ffffffff80125562>{default_wake_function+0} <ffffffff80125562>{default_wake_function+0}
       <ffffffff801e025d>{blk_execute_rq+242} <ffffffff80174151>{bio_alloc_bioset+198}
       <ffffffff801deb05>{blk_recount_segments+124} <ffffffff88006cf9>{:scsi_mod:scsi_execute+216}
       <ffffffff88006d96>{:scsi_mod:scsi_execute_req+120} <ffffffff88009411>{:scsi_mod:scsi_probe_and_add_lun+523}
       <ffffffff88009160>{:scsi_mod:scsi_alloc_target+512}
       <ffffffff88009d1d>{:scsi_mod:__scsi_scan_target+196}
       <ffffffff8031ea88>{__mutex_lock_slowpath+768} <ffffffff8800a2ea>{:scsi_mod:scsi_scan_target+111}
       <ffffffff88046aac>{:scsi_transport_fc:fc_scsi_scan_rport+0}
       <ffffffff8013ad24>{run_workqueue+161} <ffffffff8013ad6e>{worker_thread+0}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013ae73>{worker_thread+261}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
scsi_eh_5     S ffff81006eef5e98     0  4264     11          4410  4255 (L-TLB)
ffff81006eef5e98 0000000000000005 0000000000000009 00000000000009ba
       0000000000000002 0000000000000000 0000000000000001 000000016eef5e28
       ffff81007e739830 000000000000012c
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031dafb>{thread_return+100}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0}
       <ffffffff88005e10>{:scsi_mod:scsi_error_handler+100}
       <ffffffff801255b3>{__wake_up_common+67} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff88005dac>{:scsi_mod:scsi_error_handler+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}
qla2xxx_5_dpc S ffff81006fa1fe18     0  4408      1                4253 (L-TLB)
ffff81006fa1fe18 ffffffff8038f000 0000000000000000 000000000000004c
       00000000722a8980 0000000000000000 ffff8100722a8980 0000000000000000
       ffff81007e5240c0 0000000000000182
Call Trace: <ffffffff80123600>{activate_task+140} <ffffffff80123eda>{try_to_wake_up+1035}
       <ffffffff8031f7b7>{__down_interruptible+203} <ffffffff80125562>{default_wake_function+0}
       <ffffffff8031f4b9>{__down_failed_interruptible+53} <ffffffff881b952c>{:qla2xxx:.text.lock.qla_os+15}
       <ffffffff8010b846>{child_rip+8} <ffffffff881b8cb8>{:qla2xxx:qla2x00_do_dpc+0}
       <ffffffff8010b83e>{child_rip+0}
scsi_wq_5     S ffff81006fa99e98     0  4410     11                4264 (L-TLB)
ffff81006fa99e98 00005555556a41d8 0000000000000009 00000000000009ba
       00002b693ef0b820 00002b693ef0b810 0000000000000001 000000016fa99e28
       ffff81007e6990c0 0000000000000174
Call Trace: <ffffffff8031f92f>{_spin_unlock_irq+7} <ffffffff8031f92f>{_spin_unlock_irq+7}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ad6e>{worker_thread+0} <ffffffff8013ae41>{worker_thread+211}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff80125562>{default_wake_function+0} <ffffffff8013de14>{keventd_create_kthread+0}
       <ffffffff8013ddeb>{kthread+200} <ffffffff8010b846>{child_rip+8}
       <ffffffff8013de14>{keventd_create_kthread+0} <ffffffff8013dd23>{kthread+0}
       <ffffffff8010b83e>{child_rip+0}

Showing all blocking locks in the system:
S            init:    1 [ffff81007fdc4770, 116] (not blocked on mutex)
S     migration/0:    2 [ffff81007fdc4040,   0] (not blocked on mutex)
S     ksoftirqd/0:    3 [ffff81007fdc67b0, 135] (not blocked on mutex)
S      watchdog/0:    4 [ffff81007fdc6080,   0] (not blocked on mutex)
S     migration/1:    5 [ffff81007fdc77f0,   0] (not blocked on mutex)
S     ksoftirqd/1:    6 [ffff81007fdc70c0, 134] (not blocked on mutex)
S      watchdog/1:    7 [ffff810003364830,   0] (not blocked on mutex)
S        events/0:    8 [ffff810037fef870, 110] (not blocked on mutex)
S        events/1:    9 [ffff810037fef140, 110] (not blocked on mutex)
S         khelper:   10 [ffff810037fee770, 110] (not blocked on mutex)
S         kthread:   11 [ffff810037fee040, 110] (not blocked on mutex)
S       kblockd/0:   15 [ffff81007f580830, 110] (not blocked on mutex)
S       kblockd/1:   16 [ffff81007f580100, 110] (not blocked on mutex)
S          kacpid:   17 [ffff810003369870, 114] (not blocked on mutex)
S           khubd:  115 [ffff81007f5107b0, 110] (not blocked on mutex)
S         pdflush:  179 [ffff81007f505870, 120] (not blocked on mutex)
S         pdflush:  180 [ffff81007f517140, 115] (not blocked on mutex)
S           aio/0:  182 [ffff81007f7240c0, 110] (not blocked on mutex)
S         kswapd0:  181 [ffff81007f540080, 117] (not blocked on mutex)
S           aio/1:  183 [ffff81007f687100, 111] (not blocked on mutex)
S         kseriod:  258 [ffff81007f682770, 110] (not blocked on mutex)
S       kpsmoused:  316 [ffff81007f564040, 111] (not blocked on mutex)
S           ata/0:  337 [ffff81007f64e7b0, 114] (not blocked on mutex)
S           ata/1:  338 [ffff81007f682040, 110] (not blocked on mutex)
S       scsi_eh_0:  344 [ffff81007f64e080, 114] (not blocked on mutex)
S       scsi_eh_1:  345 [ffff81007f60e7b0, 114] (not blocked on mutex)
S       scsi_eh_3:  382 [ffff81007f7247f0, 111] (not blocked on mutex)
S       scsi_wq_3:  385 [ffff81007f7dc870, 111] (not blocked on mutex)
S        kmirrord:  417 [ffff81007edc7870, 115] (not blocked on mutex)
S       kjournald:  431 [ffff81007ed8e7f0, 115] (not blocked on mutex)
S           udevd: 1358 [ffff81007edc8100, 111] (not blocked on mutex)
S         kauditd: 1935 [ffff81007e562140, 110] (not blocked on mutex)
S       kmpathd/0: 2043 [ffff81007e58d7b0, 112] (not blocked on mutex)
S       kmpathd/1: 2044 [ffff81007ead57f0, 112] (not blocked on mutex)
S       kjournald: 2104 [ffff81007e9de080, 119] (not blocked on mutex)
S        dhclient: 2716 [ffff81007a0d9040, 116] (not blocked on mutex)
R         syslogd: 2760 [ffff81007edc8830, 116] (not blocked on mutex)
S           klogd: 2764 [ffff81007ebd1080, 115] (not blocked on mutex)
S      irqbalance: 2775 [ffff81007e484040, 116] (not blocked on mutex)
S         portmap: 2787 [ffff81007eece0c0, 116] (not blocked on mutex)
S       rpc.statd: 2807 [ffff81007e4f0100, 122] (not blocked on mutex)
S      rpc.idmapd: 2846 [ffff81007ec83870, 116] (not blocked on mutex)
S           acpid: 2937 [ffff81007ec71100, 119] (not blocked on mutex)
S           cupsd: 2968 [ffff81007e58d080, 116] (not blocked on mutex)
S            sshd: 3073 [ffff81007edc7140, 117] (not blocked on mutex)
S          xinetd: 3094 [ffff81007ed1b7b0, 115] (not blocked on mutex)
S        sendmail: 3113 [ffff81007e5300c0, 116] (not blocked on mutex)
S        sendmail: 3121 [ffff81007e614870, 119] (not blocked on mutex)
S             gpm: 3170 [ffff81007f53c040, 115] (not blocked on mutex)
S             htt: 3229 [ffff81000337b0c0, 121] (not blocked on mutex)
S      htt_server: 3230 [ffff81007e310140, 117] (not blocked on mutex)
S     cannaserver: 3242 [ffff81007e739100, 116] (not blocked on mutex)
S           crond: 3254 [ffff81007e6997f0, 116] (not blocked on mutex)
S             xfs: 3295 [ffff81007e5247f0, 116] (not blocked on mutex)
S             atd: 3314 [ffff81007f60e080, 118] (not blocked on mutex)
S   dbus-daemon-1: 3333 [ffff810079e48040, 116] (not blocked on mutex)
S cups-config-dae: 3343 [ffff81007e549140, 119] (not blocked on mutex)
S           login: 3363 [ffff81007a26c7b0, 116] (not blocked on mutex)
S        mingetty: 3364 [ffff81007e549870, 118] (not blocked on mutex)
S        mingetty: 3365 [ffff81007e0eb770, 120] (not blocked on mutex)
S        mingetty: 3368 [ffff810079df1770, 121] (not blocked on mutex)
S        mingetty: 3369 [ffff810079c90100, 121] (not blocked on mutex)
S        mingetty: 3371 [ffff81007eece7f0, 118] (not blocked on mutex)
S        mingetty: 3581 [ffff81007ead3100, 122] (not blocked on mutex)
S      gdm-binary: 3584 [ffff8100721bb830, 115] (not blocked on mutex)
?      gdm-binary: 3845 [ffff81007ede3040, 117] (not blocked on mutex)
S               X: 3846 [ffff81007232c830, 115] (not blocked on mutex)
S      gdmgreeter: 3966 [ffff810079894870, 115] (not blocked on mutex)
R            bash: 3967 [ffff81000337b7f0, 116] (not blocked on mutex)
S       scsi_eh_4: 4224 [ffff81007f5407b0, 112] (not blocked on mutex)
S   qla2xxx_4_dpc: 4253 [ffff81006fbf9830, 100] (not blocked on mutex)
D       scsi_wq_4: 4255 [ffff81006f9147b0, 110] (not blocked on mutex)
S       scsi_eh_5: 4264 [ffff81007e739830, 111] (not blocked on mutex)
S   qla2xxx_5_dpc: 4408 [ffff81007e5240c0, 100] (not blocked on mutex)
S       scsi_wq_5: 4410 [ffff81007e6990c0, 111] (not blocked on mutex)

---------------------------
| showing all locks held: |
---------------------------

#001:             [ffff81006ee20080] {scsi_host_alloc}
.. held by:         scsi_wq_4: 4255 [ffff81006f9147b0, 110]
... acquired at:               scsi_scan_target+0x51/0x87 [scsi_mod]

=============================================



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: Re: problems with scsi_transport_fc and qla2xxx
  2006-03-06 21:28               ` Andrew Vasquez
  2006-03-06 20:57                 ` Maxim Kozover
@ 2006-03-06 21:09                 ` Maxim Kozover
       [not found]                   ` <20060306232831.GS6278@andrew-vasquezs-powerbook-g4-15.local>
  1 sibling, 1 reply; 17+ messages in thread
From: Maxim Kozover @ 2006-03-06 21:09 UTC (permalink / raw)
  To: Andrew Vasquez; +Cc: Mike Snitzer, linux-kernel, linux-scsi

Hi Andrew!
After applying the patch the same lock exists:
#001:             [ffff81006edc4080] {scsi_host_alloc}
.. held by:         scsi_wq_4: 4255 [ffff81007edaf770, 110]
... acquired at:               scsi_scan_target+0x51/0x87 [scsi_mod]

Thanks,

Maxim.

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: problems with scsi_transport_fc and qla2xxx
  2006-03-06 20:09             ` Maxim Kozover
@ 2006-03-06 21:28               ` Andrew Vasquez
  2006-03-06 20:57                 ` Maxim Kozover
  2006-03-06 21:09                 ` Maxim Kozover
  0 siblings, 2 replies; 17+ messages in thread
From: Andrew Vasquez @ 2006-03-06 21:28 UTC (permalink / raw)
  To: Maxim Kozover; +Cc: Mike Snitzer, linux-kernel, linux-scsi

On Mon, 06 Mar 2006, Maxim Kozover wrote:

> Unfortunately I don't have the directory /proc/scsi/qla2xxx.

The two stage discovery process has not been needed since FC transport
integration.  Instead, the driver simply makes up-calls to signal
rport visiblity (add on PLOGI/PRLI; delete on LOGO/cable-pull/etc).

> However the target sees PRLI from the host again after reconnecting
> the cable between the initiator and the switch.
> Does it mean the rediscovering new devices on initiator side is
> already done?
> 

Yes, after plugging the cable back in, the driver rediscovers ports:

	Mar  3 01:07:22 multipath kernel: scsi(4): RSNN_NN exiting normally.
	Mar  3 01:07:22 multipath kernel: scsi(4): GID_PT entry - nn 200000e08b079a69 pn 210000e08b079a69 portid=010700.
	Mar  3 01:07:22 multipath kernel: scsi(4): GID_PT entry - nn 2000001738279c00 pn 1000001738279c11 portid=010200.
	Mar  3 01:07:22 multipath kernel: scsi(4): device wrap (010200)

Initiates PLOGI/PRLI: 

	Mar  3 01:07:22 multipath kernel: scsi(4): Trying Fabric Login w/loop id 0x0081 for port 010200.

And upcall via fc_remote_port_add() is done.

	Mar  3 01:07:22 multipath kernel: scsi(4): LOOP READY
	Mar  3 01:07:22 multipath kernel: scsi(4): qla2x00_loop_resync - end

Firmware then notifies software that the port has logged out:

	Mar  3 01:07:22 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0081/0007/7ee5.
	Mar  3 01:07:22 multipath kernel: scsi(4:0:0): status_entry: Port Down pid=43, compl status=0x29, port state=0x4

A CDB also returns with a completion status of PORT_LOGGED_OUT.  From
the driver's DPC routine (process-context), the upcall to
fc_remote_port_delete() is issued:

Driver attempts a relogin:

	Mar  3 01:07:22 multipath kernel: scsi(4): Port login retry: 1000001738279c11, id = 0x0081 retry cnt=8
	Mar  3 01:07:23 multipath kernel: scsi(4): fcport-0 - port retry count: 0 remaining
	Mar  3 01:07:23 multipath kernel: scsi(4): qla2x00_port_login()
	Mar  3 01:07:23 multipath kernel: scsi(4): Trying Fabric Login w/loop id 0x0081 for port 010200.

Relogin complete

	Mar  3 01:07:23 multipath kernel: scsi(4): port login OK: logged in ID 0x81

Upcall to fc_remote_port_add() done.

	Mar  3 01:07:23 multipath kernel: scsi(4): qla2x00_port_login - end
	Mar  3 01:07:23 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0000/0006/0001.
	Mar  3 01:07:23 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0000/0007/0001.
	Mar  3 01:07:23 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0000/0004/0001.
	Mar  3 01:07:24 multipath kernel: scsi(4): Asynchronous PORT UPDATE ignored 0081/0006/0001.

I also noticed that scsi_transport_fc.c::fc_user_scan() is not called
with the host_lock held... hmm..  could you try out the patch I sent
earlier and provide the results.

Also, could you send the "echo t > /proc/..." output after the cable
has been reinserted, but, before the 'echo "- - -" > /sys/class' scan
is initiated.

thanks,
av

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re[8]: problems with scsi_transport_fc and qla2xxx
       [not found]                                 ` <20060310231344.GB641@andrew-vasquezs-powerbook-g4-15.local>
@ 2006-03-11 21:10                                   ` Maxim Kozover
  2006-03-12  9:28                                     ` Arjan van de Ven
  2006-03-13 23:19                                     ` Andrew Vasquez
  0 siblings, 2 replies; 17+ messages in thread
From: Maxim Kozover @ 2006-03-11 21:10 UTC (permalink / raw)
  To: Andrew Vasquez; +Cc: linux-kernel, linux-scsi

Hi Andrew!
Congratulations! The kernel from scsi-rc-fixes git and your patch are
working.
By the way, could you, please, tell me how I get only scsi patches
from the git repository, cause I got the whole kernel by using
cg-clone http://kernel.org/pub/scm/linux/kernel/git/jejb/scsi-rc-fixes-2.6.git

Now the process looks like following:
Mar 11 23:54:22 multipath kernel: qla2xxx 0000:03:01.0: LOOP DOWN detected (2).
Mar 11 23:54:28 multipath kernel:  rport-4:0-0: blocked FC remote port time out:
 removing target and saving binding
Mar 11 23:54:37 multipath kernel: qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
Mar 11 23:54:37 multipath kernel: qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).
Mar 11 23:54:59 multipath kernel:  4:0:0:0: timing out command, waited 22s

And the disks appear.
Could you tell me, please, where this 22sec timeout came from?

Again, congratulations for good work!

Thanks much,

Maxim.


^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: Re[8]: problems with scsi_transport_fc and qla2xxx
  2006-03-11 21:10                                   ` Re[8]: " Maxim Kozover
@ 2006-03-12  9:28                                     ` Arjan van de Ven
  2006-03-12 11:46                                       ` Maxim Kozover
  2006-03-13 23:19                                     ` Andrew Vasquez
  1 sibling, 1 reply; 17+ messages in thread
From: Arjan van de Ven @ 2006-03-12  9:28 UTC (permalink / raw)
  To: Maxim Kozover; +Cc: linux-scsi, linux-kernel, Andrew Vasquez

On Sun, 2006-03-12 at 00:10 +0300, Maxim Kozover wrote:
> Hi Andrew!
> Congratulations! The kernel from scsi-rc-fixes git and your patch are
> working.
> By the way, could you, please, tell me how I get only scsi patches
> from the git repository, cause I got the whole kernel by using
> cg-clone http://kernel.org/pub/scm/linux/kernel/git/jejb/scsi-rc-fixes-2.6.git
> 
> Now the process looks like following:
> Mar 11 23:54:22 multipath kernel: qla2xxx 0000:03:01.0: LOOP DOWN detected (2).
> Mar 11 23:54:28 multipath kernel:  rport-4:0-0: blocked FC remote port time out:
>  removing target and saving binding
> Mar 11 23:54:37 multipath kernel: qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
> Mar 11 23:54:37 multipath kernel: qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).
> Mar 11 23:54:59 multipath kernel:  4:0:0:0: timing out command, waited 22s
> 
> And the disks appear.
> Could you tell me, please, where this 22sec timeout came from?

looks like your fiber fabric decided to renegotiate, and halfway it went
for a coffee and donuts break to not upset the union rules :)

I've seen LOOP negotiations take 10+ seconds before, and that is on a
really simple setup.... so nothing super special 


^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: Re: Re[8]: problems with scsi_transport_fc and qla2xxx
  2006-03-12  9:28                                     ` Arjan van de Ven
@ 2006-03-12 11:46                                       ` Maxim Kozover
  0 siblings, 0 replies; 17+ messages in thread
From: Maxim Kozover @ 2006-03-12 11:46 UTC (permalink / raw)
  To: Arjan van de Ven; +Cc: linux-scsi, linux-kernel, Andrew Vasquez

OK, Arjan, thanks!

Maxim.

AvdV> looks like your fiber fabric decided to renegotiate, and halfway it went
AvdV> for a coffee and donuts break to not upset the union rules :)

AvdV> I've seen LOOP negotiations take 10+ seconds before, and that is on a
AvdV> really simple setup.... so nothing super special 



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: Re[8]: problems with scsi_transport_fc and qla2xxx
  2006-03-11 21:10                                   ` Re[8]: " Maxim Kozover
  2006-03-12  9:28                                     ` Arjan van de Ven
@ 2006-03-13 23:19                                     ` Andrew Vasquez
  2006-03-20 10:45                                       ` Maxim Kozover
  1 sibling, 1 reply; 17+ messages in thread
From: Andrew Vasquez @ 2006-03-13 23:19 UTC (permalink / raw)
  To: Maxim Kozover; +Cc: linux-kernel, linux-scsi, Michael Reed, James.Smart

On Sun, 12 Mar 2006, Maxim Kozover wrote:

> Congratulations! The kernel from scsi-rc-fixes git and your patch are
> working.

Actually Mike R. and James S. deserve the credit for the composite
patch which consists of:

1) [PATCH] FC transport : Avoid device offline cases by stalling aborts until device unblocked
   http://marc.theaimsgroup.com/?l=linux-scsi&m=114225658724378&w=2

2) Serialize scan work during fc_remote_port_delete() so rport removal
doesn't deadlock midlayer scans.  The problem you were seeing.  (Mike
R.)

3) rport race fixes during removal (James S.).

> By the way, could you, please, tell me how I get only scsi patches
> from the git repository, cause I got the whole kernel by using
> cg-clone http://kernel.org/pub/scm/linux/kernel/git/jejb/scsi-rc-fixes-2.6.git
> 
> Now the process looks like following:
> Mar 11 23:54:22 multipath kernel: qla2xxx 0000:03:01.0: LOOP DOWN detected (2).
> Mar 11 23:54:28 multipath kernel:  rport-4:0-0: blocked FC remote port time out:
>  removing target and saving binding
> Mar 11 23:54:37 multipath kernel: qla2xxx 0000:03:01.0: LIP reset occured (f7f7).
> Mar 11 23:54:37 multipath kernel: qla2xxx 0000:03:01.0: LOOP UP detected (2 Gbps).
> Mar 11 23:54:59 multipath kernel:  4:0:0:0: timing out command, waited 22s
> 
> And the disks appear.
> Could you tell me, please, where this 22sec timeout came from?

Essentially there's currently several issues with rport consumers
making delete() calls during mid-layer scanning.

I'm hoping at a minimum we can get Mike R's fixes into 2.6.16, and
address the additional races going forward...  James/Mike?

Here's a minimal the serialize scan-work patch, could you check to see
that this addresses your issue?  Start with any latest linux-2.6.git
tree.

Thanks,
Andrew

---

diff --git a/drivers/scsi/scsi_transport_fc.c b/drivers/scsi/scsi_transport_fc.c
index 929032e..3d09920 100644
--- a/drivers/scsi/scsi_transport_fc.c
+++ b/drivers/scsi/scsi_transport_fc.c
@@ -1649,6 +1649,8 @@ fc_remote_port_delete(struct fc_rport  *
 		return;
 	}
 
+	/* flush any scan work */ /* which can sleep */
+	scsi_flush_work(rport_to_shost(rport));
 	scsi_target_block(&rport->dev);
 
 	/* cap the length the devices can be blocked until they are deleted */

^ permalink raw reply related	[flat|nested] 17+ messages in thread

* Re: Re: Re[8]: problems with scsi_transport_fc and qla2xxx
  2006-03-13 23:19                                     ` Andrew Vasquez
@ 2006-03-20 10:45                                       ` Maxim Kozover
  0 siblings, 0 replies; 17+ messages in thread
From: Maxim Kozover @ 2006-03-20 10:45 UTC (permalink / raw)
  To: Andrew Vasquez; +Cc: linux-kernel, linux-scsi, Michael Reed, James.Smart

Hi Andrew!
Unfortunately I see that scan-work patch is not included in
2.6.16 and the usual lock appears:
#001:             [ffff8100708a8080] {scsi_host_alloc}
.. held by:         scsi_wq_4: 3912 [ffff810071edf870, 110]
... acquired at:               scsi_scan_target+0x51/0x87 [scsi_mod]

Applying the patch you sent solves the problem, i.e. disks appear again after
22 sec timeout (why?).

Thanks,

Maxim.

Tuesday, March 14, 2006, 2:19:03 AM, you wrote:

AV> diff --git a/drivers/scsi/scsi_transport_fc.c
AV> b/drivers/scsi/scsi_transport_fc.c
AV> index 929032e..3d09920 100644
AV> --- a/drivers/scsi/scsi_transport_fc.c
AV> +++ b/drivers/scsi/scsi_transport_fc.c
AV> @@ -1649,6 +1649,8 @@ fc_remote_port_delete(struct fc_rport  *
AV>                 return;
AV>         }
AV>  
AV> +       /* flush any scan work */ /* which can sleep */
AV> +       scsi_flush_work(rport_to_shost(rport));
AV>         scsi_target_block(&rport->dev);
AV>  
AV>         /* cap the length the devices can be blocked until they are deleted */



^ permalink raw reply	[flat|nested] 17+ messages in thread

end of thread, other threads:[~2006-03-20 11:46 UTC | newest]

Thread overview: 17+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <1413265398.20060227150526@netvision.net.il>
     [not found] ` <978150825.20060227210552@netvision.net.il>
2006-03-01  6:14   ` problems with scsi_transport_fc and qla2xxx Andrew Morton
2006-03-01 20:25     ` Stefan Kaltenbrunner
2006-03-01 21:08       ` Andrew Vasquez
2006-03-02 16:32         ` Maxim Kozover
2006-03-02 17:38           ` Andrew Vasquez
2006-03-02 22:15             ` Maxim Kozover
2006-03-06 19:06               ` Andrew Vasquez
2006-03-06 20:00           ` Mike Snitzer
2006-03-06 20:09             ` Maxim Kozover
2006-03-06 21:28               ` Andrew Vasquez
2006-03-06 20:57                 ` Maxim Kozover
2006-03-06 21:09                 ` Maxim Kozover
     [not found]                   ` <20060306232831.GS6278@andrew-vasquezs-powerbook-g4-15.local>
     [not found]                     ` <1219491790.20060307124035@netvision.net.il>
     [not found]                       ` <20060307172227.GE6275@andrew-vasquezs-powerbook-g4-15.local>
     [not found]                         ` <1343850424.20060307231141@netvision.net.il>
     [not found]                           ` <20060308080050.GF9956@andrew-vasquezs-powerbook-g4-15.local>
     [not found]                             ` <20060308154341.GA1779@andrew-vasquezs-powerbook-g4-15.local>
     [not found]                               ` <1502511597.20060308213247@netvision.net.il>
     [not found]                                 ` <20060310231344.GB641@andrew-vasquezs-powerbook-g4-15.local>
2006-03-11 21:10                                   ` Re[8]: " Maxim Kozover
2006-03-12  9:28                                     ` Arjan van de Ven
2006-03-12 11:46                                       ` Maxim Kozover
2006-03-13 23:19                                     ` Andrew Vasquez
2006-03-20 10:45                                       ` Maxim Kozover

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).