All of lore.kernel.org
 help / color / mirror / Atom feed
* HD somtimes hanging
@ 2003-07-24 13:12 Thomas Beutin
  2003-07-24 14:36 ` AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) Duncan Gibb
  0 siblings, 1 reply; 9+ messages in thread
From: Thomas Beutin @ 2003-07-24 13:12 UTC (permalink / raw)
  To: linux-scsi

Hi,

sorry, if this question is already answered, but i cannot find
a searchable mailing list archiv. In this case it would be nice
if You can give me a pointer to the answers.

I have big problems with my secend an third hard disk on a
Adaptec AIC-7902 U320 sitting on an intel SE7501HG2 board.
The disks are SCA disks FUJITSU  Model: MAP3367NC Rev: 0105
plugged into a hot swap back plane SCA HSBP M15 (Vendor
is ESG-SHV).
The first disk is an IBM IC35L036UCPR15-0 (Rev: S70H) and
runs fine.
When i try to access one of this FUJITSU disks, the system
hangs a couple of minutes and after that my dmesg command
ouput is full of "Dump Card State" (see below).
After a while the system (vanilla linux SMP kernel 2.4.21
running on a slackware 9.0) comes back to normal operation.
The system is'nt comletly hanging, it seems a file system
access in an already open shell freezes the shell, while
doing other stuff is ok.
Have You any idea what's going on? Is the backplane kidding
me (i've never bevor worked with an intelligent
SCSI-ID-consuming backplane)? I've found the fact, that the
disks are a short time available after a cold reboot (no
power in the machine at all), but later (2..3 hours) the
problems appeares again.

I've put some output and messages below. If You need more,
please let me know.

Any help or links to explanations welcome!

Thanks,
-tb

# cat /proc/scsi/scsi 
Attached devices: 
Host: scsi0 Channel: 00 Id: 00 Lun: 00
  Vendor: IBM      Model: IC35L036UCPR15-0 Rev: S70H
  Type:   Direct-Access                    ANSI SCSI revision: 03
Host: scsi0 Channel: 00 Id: 01 Lun: 00
  Vendor: FUJITSU  Model: MAP3367NC        Rev: 0105
  Type:   Direct-Access                    ANSI SCSI revision: 03
Host: scsi0 Channel: 00 Id: 06 Lun: 00
  Vendor: ESG-SHV  Model: SCA HSBP M15     Rev: 0.10
  Type:   Processor                        ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 02 Lun: 00
  Vendor: FUJITSU  Model: MAP3367NC        Rev: 0105
  Type:   Direct-Access                    ANSI SCSI revision: 03
Host: scsi2 Channel: 00 Id: 00 Lun: 00
  Vendor: TEAC     Model: CD-552E          Rev: T016
  Type:   CD-ROM                           ANSI SCSI revision: 02

# uname -a
Linux stulle 2.4.21 #2 SMP Mon Jun 30 12:46:15 CEST 2003 i686 unknown

# lsmod 
Module                  Size  Used by    Not tainted
raid1                  14412   1 
md                     60640   1  [raid1]
uhci                   27248   0  (unused)
usbcore                66240   1  [uhci]
i810_rng                2820   0  (unused)
reiserfs              185264   1 
floppy                 51548   0 
serial                 49028   0  (unused)
rtc                     7612   0  (unused)
ide-scsi               10544   0 
e1000                  54464   1 


------ BEGIN dmesg snip ------
DevQ(0:0:0): 0 waiting
DevQ(0:1:0): 0 waiting
DevQ(0:2:0): 0 waiting
DevQ(0:6:0): 0 waiting
scsi0:0:2:0: Cmd aborted from QINFIFO
scsi0:0:2:0: Attempting to abort cmd f14af000
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi0: Dumping Card State at program address 0xb Mode 0x33
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11] 
DFFSTAT[0x11] SCSISIGI[0x74] SCSIPHASE[0x0] SCSIBUS[0x0] 
LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10] 
SEQINTCTL[0x0] SEQ_FLAGS[0xc0] SEQ_FLAGS2[0x0] SSTAT0[0x0] 
SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0] 
SIMODE1[0xa4] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x80] 
LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x81] 

SCB Count = 64 CMDS_PENDING = 2 LASTSCB 0x3e CURRSCB 0x3e NEXTSCB 0xff00
qinstart = 53881 qinfifonext = 53881
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
 17 SCB_CONTROL[0x60] SCB_SCSIID[0x27] SCB_TAG[0x11] 
 42 SCB_CONTROL[0x60] SCB_SCSIID[0x27] SCB_TAG[0x2a] 
Total 2
Kernel Free SCB list: 62 48 1 3 50 38 22 4 63 55 5 53 49 30 58 28 15 14 19 26 33 10 35 6 46 23 59 24 43 45 16 25 29 8 51 32 41 2 56 44 7 54 13 9 27 34 39 37 21 57 11 40 18 20 47 31 52 36 12 0 61 60 
Sequencer Complete DMA-inprog list: 
Sequencer Complete list: 
Sequencer DMA-Up and Complete list: 

scsi0: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0, LJSCB 0xff00
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] 
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] 
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x0] 
scsi0: FIFO1 Active, LONGJMP == 0x247, SCB 0x2a, LJSCB 0x2a
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x28] DFSTATUS[0x1] 
SG_CACHE_SHADOW[0x50] SG_STATE[0x3] DFFSXFRCTL[0x0] 
SOFFCNT[0x0] MDFFSTAT[0xa] SHADDR = 0x02d693c00, SHCNT = 0x400 
HADDR = 0x02d693c00, HCNT = 0x400 CCSGCTL[0x10] 
LQIN: 0x4 0x0 0x0 0x2a 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0xf4 0x0 0x0 0x0 0x2 0x0 
scsi0: LQISTATE = 0x29, LQOSTATE = 0x0, OPTIONMODE = 0x42
scsi0: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x1
SIMODE0[0xc] 
CCSCBCTL[0x4] 
scsi0: REG0 == 0x3e, SINDEX = 0x133, DINDEX = 0x106
scsi0: SCBPTR == 0x3e, SCB_NEXT == 0xff00, SCB_NEXT2 == 0xffe1
CDB 0 0 0 0 0 0
STACK: 0x120 0x120 0x0 0x247 0x247 0x235 0x2e 0x1
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
------ END dmesg snip ------
-- 
Thomas Beutin                             tb@laokoon.IN-Berlin.DE
Beam me up, Scotty. There is no intelligent live down in Redmond.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* AIC7902 lockups on Intel SMP (Re: HD somtimes hanging)
  2003-07-24 13:12 HD somtimes hanging Thomas Beutin
@ 2003-07-24 14:36 ` Duncan Gibb
  2003-07-24 15:44   ` Thomas Beutin
  2003-07-24 17:47   ` Justin T. Gibbs
  0 siblings, 2 replies; 9+ messages in thread
From: Duncan Gibb @ 2003-07-24 14:36 UTC (permalink / raw)
  To: Thomas Beutin; +Cc: linux-scsi

On Thu, 2003-07-24 at 14:12, Thomas Beutin wrote:

TB> I have big problems with my secend an third hard disk
TB> on a Adaptec AIC-7902 U320 sitting on an intel
TB> SE7501HG2 board.

I'm not sure I can help you, but I can certainly tell you you're not
alone.  I have very similar problems with the on-board AIC7902 on my
SuperMicro X5DA8 board.  This uses the 7505 chipset rather than the
7501, but AFAIK, both are i810-derivatives from Linux's point of view.

Just before I read your post, I built a 2.4.21-bk17 kernel in the hope
that this would have been fixed some time in the last couple of months. 
I'm not really in to running experimental kernels, so usually I won't
stray further than the -ac series.


TB> The first disk is an IBM IC35L036UCPR15-0 (Rev: S70H)
TB> and runs fine.

In my case /proc/scsi/scsi reads:

Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
  Vendor: COMPAQ   Model: ST32171W         Rev: 0388
  Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 02 Lun: 00
  Vendor: EPSON SC Model: ANNER GT-6500    Rev: 1.23
  Type:   Processor                        ANSI SCSI revision: 01
Host: scsi1 Channel: 00 Id: 03 Lun: 00
  Vendor: YAMAHA   Model: CRW2100S         Rev: 1.0H
  Type:   CD-ROM                           ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 06 Lun: 00
  Vendor: MPL      Model: MC-DISK-D-1S[0]  Rev: 3.3
  Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 06 Lun: 01
  Vendor: MPL      Model: MC-DISK-D-1S[0]  Rev: 3.3
  Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 06 Lun: 02
  Vendor: MPL      Model: MC-DISK-D-1S[1]  Rev: 3.3
  Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 06 Lun: 03
  Vendor: MPL      Model: MC-DISK-D-1S[1]  Rev: 3.3
  Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 06 Lun: 04
  Vendor: MPL      Model: MC-DISK-D-1S[1]  Rev: 3.3
  Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 06 Lun: 05
  Vendor: MPL      Model: MC-DISK-D-1S[1]  Rev: 3.3
  Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 06 Lun: 06
  Vendor: MPL      Model: MC-DISK-D-1S[0]  Rev: 3.3
  Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 06 Lun: 07
  Vendor: MPL      Model: MC-DISK-D-1S[0]  Rev: 3.3
  Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi2 Channel: 00 Id: 00 Lun: 00
  Vendor: SONY     Model: DVD RW DRU-500A  Rev: 2.0c
  Type:   CD-ROM                           ANSI SCSI revision: 02


The "Compaq" (ie Seagate) disk is the system swap disk, and it must be
working fine or the whole machine would hang (I presume).

Also I think the MPL device (which is a PC-card/CompactFlash reader)
should have only two LUNs, since that's how many slots it has... 
"scsi2" is ide-scsi.


TB> The system is'nt comletly hanging, it seems a file
TB> system access in an already open shell freezes the
TB> shell, while doing other stuff is ok.

If I try to scan something, Xsane hangs for a while, then recovers OK. 
But if I put a CD in the Yamaha drive, anything which touches the scsi
subsystem from then on will hang.

Here's what's in /var/log/messages:

-- Xsane-scan-recover------------------------------------------

scsi1:0:2:0: Attempting to abort cmd f7be0800: 0x8 0x0 0x0 0x2 0x78 0x0
scsi1: At time of recovery, card was not paused
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi1: Dumping Card State at program address 0x1ae Mode 0x33
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11]
DFFSTAT[0x31] SCSISIGI[0x0] SCSIPHASE[0x0] SCSIBUS[0x0]
LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10]
SEQINTCTL[0x0] SEQ_FLAGS[0x40] SEQ_FLAGS2[0x0] SSTAT0[0x0]
SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0]
SIMODE1[0xac] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0]
LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0]
Jul 24 14:07:51 localhost kernel:
SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x3 CURRSCB 0x3 NEXTSCB 0x0
qinstart = 9041 qinfifonext = 9042
QINFIFO: 0x4
WAITING_TID_QUEUES:
Pending list:
Jul 24 14:07:51 localhost kernel:   4 FIFO_USE[0x0] SCB_CONTROL[0x48]
SCB_SCSIID[0x27]
Jul 24 14:07:51 localhost kernel:   3 FIFO_USE[0x0] SCB_CONTROL[0x40]
SCB_SCSIID[0x37]
Total 2
Kernel Free SCB list: 2 1 0
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
Jul 24 14:07:51 localhost kernel:
scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x4
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0 0x0
scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42
scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0
SIMODE0[0xc]
CCSCBCTL[0x0]
scsi1: REG0 == 0x4, SINDEX = 0x133, DINDEX = 0x108
scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffdb
CDB 43 0 0 0 0 0
STACK: 0xe2 0x0 0x0 0x0 0x0 0x0 0x0 0x29
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
DevQ(0:2:0): 0 waiting
DevQ(0:3:0): 0 waiting
DevQ(0:6:0): 0 waiting
DevQ(0:6:1): 0 waiting
DevQ(0:6:2): 0 waiting
DevQ(0:6:3): 0 waiting
DevQ(0:6:4): 0 waiting
DevQ(0:6:5): 0 waiting
DevQ(0:6:6): 0 waiting
DevQ(0:6:7): 0 waiting
scsi1:0:2:0: Cmd aborted from QINFIFO

-- 69-second pause at this point

scsi1:0:2:0: Attempting to abort cmd f7be0800: 0x0 0x0 0x0 0x0 0x0 0x0
scsi1: At time of recovery, card was not paused
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi1: Dumping Card State at program address 0x1ae Mode 0x33
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11]
DFFSTAT[0x31] SCSISIGI[0x0] SCSIPHASE[0x0] SCSIBUS[0x0]
LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10]
SEQINTCTL[0x0] SEQ_FLAGS[0x40] SEQ_FLAGS2[0x0] SSTAT0[0x0]
SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0]
SIMODE1[0xac] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0]
LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0]
Jul 24 14:08:01 localhost kernel:
SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x3 CURRSCB 0x3 NEXTSCB 0x0
qinstart = 9041 qinfifonext = 9042
QINFIFO: 0x4
WAITING_TID_QUEUES:
Pending list:
Jul 24 14:08:01 localhost kernel:   4 FIFO_USE[0x0] SCB_CONTROL[0x48]
SCB_SCSIID[0x27]
Jul 24 14:08:01 localhost kernel:   3 FIFO_USE[0x0] SCB_CONTROL[0x40]
SCB_SCSIID[0x37]
Total 2
Kernel Free SCB list: 2 1 0
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
Jul 24 14:08:01 localhost kernel:
scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x4
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0 0x0
scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42
scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0
SIMODE0[0xc]
CCSCBCTL[0x0]
scsi1: REG0 == 0x4, SINDEX = 0x133, DINDEX = 0x108
scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffdb
CDB 43 0 0 0 0 0
STACK: 0xe2 0x0 0x0 0x0 0x0 0x0 0x0 0x29
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
DevQ(0:2:0): 0 waiting
DevQ(0:3:0): 0 waiting
DevQ(0:6:0): 0 waiting
DevQ(0:6:1): 0 waiting
DevQ(0:6:2): 0 waiting
DevQ(0:6:3): 0 waiting
DevQ(0:6:4): 0 waiting
DevQ(0:6:5): 0 waiting
DevQ(0:6:6): 0 waiting
DevQ(0:6:7): 0 waiting
scsi1:0:2:0: Cmd aborted from QINFIFO
scsi1:0:3:0: Attempting to abort cmd f7be0c00: 0x43 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0xc 0x40
scsi1: At time of recovery, card was not paused
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi1: Dumping Card State at program address 0x1ae Mode 0x33
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11]
DFFSTAT[0x31] SCSISIGI[0x0] SCSIPHASE[0x0] SCSIBUS[0x0]
LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10]
SEQINTCTL[0x0] SEQ_FLAGS[0x40] SEQ_FLAGS2[0x0] SSTAT0[0x0]
SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0]
SIMODE1[0xac] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0]
LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0]
Jul 24 14:08:02 localhost kernel:
SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x3 CURRSCB 0x3 NEXTSCB 0x0
qinstart = 9041 qinfifonext = 9041
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
Jul 24 14:08:02 localhost kernel:   3 FIFO_USE[0x0] SCB_CONTROL[0x40]
SCB_SCSIID[0x37]
Total 1
Kernel Free SCB list: 4 2 1 0
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
Jul 24 14:08:02 localhost kernel:
scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x4
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0 0x0
scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42
scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0
SIMODE0[0xc]
CCSCBCTL[0x0]
scsi1: REG0 == 0x4, SINDEX = 0x133, DINDEX = 0x108
scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffdb
CDB 43 0 0 0 0 0
STACK: 0xe2 0x0 0x0 0x0 0x0 0x0 0x0 0x29
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
DevQ(0:2:0): 0 waiting
DevQ(0:3:0): 0 waiting
DevQ(0:6:0): 0 waiting
DevQ(0:6:1): 0 waiting
DevQ(0:6:2): 0 waiting
DevQ(0:6:3): 0 waiting
DevQ(0:6:4): 0 waiting
DevQ(0:6:5): 0 waiting
DevQ(0:6:6): 0 waiting
DevQ(0:6:7): 0 waiting
scsi1:0:3:0: Unable to deliver message
Recovery code sleeping

-- four-second pause

Recovery code awake
Timer Expired
scsi1: Device reset returning 0x2003
Recovery code sleeping

-- five-second pause

Recovery code awake
Timer Expired
scsi1: Device reset returning 0x2003
Recovery SCB completes

-- Xsane-scan-recover------------------------------------------



-- Insert-CD-ROM-and-hang-system-------------------------------

scsi1:0:3:0: Attempting to abort cmd f7be0c00: 0x28 0x0 0x0 0x0 0x0 0x10
0x0 0x0 0x1 0x0
scsi1: At time of recovery, card was not paused
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi1: Dumping Card State at program address 0x2c Mode 0x22
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11]
DFFSTAT[0x31] SCSISIGI[0x48] SCSIPHASE[0x0] SCSIBUS[0x88]
LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10]
SEQINTCTL[0x0] SEQ_FLAGS[0xc0] SEQ_FLAGS2[0x0] SSTAT0[0x0]
SSTAT1[0x8] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x8]
SIMODE1[0xa4] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0]
LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0]
Jul 24 14:19:15 localhost kernel:
SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x2 CURRSCB 0x2 NEXTSCB 0x0
qinstart = 18960 qinfifonext = 18960
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
Jul 24 14:19:15 localhost kernel:   2 FIFO_USE[0x0] SCB_CONTROL[0x44]
SCB_SCSIID[0x37]
Total 1
Kernel Free SCB list: 3 4 1 0
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
Jul 24 14:19:15 localhost kernel:
scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
scsi1: FIFO1 Free, LONGJMP == 0x80ec, SCB 0x2
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x4] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0 0x0
scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42
scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0
SIMODE0[0xc]
CCSCBCTL[0x4]
scsi1: REG0 == 0x2, SINDEX = 0x122, DINDEX = 0x1c0
scsi1: SCBPTR == 0xff02, SCB_NEXT == 0xff00, SCB_NEXT2 == 0x0
CDB 2 1 0 0 0 0
STACK: 0x15 0x0 0x0 0x0 0x0 0x0 0x0 0x29
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
DevQ(0:2:0): 0 waiting
DevQ(0:3:0): 1 waiting
DevQ(0:6:0): 0 waiting
DevQ(0:6:1): 0 waiting
DevQ(0:6:2): 0 waiting
DevQ(0:6:3): 0 waiting
DevQ(0:6:4): 0 waiting
DevQ(0:6:5): 0 waiting
DevQ(0:6:6): 0 waiting
DevQ(0:6:7): 0 waiting
(scsi1:A:3:0): Device is disconnected, re-queuing SCB
Recovery code sleeping
(scsi1:A:3:0): Abort Message Sent

-- five-second pause

Recovery code awake
Timer Expired
scsi1:0:3:0: Attempting to abort cmd f7be0e00: 0x0 0x0 0x0 0x0 0x0 0x0
scsi1:0:3:0: Command found on device queue

-- ten-second pause

scsi1:0:3:0: Attempting to abort cmd f7be0e00: 0x0 0x0 0x0 0x0 0x0 0x0
scsi1:0:3:0: Command found on device queue
Recovery code sleeping

-- five-second pause

Recovery code awake
Timer Expired
scsi1: Device reset returning 0x2003
Recovery SCB completes
Recovery SCB completes

-- 42-second pause

scsi1:0:3:0: Attempting to abort cmd f7be0c00: 0x28 0x0 0x0 0x0 0x0 0x10
0x0 0x0 0x1 0x0
scsi1: At time of recovery, card was not paused
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi1: Dumping Card State at program address 0x94 Mode 0x0
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11]
DFFSTAT[0x31] SCSISIGI[0x48] SCSIPHASE[0x0] SCSIBUS[0x88]
LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10]
SEQINTCTL[0x80] SEQ_FLAGS[0xc0] SEQ_FLAGS2[0x0] SSTAT0[0x0]
SSTAT1[0x8] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x8]
SIMODE1[0xa4] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0]
LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0]
Jul 24 14:20:17 localhost kernel:
SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x2 CURRSCB 0x2 NEXTSCB 0x0
qinstart = 20221 qinfifonext = 20221
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
Jul 24 14:20:17 localhost kernel:   2 FIFO_USE[0x0] SCB_CONTROL[0x44]
SCB_SCSIID[0x37]
Total 1
Kernel Free SCB list: 3 4 1 0
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
Jul 24 14:20:17 localhost kernel:
scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
scsi1: FIFO1 Free, LONGJMP == 0x80ec, SCB 0x2
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x4] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]
LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0 0x0
scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42
scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0
SIMODE0[0xc]
CCSCBCTL[0x4]
scsi1: REG0 == 0x27a5, SINDEX = 0x111, DINDEX = 0x1c0
scsi1: SCBPTR == 0x0, SCB_NEXT == 0xff00, SCB_NEXT2 == 0x0
CDB 0 0 0 0 0 0
STACK: 0x26 0x14 0x0 0x0 0x0 0x0 0x0 0x0
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
DevQ(0:2:0): 0 waiting
DevQ(0:3:0): 0 waiting
DevQ(0:6:0): 0 waiting
DevQ(0:6:1): 0 waiting
DevQ(0:6:2): 0 waiting
DevQ(0:6:3): 0 waiting
DevQ(0:6:4): 0 waiting
DevQ(0:6:5): 0 waiting
DevQ(0:6:6): 0 waiting
DevQ(0:6:7): 0 waiting
(scsi1:A:3:0): Device is disconnected, re-queuing SCB
Recovery code sleeping
(scsi1:A:3:0): Abort Message Sent

-- four-second pause

Recovery code awake
Timer Expired
scsi: device set offline - not ready or command retry failed after bus
reset: host 1 channel 0 id 3 lun 0

-- ten-second pause

scsi1:0:3:0: Attempting to abort cmd f7be0e00: 0x0 0x0 0x0 0x0 0x0 0x0
scsi1:0:3:0: Command found on device queue
scsi: device set offline - not ready or command retry failed after bus
reset: host 1 channel 0 id 3 lun 0
 I/O error: dev 0b:00, sector 64
isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
cdrom: open failed.

-- Insert-CD-ROM-and-hang-system-------------------------------


So, the common factors seem to be that we both have Xeon SMP systems
with Intel 750x (i810-like) chipsets, and both find that the
lowest-numbered (0,0,0) SCSI disk works fine, but using other devices
hangs all future attempts to talk to SCSI devices (other than the one
that works OK).


Thanks in advance to anyone who's been patient enough to read all this
and help us.


Cheers


Duncan




^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging)
  2003-07-24 14:36 ` AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) Duncan Gibb
@ 2003-07-24 15:44   ` Thomas Beutin
  2003-07-24 17:02     ` Duncan Gibb
  2003-07-24 21:58     ` Justin T. Gibbs
  2003-07-24 17:47   ` Justin T. Gibbs
  1 sibling, 2 replies; 9+ messages in thread
From: Thomas Beutin @ 2003-07-24 15:44 UTC (permalink / raw)
  To: Duncan Gibb; +Cc: linux-scsi

Hi Duncan,

On Thu, Jul 24, 2003 at 03:36:51PM +0100, Duncan Gibb wrote:
[...]
> Just before I read your post, I built a 2.4.21-bk17 kernel in the hope
> that this would have been fixed some time in the last couple of months. 
> I'm not really in to running experimental kernels, so usually I won't
> stray further than the -ac series.

what ist the Your version of the aic79xx driver in the 2.4.21-bk17
kernel?

----- BEGIN ------
# cat /proc/scsi/aic79xx/0 
Adaptec AIC79xx driver version: 1.3.0
aic7902: Ultra320 Wide Channel A, SCSI Id=7, PCI 33 or 66Mhz, 512 SCBs

Serial EEPROM:
0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 
0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 
0x08f4 0x0146 0x2807 0x0010 0xffff 0xffff 0xffff 0xffff 
0xffff 0xffff 0xffff 0xffff 0xffff 0xffff 0x0410 0xb2d7 

Channel A Target 0 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
	Goal: 160.000MB/s transfers (80.000MHz DT, 16bit)
	Curr: 160.000MB/s transfers (80.000MHz DT, 16bit)
	Transmission Errors 0
	Channel A Target 0 Lun 0 Settings
		Commands Queued 48814
		Commands Active 0
		Command Openings 1
		Max Tagged Openings 0
		Device Queue Frozen Count 0
Channel A Target 1 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
	Goal: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
	Curr: 3.300MB/s transfers
	Transmission Errors 0
	Channel A Target 1 Lun 0 Settings
		Commands Queued 963579
		Commands Active 0
		Command Openings 32
		Max Tagged Openings 32
		Device Queue Frozen Count 0
Channel A Target 2 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
	Goal: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
	Curr: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
	Transmission Errors 1
	Channel A Target 2 Lun 0 Settings
		Commands Queued 412536
		Commands Active 0
		Command Openings 32
		Max Tagged Openings 32
		Device Queue Frozen Count 0
Channel A Target 3 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 4 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 5 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 6 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
	Goal: 3.300MB/s transfers
	Curr: 3.300MB/s transfers
	Transmission Errors 0
	Channel A Target 6 Lun 0 Settings
		Commands Queued 1
		Commands Active 0
		Command Openings 1
		Max Tagged Openings 0
		Device Queue Frozen Count 0
Channel A Target 7 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 8 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 9 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 10 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 11 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 12 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 13 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 14 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
Channel A Target 15 Negotiation Settings
	User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit)
----- END ------


(Especially the current transfer rate on Channel A Target 1 is
somewhat less than expected.)

Maybe there is a new driver by Justin Gibbs, but i didn't found
anything for the 2.4.21 kernel in
http://people.freebsd.org/~gibbs/linux/SRC/

Do You think the problem goes away by using a non SMP kernel?
(I have no problem to compile a new kernel, but the machine
is in a server room and i have to make an appointment with
the security guards to enter the room where the server is,
so i don't want to reboot this beast sitting 30 minutes
away from it...)

Greetings from Berlin,
-tb
-- 
Thomas Beutin                             tb@laokoon.IN-Berlin.DE
Beam me up, Scotty. There is no intelligent live down in Redmond.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging)
  2003-07-24 15:44   ` Thomas Beutin
@ 2003-07-24 17:02     ` Duncan Gibb
  2003-07-24 21:58     ` Justin T. Gibbs
  1 sibling, 0 replies; 9+ messages in thread
From: Duncan Gibb @ 2003-07-24 17:02 UTC (permalink / raw)
  To: Thomas Beutin; +Cc: linux-scsi

On Thu, 2003-07-24 at 16:44, Thomas Beutin wrote:

DG> I built a 2.4.21-bk17 kernel in the hope that this would
DG> have been fixed

TB> what ist the Your version of the aic79xx driver in the
TB> 2.4.21-bk17 kernel?

It's 1.3.10, which I believe is the most recent.  I didn't want to go
blasting it with Justin's source files as they are only advertised as
working for 2.4.20.

TB> Maybe there is a new driver by Justin Gibbs, but i didn't
TB> found anything for the 2.4.21 kernel in
TB> http://people.freebsd.org/~gibbs/linux/SRC/


I also (eventually) managed to compile 2.6.0-test1-ac3 (that DVB code is
a bit of a mess, isn't it?).  That kernel has aic79xx 1.3.9, but I can
reproduce the problem in a slightly less severe form.

The scanner has gone haywire (magnification seems to be locked at
maximum), so I couldn't do a scan that would transfer enough data to
cause the lockup.

Also putting a CD in the drive no longer hangs the scsi subsytem.  But a
recoverable SCSI death still happens if you try even a simple thing like
"cdrecord dev=1,3,0 -atip"...


-- lockup under 2.6.0-test1-ac3 ------------------------------------

scsi1:0:3:0: Attempting to abort cmd f7c37b00: 0x1b 0x0 0x0 0x0 0x1 0x0
scsi1: At time of recovery, card was not paused
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi1: Dumping Card State at program address 0x94 Mode 0x33
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80]:(SWTMINTMASK) SEQINTSTAT[0x0]
SAVED_MODE[0x11] DFFSTAT[0x31]:(CURRFIFO_1|FIFO0FREE|FIFO1FREE)
SCSISIGI[0x48]:(P_DATAIN|SELI) SCSIPHASE[0x0] SCSIBUS[0x88]
LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE) SCSISEQ0[0x0]
SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) SEQCTL0[0x10]:(FASTMODE)
SEQINTCTL[0x80]:(INTVEC1DSL)
SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED)
SEQ_FLAGS2[0x0] SSTAT0[0x0] SSTAT1[0x0] SSTAT2[0x0]
SSTAT3[0x0] PERRDIAG[0x8]:(AIPERR)
SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0]
LQOSTAT1[0x0] LQOSTAT2[0x0]
SCB Count = 4 CMDS_PENDING = 1 LASTSCB 0x3 CURRSCB 0x3 NEXTSCB 0x0
qinstart = 8971 qinfifonext = 8971
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
3 FIFO_USE[0x0] SCB_CONTROL[0x44]:(DISCONNECTED|DISCENB)
SCB_SCSIID[0x37]
Total 1
Kernel Free SCB list: 2 1 0
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x3
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x4]:(DIRECTION)
DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0 0x0
scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42
scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0
SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
CCSCBCTL[0x4]:(CCSCBDIR)
scsi1: REG0 == 0x3, SINDEX = 0x100, DINDEX = 0x1c0
scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffc3
CDB 1b 0 0 0 1 0
STACK: 0x23 0x14 0x0 0x0 0x0 0x0 0x0 0x0
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
DevQ(0:2:0): 0 waiting
DevQ(0:3:0): 0 waiting
DevQ(0:6:0): 0 waiting
DevQ(0:6:1): 0 waiting
DevQ(0:6:2): 0 waiting
DevQ(0:6:3): 0 waiting
DevQ(0:6:4): 0 waiting
DevQ(0:6:5): 0 waiting
DevQ(0:6:6): 0 waiting
DevQ(0:6:7): 0 waiting
(scsi1:A:3:0): Device is disconnected, re-queuing SCB
Recovery code sleeping
(scsi1:A:3:0): Abort Message Sent
Kernel Free SCB list: 2 1 0
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x3
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x4]:(DIRECTION)
DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0 0x0
scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42
scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0
SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
CCSCBCTL[0x4]:(CCSCBDIR)
scsi1: REG0 == 0x3, SINDEX = 0x100, DINDEX = 0x1c0
scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffc3
CDB 1b 0 0 0 1 0
STACK: 0x23 0x14 0x0 0x0 0x0 0x0 0x0 0x0
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
DevQ(0:2:0): 0 waiting
DevQ(0:3:0): 0 waiting
DevQ(0:6:0): 0 waiting
DevQ(0:6:1): 0 waiting
DevQ(0:6:2): 0 waiting
DevQ(0:6:3): 0 waiting
DevQ(0:6:4): 0 waiting
DevQ(0:6:5): 0 waiting
DevQ(0:6:6): 0 waiting
DevQ(0:6:7): 0 waiting
(scsi1:A:3:0): Device is disconnected, re-queuing SCB
Recovery code sleeping
(scsi1:A:3:0): Abort Message Sent

-- five-second pause

Recovery code awake
Timer Expired
Recovery code sleeping

-- five-second pause

Recovery code awake
Timer Expired
scsi1: Device reset returning 0x2003
Recovery SCB completes
Recovery SCB completes

-- ten-second pause

scsi: Device offlined - not ready after error recovery: host 1 channel 0
id 3 lun 0

-- user process unfreezes

-- lockup under 2.6.0-test1-ac3 ------------------------------------


According to dmesg, both the really-scsi and the ide-scsi CD drives
should work:

# dmesg | grep sr
sr0: scsi3-mmc drive: 59x/61x caddy
Attached scsi CD-ROM sr0 at scsi1, channel 0, id 3, lun 0
sr1: scsi3-mmc drive: 0x/0x caddy
Attached scsi CD-ROM sr1 at scsi2, channel 0, id 0, lun 0

# eject /dev/scd1

(the scsi-ide one) works perfectly, but

# eject /dev/scd0
eject: unable to find or open device for: `/dev/scd0'

(dmesg records "cdrom: open failed").


TB> Do You think the problem goes away by using a non SMP kernel?

To be honest, I have tried so many kernels I have forgotten which ones I
tested with SMP disabled.  I will have another go shortly (must get some
real work done).


Cheers


Duncan



^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging)
  2003-07-24 14:36 ` AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) Duncan Gibb
  2003-07-24 15:44   ` Thomas Beutin
@ 2003-07-24 17:47   ` Justin T. Gibbs
  2003-07-24 19:38     ` Thomas Beutin
  1 sibling, 1 reply; 9+ messages in thread
From: Justin T. Gibbs @ 2003-07-24 17:47 UTC (permalink / raw)
  To: Duncan Gibb, Thomas Beutin; +Cc: linux-scsi

> If I try to scan something, Xsane hangs for a while, then recovers OK. 
> But if I put a CD in the Yamaha drive, anything which touches the scsi
> subsystem from then on will hang.

It looks like you have a very flakey bus.  The BSY line is probably
glitching.  In the trace you provided, we successfully selected the
device at ID 3 and are waiting for the target to take us to the first
phase.  The REQ for that phase never occurs and in fact the BSY line
is not always driven.  Very, very weird.  The target controls both
the REQ and BSY lines, so this is either a problem with the target,
your cabling, or your termination.

--
Justin


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging)
  2003-07-24 17:47   ` Justin T. Gibbs
@ 2003-07-24 19:38     ` Thomas Beutin
  2003-07-24 21:58       ` Justin T. Gibbs
  0 siblings, 1 reply; 9+ messages in thread
From: Thomas Beutin @ 2003-07-24 19:38 UTC (permalink / raw)
  To: linux-scsi

On Thu, Jul 24, 2003 at 11:47:04AM -0600, Justin T. Gibbs wrote:
> It looks like you have a very flakey bus.  The BSY line is probably
> glitching.  In the trace you provided, we successfully selected the
> device at ID 3 and are waiting for the target to take us to the first
> phase.  The REQ for that phase never occurs and in fact the BSY line
> is not always driven.  Very, very weird.  The target controls both
> the REQ and BSY lines, so this is either a problem with the target,
> your cabling, or your termination.

I will give my system a try next time when i have access to the
machine.
Although i cannot believe that: all disks are mounted in a plug-in
for the backplane (SCA HSBP M15), and the first disk works fine
all the times.
Maybe i change the drives in the plug-ins and the position of the
drives (but i cannot use all positions because i have to place the
IBM disk at the first position).

-tb
-- 
Thomas Beutin                             tb@laokoon.IN-Berlin.DE
Beam me up, Scotty. There is no intelligent live down in Redmond.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging)
  2003-07-24 15:44   ` Thomas Beutin
  2003-07-24 17:02     ` Duncan Gibb
@ 2003-07-24 21:58     ` Justin T. Gibbs
  1 sibling, 0 replies; 9+ messages in thread
From: Justin T. Gibbs @ 2003-07-24 21:58 UTC (permalink / raw)
  To: Thomas Beutin, Duncan Gibb; +Cc: linux-scsi

> (Especially the current transfer rate on Channel A Target 1 is
> somewhat less than expected.)
> 
> Maybe there is a new driver by Justin Gibbs, but i didn't found
> anything for the 2.4.21 kernel in
> http://people.freebsd.org/~gibbs/linux/SRC/

The 2.4 archive should work on just about any 2.4.X kernel.
I would highly recoment upgrading from 1.3.0.

I'm still not clear on what your problem is.  Can you provide
a full dmesg from the system and any diagnostic messages the
driver generates?

--
Justin


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging)
  2003-07-24 19:38     ` Thomas Beutin
@ 2003-07-24 21:58       ` Justin T. Gibbs
  0 siblings, 0 replies; 9+ messages in thread
From: Justin T. Gibbs @ 2003-07-24 21:58 UTC (permalink / raw)
  To: Thomas Beutin, linux-scsi

> I will give my system a try next time when i have access to the
> machine.

My response was to Duncan.  I don't have enough information to
say what might be wrong with your system yet.

--
Justin


^ permalink raw reply	[flat|nested] 9+ messages in thread

* RE: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging)
@ 2003-07-25 13:51 Cress, Andrew R
  0 siblings, 0 replies; 9+ messages in thread
From: Cress, Andrew R @ 2003-07-25 13:51 UTC (permalink / raw)
  To: 'Justin T. Gibbs', Thomas Beutin, Duncan Gibb; +Cc: linux-scsi

Let me second the recommendation to upgrade the aic79xx driver version.
I've seen packetizing problems with 1.3.0 that may apply here, but that was
resolved in 1.3.2 and greater.

Andy

-----Original Message-----
From: Justin T. Gibbs [mailto:gibbs@scsiguy.com] 
Sent: Thursday, July 24, 2003 5:58 PM
To: Thomas Beutin; Duncan Gibb
Cc: linux-scsi@vger.kernel.org
Subject: Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging)


> (Especially the current transfer rate on Channel A Target 1 is
> somewhat less than expected.)
> 
> Maybe there is a new driver by Justin Gibbs, but i didn't found
> anything for the 2.4.21 kernel in
> http://people.freebsd.org/~gibbs/linux/SRC/

The 2.4 archive should work on just about any 2.4.X kernel.
I would highly recoment upgrading from 1.3.0.

I'm still not clear on what your problem is.  Can you provide
a full dmesg from the system and any diagnostic messages the
driver generates?

--
Justin

-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2003-07-25 13:37 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-07-24 13:12 HD somtimes hanging Thomas Beutin
2003-07-24 14:36 ` AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) Duncan Gibb
2003-07-24 15:44   ` Thomas Beutin
2003-07-24 17:02     ` Duncan Gibb
2003-07-24 21:58     ` Justin T. Gibbs
2003-07-24 17:47   ` Justin T. Gibbs
2003-07-24 19:38     ` Thomas Beutin
2003-07-24 21:58       ` Justin T. Gibbs
2003-07-25 13:51 Cress, Andrew R

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.