* HD somtimes hanging
@ 2003-07-24 13:12 Thomas Beutin
2003-07-24 14:36 ` AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) Duncan Gibb
0 siblings, 1 reply; 8+ messages in thread
From: Thomas Beutin @ 2003-07-24 13:12 UTC (permalink / raw)
To: linux-scsi
Hi,
sorry, if this question is already answered, but i cannot find
a searchable mailing list archiv. In this case it would be nice
if You can give me a pointer to the answers.
I have big problems with my secend an third hard disk on a
Adaptec AIC-7902 U320 sitting on an intel SE7501HG2 board.
The disks are SCA disks FUJITSU Model: MAP3367NC Rev: 0105
plugged into a hot swap back plane SCA HSBP M15 (Vendor
is ESG-SHV).
The first disk is an IBM IC35L036UCPR15-0 (Rev: S70H) and
runs fine.
When i try to access one of this FUJITSU disks, the system
hangs a couple of minutes and after that my dmesg command
ouput is full of "Dump Card State" (see below).
After a while the system (vanilla linux SMP kernel 2.4.21
running on a slackware 9.0) comes back to normal operation.
The system is'nt comletly hanging, it seems a file system
access in an already open shell freezes the shell, while
doing other stuff is ok.
Have You any idea what's going on? Is the backplane kidding
me (i've never bevor worked with an intelligent
SCSI-ID-consuming backplane)? I've found the fact, that the
disks are a short time available after a cold reboot (no
power in the machine at all), but later (2..3 hours) the
problems appeares again.
I've put some output and messages below. If You need more,
please let me know.
Any help or links to explanations welcome!
Thanks,
-tb
# cat /proc/scsi/scsi
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: IBM Model: IC35L036UCPR15-0 Rev: S70H
Type: Direct-Access ANSI SCSI revision: 03
Host: scsi0 Channel: 00 Id: 01 Lun: 00
Vendor: FUJITSU Model: MAP3367NC Rev: 0105
Type: Direct-Access ANSI SCSI revision: 03
Host: scsi0 Channel: 00 Id: 06 Lun: 00
Vendor: ESG-SHV Model: SCA HSBP M15 Rev: 0.10
Type: Processor ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 02 Lun: 00
Vendor: FUJITSU Model: MAP3367NC Rev: 0105
Type: Direct-Access ANSI SCSI revision: 03
Host: scsi2 Channel: 00 Id: 00 Lun: 00
Vendor: TEAC Model: CD-552E Rev: T016
Type: CD-ROM ANSI SCSI revision: 02
# uname -a
Linux stulle 2.4.21 #2 SMP Mon Jun 30 12:46:15 CEST 2003 i686 unknown
# lsmod
Module Size Used by Not tainted
raid1 14412 1
md 60640 1 [raid1]
uhci 27248 0 (unused)
usbcore 66240 1 [uhci]
i810_rng 2820 0 (unused)
reiserfs 185264 1
floppy 51548 0
serial 49028 0 (unused)
rtc 7612 0 (unused)
ide-scsi 10544 0
e1000 54464 1
------ BEGIN dmesg snip ------
DevQ(0:0:0): 0 waiting
DevQ(0:1:0): 0 waiting
DevQ(0:2:0): 0 waiting
DevQ(0:6:0): 0 waiting
scsi0:0:2:0: Cmd aborted from QINFIFO
scsi0:0:2:0: Attempting to abort cmd f14af000
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi0: Dumping Card State at program address 0xb Mode 0x33
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11]
DFFSTAT[0x11] SCSISIGI[0x74] SCSIPHASE[0x0] SCSIBUS[0x0]
LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10]
SEQINTCTL[0x0] SEQ_FLAGS[0xc0] SEQ_FLAGS2[0x0] SSTAT0[0x0]
SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0]
SIMODE1[0xa4] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x80]
LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x81]
SCB Count = 64 CMDS_PENDING = 2 LASTSCB 0x3e CURRSCB 0x3e NEXTSCB 0xff00
qinstart = 53881 qinfifonext = 53881
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
17 SCB_CONTROL[0x60] SCB_SCSIID[0x27] SCB_TAG[0x11]
42 SCB_CONTROL[0x60] SCB_SCSIID[0x27] SCB_TAG[0x2a]
Total 2
Kernel Free SCB list: 62 48 1 3 50 38 22 4 63 55 5 53 49 30 58 28 15 14 19 26 33 10 35 6 46 23 59 24 43 45 16 25 29 8 51 32 41 2 56 44 7 54 13 9 27 34 39 37 21 57 11 40 18 20 47 31 52 36 12 0 61 60
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
scsi0: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0, LJSCB 0xff00
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]
SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x0]
scsi0: FIFO1 Active, LONGJMP == 0x247, SCB 0x2a, LJSCB 0x2a
SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x28] DFSTATUS[0x1]
SG_CACHE_SHADOW[0x50] SG_STATE[0x3] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0xa] SHADDR = 0x02d693c00, SHCNT = 0x400
HADDR = 0x02d693c00, HCNT = 0x400 CCSGCTL[0x10]
LQIN: 0x4 0x0 0x0 0x2a 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0xf4 0x0 0x0 0x0 0x2 0x0
scsi0: LQISTATE = 0x29, LQOSTATE = 0x0, OPTIONMODE = 0x42
scsi0: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x1
SIMODE0[0xc]
CCSCBCTL[0x4]
scsi0: REG0 == 0x3e, SINDEX = 0x133, DINDEX = 0x106
scsi0: SCBPTR == 0x3e, SCB_NEXT == 0xff00, SCB_NEXT2 == 0xffe1
CDB 0 0 0 0 0 0
STACK: 0x120 0x120 0x0 0x247 0x247 0x235 0x2e 0x1
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
------ END dmesg snip ------
--
Thomas Beutin tb@laokoon.IN-Berlin.DE
Beam me up, Scotty. There is no intelligent live down in Redmond.
^ permalink raw reply [flat|nested] 8+ messages in thread
* AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) 2003-07-24 13:12 HD somtimes hanging Thomas Beutin @ 2003-07-24 14:36 ` Duncan Gibb 2003-07-24 15:44 ` Thomas Beutin 2003-07-24 17:47 ` Justin T. Gibbs 0 siblings, 2 replies; 8+ messages in thread From: Duncan Gibb @ 2003-07-24 14:36 UTC (permalink / raw) To: Thomas Beutin; +Cc: linux-scsi On Thu, 2003-07-24 at 14:12, Thomas Beutin wrote: TB> I have big problems with my secend an third hard disk TB> on a Adaptec AIC-7902 U320 sitting on an intel TB> SE7501HG2 board. I'm not sure I can help you, but I can certainly tell you you're not alone. I have very similar problems with the on-board AIC7902 on my SuperMicro X5DA8 board. This uses the 7505 chipset rather than the 7501, but AFAIK, both are i810-derivatives from Linux's point of view. Just before I read your post, I built a 2.4.21-bk17 kernel in the hope that this would have been fixed some time in the last couple of months. I'm not really in to running experimental kernels, so usually I won't stray further than the -ac series. TB> The first disk is an IBM IC35L036UCPR15-0 (Rev: S70H) TB> and runs fine. In my case /proc/scsi/scsi reads: Attached devices: Host: scsi0 Channel: 00 Id: 00 Lun: 00 Vendor: COMPAQ Model: ST32171W Rev: 0388 Type: Direct-Access ANSI SCSI revision: 02 Host: scsi1 Channel: 00 Id: 02 Lun: 00 Vendor: EPSON SC Model: ANNER GT-6500 Rev: 1.23 Type: Processor ANSI SCSI revision: 01 Host: scsi1 Channel: 00 Id: 03 Lun: 00 Vendor: YAMAHA Model: CRW2100S Rev: 1.0H Type: CD-ROM ANSI SCSI revision: 02 Host: scsi1 Channel: 00 Id: 06 Lun: 00 Vendor: MPL Model: MC-DISK-D-1S[0] Rev: 3.3 Type: Direct-Access ANSI SCSI revision: 02 Host: scsi1 Channel: 00 Id: 06 Lun: 01 Vendor: MPL Model: MC-DISK-D-1S[0] Rev: 3.3 Type: Direct-Access ANSI SCSI revision: 02 Host: scsi1 Channel: 00 Id: 06 Lun: 02 Vendor: MPL Model: MC-DISK-D-1S[1] Rev: 3.3 Type: Direct-Access ANSI SCSI revision: 02 Host: scsi1 Channel: 00 Id: 06 Lun: 03 Vendor: MPL Model: MC-DISK-D-1S[1] Rev: 3.3 Type: Direct-Access ANSI SCSI revision: 02 Host: scsi1 Channel: 00 Id: 06 Lun: 04 Vendor: MPL Model: MC-DISK-D-1S[1] Rev: 3.3 Type: Direct-Access ANSI SCSI revision: 02 Host: scsi1 Channel: 00 Id: 06 Lun: 05 Vendor: MPL Model: MC-DISK-D-1S[1] Rev: 3.3 Type: Direct-Access ANSI SCSI revision: 02 Host: scsi1 Channel: 00 Id: 06 Lun: 06 Vendor: MPL Model: MC-DISK-D-1S[0] Rev: 3.3 Type: Direct-Access ANSI SCSI revision: 02 Host: scsi1 Channel: 00 Id: 06 Lun: 07 Vendor: MPL Model: MC-DISK-D-1S[0] Rev: 3.3 Type: Direct-Access ANSI SCSI revision: 02 Host: scsi2 Channel: 00 Id: 00 Lun: 00 Vendor: SONY Model: DVD RW DRU-500A Rev: 2.0c Type: CD-ROM ANSI SCSI revision: 02 The "Compaq" (ie Seagate) disk is the system swap disk, and it must be working fine or the whole machine would hang (I presume). Also I think the MPL device (which is a PC-card/CompactFlash reader) should have only two LUNs, since that's how many slots it has... "scsi2" is ide-scsi. TB> The system is'nt comletly hanging, it seems a file TB> system access in an already open shell freezes the TB> shell, while doing other stuff is ok. If I try to scan something, Xsane hangs for a while, then recovers OK. But if I put a CD in the Yamaha drive, anything which touches the scsi subsystem from then on will hang. Here's what's in /var/log/messages: -- Xsane-scan-recover------------------------------------------ scsi1:0:2:0: Attempting to abort cmd f7be0800: 0x8 0x0 0x0 0x2 0x78 0x0 scsi1: At time of recovery, card was not paused >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< scsi1: Dumping Card State at program address 0x1ae Mode 0x33 Card was paused HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x31] SCSISIGI[0x0] SCSIPHASE[0x0] SCSIBUS[0x0] LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10] SEQINTCTL[0x0] SEQ_FLAGS[0x40] SEQ_FLAGS2[0x0] SSTAT0[0x0] SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0] SIMODE1[0xac] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0] Jul 24 14:07:51 localhost kernel: SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x3 CURRSCB 0x3 NEXTSCB 0x0 qinstart = 9041 qinfifonext = 9042 QINFIFO: 0x4 WAITING_TID_QUEUES: Pending list: Jul 24 14:07:51 localhost kernel: 4 FIFO_USE[0x0] SCB_CONTROL[0x48] SCB_SCSIID[0x27] Jul 24 14:07:51 localhost kernel: 3 FIFO_USE[0x0] SCB_CONTROL[0x40] SCB_SCSIID[0x37] Total 2 Kernel Free SCB list: 2 1 0 Sequencer Complete DMA-inprog list: Sequencer Complete list: Sequencer DMA-Up and Complete list: Jul 24 14:07:51 localhost kernel: scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x4 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 SIMODE0[0xc] CCSCBCTL[0x0] scsi1: REG0 == 0x4, SINDEX = 0x133, DINDEX = 0x108 scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffdb CDB 43 0 0 0 0 0 STACK: 0xe2 0x0 0x0 0x0 0x0 0x0 0x0 0x29 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> DevQ(0:2:0): 0 waiting DevQ(0:3:0): 0 waiting DevQ(0:6:0): 0 waiting DevQ(0:6:1): 0 waiting DevQ(0:6:2): 0 waiting DevQ(0:6:3): 0 waiting DevQ(0:6:4): 0 waiting DevQ(0:6:5): 0 waiting DevQ(0:6:6): 0 waiting DevQ(0:6:7): 0 waiting scsi1:0:2:0: Cmd aborted from QINFIFO -- 69-second pause at this point scsi1:0:2:0: Attempting to abort cmd f7be0800: 0x0 0x0 0x0 0x0 0x0 0x0 scsi1: At time of recovery, card was not paused >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< scsi1: Dumping Card State at program address 0x1ae Mode 0x33 Card was paused HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x31] SCSISIGI[0x0] SCSIPHASE[0x0] SCSIBUS[0x0] LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10] SEQINTCTL[0x0] SEQ_FLAGS[0x40] SEQ_FLAGS2[0x0] SSTAT0[0x0] SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0] SIMODE1[0xac] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0] Jul 24 14:08:01 localhost kernel: SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x3 CURRSCB 0x3 NEXTSCB 0x0 qinstart = 9041 qinfifonext = 9042 QINFIFO: 0x4 WAITING_TID_QUEUES: Pending list: Jul 24 14:08:01 localhost kernel: 4 FIFO_USE[0x0] SCB_CONTROL[0x48] SCB_SCSIID[0x27] Jul 24 14:08:01 localhost kernel: 3 FIFO_USE[0x0] SCB_CONTROL[0x40] SCB_SCSIID[0x37] Total 2 Kernel Free SCB list: 2 1 0 Sequencer Complete DMA-inprog list: Sequencer Complete list: Sequencer DMA-Up and Complete list: Jul 24 14:08:01 localhost kernel: scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x4 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 SIMODE0[0xc] CCSCBCTL[0x0] scsi1: REG0 == 0x4, SINDEX = 0x133, DINDEX = 0x108 scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffdb CDB 43 0 0 0 0 0 STACK: 0xe2 0x0 0x0 0x0 0x0 0x0 0x0 0x29 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> DevQ(0:2:0): 0 waiting DevQ(0:3:0): 0 waiting DevQ(0:6:0): 0 waiting DevQ(0:6:1): 0 waiting DevQ(0:6:2): 0 waiting DevQ(0:6:3): 0 waiting DevQ(0:6:4): 0 waiting DevQ(0:6:5): 0 waiting DevQ(0:6:6): 0 waiting DevQ(0:6:7): 0 waiting scsi1:0:2:0: Cmd aborted from QINFIFO scsi1:0:3:0: Attempting to abort cmd f7be0c00: 0x43 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0xc 0x40 scsi1: At time of recovery, card was not paused >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< scsi1: Dumping Card State at program address 0x1ae Mode 0x33 Card was paused HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x31] SCSISIGI[0x0] SCSIPHASE[0x0] SCSIBUS[0x0] LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10] SEQINTCTL[0x0] SEQ_FLAGS[0x40] SEQ_FLAGS2[0x0] SSTAT0[0x0] SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0] SIMODE1[0xac] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0] Jul 24 14:08:02 localhost kernel: SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x3 CURRSCB 0x3 NEXTSCB 0x0 qinstart = 9041 qinfifonext = 9041 QINFIFO: WAITING_TID_QUEUES: Pending list: Jul 24 14:08:02 localhost kernel: 3 FIFO_USE[0x0] SCB_CONTROL[0x40] SCB_SCSIID[0x37] Total 1 Kernel Free SCB list: 4 2 1 0 Sequencer Complete DMA-inprog list: Sequencer Complete list: Sequencer DMA-Up and Complete list: Jul 24 14:08:02 localhost kernel: scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x4 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 SIMODE0[0xc] CCSCBCTL[0x0] scsi1: REG0 == 0x4, SINDEX = 0x133, DINDEX = 0x108 scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffdb CDB 43 0 0 0 0 0 STACK: 0xe2 0x0 0x0 0x0 0x0 0x0 0x0 0x29 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> DevQ(0:2:0): 0 waiting DevQ(0:3:0): 0 waiting DevQ(0:6:0): 0 waiting DevQ(0:6:1): 0 waiting DevQ(0:6:2): 0 waiting DevQ(0:6:3): 0 waiting DevQ(0:6:4): 0 waiting DevQ(0:6:5): 0 waiting DevQ(0:6:6): 0 waiting DevQ(0:6:7): 0 waiting scsi1:0:3:0: Unable to deliver message Recovery code sleeping -- four-second pause Recovery code awake Timer Expired scsi1: Device reset returning 0x2003 Recovery code sleeping -- five-second pause Recovery code awake Timer Expired scsi1: Device reset returning 0x2003 Recovery SCB completes -- Xsane-scan-recover------------------------------------------ -- Insert-CD-ROM-and-hang-system------------------------------- scsi1:0:3:0: Attempting to abort cmd f7be0c00: 0x28 0x0 0x0 0x0 0x0 0x10 0x0 0x0 0x1 0x0 scsi1: At time of recovery, card was not paused >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< scsi1: Dumping Card State at program address 0x2c Mode 0x22 Card was paused HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x31] SCSISIGI[0x48] SCSIPHASE[0x0] SCSIBUS[0x88] LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10] SEQINTCTL[0x0] SEQ_FLAGS[0xc0] SEQ_FLAGS2[0x0] SSTAT0[0x0] SSTAT1[0x8] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x8] SIMODE1[0xa4] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0] Jul 24 14:19:15 localhost kernel: SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x2 CURRSCB 0x2 NEXTSCB 0x0 qinstart = 18960 qinfifonext = 18960 QINFIFO: WAITING_TID_QUEUES: Pending list: Jul 24 14:19:15 localhost kernel: 2 FIFO_USE[0x0] SCB_CONTROL[0x44] SCB_SCSIID[0x37] Total 1 Kernel Free SCB list: 3 4 1 0 Sequencer Complete DMA-inprog list: Sequencer Complete list: Sequencer DMA-Up and Complete list: Jul 24 14:19:15 localhost kernel: scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] scsi1: FIFO1 Free, LONGJMP == 0x80ec, SCB 0x2 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x4] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 SIMODE0[0xc] CCSCBCTL[0x4] scsi1: REG0 == 0x2, SINDEX = 0x122, DINDEX = 0x1c0 scsi1: SCBPTR == 0xff02, SCB_NEXT == 0xff00, SCB_NEXT2 == 0x0 CDB 2 1 0 0 0 0 STACK: 0x15 0x0 0x0 0x0 0x0 0x0 0x0 0x29 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> DevQ(0:2:0): 0 waiting DevQ(0:3:0): 1 waiting DevQ(0:6:0): 0 waiting DevQ(0:6:1): 0 waiting DevQ(0:6:2): 0 waiting DevQ(0:6:3): 0 waiting DevQ(0:6:4): 0 waiting DevQ(0:6:5): 0 waiting DevQ(0:6:6): 0 waiting DevQ(0:6:7): 0 waiting (scsi1:A:3:0): Device is disconnected, re-queuing SCB Recovery code sleeping (scsi1:A:3:0): Abort Message Sent -- five-second pause Recovery code awake Timer Expired scsi1:0:3:0: Attempting to abort cmd f7be0e00: 0x0 0x0 0x0 0x0 0x0 0x0 scsi1:0:3:0: Command found on device queue -- ten-second pause scsi1:0:3:0: Attempting to abort cmd f7be0e00: 0x0 0x0 0x0 0x0 0x0 0x0 scsi1:0:3:0: Command found on device queue Recovery code sleeping -- five-second pause Recovery code awake Timer Expired scsi1: Device reset returning 0x2003 Recovery SCB completes Recovery SCB completes -- 42-second pause scsi1:0:3:0: Attempting to abort cmd f7be0c00: 0x28 0x0 0x0 0x0 0x0 0x10 0x0 0x0 0x1 0x0 scsi1: At time of recovery, card was not paused >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< scsi1: Dumping Card State at program address 0x94 Mode 0x0 Card was paused HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x31] SCSISIGI[0x48] SCSIPHASE[0x0] SCSIBUS[0x88] LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10] SEQINTCTL[0x80] SEQ_FLAGS[0xc0] SEQ_FLAGS2[0x0] SSTAT0[0x0] SSTAT1[0x8] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x8] SIMODE1[0xa4] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0] Jul 24 14:20:17 localhost kernel: SCB Count = 5 CMDS_PENDING = 1 LASTSCB 0x2 CURRSCB 0x2 NEXTSCB 0x0 qinstart = 20221 qinfifonext = 20221 QINFIFO: WAITING_TID_QUEUES: Pending list: Jul 24 14:20:17 localhost kernel: 2 FIFO_USE[0x0] SCB_CONTROL[0x44] SCB_SCSIID[0x37] Total 1 Kernel Free SCB list: 3 4 1 0 Sequencer Complete DMA-inprog list: Sequencer Complete list: Sequencer DMA-Up and Complete list: Jul 24 14:20:17 localhost kernel: scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] scsi1: FIFO1 Free, LONGJMP == 0x80ec, SCB 0x2 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x4] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 SIMODE0[0xc] CCSCBCTL[0x4] scsi1: REG0 == 0x27a5, SINDEX = 0x111, DINDEX = 0x1c0 scsi1: SCBPTR == 0x0, SCB_NEXT == 0xff00, SCB_NEXT2 == 0x0 CDB 0 0 0 0 0 0 STACK: 0x26 0x14 0x0 0x0 0x0 0x0 0x0 0x0 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> DevQ(0:2:0): 0 waiting DevQ(0:3:0): 0 waiting DevQ(0:6:0): 0 waiting DevQ(0:6:1): 0 waiting DevQ(0:6:2): 0 waiting DevQ(0:6:3): 0 waiting DevQ(0:6:4): 0 waiting DevQ(0:6:5): 0 waiting DevQ(0:6:6): 0 waiting DevQ(0:6:7): 0 waiting (scsi1:A:3:0): Device is disconnected, re-queuing SCB Recovery code sleeping (scsi1:A:3:0): Abort Message Sent -- four-second pause Recovery code awake Timer Expired scsi: device set offline - not ready or command retry failed after bus reset: host 1 channel 0 id 3 lun 0 -- ten-second pause scsi1:0:3:0: Attempting to abort cmd f7be0e00: 0x0 0x0 0x0 0x0 0x0 0x0 scsi1:0:3:0: Command found on device queue scsi: device set offline - not ready or command retry failed after bus reset: host 1 channel 0 id 3 lun 0 I/O error: dev 0b:00, sector 64 isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16 cdrom: open failed. -- Insert-CD-ROM-and-hang-system------------------------------- So, the common factors seem to be that we both have Xeon SMP systems with Intel 750x (i810-like) chipsets, and both find that the lowest-numbered (0,0,0) SCSI disk works fine, but using other devices hangs all future attempts to talk to SCSI devices (other than the one that works OK). Thanks in advance to anyone who's been patient enough to read all this and help us. Cheers Duncan ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) 2003-07-24 14:36 ` AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) Duncan Gibb @ 2003-07-24 15:44 ` Thomas Beutin 2003-07-24 17:02 ` Duncan Gibb 2003-07-24 21:58 ` Justin T. Gibbs 2003-07-24 17:47 ` Justin T. Gibbs 1 sibling, 2 replies; 8+ messages in thread From: Thomas Beutin @ 2003-07-24 15:44 UTC (permalink / raw) To: Duncan Gibb; +Cc: linux-scsi Hi Duncan, On Thu, Jul 24, 2003 at 03:36:51PM +0100, Duncan Gibb wrote: [...] > Just before I read your post, I built a 2.4.21-bk17 kernel in the hope > that this would have been fixed some time in the last couple of months. > I'm not really in to running experimental kernels, so usually I won't > stray further than the -ac series. what ist the Your version of the aic79xx driver in the 2.4.21-bk17 kernel? ----- BEGIN ------ # cat /proc/scsi/aic79xx/0 Adaptec AIC79xx driver version: 1.3.0 aic7902: Ultra320 Wide Channel A, SCSI Id=7, PCI 33 or 66Mhz, 512 SCBs Serial EEPROM: 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x17c8 0x08f4 0x0146 0x2807 0x0010 0xffff 0xffff 0xffff 0xffff 0xffff 0xffff 0xffff 0xffff 0xffff 0xffff 0x0410 0xb2d7 Channel A Target 0 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Goal: 160.000MB/s transfers (80.000MHz DT, 16bit) Curr: 160.000MB/s transfers (80.000MHz DT, 16bit) Transmission Errors 0 Channel A Target 0 Lun 0 Settings Commands Queued 48814 Commands Active 0 Command Openings 1 Max Tagged Openings 0 Device Queue Frozen Count 0 Channel A Target 1 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Goal: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Curr: 3.300MB/s transfers Transmission Errors 0 Channel A Target 1 Lun 0 Settings Commands Queued 963579 Commands Active 0 Command Openings 32 Max Tagged Openings 32 Device Queue Frozen Count 0 Channel A Target 2 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Goal: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Curr: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Transmission Errors 1 Channel A Target 2 Lun 0 Settings Commands Queued 412536 Commands Active 0 Command Openings 32 Max Tagged Openings 32 Device Queue Frozen Count 0 Channel A Target 3 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 4 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 5 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 6 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Goal: 3.300MB/s transfers Curr: 3.300MB/s transfers Transmission Errors 0 Channel A Target 6 Lun 0 Settings Commands Queued 1 Commands Active 0 Command Openings 1 Max Tagged Openings 0 Device Queue Frozen Count 0 Channel A Target 7 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 8 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 9 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 10 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 11 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 12 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 13 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 14 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) Channel A Target 15 Negotiation Settings User: 320.000MB/s transfers (160.000MHz DT|IU|QAS, 16bit) ----- END ------ (Especially the current transfer rate on Channel A Target 1 is somewhat less than expected.) Maybe there is a new driver by Justin Gibbs, but i didn't found anything for the 2.4.21 kernel in http://people.freebsd.org/~gibbs/linux/SRC/ Do You think the problem goes away by using a non SMP kernel? (I have no problem to compile a new kernel, but the machine is in a server room and i have to make an appointment with the security guards to enter the room where the server is, so i don't want to reboot this beast sitting 30 minutes away from it...) Greetings from Berlin, -tb -- Thomas Beutin tb@laokoon.IN-Berlin.DE Beam me up, Scotty. There is no intelligent live down in Redmond. ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) 2003-07-24 15:44 ` Thomas Beutin @ 2003-07-24 17:02 ` Duncan Gibb 2003-07-24 21:58 ` Justin T. Gibbs 1 sibling, 0 replies; 8+ messages in thread From: Duncan Gibb @ 2003-07-24 17:02 UTC (permalink / raw) To: Thomas Beutin; +Cc: linux-scsi On Thu, 2003-07-24 at 16:44, Thomas Beutin wrote: DG> I built a 2.4.21-bk17 kernel in the hope that this would DG> have been fixed TB> what ist the Your version of the aic79xx driver in the TB> 2.4.21-bk17 kernel? It's 1.3.10, which I believe is the most recent. I didn't want to go blasting it with Justin's source files as they are only advertised as working for 2.4.20. TB> Maybe there is a new driver by Justin Gibbs, but i didn't TB> found anything for the 2.4.21 kernel in TB> http://people.freebsd.org/~gibbs/linux/SRC/ I also (eventually) managed to compile 2.6.0-test1-ac3 (that DVB code is a bit of a mess, isn't it?). That kernel has aic79xx 1.3.9, but I can reproduce the problem in a slightly less severe form. The scanner has gone haywire (magnification seems to be locked at maximum), so I couldn't do a scan that would transfer enough data to cause the lockup. Also putting a CD in the drive no longer hangs the scsi subsytem. But a recoverable SCSI death still happens if you try even a simple thing like "cdrecord dev=1,3,0 -atip"... -- lockup under 2.6.0-test1-ac3 ------------------------------------ scsi1:0:3:0: Attempting to abort cmd f7c37b00: 0x1b 0x0 0x0 0x0 0x1 0x0 scsi1: At time of recovery, card was not paused >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< scsi1: Dumping Card State at program address 0x94 Mode 0x33 Card was paused HS_MAILBOX[0x0] INTCTL[0x80]:(SWTMINTMASK) SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x31]:(CURRFIFO_1|FIFO0FREE|FIFO1FREE) SCSISIGI[0x48]:(P_DATAIN|SELI) SCSIPHASE[0x0] SCSIBUS[0x88] LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE) SCSISEQ0[0x0] SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) SEQCTL0[0x10]:(FASTMODE) SEQINTCTL[0x80]:(INTVEC1DSL) SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED) SEQ_FLAGS2[0x0] SSTAT0[0x0] SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x8]:(AIPERR) SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO) LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0] SCB Count = 4 CMDS_PENDING = 1 LASTSCB 0x3 CURRSCB 0x3 NEXTSCB 0x0 qinstart = 8971 qinfifonext = 8971 QINFIFO: WAITING_TID_QUEUES: Pending list: 3 FIFO_USE[0x0] SCB_CONTROL[0x44]:(DISCONNECTED|DISCENB) SCB_SCSIID[0x37] Total 1 Kernel Free SCB list: 2 1 0 Sequencer Complete DMA-inprog list: Sequencer Complete list: Sequencer DMA-Up and Complete list: scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS) SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL) SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL) scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x3 SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS) SEQINTSRC[0x0] DFCNTRL[0x4]:(DIRECTION) DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL) SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL) LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 SIMODE0[0xc]:(ENOVERRUN|ENIOERR) CCSCBCTL[0x4]:(CCSCBDIR) scsi1: REG0 == 0x3, SINDEX = 0x100, DINDEX = 0x1c0 scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffc3 CDB 1b 0 0 0 1 0 STACK: 0x23 0x14 0x0 0x0 0x0 0x0 0x0 0x0 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> DevQ(0:2:0): 0 waiting DevQ(0:3:0): 0 waiting DevQ(0:6:0): 0 waiting DevQ(0:6:1): 0 waiting DevQ(0:6:2): 0 waiting DevQ(0:6:3): 0 waiting DevQ(0:6:4): 0 waiting DevQ(0:6:5): 0 waiting DevQ(0:6:6): 0 waiting DevQ(0:6:7): 0 waiting (scsi1:A:3:0): Device is disconnected, re-queuing SCB Recovery code sleeping (scsi1:A:3:0): Abort Message Sent Kernel Free SCB list: 2 1 0 Sequencer Complete DMA-inprog list: Sequencer Complete list: Sequencer DMA-Up and Complete list: scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS) SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL) SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL) scsi1: FIFO1 Free, LONGJMP == 0x81ec, SCB 0x3 SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS) SEQINTSRC[0x0] DFCNTRL[0x4]:(DIRECTION) DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL) SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL) LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 SIMODE0[0xc]:(ENOVERRUN|ENIOERR) CCSCBCTL[0x4]:(CCSCBDIR) scsi1: REG0 == 0x3, SINDEX = 0x100, DINDEX = 0x1c0 scsi1: SCBPTR == 0x3, SCB_NEXT == 0xffc0, SCB_NEXT2 == 0xffc3 CDB 1b 0 0 0 1 0 STACK: 0x23 0x14 0x0 0x0 0x0 0x0 0x0 0x0 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> DevQ(0:2:0): 0 waiting DevQ(0:3:0): 0 waiting DevQ(0:6:0): 0 waiting DevQ(0:6:1): 0 waiting DevQ(0:6:2): 0 waiting DevQ(0:6:3): 0 waiting DevQ(0:6:4): 0 waiting DevQ(0:6:5): 0 waiting DevQ(0:6:6): 0 waiting DevQ(0:6:7): 0 waiting (scsi1:A:3:0): Device is disconnected, re-queuing SCB Recovery code sleeping (scsi1:A:3:0): Abort Message Sent -- five-second pause Recovery code awake Timer Expired Recovery code sleeping -- five-second pause Recovery code awake Timer Expired scsi1: Device reset returning 0x2003 Recovery SCB completes Recovery SCB completes -- ten-second pause scsi: Device offlined - not ready after error recovery: host 1 channel 0 id 3 lun 0 -- user process unfreezes -- lockup under 2.6.0-test1-ac3 ------------------------------------ According to dmesg, both the really-scsi and the ide-scsi CD drives should work: # dmesg | grep sr sr0: scsi3-mmc drive: 59x/61x caddy Attached scsi CD-ROM sr0 at scsi1, channel 0, id 3, lun 0 sr1: scsi3-mmc drive: 0x/0x caddy Attached scsi CD-ROM sr1 at scsi2, channel 0, id 0, lun 0 # eject /dev/scd1 (the scsi-ide one) works perfectly, but # eject /dev/scd0 eject: unable to find or open device for: `/dev/scd0' (dmesg records "cdrom: open failed"). TB> Do You think the problem goes away by using a non SMP kernel? To be honest, I have tried so many kernels I have forgotten which ones I tested with SMP disabled. I will have another go shortly (must get some real work done). Cheers Duncan ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) 2003-07-24 15:44 ` Thomas Beutin 2003-07-24 17:02 ` Duncan Gibb @ 2003-07-24 21:58 ` Justin T. Gibbs 1 sibling, 0 replies; 8+ messages in thread From: Justin T. Gibbs @ 2003-07-24 21:58 UTC (permalink / raw) To: Thomas Beutin, Duncan Gibb; +Cc: linux-scsi > (Especially the current transfer rate on Channel A Target 1 is > somewhat less than expected.) > > Maybe there is a new driver by Justin Gibbs, but i didn't found > anything for the 2.4.21 kernel in > http://people.freebsd.org/~gibbs/linux/SRC/ The 2.4 archive should work on just about any 2.4.X kernel. I would highly recoment upgrading from 1.3.0. I'm still not clear on what your problem is. Can you provide a full dmesg from the system and any diagnostic messages the driver generates? -- Justin ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) 2003-07-24 14:36 ` AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) Duncan Gibb 2003-07-24 15:44 ` Thomas Beutin @ 2003-07-24 17:47 ` Justin T. Gibbs 2003-07-24 19:38 ` Thomas Beutin 1 sibling, 1 reply; 8+ messages in thread From: Justin T. Gibbs @ 2003-07-24 17:47 UTC (permalink / raw) To: Duncan Gibb, Thomas Beutin; +Cc: linux-scsi > If I try to scan something, Xsane hangs for a while, then recovers OK. > But if I put a CD in the Yamaha drive, anything which touches the scsi > subsystem from then on will hang. It looks like you have a very flakey bus. The BSY line is probably glitching. In the trace you provided, we successfully selected the device at ID 3 and are waiting for the target to take us to the first phase. The REQ for that phase never occurs and in fact the BSY line is not always driven. Very, very weird. The target controls both the REQ and BSY lines, so this is either a problem with the target, your cabling, or your termination. -- Justin ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) 2003-07-24 17:47 ` Justin T. Gibbs @ 2003-07-24 19:38 ` Thomas Beutin 2003-07-24 21:58 ` Justin T. Gibbs 0 siblings, 1 reply; 8+ messages in thread From: Thomas Beutin @ 2003-07-24 19:38 UTC (permalink / raw) To: linux-scsi On Thu, Jul 24, 2003 at 11:47:04AM -0600, Justin T. Gibbs wrote: > It looks like you have a very flakey bus. The BSY line is probably > glitching. In the trace you provided, we successfully selected the > device at ID 3 and are waiting for the target to take us to the first > phase. The REQ for that phase never occurs and in fact the BSY line > is not always driven. Very, very weird. The target controls both > the REQ and BSY lines, so this is either a problem with the target, > your cabling, or your termination. I will give my system a try next time when i have access to the machine. Although i cannot believe that: all disks are mounted in a plug-in for the backplane (SCA HSBP M15), and the first disk works fine all the times. Maybe i change the drives in the plug-ins and the position of the drives (but i cannot use all positions because i have to place the IBM disk at the first position). -tb -- Thomas Beutin tb@laokoon.IN-Berlin.DE Beam me up, Scotty. There is no intelligent live down in Redmond. ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) 2003-07-24 19:38 ` Thomas Beutin @ 2003-07-24 21:58 ` Justin T. Gibbs 0 siblings, 0 replies; 8+ messages in thread From: Justin T. Gibbs @ 2003-07-24 21:58 UTC (permalink / raw) To: Thomas Beutin, linux-scsi > I will give my system a try next time when i have access to the > machine. My response was to Duncan. I don't have enough information to say what might be wrong with your system yet. -- Justin ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2003-07-24 21:42 UTC | newest] Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2003-07-24 13:12 HD somtimes hanging Thomas Beutin 2003-07-24 14:36 ` AIC7902 lockups on Intel SMP (Re: HD somtimes hanging) Duncan Gibb 2003-07-24 15:44 ` Thomas Beutin 2003-07-24 17:02 ` Duncan Gibb 2003-07-24 21:58 ` Justin T. Gibbs 2003-07-24 17:47 ` Justin T. Gibbs 2003-07-24 19:38 ` Thomas Beutin 2003-07-24 21:58 ` Justin T. Gibbs
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.