AIC7902 w/ seagate U320 drive issue on releng-4 (and current)

Don Bowman don at sandvine.com
Sat Jul 26 12:38:04 PDT 2003


have tried merging driver from current, same affects...
Machine is running releng_4.

I am seeing the following messages (below). In my config, I
have:

options         AHD_REG_PRETTY_PRINT
options         AHD_DEBUG
options         AHD_DEBUG_OPTS=0x1f

the 'test' is to do dd if=/dev/da0 of=/dev/null bs=20k
while letting other things run (in this case, tcpreplay,
but i suspect it doesn't matter).

The systems have a single seagate cheetah 15KRPM 18G drive,
as below:

# camcontrol inq 0:0:0
pass0: <SEAGATE ST318453LW 0004> Fixed Direct Access SCSI-3 device 
pass0: Serial Number 3JY0A97X000073370XXA
pass0: 320.000MB/s transfers (160.000MHz, offset 63, 16bit), Tagged Queueing
Enabled

Interestingly, a set of otherwise equal systems with the U160 version
of this drive:
# camcontrol inq 0:0:0
pass0: <SEAGATE ST318452LW 0004> Fixed Direct Access SCSI-3 device 
(pass0:ahd0:0:0:0): Handled Residual of 231 bytes
pass0: Serial Number 3EV02CLF00007236P39P
pass0: 160.000MB/s transfers (80.000MHz, offset 63, 16bit), Tagged Queueing
Enabled

do not have any problems with this same kernel. Dropping the speed to 
160 on the problem systems does not affect the behaviour: they still crash.
(ie the ST318452LW works, the ST318453LW does not).

Once the messages below come out, i lose access to the drive (files
don't exist etc). If I reboot, the drive is no longer present on the 
scsi chain, I must power cycle the device. This is very surprising to
me. I even tried hitting the reset jumper on the motherboard, same 
affect, when in the 'bad' state, the drive must be power cycled to
restore.

System is supermicro x5dpr-8g2+ motherboard, which has onboard
AIC7902. I have two different revs of these (0x005f9005, and
0x005e9005). It doesn't seem to matter which of these rev of chips
I am using.

I would appreciate any suggestions on how to proceed.

Messages follow.

# (da0:ahd0:0:0:0): SCB 0x2 - timed out
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
ahd0: Dumping Card State at program address 0x6 Mode 0x33
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80]:(SWTMINTMASK) SEQINTSTAT[0x0]
SAVED_MODE[0x11] DFFSTAT[0x33]:(CURRFIFO_NONE|FIFO0FREE|FIFO1FREE)
SCSISIGI[0x0]:(P_DATAOUT) SCSIPHASE[0x0] SCSIBUS[0x0]
LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE) SCSISEQ0[0x0]
SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) SEQCTL0[0x0] SEQINTCTL[0x0]
SEQ_FLAGS[0x0] SEQ_FLAGS2[0x0] SSTAT0[0x0] SSTAT1[0x8]:(BUSFREE)
SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0]
SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELT
IMO)
LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0]
LQOSTAT1[0x0] LQOSTAT2[0xe1]:(LQOSTOP0|LQOPKT)
   
SCB Count = 32 CMDS_PENDING = 7 LASTSCB 0x8 CURRSCB 0x0 NEXTSCB 0xff00
qinstart = 64391 qinfifonext = 64391
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
  0 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7]
  4 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7]
  9 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7]
  8 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7]
 12 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7]
 10 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7]
  2 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7]
Total 7
Kernel Free SCB list: 13 14 11 1 15 3 7 5 6 31 30 29 28 27 26 25 22 23 24 21
20 
19 18 17 16
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
   
ahd0: FIFO0 Free, LONGJMP == 0x8239, SCB 0xc
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENS
AVEP
TRS)
SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
ahd0: FIFO1 Free, LONGJMP == 0x8063, SCB 0xf
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENS
AVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
LQIN: 0x8 0x0 0x0 0xc 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0
ahd0: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x52
ahd0: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x1
   
SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
CCSCBCTL[0x4]:(CCSCBDIR)
ahd0: REG0 == 0x9, SINDEX = 0x102, DINDEX = 0x102
ahd0: SCBPTR == 0x0, SCB_NEXT == 0xff00, SCB_NEXT2 == 0xff98
CDB 28 0 2 3 b7 56
STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
ahd0: Handle Seqint Called for code 7
Saw Busfree.  Busfreetime = 0x0.
ahd0: Handle Seqint Called for code 26
(da0:ahd0:0:0:0): SCB 0x2 Received PKT Status of 0x2
        flags = 0x2, sense len = 0x12, pktfail = 0x0
Sense data available
Copied 18 bytes of sense data offset 12: 0x70 0x0 0x6 0x0 0x0 0x0 0x0 0xa
0x0 0x0 0x0 0x0 0x29 0x2 0x2 0x0 0x0 0x0
Jul 26 19:27:20  /kernel: <<<<<<<<<<<<<<<< Dump Card State Ends
>>>>>>>>>>>>>>>>>> 
   
Message from syslogd@ at Sat Jul 26 19:27:20 2003 ...
 /kernel: <<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>

ahd0: Setting up iocell workaround
ahd0: iocell first selection
ahd0: BYPASS now disabled
(da0:ahd0:0:0:0): Invalidating pack
ahd0: Single stepping at 0x93
ahd0: Single stepping at 0x93
(da0:ahd0:0:0:0): Invalidating pack
ahd0: Single stepping at 0x93
ahd0: Single stepping at 0x93
(da0:ahd0:0:0:0): Invalidating pack
(da0:ahd0:0:0:0): Invalidating pack
(da0:ahd0:0:0:0): Invalidating pack

>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
ahd0: Dumping Card State at program address 0x27 Mode 0x22
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80]:(SWTMINTMASK) SEQINTSTAT[0x0] 
SAVED_MODE[0x11] DFFSTAT[0x30]:(CURRFIFO_0|FIFO0FREE|FIFO1FREE) 
SCSISIGI[0x0]:(P_DATAOUT) SCSIPHASE[0x0] SCSIBUS[0x0] 
LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE) SCSISEQ0[0x0] 
SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) SEQCTL0[0x10]:(FASTMODE) 
SEQINTCTL[0x0] SEQ_FLAGS[0x0] SEQ_FLAGS2[0x0] SSTAT0[0x0] 
SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x8]:(AIPERR) 
SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO) LQISTAT0[0x0] 
LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] 
LQOSTAT2[0x1]:(LQOSTOP0) 

SCB Count = 32 CMDS_PENDING = 4 LASTSCB 0x9 CURRSCB 0x9 NEXTSCB 0xffc0
qinstart = 7342 qinfifonext = 7342
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
  9 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7] 
 13 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7] 
 12 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7] 
  2 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0x7] 
Total 4
Kernel Free SCB list: 4 11 15 5 8 6 3 10 7 22 14 0 31 1 30 29 27 26 25 28 23
24 21 20 19 18 17 16 
Sequencer Complete DMA-inprog list: 
Sequencer Complete list: 
Sequencer DMA-Up and Complete list: 

ahd0: FIFO0 Free, LONGJMP == 0x8277, SCB 0xd
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENS
AVEPTRS) 
SEQINTSRC[0x0] DFCNTRL[0x4]:(DIRECTION)
DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL) 
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0] 
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0 
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL) 
ahd0: FIFO1 Free, LONGJMP == 0x8057, SCB 0xb
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENS
AVEPTRS) 
SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL) 
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0] 
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0 
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL) 
LQIN: 0x55 0x0 0x0 0xd 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0 
ahd0: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42
ahd0: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x1

SIMODE0[0xc]:(ENOVERRUN|ENIOERR) 
CCSCBCTL[0x4]:(CCSCBDIR) 
ahd0: REG0 == 0x9, SINDEX = 0x122, DINDEX = 0x102
ahd0: SCBPTR == 0xff0d, SCB_NEXT == 0xff00, SCB_NEXT2 == 0x0
CDB d 1 0 0 0 0
STACK: 0x15 0x125 0x0 0x0 0x257 0x257 0x17a 0x29
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
ahd0: Setting up iocell workaround
ahd0: iocell first selection
ahd0: BYPASS now disabled
(da0:ahd0:0:0:0): Invalidating pack
ahd0: Single stepping at 0x93
ahd0: Single stepping at 0x93
(da0:ahd0:0:0:0): Invalidating pack
ahd0: Single stepping at 0x93
ahd0: Single stepping at 0x93
(da0:ahd0:0:0:0): Invalidating pack
(da0:ahd0:0:0:0): Invalidating pack
(da0:ahd0:0:0:0): Invalidating pack
Jul 26 19:29:34  /kernel: <<<<<<<<<<<<<<<< Dump Card State Ends
>>>>>>>>>>>>>>>>>>

Message from syslogd@ at Sat Jul 26 19:29:34 2003 ...
 /kernel: <<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<


More information about the aic7xxx mailing list