DDS trouble - device hanging

Volker volker at vwsoft.com
Wed Aug 10 19:15:03 GMT 2005


Hi guys,

on a system with 5.4-STABLE I'm experiencing trouble with a DAT drive.

I've used a Sony SDT-7000 (DDS-2) attached to an Adaptec 3985 for the
last years without trouble (narrow SCSI).

A year ago I've changed the controller to an Adaptec AAA-133B. When
trying to get a backup some time later, the backup was hanging in the
middle of the backup and I thought the SDT-7000 would be faulty.

Now I've changed the tape drive to a Sony SDT-11000 (DDS-4) and the same
thing happens, it stops _in the middle_ of the backup (whether using tar
or bacula doesn't make a difference).

The cabling has been changed when changing the tape drive. SCSI bus
termination is ok, tape drive firmware and jumper settings have been
double checked. When the error occours, the tape drive doesn't respond
to any commands being sent (camcontrol) and will not eject media
(emergency eject).

In the meantime (a year ago) I've changed the server from RELENG_4 to
RELENG_5 but while the backup has been disabled over months I can't say
for sure if the fBSD version update or the hardware change was causing
the fault.

The following error messages are taken from the console after the backup
has been aborted. Please note a manual `camcontrol reset 1:6:0' at the
end (which did _not_ solve the frozen device).

How do I debug, what can be read from the card dump state or the
debug messages? Is the controller at fault? Any hints?

I'm near of pulling out the controller and going single channel (that
would be possible in my setup without causing additional trouble).

Thanks,

Volker

#uname -v
FreeBSD 5.4-STABLE #10: Fri May 13 16:12:28 CEST 2005

ahc2: Recovery Initiated
>>>>>>>>>>>>>>>>>> Dump Card State Begins
>>>>>>>>>>>>>>>>>> <<<<<<<<<<<<<<<<<
ahc2: Dumping Card State while idle, at SEQADDR 0x7
Card was paused
ACCUM = 0xd9, SINDEX = 0x67, DINDEX = 0x27, ARG_2 = 0x3
HCNT = 0x0 SCBPTR = 0x0
SCSISIGI[0x0] ERROR[0x0] SCSIBUSL[0x0] LASTPHASE[0x1]:(P_BUSFREE)
SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI) SBLKCTL[0x2]:(SELWIDE)
SCSIRATE[0x0] SEQCTL[0x10]:(FASTMODE)
SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED)
SSTAT0[0x5]:(DMADONE|SDONE) SSTAT1[0xa]:(PHASECHG|BUSFREE)
SSTAT2[0x0] SSTAT3[0x0] SIMODE0[0x0]
SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
SXFRCTL0[0x80]:(DFON) DFCNTRL[0x0]
DFSTATUS[0x2d]:(FIFOEMP|DFTHRESH|HDONE|FIFOQWDEMP)
STACK: 0xcc 0x151 0x192 0x3
SCB count = 20
Kernel NEXTQSCB = 17
Card NEXTQSCB = 17
QINFIFO entries:
Waiting Queue entries:
Disconnected Queue entries: 0:14
QOUTFIFO entries:
Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Sequencer SCB Info:
  0 SCB_CONTROL[0x4c]:(DISCONNECTED|ULTRAENB|DISCENB) SCB_SCSIID[0x67]
SCB_LUN[0x0] SCB_TAG[0xe]
  1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
  2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
  3 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
  4 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
  5 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
  6 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
  7 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
  8 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
  9 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
 10 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
 11 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
 12 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
 13 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
 14 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
 15 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
Pending list:
 14 SCB_CONTROL[0x48]:(ULTRAENB|DISCENB) SCB_SCSIID[0x67] SCB_LUN[0x0]
Kernel Free SCB list: 18 9 8 6 5 3 2 0 19 16 15 1 7 4 13 12 11 10
Untagged Q(6): 14

<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
(sa0:ahc2:0:6:0): SCB 0xe - timed out
sg[0] - Addr 0x219d028 : Length 4056
sg[1] - Addr 0x96de000 : Length 4096
sg[2] - Addr 0x15d3f000 : Length 4096
sg[3] - Addr 0x3c00000 : Length 4096
sg[4] - Addr 0xbe61000 : Length 4096
sg[5] - Addr 0xb9e2000 : Length 4096
sg[6] - Addr 0xeca3000 : Length 4096
sg[7] - Addr 0xc064000 : Length 4096
sg[8] - Addr 0x7485000 : Length 4096
sg[9] - Addr 0x41e6000 : Length 4096
sg[10] - Addr 0xdde7000 : Length 4096
sg[11] - Addr 0xb488000 : Length 4096
sg[12] - Addr 0x7689000 : Length 4096
sg[13] - Addr 0xb7ca000 : Length 4096
sg[14] - Addr 0xceb000 : Length 4096
sg[15] - Addr 0x3a4c000 : Length 3112
(sa0:ahc2:0:6:0): Queuing a BDR SCB
(sa0:ahc2:0:6:0): Bus Device Reset Message Sent
ahc2: Timedout SCBs already complete. Interrupts may not be functioning.
(sa0:ahc2:0:6:0): no longer in timeout, status = 24b
ahc2: Bus Device Reset on A:6. 1 SCBs aborted
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x0
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x75
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): NO SENSE ILI (length mismatch): -2048 asc:0,0
(sa0:ahc2:0:6:0): No additional sense information
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): NO SENSE ILI (length mismatch): -2048 asc:0,0
(sa0:ahc2:0:6:0): No additional sense information
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): NO SENSE ILI (length mismatch): -2048 asc:0,0
(sa0:ahc2:0:6:0): No additional sense information
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x0
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x0
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x75
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x75
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x75
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x75
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x75
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x75
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x75
(sa0:ahc2:0:6:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0
(sa0:ahc2:0:6:0): Sense Error Code 0x75
(pass3:ahc2:0:6:0): Bus Device Reset Message Sent
ahc2: Bus Device Reset on A:6. 1 SCBs aborted

-- 
GPG/PGP fingerprint:
FF93 13A1 2477 B631 E953 06DF 4C49 ADD9 E4BF 79B1



More information about the freebsd-scsi mailing list