MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

Andy Farkas chuzzwassa at gmail.com
Tue Apr 27 08:43:55 UTC 2010


Hi, firstly:

RELENG_8 csup'd with date=2010.02.14.00.00 works perfectly for days.

RELENG_8 csup'd with date=2010.02.15.00.00 dead-locks the disk I/O
subsystem. Network still operational but anything needing disk hangs.
Power-cycle required.

kernel config is GENERIC with KDB, DDB and BREAK_TO_DEBUGGER options added.

hardware:
ahc0: <Adaptec 29160 Ultra160 SCSI adapter> port 0x4000-0x40ff mem
0xefa00000-0xefa00fff irq 16 at device 0.0 on pci10
ahc0: [ITHREAD]
aic7892: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs

da0: <SEAGATE ST3146707LW 0005> Fixed Direct Access SCSI-3 device
da1: <SEAGATE ST3146707LW 0005> Fixed Direct Access SCSI-3 device


The dead-lock can happen at any time, but I can provoke it by running
a bonnie++ disk test. It happens doing rm -rf /usr/obj/usr and it has
happened doing a make installworld. It can survive a make buildworld
(the system runs normally until it decides to dead-lock).

The box (HP ProLiant ML 110) has 2 scsi disks and 4 sata disks. The
2010.02.15 kernel will run perfectly for days on the SATA disks. *Only*
when the scsi disks are accessed will the system dead-lock. Note that
the SATA disks do not work either if the system has dead-locked.

I can provide more details and a vmcore.0 if anyone is interested.

-andyf


More information about the freebsd-scsi mailing list