MFC of "Large set of CAM improvements" breaks I/O to Adaptec
29160 SCSI controller
Scott Long
scottl at samsco.org
Tue Apr 27 20:26:17 UTC 2010
On Apr 27, 2010, at 2:20 PM, Andy Farkas wrote:
> On Wed, Apr 28, 2010 at 6:12 AM, Matthew Jacob <mj at feral.com> wrote:
>
>> Does anything time out (eventually)?
>
> No. I left it sitting overnight and it was still deadlocked
> in the morning...
>
A couple of possible scenarios here:
1. A command completed with an error, that error was reported up to the periph layer, and the periph failed to properly handle it, leading to a lost command that eventually livelocked the VM/block layer.
2. An error happened the transport layer, and the aic7xxx tried to freeze the CAM queues to perform error recovery. Something broke in the freeze/unfreeze API, so the aic7xxx was left stranded.
The more I think about it, it's likely case 2, since I know that Alexander has been working in or near that code.
Scott
More information about the freebsd-scsi
mailing list