6.1RC-2 ciss Driver hangs on Rebuild for Internal Drives with
external MSA20 attached
Karl Pielorz
kpielorz at tdx.co.uk
Mon May 8 08:45:34 UTC 2006
Hi All,
We've recently added an MSA20 external SATA enclosure to our HP Proliant
DL380 server.
While testing, we found the following problem:
If you fail an internal RAID array, when the system starts rebuilding it -
any disk access to ciss0 will 'hang' - killing the server.
The rebuild does complete OK (guess that's the controller continuing to do
it in-background) - but the machine never recovers from the hang.
If you remove the MSA20 - internal drives can be failed, and will rebuild
fine. Additionally - if you fail a drive in the MSA20 when it's attached -
it will fail, and rebuild perfectly Ok (no hangs).
I've also noticed the following appears logged, when the machine is going
to hang:
"
ciss0: ** Hot-plug drive inserted: SCSI port 2 ID 5
ciss0: ** State change, logical drive 2
[server hangs at this point - after ~30 sec you get...]
ciss0: error sending 195 LUN command (35) <---- Presumably not good :)
ciss0: Warning, cannot get physical lun list
ciss0: logical drive 2 (da2) changed status interim recovery->ready for
recovery, spare status 0x0
"
Any suggestions? - We've checked the firmware on both the server, and the
MSA20 is the latest. The cable is the HP supplied one.
Thanks,
-Karl
You can find a full verbose boot for this machine, admittedly under 6.1RC1
(which had the same problem) at: http://www.tdx.com/verbose_6.1rc1.txt
More information about the freebsd-scsi
mailing list