mps0-troubles

Kenneth D. Merry ken at freebsd.org
Thu Jan 13 21:07:24 UTC 2011


On Thu, Jan 13, 2011 at 01:14:50 +0100, Joachim Tingvold wrote:
> On Wed, Jan 12, 2011, at 23:29:53PM GMT+01:00, Joachim Tingvold wrote:
> >If I were copying from the AHCI-attached disk to the mps controller,  
> >and the AHCI-attached disk timeouts, wouldn't this cause the disks  
> >on the mps controller to timeout as well?
> 
> Now it happened again (while copying from 'zroot' to 'storage'). This  
> time only mps0 produced errors; 
> <http://home.komsys.org/~jocke/dmesg_mps0_freebsd-scsi_2.txt >. As the 
> timeout seem to be over quickly, I find it strange that  whatever process 
> that accessed the disks (in my case, 'mv'), doesn't  continue once the 
> disks are available -- or is this some kind of  safeguard against corrupted 
> data?

Did the system recover this time?

The 'out of chain frames' messages are somewhat worrysome.  From looking at
the logic in the driver (mpssas_action_scsiio() and mps_data_cb()), it
looks like if it runs out of chain frames, it won't cancel the timeout on
the command.  So you'll wind up getting timeouts.  But sending an abort for
a command that hasn't gone down to the chip is rather pointless.

Did you see any other messages before the 'out of chain frames' messages
popped up?

Try editing sys/dev/mps/mpsvar.h, and change MPS_CHAIN_FRAMES from 1024 to
2048 and see if that helps things any.

That won't fix the underlying problem, but it may help you avoid running
out of that resource.

Ken
-- 
Kenneth Merry
ken at FreeBSD.ORG


More information about the freebsd-scsi mailing list