7.1 Panic on degraded disk w/mpt

Charles Sprickman spork at bway.net
Mon Feb 9 22:49:11 PST 2009


On Mon, 9 Feb 2009, Scott Long wrote:

> Charles Sprickman wrote:
>> (posted on -stable already, no takers - added info: full dmesg, crash info 
>> from panic when array finished rebuilding, some comments on dmesg)
>> 
>> Howdy,
>> 
>> I dug around and can't find a PR on this, and the only other report I saw 
>> was in this mailing list post that has no replies:
>> 
>> http://www.nabble.com/7.1-BETA2-panic-on-mpt-degrade-td20183173.html
>> 
>> The hardware is a Dell PowerEdge 860 with the Dell/LSI SAS5 controller:
>> 
>> mpt0: <LSILogic SAS/SATA Adapter> port 0xec00-0xecff mem 
>> 0xfe9fc000-0xfe9fffff,0xfe9e0000-0xfe9effff irq 16 at device 8.0 on pci2
>> mpt0: MPI Version=1.5.13.0
>> 
>> The panic is repeatable by forcing the array into a degraded state.  When 
>> the array finishes rebuilding, the box also panics.
>> 
>> Here's my best shot at getting info out of kgdb (panic on array going to 
>> degraded state):
>
> I wonder if the MPT card is temporarily detaching and then reattaching
> the logical drive when the rebuild completes.

IIRC, just before the panic there is a bunch of CAM debug splattered 
across the monitor.  I can run down to the garage and snap a few pics of 
the monitor after detaching a drive.

> The info you posted is inconclusive here.  CAM (the FreeBSD SCSI layer) 
> has had some problems handling device detaches, but we've been very 
> fortunate to have someone examining and fixing this recently.

Yeah, I was looking at the commit log for cam_xpt.c and someone has been 
very busy...

> Would it be possible for you to upgrade to the most recent 8-CURRENT 
> tree, and re-run your test?  If not, I'll see about generating a 
> patchset against 7.1.

Can I get away with just updating the kernel, or is there a simple way to 
build a live-cd?  I don't want to screw with userland, but I'd boot a 
kernel if that's not too rough - but I suppose my 7.1 kgdb would not know 
what to do with the dump, right?

On the bright side, the controller is not getting so scrambled by the 
panic that it can no longer write the crashdump.  That's a positive!

I'm going to go panic it again, I'm getting curious about the messages 
before the panic...

Thanks,

Charles

> Scott
>


More information about the freebsd-scsi mailing list