mps0-troubles
Joachim Tingvold
joachim at tingvold.com
Wed Feb 23 03:58:19 UTC 2011
On Mon, Feb 21, 2011, at 22:45:44PM GMT+01:00, Kenneth D. Merry wrote:
>>> Okay, good. It looks like it is running as designed.
>> It is? It still terminating the commands, which I guess it shouldn't?
>>
>> mps0: (0:40:0) terminated ioc 804b scsi 0 state c xfer 0
> Sorry, I missed that, I was just looking at the first part.
No worries. (-:
> I'm still waiting for LSI to look at the SAS analyzer trace I sent
> them for
> the "IOC terminated" bug.
>
> It appears to be (at least on my hardware) a backend issue of some
> sort,
> and probably not anything we can fix in the driver.
I see. Good to know that you're able to reproduce it, since I with
good possibility can rule out that it's a hardware-issue on my
controller.
> Since you've got an HP branded expander, that makes it a little more
> difficult to determine whether it's an LSI, Maxim, or some other
> expander.
> Can you try the following on your system? You'll need the sg3_utils
> port:
>
> sg_inq -i ses0
>
> (I need to update camcontrol to parse page 0x83 output.)
>
> [...]
>
> Maxim expanders seem to report LUN descriptors in VPD page 0x83
> instead of
> target port descriptors. We might get a slight clue from the
> output, but
> it's hard to say for certain since HP could have customized the page
> 0x83
> values in the expander firmware.
VPD INQUIRY: Device Identification page
Designation descriptor number 1, descriptor length: 12
transport: Serial Attached SCSI (SAS)
designator_type: NAA, code_set: Binary
associated with the target port
NAA 5, IEEE Company_id: 0x1438
Vendor Specific Identifier: 0x101a2865
[0x50014380101a2865]
Designation descriptor number 2, descriptor length: 8
transport: Serial Attached SCSI (SAS)
designator_type: Relative target port, code_set: Binary
associated with the target port
Relative target port: 0x1
>> It just doesn't display the 'out of chain'-errors, that's all I
>> think.
>
> Well, if you don't see the 'out of chain' errors with 2048 chain
> buffers,
> that means the condition isn't happening.
>
> The cost of going from 1024 to 2048 is only 32K of extra memory,
> which is
> not a big deal, so I think I'll go ahead and bump the limit up and
> remove
> the printfs. We've now proven the recovery strategy, so it'll just
> slow
> things down slightly if anyone runs into that issue again.
Good. It has such a small impact, yes, so it shouldn't trouble anyone.
>>> What filesystem are you using by the way?
>> ZFS.
> Interesting. I haven't been able to run out of chain elements with
> ZFS,
> but I can use quite a few with UFS. I had to artificially limit the
> number
> of chain elements to test the change.
Maybe it's because the amount of disks in the same pool that I have?
Or that I have two un-even raidz2 vdev's in the same pool? The latter
has to be forced when adding it to the pool, so I guess it's not an
"ideal" solution... (but "everyone" does it, it seems).
--
Joachim
More information about the freebsd-scsi
mailing list