tws bug ? (LSI SAS 9750)

Thomas Mueller mueller23 at insightbb.com
Sat Sep 22 08:35:01 UTC 2012


On Fri, Sep 21, 2012 at 5:37 PM, Mike Tancsa <mike at sentex.net> wrote:
> On 9/21/2012 8:03 PM, Jim Harris wrote:
>>> .
>>> then a lot of
>>> .
>>> (probe65:tws0:0:65:0): INQUIRY. CDB: 12 0 0 0 24 0
>>> (probe65:tws0:0:65:0): CAM status: Invalid Target ID
>>> (probe65:tws0:0:65:0): Error 22, Unretryable error
>>> (probe1:tws0:0:1:0): INQUIRY. CDB: 12 0 0 0 24 0
>>> (probe1:tws0:0:1:0): CAM status: Invalid Target ID
>>> (probe1:tws0:0:1:0): Error 22, Unretryable error
>>> (probe2:tws0:0:2:0): INQUIRY. CDB: 12 0 0 0 24 0
>>> (probe2:tws0:0:2:0): CAM status: Invalid Target ID
>>> .
>>> .
>>> .
>>> (probe63:tws0:0:63:0): INQUIRY. CDB: 12 0 0 0 24 0
>>> (probe63:tws0:0:63:0): CAM status: Invalid Target ID
>>> (probe63:tws0:0:63:0): Error 22, Unretryable error
>>> (probe64:tws0:0:64:0): INQUIRY. CDB: 12 0 0 0 24 0
>>> (probe64:tws0:0:64:0): CAM status: Invalid Target ID
>>> (probe64:tws0:0:64:0): Error 22, Unretryable error

>> These can be ignored.  CAM is just telling you that there are no
>> devices attached at these target IDs.

> What about a change similar to what Alexander Motin did in

> http://lists.freebsd.org/pipermail/svn-src-head/2012-June/038196.html

Jim Harris <jimharris at freebsd.org> responded:

> Ah, yes.  I was thinking you had CAM_DEBUG enabled which is why you
> were seeing this spew - but that's not the case.  This indeed should
> be fixed and not just ignored.

> Seeing the attributions on Alexander's commit, you certainly seem to
> have a monopoly on controllers that exhibit this problem on FreeBSD.
> :)

> I believe the CAM_LUN_INVALID here should be fixed as well, similar to
> the twa commit.  If you send me a revised patch I will commit it.

The specific subject of this thread is not my issue, but I did notice 
problems apparently related to CAM on a SATA hard drive.

I use one UFS partition, with FreeBSD 9.0-BETA1 installed (subsequently 
updated on another partition, using GPT as opposed to MBR), for ports tree
and also NetBSD pkgsrc and NetBSD source code.  I built NetBSD 5.1_STABLE i386
from FreeBSD and also built xorg-modular on the new NetBSD installation from
pkgsrc.  Going into and out of the newly installed Xorg resulted in some
crashes with the FreeBSD 9.0-BETA1 partition mounted and not cleanly 
unmounted.  File system was damaged, and FreeBSD fsck_ffs wouldn't fix it,
went into a loop:


Script started on Wed Sep 19 04:15:02 2012
fsck_ffs /dev/ada0p9
** /dev/ada0p9
** Last Mounted on /BETA1
** Phase 1 - Check Blocks and Sizes

CANNOT READ BLK: 7584192
CONTINUE? [yn] y

THE FOLLOWING DISK SECTORS COULD NOT BE READ: 7584318, 7584319,
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cyl groups
1475900 files, 4638292 used, 21162419 free (61643 frags, 2637597 blocks, 0.2% fragmentation)

***** FILE SYSTEM STILL DIRTY *****

***** PLEASE RERUN FSCK *****

Script done on Wed Sep 19 04:17:27 2012


This happened repeatedly, meaning an impasse.

I didn't get to record preceding error messages relating to ATA and CAM but,
seeing this last message, wonder if there are some bugs in the CAM.

I booted that new NetBSD 5.1_STABLE i386 installation, on a USB stick, was
able to mount that partition and see it wasn't trashed though there was a 
message about the dirty flag.  I then umounted and ran NetBSD fsck_ffs
successfully, just a few files were lost, and FreeBSD can access that
partition again.

I still intend to be more cautious when in NetBSD, not mounting a FreeBSD
partition unnecessarily when doing something crash-prone on my system in 
NetBSD, such as going into and out of X.

Tom


More information about the freebsd-stable mailing list