[Bug 257094] SSD detach / reattach 2-3 times a day, WRITE(10). CDB: 2a 00 06 15 ec 28 00 00 08 00 / CAM status: SCSI Status Error

From: <bugzilla-noreply_at_freebsd.org>
Date: Sat, 10 Jul 2021 10:51:19 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=257094

            Bug ID: 257094
           Summary: SSD detach / reattach 2-3 times a day, WRITE(10). CDB:
                    2a 00 06 15 ec 28 00 00 08 00 / CAM status: SCSI
                    Status Error
           Product: Base System
           Version: 12.2-RELEASE
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: kern
          Assignee: bugs@FreeBSD.org
          Reporter: nerozero@gmail.com

Hello, 

facing issue for quite a while. SSD detach/reattach again in a 3-10 sec. Two
identical SSD drives are in the ZFS mirror used as a rootfs.

I had similar issues on the other machine with generic hard drives which was
fixed by disabling hard drive built-in EPC/APM. But this SSD doesn't seems to
support EPC, and disabling or setting it to 254 (maximum performance) has
little to no effect. The system is literally 4-5 month old. SSD  drives long
smart test shows no errors. Replacing SSD drives with same new ones produces
same result... Tested SSD on a linux desktop pc basic stand-by for a week - no
issues observed. 

I also have tried to read 100MB blocks from drive with dd every 2 hours which
reduced detach / reattach frequency from 3-5 times per 24 hour to 1-2.

also camcontrol failng to set APM values while smartctl has no issues doing the
same:

# camcontrol apm /dev/da1 -l 254
camcontrol: ATA SETFEATURES ENABLE APM via pass_16 failed

So i have a strong filling that this looks like a bug... 

logs: 
----- 8< -----
/boot/loader.conf
    mrsas_load=YES
    hw.mfi.mrsas_enable=1



Hardware:
    Platform: Dell PowerEdge R540
    Storage Controller: PERC H730P Adapter / JBOD
    Drive - SSDx2: KINGSTON SA400S3 120GB
    Drive - SASx4: TOSHIBA MG04SCA40ENY 


Kernel messages:
    Jul  8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): ATA COMMAND PASS
THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00 
    Jul  8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Status
Error
    Jul  8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK
    Jul  8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack
    Jul  8 08:17:56 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  8 08:17:56 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2>  s/n
xxxxxxxxxxxxxxxxxx     detached
    Jul  8 08:17:56 vmhost kernel: mrsas0: System PD deleted target ID: 0x1
    Jul  8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed
    Jul  8 08:18:13 vmhost kernel: mrsas0: System PD created target ID: 0x1
    Jul  8 08:18:13 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS Slot:
2 phys at slot 1
    Jul  8 08:18:13 vmhost kernel: ses2:  phy 0: SATA device
    Jul  8 08:18:13 vmhost kernel: ses2:  phy 0: parent 500056b36d81e5ff addr
500056b36d81e5c1
    Jul  8 08:18:13 vmhost kernel: ses2:  phy 1: SAS device type 0 phy 0
    Jul  8 08:18:13 vmhost kernel: ses2:  phy 1: parent 0 addr 0
    Jul  8 08:18:13 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  8 08:18:13 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2> Fixed
Direct Access SPC-4 SCSI device
    Jul  8 08:18:13 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx    
    Jul  8 08:18:13 vmhost kernel: da1: 150.000MB/s transfers
    Jul  8 08:18:13 vmhost kernel: da1: 114473MB (234441648 512 byte sectors)
    Jul  8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): ATA COMMAND PASS
THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00 
    Jul  8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Status
Error
    Jul  8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK
    Jul  8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack
    Jul  8 19:29:39 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  8 19:29:39 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2>  s/n
xxxxxxxxxxxxxxxxxx     detached
    Jul  8 19:29:39 vmhost kernel: mrsas0: System PD deleted target ID: 0x1
    Jul  8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed
    Jul  8 19:29:54 vmhost kernel: mrsas0: System PD created target ID: 0x1
    Jul  8 19:29:54 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS Slot:
2 phys at slot 1
    Jul  8 19:29:54 vmhost kernel: ses2:  phy 0: SATA device
    Jul  8 19:29:54 vmhost kernel: ses2:  phy 0: parent 500056b36d81e5ff addr
500056b36d81e5c1
    Jul  8 19:29:54 vmhost kernel: ses2:  phy 1: SAS device type 0 phy 0
    Jul  8 19:29:54 vmhost kernel: ses2:  phy 1: parent 0 addr 0
    Jul  8 19:29:54 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  8 19:29:54 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2> Fixed
Direct Access SPC-4 SCSI device
    Jul  8 19:29:54 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx    
    Jul  8 19:29:54 vmhost kernel: da1: 150.000MB/s transfers
    Jul  8 19:29:54 vmhost kernel: da1: 114473MB (234441648 512 byte sectors)
    Jul  9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): WRITE(10). CDB: 2a 00 06
1c 38 58 00 00 08 00 
    Jul  9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Status
Error
    Jul  9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK
    Jul  9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack
    Jul  9 09:47:07 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  9 09:47:07 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2>  s/n
xxxxxxxxxxxxxxxxxx     detached
    Jul  9 09:47:07 vmhost kernel: mrsas0: System PD deleted target ID: 0x1
    Jul  9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed
    Jul  9 09:47:23 vmhost kernel: mrsas0: System PD created target ID: 0x1
    Jul  9 09:47:23 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  9 09:47:23 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2> Fixed
Direct Access SPC-4 SCSI device
    Jul  9 09:47:23 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx    
    Jul  9 09:47:23 vmhost kernel: da1: 150.000MB/s transfers
    Jul  9 09:47:23 vmhost kernel: da1: 114473MB (234441648 512 byte sectors)
    Jul  9 09:47:23 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS Slot:
2 phys at slot 1
    Jul  9 09:47:23 vmhost kernel: ses2:  phy 0: SATA device
    Jul  9 09:47:23 vmhost kernel: ses2:  phy 0: parent 500056b36d81e5ff addr
500056b36d81e5c1
    Jul  9 09:47:23 vmhost kernel: ses2:  phy 1: SAS device type 0 phy 0
    Jul  9 09:47:23 vmhost kernel: ses2:  phy 1: parent 0 addr 0
    Jul  9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): ATA COMMAND PASS
THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00 
    Jul  9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): CAM status: SCSI Status
Error
    Jul  9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): SCSI status: OK
    Jul  9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): Invalidating pack
    Jul  9 10:35:42 vmhost kernel: da0 at mrsas0 bus 1 scbus17 target 0 lun 0
    Jul  9 10:35:42 vmhost kernel: da0: <ATA KINGSTON SA400S3 B1E2>  s/n
xxxxxxxxxxxxxxxxxx     detached
    Jul  9 10:35:42 vmhost kernel: mrsas0: 
    Jul  9 10:35:42 vmhost kernel: 
    Jul  9 10:35:42 vmhost kernel: System PD deleted target ID: 0x0
    Jul  9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): Periph destroyed
    Jul  9 10:35:57 vmhost kernel: mrsas0: System PD created target ID: 0x0
    Jul  9 10:35:57 vmhost kernel: ses2: pass2,da0 in 'Drive Slot 0', SAS Slot:
2 phys at slot 0
    Jul  9 10:35:57 vmhost kernel: ses2:  phy 0: SATA device
    Jul  9 10:35:57 vmhost kernel: ses2:  phy 0: parent 500056b36d81e5ff addr
500056b36d81e5c0
    Jul  9 10:35:57 vmhost kernel: ses2:  phy 1: SAS device type 0 phy 0
    Jul  9 10:35:57 vmhost kernel: ses2:  phy 1: parent 0 addr 0
    Jul  9 10:35:57 vmhost kernel: da0 at mrsas0 bus 1 scbus17 target 0 lun 0
    Jul  9 10:35:57 vmhost kernel: da0: <ATA KINGSTON SA400S3 B1E2> Fixed
Direct Access SPC-4 SCSI device
    Jul  9 10:35:57 vmhost kernel: da0: Serial Number xxxxxxxxxxxxxxxxxx    
    Jul  9 10:35:57 vmhost kernel: da0: 150.000MB/s transfers
    Jul  9 10:35:57 vmhost kernel: da0: 114473MB (234441648 512 byte sectors)
    Jul  9 14:49:31 vmhost kernel: (da1:mrsas0:1:1:0): ATA COMMAND PASS
THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00 
    Jul  9 14:49:31 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Status
Error
    Jul  9 14:49:31 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK
    Jul  9 14:49:31 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack
    Jul  9 14:49:31 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  9 14:49:31 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2>  s/n
xxxxxxxxxxxxxxxxxx     detached
    Jul  9 14:49:32 vmhost kernel: mrsas0: System PD deleted target ID: 0x1
    Jul  9 14:49:32 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed
    Jul  9 14:49:44 vmhost kernel: mrsas0: System PD created target ID: 0x1
    Jul  9 14:49:45 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS Slot:
2 phys at slot 1
    Jul  9 14:49:45 vmhost kernel: ses2:  phy 0: SATA device
    Jul  9 14:49:45 vmhost kernel: ses2:  phy 0: parent 500056b36d81e5ff addr
500056b36d81e5c1
    Jul  9 14:49:45 vmhost kernel: ses2:  phy 1: SAS device type 0 phy 0
    Jul  9 14:49:45 vmhost kernel: ses2:  phy 1: parent 0 addr 0
    Jul  9 14:49:45 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  9 14:49:45 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2> Fixed
Direct Access SPC-4 SCSI device
    Jul  9 14:49:45 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx    
    Jul  9 14:49:45 vmhost kernel: da1: 150.000MB/s transfers
    Jul  9 14:49:45 vmhost kernel: da1: 114473MB (234441648 512 byte sectors)
    Jul  9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): ATA COMMAND PASS
THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00 
    Jul  9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Status
Error
    Jul  9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK
    Jul  9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack
    Jul  9 19:54:58 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  9 19:54:58 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2>  s/n
xxxxxxxxxxxxxxxxxx     detached
    Jul  9 19:54:58 vmhost kernel: mrsas0: 
    Jul  9 19:54:58 vmhost kernel: 
    Jul  9 19:54:58 vmhost kernel: System PD deleted target ID: 0x1
    Jul  9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed
    Jul  9 19:55:13 vmhost kernel: mrsas0: System PD created target ID: 0x1
    Jul  9 19:55:13 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS Slot:
2 phys at slot 1
    Jul  9 19:55:13 vmhost kernel: ses2:  phy 0: SATA device
    Jul  9 19:55:13 vmhost kernel: ses2:  phy 0: parent 500056b36d81e5ff addr
500056b36d81e5c1
    Jul  9 19:55:13 vmhost kernel: ses2:  phy 1: SAS device type 0 phy 0
    Jul  9 19:55:13 vmhost kernel: ses2:  phy 1: parent 0 addr 0
    Jul  9 19:55:13 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul  9 19:55:13 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2> Fixed
Direct Access SPC-4 SCSI device
    Jul  9 19:55:13 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx    
    Jul  9 19:55:13 vmhost kernel: da1: 150.000MB/s transfers
    Jul  9 19:55:13 vmhost kernel: da1: 114473MB (234441648 512 byte sectors)
    Jul  9 21:49:13 vmhost kernel: (da1:mrsas0:1:1:0): WRITE(10). CDB: 2a 00 07
19 f9 78 00 00 08 00 
    Jul  9 21:49:13 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Status
Error
    Jul  9 21:49:13 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK
    Jul 10 04:01:18 vmhost kernel: (da1:mrsas0:1:1:0): WRITE(10). CDB: 2a 00 06
0f 73 88 00 00 08 00 
    Jul 10 04:01:18 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Status
Error
    Jul 10 04:01:18 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK
    Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): WRITE(10). CDB: 2a 00 06
15 ec 28 00 00 08 00 
    Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Status
Error
    Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK
    Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack
    Jul 10 12:58:21 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul 10 12:58:21 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2>  s/n
xxxxxxxxxxxxxxxxxx     detached
    Jul 10 12:58:21 vmhost kernel: mrsas0: System PD deleted target ID: 0x1
    Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed
    Jul 10 12:58:41 vmhost kernel: mrsas0: System PD created target ID: 0x1
    Jul 10 12:58:41 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS Slot:
2 phys at slot 1
    Jul 10 12:58:41 vmhost kernel: ses2:  phy 0: SATA device
    Jul 10 12:58:41 vmhost kernel: ses2:  phy 0: parent 500056b36d81e5ff addr
500056b36d81e5c1
    Jul 10 12:58:41 vmhost kernel: ses2:  phy 1: SAS device type 0 phy 0
    Jul 10 12:58:41 vmhost kernel: ses2:  phy 1: parent 0 addr 0
    Jul 10 12:58:41 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun 0
    Jul 10 12:58:41 vmhost kernel: da1: <ATA KINGSTON SA400S3 B1E2> Fixed
Direct Access SPC-4 SCSI device
    Jul 10 12:58:41 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx    
    Jul 10 12:58:41 vmhost kernel: da1: 150.000MB/s transfers
    Jul 10 12:58:41 vmhost kernel: da1: 114473MB (234441648 512 byte sectors)


Thanks

-- 
You are receiving this mail because:
You are the assignee for the bug.