A little story of failed raid5 (3ware 8000 series)

Sat Aug 25 02:27:49 PDT 2007

>   This isn't really accurate.  First of all, if the RAID 
> controller isn't confirming checksums before giving the data to 
> the OS, what is the checksum for exactly?

The checksum is used to recover the data in the event one piece of the data is lost. With all of the data but one piece, and the checksum, the data can be recovered. Confirming the checksum on every read would be a waste of time since the individual drives already checks the data for errors.

> It is supposed to be 
> for detecting data corruption, so if the card isn't using the 
> checksum, its kinda of useless.

You are confused. Checking for data corruption is done, by checking if the *DATA* is corrupt. This does not require looking at the RAID5 checksum since the data has its own data checksum.

> I know some RAID systems do fake 
> their checksums, as they don't actually validate data against the 
> checksums during normal reads because they don't have the 
> processing power.  I'd stay away from these type of systems 
> (cough ... Blue Arc ... cough).

It has nothing to do with processing power. It's simply a waste. The RAID 5 checksum isn't for verifying the data, it's for recovering the data if it can't be read.

> Second, most RAID systems don't use their own checksums 
> anymore.  Netapp is quite famous for their ZCS (zone checksum) 
> drives, and still uses a variation of this technology on their 
> newer systems (which are using 512 sectors).  But most RAID 
> vendors just rely on the drives own error detection and 
> correction systems (hamming code based usually, which is actually 
> pretty solid).  I'm pretty sure that that 3ware doesn't use any checksums.

You are seriously confused. You are confusing the RAID 5 checksum with the drive data checksum. We are talking about making sure the RAID 5 checksums are readable so that, if a drive fails, the data can be reconstructed from the checksum.

> However, in this particular case, validating checksums would 
> have been unhelpful, since the disk was unreadable.  diskcheckd 
> would have detected this issue.  It would probably have prevented 
> the problem, if it had been running previously.

No, it would have saved him. The problem was he lost a drive, and checksums *ON* *OTHER* *DRIVES* were unreadable. Quite possibly they had been unreabable for months, but were never checked, since they are only *needed* to reconstruct the data.

> ZFS is also a good option.  It has file level checksumming.  
> ZFS never trusts the disks, and is super paranoid.  And ZFS can 
> do background scrubbing too.  I can't wait for ZFS in FreeBSD 7, 
> because ZFS in software is going to 10 x better than anything 3ware has.

That wouuld not have helped him one bit. When the drive failed, the RAID 5 checksums on the other drives still would not have been scrubbed. The RAID 5 checksum (technically an XOR) is only needed to recover the RAID 5 array if a drive (or sector) fails.

The only thing that will fix this is specifically verifying the RAID 5 checksum blocks. If a controller provides no way to do this, it is badly broken. If a verify operation does not verify the checksum blocks, it is broken.

DS