Certification Failu...
 
Notifications
Clear all

Certification Failure 6.3

(@j-a-duke)
Active Member Customer

I've now had 4 of 5 drives of the same model fail certification with "4 errors".

What has me posting here is that the errors, from my reading of the log, are flagging the same blocks on all 4 drives.  The 5th drive is still running, but I'm expecting it to turn up with the same problem as the errors seem to turn up at around 26 hours and the 5th is only at 20.

I installed 6.3 release over 6.3b20 prior to starting the certify of the drives.  

Is this an issue with SR or really just coincidence?

I've pasted the relevant log entry below.  I've also attached a tech support report as I'm running on Monterey 12.4 here.

Thanks.

Cheers,

Jon

Jul 19 1505 - SoftRAID Application: The certify disk command for disk disk9, SN: 61H0A1WHFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 16,777,216, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1505 - SoftRAID Application: The certify disk command for disk disk9, SN: 61H0A1WHFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 33,554,432, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1505 - SoftRAID Application: The certify disk command for disk disk9, SN: 61H0A1WHFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 50,331,648, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1505 - SoftRAID Application: The certify disk command for disk disk9, SN: 61H0A1WHFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 67,108,864, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1506 - SoftRAID Application: The certify disk command for disk disk9, SN: 61H0A1WHFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) failed because this disk has unreliable sectors. It should be replaced immediately (error number = 206).
Jul 19 1757 - SoftRAID Application: The certify disk command for disk disk11, SN: 61H0A2NLFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 16,777,216, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1757 - SoftRAID Application: The certify disk command for disk disk11, SN: 61H0A2NLFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 33,554,432, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1757 - SoftRAID Application: The certify disk command for disk disk11, SN: 61H0A2NLFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 50,331,648, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1757 - SoftRAID Application: The certify disk command for disk disk11, SN: 61H0A2NLFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 67,108,864, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1758 - SoftRAID Application: The certify disk command for disk disk11, SN: 61H0A2NLFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) failed because this disk has unreliable sectors. It should be replaced immediately (error number = 206).
Jul 19 1723 - SoftRAID Application: The certify disk command for disk disk10, SN: 61H0A2AMFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 16,777,216, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1723 - SoftRAID Application: The certify disk command for disk disk10, SN: 61H0A2AMFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 33,554,432, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1723 - SoftRAID Application: The certify disk command for disk disk10, SN: 61H0A2AMFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 50,331,648, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1723 - SoftRAID Application: The certify disk command for disk disk10, SN: 61H0A2AMFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 67,108,864, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1723 - SoftRAID Application: The certify disk command for disk disk10, SN: 61H0A2AMFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) failed because this disk has unreliable sectors. It should be replaced immediately (error number = 206).
Jul 19 1736 - SoftRAID Application: The certify disk command for disk disk12, SN: 61H0A1GUFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 16,777,216, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1736 - SoftRAID Application: The certify disk command for disk disk12, SN: 61H0A1GUFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 33,554,432, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1736 - SoftRAID Application: The certify disk command for disk disk12, SN: 61H0A1GUFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 50,331,648, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1736 - SoftRAID Application: The certify disk command for disk disk12, SN: 61H0A1GUFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) encountered a verify error (offset 67,108,864, i/o block size = 16,777,216). Error during pass number = 1. This disk should be replaced immediately.
Jul 19 1737 - SoftRAID Application: The certify disk command for disk disk12, SN: 61H0A1GUFVGG, SATA bus 0, id 0, lun 0 (Thunderbolt) failed because this disk has unreliable sectors. It should be replaced immediately (error number = 206).

Quote
Topic starter Posted : 20/07/2022 8:31 am
(@softraid-support)
Member Admin

Please avoid posting long strings. better is save in text edit (make plain text), then post the file.

What if you "verify disk". This appears to be a read problem.

ReplyQuote
Posted : 20/07/2022 11:35 am
(@j-a-duke)
Active Member Customer
Posted by: @softraid-support

What if you "verify disk". This appears to be a read problem.

If I verify the disk, it completes successfully (at least for a single drive that I ran).  I've queued the remaining disks for a verify and will report back when they complete or error out.

The remaining certify process is proceeding without error so far.

If it matters, I was trying to certify 5 drives simultaneously.  I've done that using these enclosures (2 x CalDigit T3) previously, so I don't think it's the enclosure or cabling.

Should I retry the certification again?

Thanks.

Cheers,
Jon

ReplyQuote
Topic starter Posted : 21/07/2022 11:01 am
(@softraid-support)
Member Admin

@j-a-duke 

You can easily do 8 drives without affecting certify times.

The certify process is a macOS "script" essentially, it does not use the SoftRAID driver, it is completely a macOS technology performing the writes and reads. So errors should essentially not happen, without a hardware failure somewhere.

ReplyQuote
Posted : 21/07/2022 12:54 pm
(@j-a-duke)
Active Member Customer

Just to make things more interesting, I started a fresh certification round with all 5 drives and all 5 passed. 

Might have updated to the release of 6.3, but that would have been the only change.

Cheers,

Jon

ReplyQuote
Topic starter Posted : 09/08/2022 3:03 pm
(@softraid-support)
Member Admin

@j-a-duke 

Certify is a low level macOS script process. Its direct to wire, hardware testing.

Anyway, great!

ReplyQuote
Posted : 09/08/2022 3:06 pm
Share:
close
open