Errors in Predicting Disk Failure
Since the upgrade to version 6 I am getting erroneous alerts that a disk is about to fail. I have run validation on my RAID volume on April 24 and yesterday. Twice a disk failure alert has popped up when the RAID volume had not been mounted for over a week, and when the app was not running for more than a week. When I launched SoftRAID moments ago there was a brief 1 second appearance in the Volume pane that a a disk was about to fail, and then switched to no errors. The SoftRAID button icon in the Mac menu bar still persists to stay yellow. Any suggestions? Thank you!
Thanks for attaching the support file, it made this easier.
There is a bug in the user interface of SoftRAID, it cannot display SMART data on disks in fusion drive configurations in their disk tiles.
The failing drive is your internal 2TB HDD. It is just starting to fail, as it has 8 uncorrectable sectors, and 8 pending reallocations.
When you have time, you can try using SoftRAID to "certify" the internal since you have a Pro license. Certify wipes all data on a drive, but it may clarify the condition. Its not easy to replace HDD's in an iMac, but it is possible to do yourself. Or you can wait until it totally fails, but you may start getting kernel panics, and one day your system will not boot and all data on your internal will be lost, as you cannot recover data from a fusion configuration very easily.
This is what is in the log. Disk1 is your HDD internal drive.
This prediction is based on SMART data retrieved from the disk. This disk should be replaced soon. SMART Attributes used for failure prediction are: ID 5 (Reallocated SectorCount) = 0, ID 196 (Reallocated Event Count) = 0, ID 197 (Reallocation Pending Sector Count) = 8, ID 198 (Uncorrectable Sector Count) = 8.
Thank you! that is very good information. Since it will cost me around $400+ from OWC to replace the fusion drive, and I hope to replace this 2017 iMac in a year or two, it seems to make more sense to use to use an external SSD drive, yes? Will SoftRAID continue to throw up alert notices for the failing fusion drive? or can that be disabled? If the alerts cannot be disabled it seems I have to replace the internal drive. I currently do daily backups with Carbon Copy Cloner. Any suggestions are helpful. 😎
I do not think we can disable SMART on the fusion drive. (that is a SoftRAID bug, otherwise you need to disable SMART checks altogether)
Its something we hope to fix when we support APFS volumes in a couple months.
@softraid-support As a stopgap measure I will create an external SSD boot drive. Once I have done that I can Certify the Apple fusion drive as you suggested. Yes, I am unable to currently disable SMART on the fusion drive. Will it ever be an option to disable SMART reporting once the fusion drive fails?
DriveDX does confirm the failing status.
Thank you for your help!
I hope we do, once we support APFS, I am hoping the same changes allow us to tackle the fusion drives better. Something always comes in front of this fix.
@softraid-support In the last few days I have installed Big Sur 11.6 on an external SSD and transferred all files. Booting to the external drive seems to be working fine. Disk Utility reports that it can unmount the internal drives. Is that a good solution to put an end to SMART reporting? I still have not tried to certify the internal fusion drive that was suggested in May. Is the bug in the user interface of SoftRAID still happening?
We have not fixed the Fusion bug yet, no. What you have seems to work, so great. I would try at some point to certify the disk, however. Also, once it is "not" a Fusion drive, then SoftRAID can see all the disks OK and you can disable SMART reporting on the one disk.
I attempted to certify my Fusion drive, and immediately got an Error #0. How to proceed?
We cannot certify a "fusion" drive. What we can do is if you delete the Fusion volume, and leave it as two separate disks, then we can certify both. Fusion drives are in a unusual state, kind of like one drive (to macOS), but not quite. We did not do the engineering work to be able to work with them, so you need to delete the volume first.
How do I delete the Fusion volume? Ctl-clicking on the Love Again volume shows a Delete option that is grayed out.
@softraid-support The Fusion volume was deleted. Only the Data volume remains. I attempted to Certify the Data volume, and am again getting Error #0.