Notifications
Clear all

I/O Error With No Additional Errors On Validation

(@jkkuehne)
New Member

Environment:

MacOS Big Sur 11.4

SoftRAID 6.0.5

Enclosure: OWC Express 4M2 4-Slot M.2 NVMe SSD Enclosure.

Drives: Sabrent RocketQ 2TB

Formatting: APFS Raid 0 Stripe

Scenario:

While performing some pretty intensive video conversion activities SoftRAID indicated an I/O error on the enclosure.  It subsequently took the array offline.  Shutting down the machine and resetting the cables (just in case) and restarting produced no errors.  I also ran a validation on the array for additional confirmation.

Issue:

The original error message on screen (attached) did not indicate which drive had the i/o error. Validation did not produce any errors.  Error log at the time of the issue as follows:

Jul 09 0626 - SoftRAID Driver: SoftRAID driver loaded, version 6.0.5.
Jul 09 0633 - SoftRAID Monitor: Starting SMART test on all disks which support SMART.
Jul 09 0638 - SoftRAID Monitor: Finished SMART test on all disks. No disks failed the SMART test.
Jul 09 0636 - SoftRAID Monitor: Starting SMART test on all disks which support SMART.
Jul 09 0638 - SoftRAID Monitor: SoftRAID Pro license activated for all users on this computer (nickname “iMac Pro”). This license is for all disks.
Jul 09 0642 - SoftRAID Monitor: Starting SMART test on all disks which support SMART.
Jul 09 0642 - SoftRAID Monitor: Finished SMART test on all disks. No disks failed the SMART test.
Jul 09 1604 - SoftRAID Driver: A disk (disk2, SoftRAID ID: 08BFE7D3990AD780) for the SoftRAID volume "RocketQ-RAID" (disk12) was removed or stopped responding while the volume was mounted and in use.
Jul 09 1640 - SoftRAID Application: Launching application: SoftRAID version 6.0.5
Jul 09 1603 - SoftRAID Application: Quitting application
Jul 09 1738 - SoftRAID Monitor: Starting SMART test on all disks which support SMART.
Jul 09 1738 - SoftRAID Monitor: Finished SMART test on all disks. No disks failed the SMART test.
Jul 09 1747 - SoftRAID Monitor: Starting SMART test on all disks which support SMART.
Jul 09 1753 - SoftRAID Monitor: Starting SMART test on all disks which support SMART.
Jul 09 1753 - SoftRAID Monitor: Finished SMART test on all disks. No disks failed the SMART test.
Jul 09 1705 - SoftRAID Application: Launching application: SoftRAID version 6.0.5
Jul 09 1705 - SoftRAID Monitor: SoftRAID Pro license activated for all users on this computer (nickname “iMac Pro”). This license is for all disks.
Jul 09 1747 - SoftRAID Driver: SoftRAID driver loaded, version 6.0.5.
Jul 09 1751 - SoftRAID Driver: The SoftRAID volume "RocketQ-RAID" () previously encountered a read or write error. Please use the SoftRAID application to determine which disks needs to be replaced.
Jul 09 1901 - SoftRAID Driver: The volume "RocketQ-RAID" (disk13) has started validating.
Jul 09 2031 - SoftRAID Driver: The volume "RocketQ-RAID" (disk13) validated successfully.
Jul 09 2313 - SoftRAID Application: Quitting application

I am thinking  to reformat the array to HFS+ since SoftRAID has limited capabilities with APFS and shows the volume as "unmounted".

Not sure if I should suspect a drive or the array on-board interface as having an intermittent issue.

Any suggestions appreciated.

thanks

John

SoftRAID Error
SoftRAID Status
Quote
Topic starter Posted : 10/07/2021 12:37 pm
Topic Tags
(@softraid-support)
Member Admin

If there is an error on the volume, and not on any disk, then the error originated reading/writing to the volume. (a failure to communicate).

A disk did not report an error. If one had, it would have logged it in this case. These kinds of errors can also be from the file system, SoftRAID is going to report any errors, not ignore them.

But if the array also went off line, I would guess it is the enclosure that hung. that enclosure does not have activity lights to help out in this case. See if it happens again.

ReplyQuote
Posted : 10/07/2021 8:00 pm
(@jkkuehne)
New Member

@softraid-support Thanks for the reply.

Just to be safe I went ahead and did the following:

1. Moved everything off of the volume.

2. Erased the volume which created a HFS+ volume

3. Deleted the volume.

4. Built a new RAID 4 volume.

5. Performed a Verify - no errors.

I was ok to take a bit of a speed hit to provide a little safety since this is where I work on "current" projects.

Will probably order a spare NVme drive as well.

ReplyQuote
Topic starter Posted : 10/07/2021 9:27 pm
Share:
close
open