[OmniOS-discuss] ZFS data corruption

wuffers moo at wuffers.net
Fri Aug 14 16:08:05 UTC 2015


A few weeks ago (while I was away on vacation), both of my VMware hosts
PSOD within a day of each other. The first time DR kicked in and VMs
restarted smoothly, but my backup didn't notice that the rebooted host
didn't reconnect to the SAN, so when the second host PSODed everything went
down. He rebooted the SAN and the hosts and everything seemed okay.

I came back and saw this on my pool:

  pool: tank
 state: ONLINE
status: One or more devices has experienced an error resulting in data
        corruption.  Applications may be affected.
action: Restore the file in question if possible.  Otherwise restore the
        entire pool from backup.
   see: http://illumos.org/msg/ZFS-8000-8A
  scan: scrub in progress since Tue Jul 28 14:10:23 2015
    19.6T scanned out of 46.3T at 80.0M/s, 97h9m to go
    0 repaired, 42.30% done

[snip config]

errors: Permanent errors have been detected in the following files:

        tank/vmware-64k-5tb-7:<0x1>

I moved all the VMs off that datastore, and had to repair an Exchange
database that was reporting some issues. I then started a scrub (as seen
above).

My plan was to delete this block device, and recreate a new datastore but
the scrub completed and now it shows:

errors: No known data errors

Should I trust this? I suppose that now that I've moved all the data on it
there can be no corruption at ZFS level (since I didn't find any hardware
issues in iostat or fmdump logs). Or would the consensus be to delete this,
recreate it and present it to VMware again?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://omniosce.org/ml-archive/attachments/20150814/d53bb693/attachment-0001.html>


More information about the OmniOS-discuss mailing list