[OmniOS-discuss] Hung ZFS Pool

Brian Hechinger wonko at 4amlunch.net
Wed Dec 9 16:21:01 UTC 2015


Also, I would expect the other slice to be affected as well?  It’s been humming along just fine as SLOG with no errors:

        logs
          mirror-3    ONLINE       0     0     0
            c4t1d0s0  ONLINE       0     0     0
            c5t1d0s0  ONLINE       0     0     0

> On Dec 9, 2015, at 11:17 AM, Dan McDonald <danmcd at omniti.com> wrote:
> 
> 
>> On Dec 9, 2015, at 11:13 AM, Brian Hechinger <wonko at 4amlunch.net> wrote:
>> 
>> I didn’t know about pgrep, no. :)
> 
> The Solaris/illumos ptools are a huge win.  Learn about 'em.  :)
> 
> Back to the main discussion...
> 
>> So the ‘zpool clear’ has fixed things a bit. The touch processes have all exited.
>> 
>> I can now touch a file on that pool.
>> 
>> A zpool scrub later and this is the status:
>> 
>> pool: zoom
>> state: ONLINE
>> status: One or more devices has experienced an unrecoverable error.  An
>>       attempt was made to correct the error.  Applications are unaffected.
>> action: Determine if the device needs to be replaced, and clear the errors
>>       using 'zpool clear' or replace the device with 'zpool replace'.
>>  see: http://illumos.org/msg/ZFS-8000-9P
>> scan: scrub repaired 6K in 0h0m with 0 errors on Wed Dec  9 10:25:33 2015
>> config:
>> 
>>       NAME          STATE     READ WRITE CKSUM
>>       zoom          ONLINE       0     0     0
>>         mirror-0    ONLINE       0     0     0
>>           c4t1d0s1  ONLINE       0     0     0
>>           c5t1d0s1  ONLINE       0     0     2
>> 
>> errors: No known data errors
>> 
>> I’m going to try to re-run iozone later and see if I can’t get it to happen again.
>> 
>> This is concerning.
> 
> I see this, and I think "c5t1d0" is broken HW and needs to be replaced.
> 
> Combine that with "unrecoverable IO failures" and you really should be planning to replace that drive.
> 
> Dan
> 



More information about the OmniOS-discuss mailing list