[OmniOS-discuss] Multiple faulty SSD's ?

Piotr Jasiukajtis estibi at me.com
Tue Jul 26 18:30:57 UTC 2016


I don’t know a root cause, but it’s better to have a workaround than a corrupted pools. 

--
Piotr Jasiukajtis

> On 26 Jul 2016, at 20:06, Dan McDonald <danmcd at omniti.com> wrote:
> 
> I wonder if those sd.conf changes should be upstreamed or not?
> 
> Dan
> 
> Sent from my iPhone (typos, autocorrect, and all)
> 
>> On Jul 26, 2016, at 1:28 PM, Piotr Jasiukajtis <estibi at me.com> wrote:
>> 
>> You may want to force the driver to use 4k instead of 512b for those drivers and create a new pool:
>> 
>> https://github.com/joyent/smartos-live/commit/dd25937d2f9725def16f5e8dbb16a8bcbc2213d5
>> 
>> --
>> Piotr Jasiukajtis
>> 
>>> On 26 Jul 2016, at 02:24, Shaun McGuane <shaun at rackcentral.com> wrote:
>>> 
>>> Hi List,
>>> 
>>> I want to report very strange SSD behaviour on a new pool I setup.
>>> 
>>> The hardware is a HP DL180 G6 Server with the LSI 9207-8i Card
>>> And 8x 1TB Samsung SSD Pro drives. Running omnios-10b9c79
>>> 
>>> All the drives are brand spanking new setup in a raidz2 array.
>>> 
>>> Within 2 months the below has happened and there has been very
>>> Little use on this array. 
>>> 
>>> pool: SSD-TANK
>>> state: DEGRADED
>>> status: One or more devices are faulted in response to persistent errors.
>>>       Sufficient replicas exist for the pool to continue functioning in a
>>>       degraded state.
>>> action: Replace the faulted device, or use 'zpool clear' to mark the device
>>>       repaired.
>>> scan: scrub repaired 23K in 1h12m with 0 errors on Mon Jul 25 20:13:04 2016
>>> config:
>>> 
>>>       NAME                       STATE     READ WRITE CKSUM
>>>       SSD-TANK                   DEGRADED     1    67    35
>>>         raidz2-0                 DEGRADED     4    72   113
>>>           c5t500253884014D0D3d0  ONLINE       0     0     2
>>>           c5t50025388401F767Ad0  DEGRADED     0     0    19  too many errors
>>>           c5t50025388401F767Bd0  FAULTED      0     0     0  too many errors
>>>           c5t50025388401F767Dd0  ONLINE       0     0     0
>>>           c5t50025388401F767Fd0  ONLINE       0     0     1
>>>           c5t50025388401F7679d0  ONLINE       0     0     2
>>>           c5t50025388401F7680d0  REMOVED      0     0     0
>>>           c5t50025388401F7682d0  ONLINE       0     0     1
>>> 
>>> Can anyone suggest why I would have this problem where I am seeing CKSUM errors
>>> On most disks and while only one has faulted others have been degraded or removed.
>>> 
>>> Thanks
>>> Shaun
>>> _______________________________________________
>>> OmniOS-discuss mailing list
>>> OmniOS-discuss at lists.omniti.com
>>> http://lists.omniti.com/mailman/listinfo/omnios-discuss
>> 
>> _______________________________________________
>> OmniOS-discuss mailing list
>> OmniOS-discuss at lists.omniti.com
>> http://lists.omniti.com/mailman/listinfo/omnios-discuss



More information about the OmniOS-discuss mailing list