[OmniOS-discuss] iscsi timeouts
Saso Kiselkov
skiselkov.ml at gmail.com
Tue Jan 21 22:09:55 UTC 2014
On 1/21/14, 10:01 PM, Tobias Oetiker wrote:
> Hi Nld,
>
> Today Narayan Desai wrote:
>
>> Sorry, I should have given the requisite "yes, I know that this is a recipe
>> for sadness, for I too have experienced said sadness".
>>
>> That said, we've seen this kind of problem when there was a device in a
>> vdev that was dying a slow death. There wouldn't necessarily be any sign,
>> aside from insanely high service times on an individual device in the pool.
>> From this, I assume that ZFS is still sensitive to variation in underlying
>> drive performance.
>>
>> Tobi, what do your drive service times look like?
>> -nld
>
> the drives seem fine, smart is not reporting anything out of the
> ordinary and also iostat -En shows 0 on all counts
>
> I don't think it is a disk issue, but rather something connected
> with the network ...
>
> On times the machine becomes unreachable for some time, and then it
> is possible to login via console and all seems well internally.
> setting the network interface offline and then online again using
> the dladm tool brings the connectivity back immediatly. waiting
> helps as well ... since the problem sorts itself out after a few
> seconds to minutes ...
>
> we just had another 'off the net' periode for 30 minutes
>
> unfortunately omnios itself does not seem to realize that something
> is off, at least dmesg does not show any kernel messages about this
> problem ...
>
> we have several systems running on the S2600CP MB ... this is the
> only one showing problems ...
>
> the next thing I intend todo is to upgrade the MB firmware since I
> found that this box has an older version than the other ones ...
>
> System Configuration: Intel Corporation S2600CP
> BIOS Configuration: Intel Corp. SE5C600.86B.01.06.0002.110120121539 11/01/2012
>
> other ideas, most welcome !
You mentioned a couple of e-mails back that you're using Intel I350s.
Can you verify that your kernel has:
commit 43ae55058ad99c869a9ae39d039490e8a3680520
Author: Dan McDonald <danmcd at nexenta.com>
Date: Thu Feb 7 19:27:18 2013 -0500
3534 Disable EEE support in igb for I350
Reviewed by: Robert Mustacchi <rm at joyent.com>
Reviewed by: Jason King <jason.brian.king at gmail.com>
Reviewed by: Marcel Telka <marcel at telka.sk>
Reviewed by: Sebastien Roy <sebastien.roy at delphix.com>
Approved by: Richard Lowe <richlowe at richlowe.net>
I guess you can check for this string at runtime:
$ strings /kernel/drv/amd64/igb | grep _eee_support
If it is missing, then it could be the buggy EEE support that's throwing
your link out of whack here.
Cheers,
--
Saso
More information about the OmniOS-discuss
mailing list