[OmniOS-discuss] questions
Ian Kaufman
ikaufman at eng.ucsd.edu
Thu Sep 14 15:07:06 UTC 2017
Some other things you need to take into account:
QDR Infiniband is 40Gbps, not 40GB/s. That is a factor of 8 difference.
That is also a theoretical maximum throughput, there is some overhead. In
reality, you will never see 40Gbps.
My system tested out at 6Gbps - 8Gbps using NFS over IPoIB, with DDR
(20Gbps) nodes and a QDR (40Gbps) storage server. IPoIB drops the
theoretical max rates to 18Gbps and 36Gbps respectively.
If you are getting 185MB/s, you are seeing 1.48Gbps.
Keep your B's and b's straight. Did you play with your frame size at all?
Ian
On Thu, Sep 14, 2017 at 7:10 AM, Jim Klimov <jimklimov at cos.ru> wrote:
> On September 14, 2017 2:26:13 PM GMT+02:00, Dirk Willems <
> dirk.willems at exitas.be> wrote:
> >Hello,
> >
> >
> >I'm trying to understand something let me explain.
> >
> >
> >Oracle always told to me that if you create a etherstub switch it has
> >infiniband speed 40GB/s.
> >
> >But I have a customer running on Solaris (Yeah I know but let me
> >explain) who is copy from 1 NGZ to another NGZ on the same GZ over Lan
> >(I know told him to to use etherstub).
> >
> >The copy witch is performed for a Oracle database with sql command, the
> >
> >DBA witch have 5 streams say it's waiting on the disk, the disk are 50
> >-
> >60 % busy the speed is 30 mb/s.
> >
> >
> >So I did some test just to see and understand if it's the database or
> >the system, but with doing my tests I get very confused ???
> >
> >
> >On another Solaris at my work copy over etherstub switch => copy speed
> >is 185MB/s expected much more of infiniband speed ???
> >
> >
> >root at test1:/export/home/Admin# scp test10G
> >Admin at 192.168.1.2:/export/home/Admin/
> >Password:
> >test10G 100%
> >|****************************************************************|
> >10240
> >MB 00:59
> >
> >
> >root at test2:~# dlstat -i 2
> >
> > LINK IPKTS RBYTES OPKTS OBYTES
> > net1 25.76K 185.14M 10.08K 2.62M
> > net1 27.04K 187.16M 11.23K 3.22M
> > net1 26.97K 186.37M 11.24K 3.23M
> > net1 26.63K 187.67M 10.82K 2.99M
> > net1 27.94K 186.65M 12.17K 3.75M
> > net1 27.45K 187.46M 11.70K 3.47M
> > net1 26.01K 181.95M 10.63K 2.99M
> > net1 27.95K 188.19M 12.14K 3.69M
> > net1 27.91K 188.36M 12.08K 3.64M
> >
> >The disks are all separate luns with all separated pools => disk are 20
> >
> >- 30% busy
> >
> >
> >On my OmniOSce at my lab over etherstub
> >
> >
> >root at GNUHealth:~# scp test10G witte at 192.168.20.3:/export/home/witte/
> >Password:
> >test10G 76% 7853MB 116.4MB/s
> >
> >
> >=> copy is 116.4 MB/s => expected much more from infiniband speed is
> >just the same as Lan ???
> >
> >
> >Is not that my disk can not follow 17% busy there sleeping ...
> >
> > extended device statistics
> > r/s w/s Mr/s Mw/s wait actv wsvc_t asvc_t %w %b device
> > 0,0 248,4 0,0 2,1 0,0 1,3 0,0 5,3 0 102 c1
> > 0,0 37,5 0,0 0,7 0,0 0,2 0,0 4,7 0 17 c1t0d0 =>
> >rpool
> > 0,0 38,5 0,0 0,7 0,0 0,2 0,0 4,9 0 17 c1t1d0 =>
> >rpool
> > 0,0 40,5 0,0 0,1 0,0 0,2 0,0 5,6 0 17 c1t2d0 =>
> >data pool
> > 0,0 43,5 0,0 0,2 0,0 0,2 0,0 5,4 0 17 c1t3d0 =>
> >data pool
> > 0,0 44,5 0,0 0,2 0,0 0,2 0,0 5,5 0 18 c1t4d0 =>
> >data pool
> > 0,0 44,0 0,0 0,2 0,0 0,2 0,0 5,4 0 17 c1t5d0 =>
> >data pool
> > 0,0 76,0 0,0 1,5 7,4 0,4 97,2 4,9 14 18 rpool
> > 0,0 172,4 0,0 0,6 2,0 0,9 11,4 5,5 12 20 DATA
> >
> >
> >
> >root at NGINX:/root# dlstat show-link NGINX1 -i 2
> >
> > LINK TYPE ID INDEX PKTS BYTES
> > NGINX1 rx bcast -- 0 0
> > NGINX1 rx sw -- 0 0
> > NGINX1 tx bcast -- 0 0
> > NGINX1 tx sw -- 9.26K 692.00K
> > NGINX1 rx local -- 26.00K 216.32M
> > NGINX1 rx bcast -- 0 0
> > NGINX1 rx sw -- 0 0
> > NGINX1 tx bcast -- 0 0
> > NGINX1 tx sw -- 7.01K 531.38K
> > NGINX1 rx local -- 30.65K 253.73M
> > NGINX1 rx bcast -- 0 0
> > NGINX1 rx sw -- 0 0
> > NGINX1 tx bcast -- 0 0
> > NGINX1 tx sw -- 8.95K 669.32K
> > NGINX1 rx local -- 29.10K 241.15M
> >
> >
> >On the other NGZ I receive 250MB/s ????
> >
> >
> >- So my question is how comes that the speed is equal to Lan 100MB/s on
> >
> >OmniOSce but i receive 250MB/s ?
> >
> >- Why is etherstub so slow if infiniband speed is 40GB/s ???
> >
> >
> >I'm very confused right now ...
> >
> >
> >And want to know for sure how to understand and see this in the right
> >way, because this customer will be the first customer from my who gonna
> >
> >switch complety over to OmniOSce on production and because this
> >customer
> >is one or the biggest company's in Belgium I really don't want to mess
> >up !!!
> >
> >
> >So any help and clarification will be highly appreciate !!!
> >
> >
> >Thank you very much.
> >
> >
> >Kind Regards,
> >
> >
> >Dirk
>
> I am not sure where the infiniband claim comes from, but copying data disk
> to disk, you involve the slow layers like disk, skewed by faster layers
> like cache of already-read data and delayed writes :)
>
> If you have a wide pipe that you may fill, it doesn't mean you do have the
> means to fill it with a few disks.
>
> To estimate the speeds, try pure UDP streams from process to process (no
> disk), large-packet floodping, etc.
>
> I believe etherstub is not constrained artificially, and defaults to jumbo
> frames. Going to LAN and back can in fact use external hardware (IIRC there
> may be a system option to disable that, not sure) and so is constrained by
> that.
>
> Jim
> --
> Typos courtesy of K-9 Mail on my Android
> _______________________________________________
> OmniOS-discuss mailing list
> OmniOS-discuss at lists.omniti.com
> http://lists.omniti.com/mailman/listinfo/omnios-discuss
>
--
Ian Kaufman
Research Systems Administrator
UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://omniosce.org/ml-archive/attachments/20170914/0b30fd66/attachment.html>
More information about the OmniOS-discuss
mailing list