openssl/doc/speed.doc

97 lines
3.8 KiB
Plaintext
Raw Normal View History

To get an idea of the performance of this library, use
ssleay speed
perl util/sp-diff.pl file1 file2
will print out the relative differences between the 2 files which are
expected to be the output from the speed program.
The performace of the library is very dependant on the Compiler
quality and various flags used to build.
---
These are some numbers I did comparing RSAref and SSLeay on a Pentium 100.
[ These numbers are all out of date, as of SSL - 0.6.1 the RSA
operations are about 2 times faster, so check the version number ]
RSA performance.
SSLeay 0.6.0
Pentium 100, 32meg, Windows NT Workstation 3.51
linux - gcc v 2.7.0 -O3 -fomit-frame-pointer -m486
and
Windows NT - Windows NT 3.51 - Visual C++ 4.1 - 586 code + 32bit assember
Windows 3.1 - Windows NT 3.51 - Visual C++ 1.52c - 286 code + 32bit assember
NT Dos Shell- Windows NT 3.51 - Visual C++ 1.52c - 286 code + 16bit assember
Times are how long it takes to do an RSA private key operation.
512bits 1024bits
-------------------------------
SSLeay NT dll 0.042s 0.202s see above
SSLeay linux 0.046s 0.218s Assember inner loops (normal build)
SSLeay linux 0.067s 0.380s Pure C code with BN_LLONG defined
SSLeay W3.1 dll 0.108s 0.478s see above
SSLeay linux 0.109s 0.713s C without BN_LLONG.
RSAref2.0 linux 0.149s 0.936s
SSLeay MS-DOS 0.197s 1.049s see above
486DX66, 32meg, Windows NT Server 3.51
512bits 1024bits
-------------------------------
SSLeay NT dll 0.084s 0.495s <- SSLeay 0.6.3
SSLeay NT dll 0.154s 0.882s
SSLeay W3.1 dll 0.335s 1.538s
SSLeay MS-DOS 0.490s 2.790s
What I find cute is that I'm still faster than RSAref when using standard C,
without using the 'long long' data type :-), %35 faster for 512bit and we
scale up to 3.2 times faster for the 'default linux' build. I should mention
that people should 'try' to use either x86-lnx.s (elf), x86-lnxa.s or
x86-sol.s for any x86 based unix they are building on. The only problems
with be with syntax but the performance gain is quite large, especially for
servers. The code is very simple, you just need to modify the 'header'.
The message is, if you are stuck using RSAref, the RSA performance will be
bad. Considering the code was compiled for a pentium, the 486DX66 number
would indicate 'Use RSAref and turn you Pentium 100 into a 486DX66' :-).
[ As of verson 0.6.1, it would be correct to say 'turn you pentium 100
into a 486DX33' :-) ]
I won't tell people if the DLL's are using RSAref or my stuff if no-one
asks :-).
eric
PS while I know I could speed things up further, I will probably not do
so due to the effort involved. I did do some timings on the
SSLeay bignum format -> RSAref number format conversion that occurs
each time RSAref is used by SSLeay, and the numbers are trivial.
0.00012s a call for 512bit vs 0.149s for the time spent in the function.
0.00018s for 1024bit vs 0.938s. Insignificant.
So the 'way to go', to support faster RSA libraries, if people are keen,
is to write 'glue' code in a similar way that I do for RSAref and send it
to me :-).
My base library still has the advantage of being able to operate on
any size numbers, and is not that far from the performance from the
leaders in the field. (-%30?)
[ Well as of 0.6.1 I am now the leader in the filed on x86 (we at
least very close :-) ]
I suppose I should also mention some other numbers RSAref numbers, again
on my Pentium.
DES CBC EDE-DES MD5
RSAref linux 830k/s 302k/s 4390k/s
SSLeay linux 855k/s 319k/s 10025k/s
SSLeay NT 1158k/s 410k/s 10470k/s
SSLeay w31 378k/s 143k/s 2383k/s (fully 16bit)
Got to admit that Visual C++ 4.[01] is a damn fine compiler :-)
--
Eric Young | BOOL is tri-state according to Bill Gates.
AARNet: eay@cryptsoft.com | RTFM Win32 GetMessage().