rc4-x86_64.pl: fix due credit.
This commit is contained in:
parent
986289604e
commit
f44cb15fab
@ -78,12 +78,13 @@
|
|||||||
|
|
||||||
# May 2011
|
# May 2011
|
||||||
#
|
#
|
||||||
# The only code path that was not modified is P4-specific one. New
|
# The only code path that was not modified is P4-specific one. Non-P4
|
||||||
# AMD code path is inspired by and Intel optimization is heavily
|
# Intel code path optimization is heavily based on submission by Maxim
|
||||||
# based on submission from Maxim Locktyukhin of Intel. Current
|
# Perminov, Maxim Locktyukhin and Jim Guilford of Intel. I've used
|
||||||
# performance in cycles per processed byte (less is better) and
|
# some of the ideas even in attempt to optmize the original RC4_INT
|
||||||
# improvement coefficients relative to previous version of this
|
# code path... Current performance in cycles per processed byte (less
|
||||||
# module are:
|
# is better) and improvement coefficients relative to previous
|
||||||
|
# version of this module are:
|
||||||
#
|
#
|
||||||
# Opteron 5.3/+0%
|
# Opteron 5.3/+0%
|
||||||
# P4 6.5
|
# P4 6.5
|
||||||
@ -92,9 +93,10 @@
|
|||||||
# Sandy Bridge 4.2/+120%
|
# Sandy Bridge 4.2/+120%
|
||||||
# Atom 9.3/+80%
|
# Atom 9.3/+80%
|
||||||
#
|
#
|
||||||
# (*) Note that this result is ~15% lower than result for 32-bit
|
# (*) Note that Core2 result is ~15% lower than corresponding result
|
||||||
# code, meaning that it's possible to improve it, but it's
|
# for 32-bit code, meaning that it's possible to improve it,
|
||||||
# more than likely at the cost of the others...
|
# but more than likely at the cost of the others (see rc4-586.pl
|
||||||
|
# to get the idea)...
|
||||||
|
|
||||||
$flavour = shift;
|
$flavour = shift;
|
||||||
$output = shift;
|
$output = shift;
|
||||||
|
Loading…
x
Reference in New Issue
Block a user