rc4-x86_64.pl: fix due credit.

2011-05-27 18:58:37 +00:00 · 2011-05-27 18:58:37 +00:00 · f44cb15fab
commit f44cb15fab
parent 986289604e
1 changed files with 11 additions and 9 deletions
--- a/crypto/rc4/asm/rc4-x86_64.pl
+++ b/crypto/rc4/asm/rc4-x86_64.pl
@ -78,12 +78,13 @@
 # May 2011
 #
-# The only code path that was not modified is P4-specific one. New
+# The only code path that was not modified is P4-specific one. Non-P4
-# AMD code path is inspired by and Intel optimization is heavily
+# Intel code path optimization is heavily based on submission by Maxim
-# based on submission from Maxim Locktyukhin of Intel. Current
+# Perminov, Maxim Locktyukhin and Jim Guilford of Intel. I've used
-# performance in cycles per processed byte (less is better) and
+# some of the ideas even in attempt to optmize the original RC4_INT
-# improvement coefficients relative to previous version of this
+# code path... Current performance in cycles per processed byte (less
-# module are:
+# is better) and improvement coefficients relative to previous
 # version of this module are:
 #
 # Opteron	5.3/+0%
 # P4		6.5
@ -92,9 +93,10 @@
 # Sandy Bridge	4.2/+120%
 # Atom		9.3/+80%
 #
-# (*)	Note that this result is ~15% lower than result for 32-bit
+# (*)	Note that Core2 result is ~15% lower than corresponding result
-#	code, meaning that it's possible to improve it, but it's
+#	for 32-bit code, meaning that it's possible to improve it,
-#	more than likely at the cost of the others...
+#	but more than likely at the cost of the others (see rc4-586.pl
 #	to get the idea)...
 $flavour = shift;
 $output  = shift;