Andy Polyakov
|
47edeb9f59
|
sha256-586.pl: code refresh.
|
2013-02-15 11:23:06 +01:00 |
|
Andy Polyakov
|
35c77b7303
|
sha1-586.pl: code refresh.
|
2013-02-14 22:20:17 +01:00 |
|
Andy Polyakov
|
7c9e81be40
|
[aesni-]sha1-x86_64.pl: code refresh.
|
2013-02-14 16:14:02 +01:00 |
|
Andy Polyakov
|
c4558efbf3
|
sha512-x86_64.pl: add AVX2 code path.
|
2013-02-14 15:39:42 +01:00 |
|
Andy Polyakov
|
46bf83f07a
|
x86_64 assembly pack: make Windows build more robust.
PR: 2963 and a number of others
|
2013-01-22 22:27:28 +01:00 |
|
Andy Polyakov
|
1598af9ac9
|
sha512-ppc.pl: minimize stack frame.
|
2013-01-22 21:09:14 +01:00 |
|
Andy Polyakov
|
d4571f43e8
|
sha512-ppc.pl: add PPC32 code, >2x improvement on in-order cores.
|
2013-01-19 17:22:05 +01:00 |
|
Andy Polyakov
|
904732f68b
|
C64x+ assembly pack: improve EABI support.
|
2012-11-28 13:19:10 +00:00 |
|
Andy Polyakov
|
1efd583085
|
SPARCv9 assembly pack: harmonize ABI handling (so that it's handled in one
place at a time, by pre-processor in .S case and perl - in .s).
|
2012-10-25 12:07:32 +00:00 |
|
Andy Polyakov
|
aea4126e4e
|
[md5|sha1|sha512]-sparcv9.pl: "cooperative" optimizations based on
suggestions from David Miller.
|
2012-10-14 14:46:56 +00:00 |
|
Andy Polyakov
|
3ed6e22771
|
sha[1|512]-sparcv9.pl: add hardware SPARC T4 support.
Submitted by: David Miller
|
2012-09-28 09:35:39 +00:00 |
|
Andy Polyakov
|
5cc2159526
|
MIPS assembly pack: add support for SmartMIPS ASE.
|
2012-09-18 12:52:23 +00:00 |
|
Andy Polyakov
|
8df5518bd9
|
MIPS assembly pack: add MIPS[32|64]R2 code.
|
2012-09-15 11:18:20 +00:00 |
|
Andy Polyakov
|
17e820aec3
|
sha512-sparcv9.pl: fix binutils compilation error.
Submitted by: David Miller
|
2012-09-15 08:53:25 +00:00 |
|
Andy Polyakov
|
f26328c2f3
|
sha512-armv4.pl: optimize for Snapdragon S4.
|
2012-09-04 08:25:37 +00:00 |
|
Andy Polyakov
|
1a9d60d2e3
|
sha1-armv4-large.pl: comply with ABI.
|
2012-08-17 19:57:04 +00:00 |
|
Andy Polyakov
|
1a002d88ad
|
MIPS assembly pack: assign default value to $flavour.
|
2012-08-17 09:10:31 +00:00 |
|
Andy Polyakov
|
f6ff1aa8e0
|
sha512-x86_64.pl: revert previous change and solve the problem through
perlasm/x86_64-xlate.pl instead.
|
2012-08-13 12:34:36 +00:00 |
|
Andy Polyakov
|
3a5485a9f8
|
sha512-x86_64.pl: minimum gas requirement for AMD XOP.
|
2012-08-13 11:01:44 +00:00 |
|
Andy Polyakov
|
69f45c520c
|
sha1-[586|x86_64].pl: shave off one instruction from body_40_59, it's
2% less instructions in SIMD code paths, so 2% improvement in average:-)
|
2012-07-15 20:33:30 +00:00 |
|
Andy Polyakov
|
ee923b4cef
|
sha512-s390x.pl: lingering comment update.
|
2012-07-15 13:36:57 +00:00 |
|
Andy Polyakov
|
46a2b3387a
|
sha512-ia64.pl: 15-20% performance improvement.
|
2012-07-15 13:36:25 +00:00 |
|
Andy Polyakov
|
e09039c01c
|
sha256-armv4.pl: 4% performance improvement.
|
2012-07-15 13:35:10 +00:00 |
|
Andy Polyakov
|
da3bd2779d
|
sha1-s390x.pl: lingering comment update.
|
2012-07-15 13:34:26 +00:00 |
|
Andy Polyakov
|
660164a9ed
|
sha512-586.pl: optimize SSE2 code path, +5-7% on most CPUs, +13% on AMD K8.
|
2012-07-15 13:28:15 +00:00 |
|
Andy Polyakov
|
367b126491
|
sha1-586.pl: let masm compile AVX code.
|
2012-07-15 13:25:31 +00:00 |
|
Andy Polyakov
|
2dce10c56d
|
sha256-586.pl: fix typos.
|
2012-07-01 08:46:38 +00:00 |
|
Andy Polyakov
|
6251989eb6
|
x86_64 assembly pack: make it possible to compile with Perl located on
path with spaces.
PR: 2835
|
2012-06-27 10:08:23 +00:00 |
|
Andy Polyakov
|
faee82c1bc
|
sha512-x86_64.pl: fix typo.
|
2012-06-25 17:13:15 +00:00 |
|
Andy Polyakov
|
a8f3b8b519
|
sha512-x86_64.pl: add SIMD code paths.
|
2012-06-24 19:22:06 +00:00 |
|
Andy Polyakov
|
ad880dc469
|
sha512-x86_64.pl: fix typo.
|
2012-06-19 07:50:10 +00:00 |
|
Andy Polyakov
|
42a36658c1
|
sha256-586.pl: fix linking error.
|
2012-06-19 07:49:36 +00:00 |
|
Andy Polyakov
|
f3eac74bc5
|
sha256-586.pl: add AVX and XOP code paths.
|
2012-06-12 14:40:11 +00:00 |
|
Andy Polyakov
|
3a9b3852c6
|
sha256-586.pl: squeeze some more, most notably ~10% on Nehalem.
|
2012-06-12 14:38:01 +00:00 |
|
Andy Polyakov
|
d2e1803197
|
x86[_64] assembly pack: update benchmark results.
|
2012-06-12 14:18:21 +00:00 |
|
Andy Polyakov
|
f889bb0384
|
sha256-586.pl: full unroll to deliver additional ~16%, add Sandy Bridge-
specific code path.
|
2012-05-28 17:50:57 +00:00 |
|
Andy Polyakov
|
83698d3191
|
sha512-x86_64.pl: >5% better performance.
|
2012-05-28 17:47:15 +00:00 |
|
Andy Polyakov
|
d4bb6bddf8
|
sha256-586.pl: tune away regression on Nehalem core and incidentally
improve performance on Atom and P4.
|
2012-05-24 07:39:04 +00:00 |
|
Andy Polyakov
|
ee9bf3eb6c
|
sha256-586.pl optimization.
|
2012-05-19 10:10:30 +00:00 |
|
Andy Polyakov
|
f9c5e5d92e
|
perlasm: fix symptom-less bugs, missing semicolons and 'my' declarations.
|
2012-04-28 10:36:58 +00:00 |
|
Andy Polyakov
|
3e181369dd
|
C64x+ assembler pack. linux-c64xplus build is *not* tested nor can it be
tested, because kernel is not in shape to handle it *yet*. The code is
committed mostly to stimulate the kernel development.
|
2012-04-18 13:01:36 +00:00 |
|
Andy Polyakov
|
482a7d80cf
|
sha512-armv4.pl: optimize NEON code path by utilizing vbsl, bitwise select.
|
2012-03-29 18:20:11 +00:00 |
|
Andy Polyakov
|
adb5a2694a
|
sha512-sparcv9.pl: work around V8+ warning.
|
2012-01-13 09:18:05 +00:00 |
|
Andy Polyakov
|
23b93b587b
|
aes-ppc.pl, sha512-ppc.pl: comply even with Embedded ABI specification
(most restrictive about r2 and r13 usage).
|
2012-01-13 09:16:52 +00:00 |
|
Andy Polyakov
|
d528caa725
|
sha1-mips.pl: fix typo.
|
2011-10-20 08:39:29 +00:00 |
|
Andy Polyakov
|
227a822ab6
|
vxworks-mips: unify and add assembler.
|
2011-10-19 21:49:20 +00:00 |
|
Andy Polyakov
|
7470276a25
|
sha256-586.pl: minor optimization, +0-2% on all CPUs, +7% on Westmere.
|
2011-09-17 12:57:33 +00:00 |
|
Andy Polyakov
|
d2fd65f6f6
|
sha512-x86_64.pl: +15% better performance on Westmere and incidentally Atom.
Other Intel processors +5%, Opteron -2%.
|
2011-09-17 11:30:28 +00:00 |
|
Andy Polyakov
|
9df286b13a
|
sha512-sparcv9.pl: minor optimization of sha256.
|
2011-07-18 11:34:07 +00:00 |
|
Andy Polyakov
|
87873f4328
|
ARM assembler pack: add platform run-time detection.
|
2011-07-17 17:40:29 +00:00 |
|