
These inline assembly versions are about as fast as what the compiler itself can do. However, when the compiler emits it by itself instead of via inline assembly, it can interleave the instructions with other instructions, improving scheduling. Silence the warnings in fixmul.h though, since the compiler is able to produce good enough code for the C version of these functions. This gives a speedup on both GCC and clang.
Description
No description provided
Languages
C++
74.1%
C
25.7%
Makefile
0.2%