isa-l/mem
Nicola Torracca 0e65117138 mem_zero_detect_avx: OR multiple vector and test for non zero on the result
micro-optimizations: vpcmpeqb+vpmaskmov is faster than vptest according
to uops.info; make usually untaken branches target forward.
reduce numbers of data dependant branches and code size.

Change-Id: Ie70b4bc99685368e5131f23344348bfaf7c27d3e
Signed-off-by: Nicola Torracca <shark@bitchx.it>
2021-09-30 16:55:30 -07:00
..
aarch64 multibinary: Add run-time cpu feature detect for aarch64 2019-08-26 17:58:42 +08:00
Makefile.am build: Remove unneeded file types.h 2021-06-10 09:35:43 -07:00
mem_multibinary.asm mem: Add zero detect memory functions 2018-09-25 14:33:31 -07:00
mem_zero_detect_avx.asm mem_zero_detect_avx: OR multiple vector and test for non zero on the result 2021-09-30 16:55:30 -07:00
mem_zero_detect_base_aliases.c mem: Add zero detect memory functions 2018-09-25 14:33:31 -07:00
mem_zero_detect_base.c mem: Remove unaligned loads in base function 2019-03-07 09:27:50 -07:00
mem_zero_detect_perf.c build: Remove unneeded file types.h 2021-06-10 09:35:43 -07:00
mem_zero_detect_sse.asm x86: Add ENDBR32/ENDBR64 at function entries for Intel CET 2020-05-26 09:16:49 -07:00
mem_zero_detect_test.c build: Remove unneeded file types.h 2021-06-10 09:35:43 -07:00