James Almer 15574c505b x86/hevcdsp: add ff_hevc_sao_edge_filter_{10,12}_{sse2,avx2}
Original x86 intrinsics code by Pierre-Edouard Lepere.
Yasm port, refactoring and optimizations by James Almer.

Benchmarks of BQTerrace_1920x1080_60_qp22.bin with an Intel Core i5-4200U

Width 32
342694 decicycles in sao_edge_filter_10, 16384 runs, 0 skips
29476 decicycles in ff_hevc_sao_edge_filter_32_10_ssse3, 16384 runs, 0 skips
13996 decicycles in ff_hevc_sao_edge_filter_32_10_avx2, 16381 runs, 3 skips

Width 64
581163 decicycles in sao_edge_filter_10, 8192 runs, 0 skips
59774 decicycles in ff_hevc_sao_edge_filter_64_10_ssse3, 8192 runs, 0 skips
28383 decicycles in ff_hevc_sao_edge_filter_64_10_avx2, 8191 runs, 1 skips

Signed-off-by: James Almer <jamrial@gmail.com>
2015-02-05 15:02:33 -03:00
..
2014-04-11 12:09:09 +02:00
2014-08-06 19:24:57 +02:00
2014-08-06 19:25:08 +02:00
2014-05-21 03:25:08 +02:00
2014-12-03 23:03:13 +01:00
2014-08-06 19:24:49 +02:00
2014-04-19 09:56:01 +02:00
2015-01-25 18:20:43 -03:00
2014-05-21 03:25:08 +02:00
2014-12-15 02:34:05 +01:00
2014-04-19 09:56:01 +02:00