vpx/vp9/encoder/x86
levytamar82 ea14909687 AVX2 SubPixel AVG Variance Optimization
Optimizing 2 functions to process 32 elements in parallel instead of 16:
1. vp9_sub_pixel_avg_variance64x64
2. vp9_sub_pixel_avg_variance32x32
both of those function were calling vp9_sub_pixel_avg_variance16xh_ssse3
instead of calling that function, it calls vp9_sub_pixel_avg_variance32xh_avx2
that is written in avx2 and process 32 elements in parallel.
This Optimization gave 80% function level gain and 2% user level gain

Change-Id: Iea694654e1b7612dc6ed11e2626208c2179502c8
2014-02-28 22:51:04 -07:00
..
vp9_dct32x32_avx2.c vp9_short_fdct32x32_rd vp9_short_fdct32x32 optimized for AVX2 2013-11-21 14:19:49 -08:00
vp9_dct32x32_sse2.c Making input pointer constant for all fdct/fht functions. 2013-10-24 11:48:25 -07:00
vp9_dct_avx2.c minor spelling cleanup in comments 2014-02-12 16:32:51 -08:00
vp9_dct_sse2.c minor spelling cleanup in comments 2014-02-12 16:32:51 -08:00
vp9_error_sse2.asm Make coefficient skip condition an explicit RD choice. 2013-06-28 10:28:49 -07:00
vp9_mcomp_x86.h vp9/encoder: add extern "C" to headers 2014-01-23 16:21:24 -08:00
vp9_quantize_ssse3.asm Bug fix in ssse3 quantize function 2014-02-07 14:32:30 -08:00
vp9_sad4d_sse2.asm Implement SSE version for sad4x8x4d and SSE2 version for sad8x4x4d. 2013-06-12 17:40:01 -04:00
vp9_sad_mmx.asm add private to assembly files to insure proper chromebuild 2012-12-20 09:40:18 -08:00
vp9_sad_sse2.asm Add averaging-SAD functions for 8-point comp-inter motion search. 2013-06-25 12:57:28 -07:00
vp9_sad_sse3.asm Merge master branch into experimental 2013-03-01 11:06:05 -08:00
vp9_sad_sse4.asm this commit converts all sad ptrs to uint32 2013-02-28 08:46:35 -08:00
vp9_sad_ssse3.asm add private to assembly files to insure proper chromebuild 2012-12-20 09:40:18 -08:00
vp9_ssim_opt.asm add private to assembly files to insure proper chromebuild 2012-12-20 09:40:18 -08:00
vp9_subpel_variance_impl_intrin_avx2.c AVX2 SubPixel AVG Variance Optimization 2014-02-28 22:51:04 -07:00
vp9_subpel_variance_impl_sse2.asm Implement sse2 and ssse3 versions for all sub_pixel_variance sizes. 2013-06-20 09:34:25 -07:00
vp9_subpel_variance.asm Remove TEXTREL from 32bit encoder 2013-11-07 13:39:40 -08:00
vp9_subtract_sse2.asm Remove emms - that shouldn't be there. 2013-06-21 14:45:04 -07:00
vp9_temporal_filter_apply_sse2.asm Fix --as=nasm compatibility for new asm code. 2013-02-27 09:55:38 -08:00
vp9_variance_avx2.c AVX2 SubPixel AVG Variance Optimization 2014-02-28 22:51:04 -07:00
vp9_variance_impl_intrin_avx2.c AVX2 Variance Optimization 2014-01-08 12:05:53 -07:00
vp9_variance_impl_mmx.asm Fix the reading of too many input pixels 2013-08-26 12:35:27 -07:00
vp9_variance_impl_sse2.asm Implement sse2 and ssse3 versions for all sub_pixel_variance sizes. 2013-06-20 09:34:25 -07:00
vp9_variance_mmx.c cpplint issues resolved in vp9_variance_mmx.c 2013-10-04 14:22:08 -07:00
vp9_variance_sse2.c cpplint vp9_variance_sse2.c 2013-10-04 14:15:06 -07:00