levytamar82 52dac5d1cb AVX2 SubPixel Variance Optimization
Optimizing 2 functions to process 32 elements in parallel instead of 16:
1. vp9_sub_pixel_variance64x64
2. vp9_sub_pixel_variance32x32
both of those function were calling vp9_sub_pixel_variance16xh_ssse3
instead of calling that function, it calls vp9_sub_pixel_variance32xh_avx2
that is written in avx2 and process 32 elements in parallel.
This Optimization gave 70% function level gain and 2% user level gain

Change-Id: I4f5cb386b346ff6c878a094e1c3b37e418e50bde
2014-02-14 16:59:11 -07:00
..
2014-01-24 15:53:12 -08:00
2014-01-31 16:30:04 -08:00
2014-02-09 20:04:54 -08:00
2013-09-29 19:29:58 -07:00
2013-12-16 17:27:48 -08:00
2012-11-27 14:12:30 -08:00
2013-02-22 11:03:14 -08:00