levytamar82 0fa8b668c1 AVX2 SAD Optimization:
2 functions were optimized for avx2 by using full 256 bit register
In order to handle 32 elements in parallel instead of only 16 in parallel:
1. vp9_sad32x32x4d
2. vp9_sad64x64x4d

The function level gain is 66% and the user level gain is ~1%.

Change-Id: I4efbb3bc7d8bc03b64b6c98f5cd5c4a9dd3212cb
2014-03-21 13:53:32 -07:00
..
2014-02-27 16:05:50 -08:00
2014-03-19 10:47:32 -07:00
2014-03-19 12:23:32 -07:00
2014-01-31 16:30:04 -08:00
2014-02-09 20:04:54 -08:00
2014-03-21 13:53:32 -07:00
2013-09-29 19:29:58 -07:00
2013-12-16 17:27:48 -08:00
2012-11-27 14:12:30 -08:00
2013-02-22 11:03:14 -08:00