Johann 28a8622143 vpx_comp_avg_pred: sse2 optimization
Provides over 15x speedup for width > 8.

Due to smaller loads and shifting for width == 8 it gets about 8x
speedup.

For width == 4 it's only about 4x speedup because there is a lot of
shuffling and shifting to get the data properly situated.

BUG=webm:1390

Change-Id: Ice0b3dbbf007be3d9509786a61e7f35e94bdffa8
2017-04-13 08:44:52 -07:00
..
2016-07-25 14:14:19 -07:00
2016-07-25 14:14:19 -07:00
2016-07-25 14:14:19 -07:00
2015-08-07 16:44:44 -07:00
2015-08-07 16:44:44 -07:00
2016-07-25 14:14:19 -07:00
2015-12-17 17:43:46 -08:00
2016-07-25 14:14:19 -07:00
2015-05-06 16:58:20 -07:00
2015-05-06 16:58:20 -07:00
2015-05-06 16:58:20 -07:00
2015-08-07 16:44:44 -07:00
2016-07-25 14:14:19 -07:00
2016-07-25 14:14:19 -07:00