Yunqing Wang 6344c84c82 Optimize 8x8 idct function
Wrote sse2 functions of vp9_short_idct8x8 and vp9_short_idct10_8x8.
Compared to c version, the sse2 version is 2X faster. The decoder
test didn't show noticeable gain since 8x8 idct doesn't take much
of decoding time (less than 1% in my test).

Change-Id: I56313e18cd481700b3b52c4eda5ca204ca6365f3
2013-03-18 15:34:14 -07:00
..
2013-03-13 08:35:46 -07:00
2013-03-18 15:34:14 -07:00
2013-03-11 17:02:27 -07:00
2013-03-05 14:12:16 -08:00
2013-03-05 14:12:16 -08:00
2013-03-13 08:35:46 -07:00
2013-03-18 15:34:14 -07:00
2013-03-05 14:12:16 -08:00
2013-03-05 14:12:16 -08:00
2013-03-05 14:12:16 -08:00
2013-03-13 19:08:06 -07:00
2013-03-05 14:12:16 -08:00
2013-03-05 14:12:16 -08:00
2013-03-05 14:12:16 -08:00
2013-03-05 14:12:16 -08:00
2013-03-18 15:34:14 -07:00
2013-01-28 17:28:04 +00:00
2012-11-27 14:12:30 -08:00
2013-02-22 11:03:14 -08:00
2013-02-22 11:03:14 -08:00
2013-03-05 14:12:16 -08:00
2013-03-05 14:12:16 -08:00