vpx/vp9/common/x86
Yunqing Wang ed36720b66 Do vertical loopfiltering in parallel
This patch followed "Add filter_selectively_vert_row2 to enable
parallel loopfiltering" commit, and added x86 SSE2 optimization
to do 16-pixel filtering in parallel. For other optimizations
(neon and dspr2), current 16-pixel functions were done by calling
8-pixel functions twice, and real 16-pixel functions could be added
later.

Decoder speedup:
tulip clip:     2% speed gain;
old_town_cross: 1.2% speed gain;
bus:            2% speed gain.

Change-Id: I4818a0c72f84b34f5fe678e496cf4a10238574b7
2013-11-22 10:04:51 -08:00
..
vp9_asm_stubs.c Fix a few indent format issues in buffer defs 2013-10-15 11:51:09 -07:00
vp9_copy_sse2.asm Replace copy_memNxM functions with a generic copy/avg function. 2013-07-10 18:27:24 -07:00
vp9_idct_intrin_sse2.c Improve vp9_iht4x4_16_add_sse2 (x1.341) 2013-11-18 20:53:13 -05:00
vp9_intrapred_sse2.asm Fix x86inc.asm to build PIC code correctly 2013-09-18 13:45:46 -07:00
vp9_intrapred_ssse3.asm vp9 ssse3 d207_predictor_32x32: add missing GLOBAL() 2013-11-01 20:33:22 -07:00
vp9_loopfilter_intrin_avx2.c mb_lpf_horizontal_edge AVX2 optimization 2013-10-31 10:26:15 -06:00
vp9_loopfilter_intrin_sse2.c Do vertical loopfiltering in parallel 2013-11-22 10:04:51 -08:00
vp9_loopfilter_mmx.asm Removing unused simple loopfilter code. 2013-05-10 11:04:43 -07:00
vp9_postproc_mmx.asm Code cleanup: lower case variable names. 2013-03-20 16:41:30 -07:00
vp9_postproc_sse2.asm Code cleanup: lower case variable names. 2013-03-20 16:41:30 -07:00
vp9_postproc_x86.h fixed cpp lint issue in vp9_postproc_x86 2013-09-29 18:44:58 -07:00
vp9_subpixel_8t_sse2.asm SSE2 8-tap sub-pixel filter optimization 2013-10-10 14:12:47 -07:00
vp9_subpixel_8t_ssse3.asm Correct ssse3 8/16-pixel wide sub-pixel filter calculation 2013-11-20 12:52:56 -08:00