vpx/vp9/common/arm/neon
Yunqing Wang 64f728caef Do horizontal loopfiltering in parallel
This patch followed "Rewrite filter_selectively_horiz for parallel
loopfiltering" commit, and added x86 SSE2 optimization to do
16-pixel filtering in parallel. Also, corrected the declaration
of aligned arrays. For 8-pixel-in-parallel case, improved the
calculation of the masks and filters. Updated the threshold loading
since the thresholds were already duplicated. Updated neon C functions
to call neon loopfilters twice.

Using tulip clip, tests showed it gave a ~1.5% decoder speed gain.

Change-Id: Id02638626ac27a4b0e0b09d71792a24c0499bd35
2013-11-15 16:18:43 -08:00
..
vp9_avg_neon.asm vp9: neon: add vp9_convolve_avg_neon 2013-08-14 16:27:55 +01:00
vp9_convolve8_avg_neon.asm vp9: neon: use aligned stores in convolve functions 2013-08-16 14:25:08 +01:00
vp9_convolve8_neon.asm vp9: neon: use aligned stores in convolve functions 2013-08-16 14:25:08 +01:00
vp9_convolve_neon.c vp9: neon: use aligned stores in convolve functions 2013-08-16 14:25:08 +01:00
vp9_copy_neon.asm vp9: neon: add vp9_convolve_copy_neon 2013-08-14 16:27:55 +01:00
vp9_dc_only_idct_add_neon.asm Add neon optimize vp9_dc_only_idct_add. 2013-07-11 10:30:47 -07:00
vp9_idct16x16_neon.c Making input pointer of any inverse transform constant. 2013-10-11 18:27:12 -07:00
vp9_loopfilter_16_neon.c Do horizontal loopfiltering in parallel 2013-11-15 16:18:43 -08:00
vp9_loopfilter_neon.asm Speedup loopfilter neon code. 2013-07-22 17:00:01 -07:00
vp9_mb_lpf_neon.asm vp9: neon: optimise vp9_wide_mbfilter_neon 2013-08-15 19:07:22 +01:00
vp9_save_reg_neon.asm Properly save neon registers. 2013-09-27 14:25:33 -07:00
vp9_short_idct4x4_1_add_neon.asm Giving consistent names to IDCT/IWHT functions. 2013-10-04 14:17:06 -07:00
vp9_short_idct4x4_add_neon.asm Giving consistent names to IDCT/IWHT functions. 2013-10-04 14:17:06 -07:00
vp9_short_idct8x8_1_add_neon.asm Giving consistent names to IDCT 8x8 functions. 2013-10-06 00:24:09 -07:00
vp9_short_idct8x8_add_neon.asm Giving consistent names to IDCT 8x8 functions. 2013-10-06 00:24:09 -07:00
vp9_short_idct16x16_1_add_neon.asm Giving consistent names to IDCT 16x16 functions. 2013-10-07 14:31:10 -07:00
vp9_short_idct16x16_add_neon.asm Giving consistent names to IDCT 16x16 functions. 2013-10-07 14:31:10 -07:00
vp9_short_idct32x32_1_add_neon.asm Fix a bug in the assembly code. 2013-11-11 12:49:12 -08:00
vp9_short_idct32x32_add_neon.asm Use lowercase 'b' to branch 2013-11-12 10:41:06 -08:00
vp9_short_iht4x4_add_neon.asm Consistent names for inverse hybrid transforms (1 of 2). 2013-10-11 13:31:32 -07:00
vp9_short_iht8x8_add_neon.asm Consistent names for inverse hybrid transforms (1 of 2). 2013-10-11 13:31:32 -07:00