vpx/vp9/common/arm/neon
Mans Rullgard b84dc949c8 vp9: neon: optimise convolve8_horiz functions
Each iteration of the horizontal loop reuses 7 of the 11 source
values.  Loading only the 4 new values saves some time.

Also add preload for source data.

Overall 4% faster on Chromebook.

Change-Id: I8f69e749f2b7f79e9734620dcee51dbfcd716b44
2013-08-11 16:21:55 +01:00
..
vp9_convolve8_avg_neon.asm vp9: neon: optimise convolve8_horiz functions 2013-08-11 16:21:55 +01:00
vp9_convolve8_neon.asm vp9: neon: optimise convolve8_horiz functions 2013-08-11 16:21:55 +01:00
vp9_convolve_neon.c vp9_convolve8_neon placeholder 2013-07-17 08:39:27 -07:00
vp9_dc_only_idct_add_neon.asm Add neon optimize vp9_dc_only_idct_add. 2013-07-11 10:30:47 -07:00
vp9_loopfilter_neon.asm Speedup loopfilter neon code. 2013-07-22 17:00:01 -07:00
vp9_mb_lpf_neon.asm vp9: neon: add vp9_mb_lpf_* functions 2013-08-02 08:10:50 -07:00
vp9_short_idct4x4_add_neon.asm Neon version of vp9_short_idct4x4_add. 2013-08-06 18:41:27 -07:00
vp9_short_idct8x8_add_neon.asm Fix some format error and code error in neon code. 2013-07-26 14:14:57 -07:00