vpx/vpx_dsp
Johann 13234d3c43 Remove neon assembly for idct 16x16 and 8x8
Tested using test/partial_idct_test.cc:DISABLED_Speed

Both gcc 4.9 and clang 3.8 from the r13 Android NDK offer improvements
using the intrinsics:
<function>    <clang asm> <gcc asm> <clang intrin> <gcc intrin>
idct16x16_256  1720ms      1703ms    1546ms         1554ms
idct16x16_10   1320ms      1247ms     518ms          488ms
idct16x16_1     107ms       108ms      64ms           68ms
idct8x8_64      924ms       931ms     866ms          989ms
idct8x8_12      826ms       824ms     519ms          514ms
idct8x8_1       172ms       166ms     110ms          125ms

idct8x8_64 isn't quite perfect (slight regression with gcc intrinsics)
but as a counter example idct16x16_10 goes from ~1300ms to ~500ms

On a sample clip, clang improved from 48.5 to 49fps and gcc stayed roughly
stable.

BUG=webm:1303

Change-Id: I9d4fd2b41b46ea6174a887b40a82c8e6e4769ed4
2017-01-19 12:27:31 -08:00
..
arm Remove neon assembly for idct 16x16 and 8x8 2017-01-19 12:27:31 -08:00
mips Merge "Fix mips dspr2 idct32x32 functions for large coefficient input" 2017-01-09 19:47:47 +00:00
x86 Rework 8x8 transpose SSSE3 for avg computation 2017-01-12 15:16:07 -08:00
add_noise.c add_noise,vpx_setup_noise: correct 'char_dist' type 2016-08-24 21:46:26 -07:00
avg.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
bitreader_buffer.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
bitreader_buffer.h vp10: code sign bit before absolute value in non-arithcoded header. 2015-09-16 19:35:03 -04:00
bitreader.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
bitreader.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
bitwriter_buffer.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
bitwriter_buffer.h vp10: code sign bit before absolute value in non-arithcoded header. 2015-09-16 19:35:03 -04:00
bitwriter.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
bitwriter.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
deblock.c Change C/MSA post proc to match SSE2. 2016-11-23 10:42:11 -08:00
fastssim.c Add compiler warning flag -Wextra and fix related warnings. 2016-09-27 12:05:01 -07:00
fwd_txfm.c Fix warnings reported by -Wshadow: Part1: vpx_dsp directory 2016-10-17 19:25:19 -07:00
fwd_txfm.h Replace vp9_ prefix in 2D-DCT functions with vpx_ 2015-07-28 16:06:44 -07:00
intrapred.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
inv_txfm.c Cosmetics by unifying dest_stride to stride in idct 2016-12-12 15:13:22 -08:00
inv_txfm.h Replace highbd_dct_const_round_shift with dct_const_round_shift 2016-11-04 16:15:02 -07:00
loopfilter.c [vpx highbd lpf NEON 1/6] horizontal 4 2016-10-06 14:03:04 -07:00
postproc.h postproc : fix function parameters for noise functions. 2016-07-15 08:27:34 -07:00
prob.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
prob.h vpx_dsp/get_prob: relocate den == 0 test 2016-09-28 17:42:49 -07:00
psnr.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
psnr.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
psnrhvs.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
quantize.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
quantize.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
sad.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
ssim.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
ssim.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
subtract.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
sum_squares.c Support measure distortion in the pixel domain 2016-07-06 18:25:17 -07:00
txfm_common.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
variance.c Remove halfpix specialization 2016-08-23 17:05:39 -07:00
variance.h Remove halfpix specialization 2016-08-23 17:05:39 -07:00
vpx_convolve.c add vpx high bitdepth convolve8 NEON intrinsics optimization 2016-10-17 15:23:54 -07:00
vpx_convolve.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
vpx_dsp_common.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
vpx_dsp_rtcd_defs.pl postproc: vpx_mbpost_proc_down_neon 2017-01-09 10:21:56 -08:00
vpx_dsp_rtcd.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
vpx_dsp.mk Remove neon assembly for idct 16x16 and 8x8 2017-01-19 12:27:31 -08:00
vpx_filter.h add vpx high bitdepth convolve8 NEON intrinsics optimization 2016-10-17 15:23:54 -07:00