vpx/vpx_dsp at 13234d3c433b6379ee1cc0de398dd71cef3f5907 - vpx

History

Johann 13234d3c43 Remove neon assembly for idct 16x16 and 8x8 Tested using test/partial_idct_test.cc:DISABLED_Speed Both gcc 4.9 and clang 3.8 from the r13 Android NDK offer improvements using the intrinsics: <function> <clang asm> <gcc asm> <clang intrin> <gcc intrin> idct16x16_256 1720ms 1703ms 1546ms 1554ms idct16x16_10 1320ms 1247ms 518ms 488ms idct16x16_1 107ms 108ms 64ms 68ms idct8x8_64 924ms 931ms 866ms 989ms idct8x8_12 826ms 824ms 519ms 514ms idct8x8_1 172ms 166ms 110ms 125ms idct8x8_64 isn't quite perfect (slight regression with gcc intrinsics) but as a counter example idct16x16_10 goes from ~1300ms to ~500ms On a sample clip, clang improved from 48.5 to 49fps and gcc stayed roughly stable. BUG=webm:1303 Change-Id: I9d4fd2b41b46ea6174a887b40a82c8e6e4769ed4		2017-01-19 12:27:31 -08:00
..
arm	Remove neon assembly for idct 16x16 and 8x8	2017-01-19 12:27:31 -08:00
mips	Merge "Fix mips dspr2 idct32x32 functions for large coefficient input"	2017-01-09 19:47:47 +00:00
x86	Rework 8x8 transpose SSSE3 for avg computation	2017-01-12 15:16:07 -08:00
add_noise.c	add_noise,vpx_setup_noise: correct 'char_dist' type	2016-08-24 21:46:26 -07:00
avg.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
bitreader_buffer.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
bitreader_buffer.h	vp10: code sign bit before absolute value in non-arithcoded header.	2015-09-16 19:35:03 -04:00
bitreader.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
bitreader.h	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
bitwriter_buffer.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
bitwriter_buffer.h	vp10: code sign bit before absolute value in non-arithcoded header.	2015-09-16 19:35:03 -04:00
bitwriter.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
bitwriter.h	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
deblock.c	Change C/MSA post proc to match SSE2.	2016-11-23 10:42:11 -08:00
fastssim.c	Add compiler warning flag -Wextra and fix related warnings.	2016-09-27 12:05:01 -07:00
fwd_txfm.c	Fix warnings reported by -Wshadow: Part1: vpx_dsp directory	2016-10-17 19:25:19 -07:00
fwd_txfm.h	Replace vp9_ prefix in 2D-DCT functions with vpx_	2015-07-28 16:06:44 -07:00
intrapred.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
inv_txfm.c	Cosmetics by unifying dest_stride to stride in idct	2016-12-12 15:13:22 -08:00
inv_txfm.h	Replace highbd_dct_const_round_shift with dct_const_round_shift	2016-11-04 16:15:02 -07:00
loopfilter.c	[vpx highbd lpf NEON 1/6] horizontal 4	2016-10-06 14:03:04 -07:00
postproc.h	postproc : fix function parameters for noise functions.	2016-07-15 08:27:34 -07:00
prob.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
prob.h	vpx_dsp/get_prob: relocate den == 0 test	2016-09-28 17:42:49 -07:00
psnr.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
psnr.h	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
psnrhvs.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
quantize.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
quantize.h	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
sad.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
ssim.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
ssim.h	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
subtract.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
sum_squares.c	Support measure distortion in the pixel domain	2016-07-06 18:25:17 -07:00
txfm_common.h	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
variance.c	Remove halfpix specialization	2016-08-23 17:05:39 -07:00
variance.h	Remove halfpix specialization	2016-08-23 17:05:39 -07:00
vpx_convolve.c	add vpx high bitdepth convolve8 NEON intrinsics optimization	2016-10-17 15:23:54 -07:00
vpx_convolve.h	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
vpx_dsp_common.h	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
vpx_dsp_rtcd_defs.pl	postproc: vpx_mbpost_proc_down_neon	2017-01-09 10:21:56 -08:00
vpx_dsp_rtcd.c	vpx_dsp: apply clang-format	2016-07-25 14:14:19 -07:00
vpx_dsp.mk	Remove neon assembly for idct 16x16 and 8x8	2017-01-19 12:27:31 -08:00
vpx_filter.h	add vpx high bitdepth convolve8 NEON intrinsics optimization	2016-10-17 15:23:54 -07:00