ffmpeg

Author	SHA1	Message	Date
Ben Avison	15a29c39d9	truehd: add hand-scheduled ARM asm version of mlp_filter_channel. Profiling results for overall audio decode and the mlp_filter_channel(_arm) function in particular are as follows: Before After Mean StdDev Mean StdDev Confidence Change 6:2 total 380.4 22.0 370.8 17.0 87.4% +2.6% (insignificant) 6:2 function 60.7 7.2 36.6 8.1 100.0% +65.8% 8:2 total 357.0 17.5 343.2 19.0 97.8% +4.0% (insignificant) 8:2 function 60.3 8.8 37.3 3.8 100.0% +61.8% 6:6 total 717.2 23.2 658.4 15.7 100.0% +8.9% 6:6 function 140.4 12.9 81.5 9.2 100.0% +72.4% 8:8 total 981.9 16.2 896.2 24.5 100.0% +9.6% 8:8 function 193.4 15.0 103.3 11.5 100.0% +87.2% Experiments with adding preload instructions to this function yielded no useful benefit, so these have not been included. The assembly version has also been tested with a fuzz tester to ensure that any combinations of inputs not exercised by my available test streams still generate mathematically identical results to the C version. Signed-off-by: Martin Storsjö <martin@martin.st>	2014-03-26 19:53:52 +02:00
Michael Niedermayer	011d83de48	Merge commit '0e083d7e43805db1a978cb57bfa25fda62e8ff18' * commit '0e083d7e43805db1a978cb57bfa25fda62e8ff18': build: Group general components separate from de/encoders in arch Makefiles Conflicts: libavcodec/arm/Makefile libavcodec/x86/Makefile Merged-by: Michael Niedermayer <michaelni@gmx.at>	2014-03-20 22:26:31 +01:00
Diego Biurrun	0e083d7e43	build: Group general components separate from de/encoders in arch Makefiles This is in line with how the top-level libavcodec Makefile is structured.	2014-03-20 05:03:23 -07:00
James Darnley	623f380a18	lavc: fix flac encoder and decoder dependencies Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2014-02-13 21:00:32 +01:00
Martin Storsjö	44a0a98f92	arm: Add an option for making sure NEON registers aren't clobbered This is pretty much based on the same test for XMM registers. Signed-off-by: Martin Storsjö <martin@martin.st>	2014-01-11 00:03:00 +02:00
Mason Carter	832e190632	vc1: arm: Add NEON assembly For: ff_vc1_inv_trans_{8,4}x{8,4}_{dc_,}neon ff_put_pixels8x8_neon ff_put_vc1_mspel_mc{0,1,2,3}{0,1,2,3}_neon (except for 00) Based on ARM assembly code in libavcodec/arm by Rob Clark and Mans Rullgard. Signed-off-by: Martin Storsjö <martin@martin.st>	2013-12-20 14:53:39 +02:00
Diego Biurrun	f0389eb777	arm: fmtconvert: Split armv6 fmtconvert code off from vfp code	2013-08-29 11:24:14 +02:00
Diego Biurrun	8506ff97c9	vp56: Mark VP6-only optimizations as such. Most of our VP56 optimizations are VP6-only and will stay that way. So avoid compiling them for VP5-only builds.	2013-08-23 14:42:19 +02:00
Ben Avison	45e10e5c8d	arm: Add assembly version of h264_find_start_code_candidate Before After Mean StdDev Mean StdDev Change This function 508.8 23.4 185.4 9.0 +174.4% Overall 3068.5 31.7 2752.1 29.4 +11.5% In combination with the preceding patch: Before After Mean StdDev Mean StdDev Change Overall 2925.6 26.2 2752.1 29.4 +6.3% Signed-off-by: Martin Storsjö <martin@martin.st>	2013-08-08 12:08:34 +03:00
Martin Storsjö	8b9eba664e	arm: Add VFP-accelerated version of fft16 Before After Mean StdDev Mean StdDev Change This function 1389.3 4.2 967.8 35.1 +43.6% Overall 15577.5 83.2 15400.0 336.4 +1.2% Signed-off-by: Martin Storsjö <martin@martin.st>	2013-07-22 10:15:41 +03:00
Martin Storsjö	ba6836c966	arm: Add VFP-accelerated version of dca_lfe_fir Before After Mean StdDev Mean StdDev Change This function 868.2 33.5 436.0 27.0 +99.1% Overall 15973.0 223.2 15577.5 83.2 +2.5% Signed-off-by: Martin Storsjö <martin@martin.st>	2013-07-22 10:15:39 +03:00
Martin Storsjö	b63bb251ea	arm: Add VFP-accelerated version of imdct_half Before After Mean StdDev Mean StdDev Change This function 2653.0 28.5 1108.8 51.4 +139.3% Overall 17049.5 408.2 15973.0 223.2 +6.7% Signed-off-by: Martin Storsjö <martin@martin.st>	2013-07-22 10:15:37 +03:00
Ben Avison	41ef1d360b	arm: Add VFP-accelerated version of synth_filter_float Before After Mean StdDev Mean StdDev Change This function 9295.0 114.9 4853.2 83.5 +91.5% Overall 23699.8 397.6 19285.5 292.0 +22.9% Signed-off-by: Martin Storsjö <martin@martin.st>	2013-07-22 10:15:17 +03:00
Martin Storsjö	86113667c0	arm: Include hpeldsp_neon.o if h264qpel is enabled A few of the h264qpel neon functions are shared with other hpeldsp functions in this file. This fixes standalone compilation of the h264 decoder on arm. Signed-off-by: Martin Storsjö <martin@martin.st>	2013-05-30 02:17:37 +03:00
Martin Storsjö	efb7968cfe	arm: Don't unconditionally build dsputil files Signed-off-by: Martin Storsjö <martin@martin.st>	2013-05-30 02:17:35 +03:00
Martin Storsjö	36a7df8cf1	arm: Only build the FFT init files if FFT is enabled This fixes build errors in cases where FFT is disabled. Signed-off-by: Martin Storsjö <martin@martin.st>	2013-05-30 02:17:33 +03:00
Diego Biurrun	186599ffe0	build: cosmetics: Place unconditional before conditional OBJS lines Signed-off-by: Martin Storsjö <martin@martin.st>	2013-05-30 02:17:31 +03:00
Diego Biurrun	9b9b2e9f30	build: arm: cosmetics: Place all OBJS declarations in alphabetical order Signed-off-by: Martin Storsjö <martin@martin.st>	2013-05-30 02:17:27 +03:00
Ronald S. Bultje	7384b7a713	arm: hpeldsp: Move half-pel assembly from dsputil to hpeldsp Signed-off-by: Martin Storsjö <martin@martin.st>	2013-04-19 23:19:08 +03:00
Diego Biurrun	79dad2a932	dsputil: Separate h264chroma	2013-02-06 11:30:53 +01:00
Diego Biurrun	33552a5f7b	arm: Add mathops.h to ARCH_HEADERS list It is an arch-specific header not suitable for standalone compilation.	2013-01-24 20:59:22 +01:00
Mans Rullgard	e9d817351b	dsputil: Separate h264 qpel The sh4 optimizations are removed, because the code is 100% identical to the C code, so it is unlikely to provide any real practical benefit. Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	2013-01-24 10:44:43 +01:00
Ronald S. Bultje	42d3246948	floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp. Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also.	2013-01-22 11:55:42 -08:00
Ronald S. Bultje	fef906c77c	Move vorbis_inverse_coupling from dsputil to vorbisdspcontext. Conveniently (together with Justin's earlier patches), this makes our vorbis decoder entirely independent of dsputil.	2013-01-19 22:21:10 -08:00
Ronald S. Bultje	8c53d39e7f	lavc: introduce VideoDSPContext Move some functions from dsputil. The idea is that videodsp contains functions that are useful for a large and varied set of video decoders. Currently, it contains emulated_edge_mc() and prefetch(). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	2012-12-20 13:40:45 +01:00
Mans Rullgard	b326755989	arm: rename ARMVFP config symbol to VFP This is consistent with usual ARM nomenclature as well as with the VFPV3 and NEON symbols which both lack the ARM prefix. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-12-07 16:54:04 +00:00
Jean-Baptiste Kempf	507dce2536	arm: call arm-specific rv34dsp init functions under if (ARCH_ARM) Assign NEON specific function pointers after runtime check via av_get_cpu_flags(). Signed-off-by: Janne Grunau <janne-libav@jannau.net>	2012-10-10 15:28:50 +02:00
Diego Biurrun	ac56ff9cc9	build: non-x86: Only compile mpegvideo optimizations when necessary	2012-10-09 14:45:59 +02:00
Mans Rullgard	7689eea49a	flacdsp: arm optimised lpc filter	2012-09-15 23:54:21 +01:00
Mans Rullgard	28f9ab7029	vp3: move idct and loop filter pointers to new vp3dsp context This moves all VP3-specific function pointers from dsputil to a new vp3dsp context. There is no reason to ever use the VP3 IDCT where an MPEG2 IDCT is expected or vice versa. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-07-18 10:32:19 +01:00
Mans Rullgard	ab9f987661	build: add CONFIG_VP3DSP, reduce repetition in OBJS lists Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-07-18 10:32:18 +01:00
Mans Rullgard	96f7590efd	aacps: NEON optimisations Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-05-05 22:04:21 +01:00
Mans Rullgard	b692d246ea	vp8: arm: separate ARMv6 functions from NEON This is a preparation for complete ARMv6 optimisations. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-04-25 21:41:39 +01:00
Diego Biurrun	7bb3a302fe	build: Consistently handle conditional compilation for all optimization OBJS.	2012-04-12 09:00:49 +02:00
Janne Grunau	363bd1c62c	remove iwmmxt optimizations The were broken since August of 2010 without anyone noticing until three weeks ago. Nobody cares about it anymore and hopefully Marvell will support NEON like in the PXA978 from now on.	2012-03-12 22:46:56 +01:00
Mans Rullgard	be822d77b6	aacsbr: ARM NEON optimised sbrdsp functions Overall speedup of HE-AAC decoding 2.3x on Cortex-A8, 1.2x on A9. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-01-28 14:56:18 +00:00
Janne Grunau	6c88988866	rv40: NEON optimised weighted prediction Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-12-06 13:48:25 +00:00
Janne Grunau	f5c05b9aa5	rv40: NEON optimised chroma MC Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-12-06 13:48:25 +00:00
Mans Rullgard	f054a82727	ARM: move NEON H264 chroma mc to a separate file This allows sharing code with the rv40 version of these functions. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-12-06 13:48:24 +00:00
Janne Grunau	42d32cf53c	rv34: NEON optimised inverse transform functions Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-12-06 13:48:24 +00:00
Mans Rullgard	5c46ad1da0	ARM: optimised mpadsp_apply_window_fixed Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-06-13 11:33:44 +01:00
Mans Rullgard	8e112df409	ARM: ac3dsp: optimised update_bap_counts() Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-06-01 15:45:13 +01:00
Mans Rullgard	edfa89b260	ARM: unbreak build Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-05-28 18:41:20 +01:00
Mans Rullgard	f7653904c8	ARM: NEON fixed-point forward MDCT Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-04-03 22:39:52 +01:00
Mans Rullgard	dba9852935	ARM: NEON fixed-point FFT Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-04-03 22:39:52 +01:00
Mans Rullgard	aa05f2126e	ac3enc: ARM optimised ac3_compute_matissa_size Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-04-01 22:46:21 +01:00
Mans Rullgard	182826c884	ac3: armv6 optimised bit_alloc_calc_bap Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-04-01 22:46:05 +01:00
Mans Rullgard	f4855a904e	ac3enc: NEON optimised ac3_max_msb_abs_int16 and ac3_exponent_min	2011-03-24 16:30:49 +00:00
Mans Rullgard	a7878c9f73	VP8: ARM optimised decode_block_coeffs_internal Approximately 5% faster on Cortex-A8. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-02-11 15:48:11 +00:00
Mans Rullgard	a1c1d3c003	VP8: ARM NEON optimisations for dsp functions This adds NEON optimised versions of all functions in VP8DSPContext. Based on initial work by Rob Clark. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-02-07 16:08:23 +00:00
Justin Ruggles	c73d99e672	Separate format conversion DSP functions from DSPContext. This will be beneficial for use with the audio conversion API without requiring it to depend on all of dsputil. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-02-02 02:44:53 +00:00
Jason Garrett-Glaser	4a384de5b8	Split h264dsp and h264pred in configure. Many H.264 derivatives, like RV40 and VP8, use the H.264 prediction functions but not the weight/loopfilter functions. This should reduce the size of builds with one of these derivatives but without H.264 decoding itself. Originally committed as revision 24741 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-08-07 23:10:25 +00:00
Aurelien Jacobs	42d1e7a287	fix VP5/6 neon dependencies Originally committed as revision 24160 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-10 14:26:37 +00:00
Måns Rullgård	41331b65f2	ARM: NEON optimised dct_unquantize_h263_{intra,inter} Originally committed as revision 23386 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-05-29 15:29:40 +00:00
Måns Rullgård	5635985c26	ARM: NEON optimised VP6 edge filter Originally committed as revision 22993 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-04-30 21:30:27 +00:00
Måns Rullgård	b591c7af31	10l: fix build on non-NEON ARM Originally committed as revision 22867 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-04-13 00:48:49 +00:00
Måns Rullgård	08255107cf	DCA: ARM/NEON optimised lfe_fir Originally committed as revision 22863 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-04-12 20:45:33 +00:00
Måns Rullgård	e73d1a5efc	ARM: NEON optimised synth_filter_float 2.7x faster DCA decoding on Cortex-A8 Originally committed as revision 22828 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-04-10 16:27:56 +00:00
Måns Rullgård	a8bb9ea532	ARM: NEON optimised RDFT Originally committed as revision 22641 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-03-23 03:35:02 +00:00
Måns Rullgård	3bd74e9243	Simplify arch-specific object file lists Originally committed as revision 22570 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-03-16 21:23:03 +00:00
Måns Rullgård	43f60eba19	Move arch-specific makefile parts into $arch/Makefile Originally committed as revision 22569 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-03-16 21:22:59 +00:00

1 2 3

111 Commits