ffmpeg

Author	SHA1	Message	Date
Kostya Shishkov	f399e406af	altivec: perform an explicit unaligned load Implicit vector loads on POWER7 hardware can use the VSX instruction set instead of classic Altivec/VMX. Let's force a VMX load in this case. Signed-off-by: Martin Storsjö <martin@martin.st>	2013-08-16 10:08:47 +03:00
Diego Biurrun	3ac7fa81b2	Consistently use "cpu_flags" as variable/parameter name for CPU flags	2013-07-18 00:31:35 +02:00
Christophe Gisquet	b6293e2798	fmtconvert: Explicitly use int32_t instead of int Signed-off-by: Martin Storsjö <martin@martin.st>	2013-07-17 11:02:47 +03:00
Kostya Shishkov	0418cbf081	fix scalarproduct_and_madd_int16_altivec() for orders > 16 the second and third sources were incremented only by half of the needed size	2013-05-26 16:10:47 +02:00
Diego Biurrun	a650c906cb	ppc: Only compile AltiVec FFT assembly when AltiVec is enabled	2013-05-02 10:25:30 +02:00
Diego Biurrun	7f75f2f2bd	ppc: Drop unnecessary ff_ name prefixes from static functions	2013-04-30 16:10:06 +02:00
Diego Biurrun	38282149b6	ppc: More consistent arch initialization	2013-04-30 12:19:45 +02:00
Diego Biurrun	a053dbfcfb	ppc: Move AltiVec utility headers out of AltiVec ifdefs Now that the headers themselves have ifdef protection this is no longer necessary and more consistent with normal include handling.	2013-04-30 12:19:44 +02:00
Diego Biurrun	6b110d3a73	ppc: More consistent names for H.264 optimizations files	2013-04-30 12:19:43 +02:00
Diego Biurrun	643e433bf7	mpegaudiosp: More consistent names for ppc/x86 optimization files	2013-04-30 12:19:43 +02:00
Martin Storsjö	6d0fbebf94	ppc: hpeldsp: Include attributes.h This fixes building in configurations where altivec is disabled. Signed-off-by: Martin Storsjö <martin@martin.st>	2013-04-20 16:43:01 +03:00
Ronald S. Bultje	47e5a98174	ppc: hpeldsp: Move half-pel assembly from dsputil to hpeldsp Signed-off-by: Martin Storsjö <martin@martin.st>	2013-04-19 23:18:59 +03:00
Ronald S. Bultje	015821229f	vp3: Use full transpose for all IDCTs This way, the special IDCT permutations are no longer needed. This is similar to how H264 does it, and removes the dsputil dependency imposed by the scantable code. Also remove the unused type == 0 cases from the plain C version of the idct. Signed-off-by: Martin Storsjö <martin@martin.st>	2013-04-15 12:32:05 +03:00
Ronald S. Bultje	62844c3fd6	h264: Integrate clear_blocks calls with IDCT The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700 to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip tested (cathedral), i.e. almost 30 cycles per mb faster. Signed-off-by: Martin Storsjö <martin@martin.st>	2013-04-10 11:03:06 +03:00
Luca Barbato	a8b6015823	dsputil: convert remaining functions to use ptrdiff_t strides Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	2013-03-12 18:26:42 +01:00
Diego Biurrun	c242bbd8b6	Remove unnecessary dsputil.h #includes	2013-02-26 00:51:34 +01:00
Diego Biurrun	218aefce44	dsputil: Move LOCAL_ALIGNED macros to libavutil	2013-02-08 23:13:37 +01:00
Diego Biurrun	79dad2a932	dsputil: Separate h264chroma	2013-02-06 11:30:53 +01:00
Diego Biurrun	c9f933b5b6	Add av_cold attributes to arch-specific init functions	2013-02-05 17:01:05 +01:00
Diego Biurrun	25841dfe80	Use ptrdiff_t instead of int for {avg, put}_pixels line_size parameter. This avoids SIMD-optimized functions having to sign-extend their line size argument manually to be able to do pointer arithmetic.	2013-02-05 12:59:12 +01:00
Diego Biurrun	4eef2ed707	ppc: fmtconvert: Drop two unused variables.	2013-02-01 12:51:13 +01:00
Mans Rullgard	e9d817351b	dsputil: Separate h264 qpel The sh4 optimizations are removed, because the code is 100% identical to the C code, so it is unlikely to provide any real practical benefit. Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	2013-01-24 10:44:43 +01:00
Diego Biurrun	88bd7fdc82	Drop DCTELEM typedef It does not help as an abstraction and adds dsputil dependencies. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2013-01-22 18:32:56 -08:00
Ronald S. Bultje	42d3246948	floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp. Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also.	2013-01-22 11:55:42 -08:00
Ronald S. Bultje	55aa03b9f8	floatdsp: move vector_fmul_add from dsputil to avfloatdsp.	2013-01-22 11:55:42 -08:00
Ronald S. Bultje	1768e43ceb	vorbisdsp: change block_size type from int to intptr_t. This saves one instruction in the x86-64 assembly.	2013-01-20 22:26:42 -08:00
Diego Biurrun	d9bf716945	ppc: vorbisdsp: Drop some unnecessary #includes Also fixes compilation with AltiVec disabled.	2013-01-20 17:38:11 +01:00
Martin Storsjö	d160a2fb4c	ppc: Include string.h for memset This fixes build failures on ppc machines with a compiler that supports -Werror=implicit-function-declaration. Signed-off-by: Martin Storsjö <martin@martin.st>	2013-01-20 18:10:21 +02:00
Ronald S. Bultje	fef906c77c	Move vorbis_inverse_coupling from dsputil to vorbisdspcontext. Conveniently (together with Justin's earlier patches), this makes our vorbis decoder entirely independent of dsputil.	2013-01-19 22:21:10 -08:00
Ronald S. Bultje	aeaf268e52	vp3: integrate clear_blocks with idct of previous block. This is identical to what e.g. vp8 does, and prevents the function call overhead (plus dependency on dsputil for this particular function). Arm asm updated by Janne Grunau <janne-libav@jannau.net>. Signed-off-by: Janne Grunau <janne-libav@jannau.net>	2013-01-19 22:04:55 -08:00
Justin Ruggles	e034cc6c60	lavc: Move vector_fmul_window to AVFloatDSPContext Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	2013-01-16 10:45:45 +01:00
Ronald S. Bultje	8c53d39e7f	lavc: introduce VideoDSPContext Move some functions from dsputil. The idea is that videodsp contains functions that are useful for a large and varied set of video decoders. Currently, it contains emulated_edge_mc() and prefetch(). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	2012-12-20 13:40:45 +01:00
Mans Rullgard	a384f6a7f7	ppc: replace pointer casting with AV_COPY32 This removes warnings about strict aliasing violations. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-11-12 10:31:31 +00:00
Mans Rullgard	031aac9861	ppc: fix some unused variable warnings The third argument of OP_U8_ALTIVEC is evaluated at most once so there is no need for a potentially unused temporary variable. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-11-12 10:31:31 +00:00
Diego Biurrun	ac56ff9cc9	build: non-x86: Only compile mpegvideo optimizations when necessary	2012-10-09 14:45:59 +02:00
Mans Rullgard	f79364b2c3	ppc: fix Altivec build with old compilers The vec_splat() intrinsic requires a constant argument for the element number, and the code relies on the compiler unrolling the loop to provide this. Manually unrolling the loop avoids this reliance and works with all compilers. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-10-08 23:14:51 +01:00
Mans Rullgard	642b4efaf7	ppc: fmtconvert: kill VLA in float_to_int16_interleave_altivec() Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-10-05 22:33:32 +01:00
Martin Storsjö	33e112847d	Add more missing includes after removing the implicit common.h Signed-off-by: Martin Storsjö <martin@martin.st>	2012-08-16 10:49:54 +03:00
Martin Storsjö	70766c2182	Add some more missing includes after removing the implicit common.h Signed-off-by: Martin Storsjö <martin@martin.st>	2012-08-15 23:48:48 +03:00
Martin Storsjö	1d9c2dc89a	Don't include common.h from avutil.h Signed-off-by: Martin Storsjö <martin@martin.st>	2012-08-15 22:32:06 +03:00
Justin Ruggles	a35738f424	dsputil: ppc: cosmetics: pretty-print	2012-07-22 17:38:55 -04:00
Mans Rullgard	ffdd93a25e	ppc: fix build with altivec disabled Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-07-18 13:34:42 +01:00
Mans Rullgard	28f9ab7029	vp3: move idct and loop filter pointers to new vp3dsp context This moves all VP3-specific function pointers from dsputil to a new vp3dsp context. There is no reason to ever use the VP3 IDCT where an MPEG2 IDCT is expected or vice versa. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-07-18 10:32:19 +01:00
Diego Biurrun	af10feadc2	ppc: Rename H.264 optimization template file for consistency.	2012-06-12 23:20:05 +02:00
Justin Ruggles	d5a7229ba4	Add a float DSP framework to libavutil Move vector_fmul() from DSPContext to AVFloatDSPContext.	2012-06-08 13:14:38 -04:00
Justin Ruggles	98db4e2a4e	PPC: Move types_altivec.h and util_altivec.h from libavcodec to libavutil This will allow for easier implementation of Altivec functions in libraries other than libavcodec.	2012-06-08 13:14:38 -04:00
Diego Biurrun	3ea5429489	ppc: Drop unused header regs.h	2012-05-22 11:54:53 +02:00
Mans Rullgard	c81d1e2390	ppc: add const where needed in scalarproduct_int16_altivec() Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-05-01 00:21:30 +01:00
Mans Rullgard	ce82dad7eb	ppc: remove shift parameter from scalarproduct_int16_altivec() The shift parameter was removed from this interface in `7e1ce6a`. This updates the Altivec implementation to match. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-05-01 00:21:30 +01:00
Mans Rullgard	4c387c7070	ppc: dsputil: do unaligned block accesses correctly To load unaligned vector data in the usual way, explicit vec_ld() should be used rather than dereferencing a pointer to a vector type. When the VSX extension is enabled, gcc may compile vector pointer dereferences using the VSX lxvw4x instruction instead of the lvx instruction typically used with Altivec/VMX. As the behaviour of these instructions with unaligned addresses differs, it is important that only lvx is used here. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-05-01 00:21:30 +01:00
Mans Rullgard	2bcbd98459	Remove lowres video decoding This feature is complex, of questionable utility, and slows down normal decoding. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-04-21 18:56:19 +01:00
Diego Biurrun	0f53601ac6	ppc: drop unused function dct_quantize_altivec() This also allows dropping some PPC-specific ugliness from dsputil.[ch].	2012-04-18 18:53:54 +02:00
Diego Biurrun	7bb3a302fe	build: Consistently handle conditional compilation for all optimization OBJS.	2012-04-12 09:00:49 +02:00
Diego Biurrun	02c39f056a	ppc: Add/remove a number of const qualifiers to fix related warnings.	2012-04-09 20:39:33 +02:00
Diego Biurrun	72ccfb3cb7	build: ppc: drop stray leftover backslash	2012-03-26 16:37:57 +02:00
Diego Biurrun	ad0e31f134	build: prettyprinting cosmetics	2012-03-26 13:00:10 +02:00
Ronald S. Bultje	bd66f073fe	vp8: change int stride to ptrdiff_t stride. On 64bit platforms with 32bit int, this means we won't have to sign- extend the integer anymore.	2012-03-02 10:31:50 -08:00
Martin Storsjö	210f72845c	ppc: Add ff_ prefix to nonstatic symbols Signed-off-by: Martin Storsjö <martin@martin.st>	2012-02-15 22:07:29 +02:00
Martin Storsjö	efd29844eb	mpegvideo: Add ff_ prefix to nonstatic functions Signed-off-by: Martin Storsjö <martin@martin.st>	2012-02-15 22:07:23 +02:00
Martin Storsjö	9cf0841ef3	dsputil: Add ff_ prefix to the dsputil_init functions Signed-off-by: Martin Storsjö <martin@martin.st>	2012-02-15 22:06:34 +02:00
Diego Biurrun	0bba26466f	cosmetics: Delete empty lines at end of file.	2012-02-09 12:26:45 +01:00
Diego Biurrun	32f3c541bc	doxygen: Do not include license boilerplates in Doxygen comment blocks.	2012-02-06 19:39:24 +01:00
Diego Biurrun	3dc99a18d4	cosmetics: drop some pointless parentheses	2012-01-07 22:13:07 +01:00
Diego Biurrun	ff159e7816	doxygen: Replace '\' by '@' in Doxygen markup tags.	2011-12-07 15:29:14 +01:00
Mans Rullgard	b034c95cc1	h264: fix ppc/altivec build Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-10-21 12:49:01 +01:00
Ronald S. Bultje	c2d337429c	H264: change weight/biweight functions to take a height argument. Neon parts by Mans Rullgard <mans@mansr.com>.	2011-10-21 01:00:45 -07:00
Baptiste Coudurier	76741b0e56	h264: 4:2:2 intra decoding support Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2011-10-21 01:00:41 -07:00
Mans Rullgard	6e4a35ced9	ppc: fix 32-bit PIC build On 32-bit ppc, the GOT pointer must be loaded manually. This adds a "get_got" assembler macro to compute the GOT address. The "movrel" macro is updated to take an additional parameter containing the GOT address since no register is reserved for this purpose on ppc32. These changes have no effect on ppc64 builds. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-09-25 17:27:48 +01:00
Mans Rullgard	ca6a904656	ppc: remove redundant setting of Altivec IDCT This is already set by dsputil_init_ppc() and is best done in only one place. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-07-27 20:14:12 +01:00
Mans Rullgard	a617c6aaa3	dsputil: update per-arch init funcs for non-h264 high bit depth Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-07-21 18:10:58 +01:00
Mans Rullgard	874f1a901d	dsputil: template get_pixels() for different bit depths Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-07-21 18:10:58 +01:00
Mans Rullgard	0a72533e98	jfdctint: add 10-bit version Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-07-21 18:10:58 +01:00
Mans Rullgard	e7a972e113	simple_idct: add 10-bit version Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-07-20 17:49:48 +01:00
Diego Biurrun	21aed0ed92	ppc: remove disabled code	2011-07-16 02:56:52 +02:00
Mans Rullgard	6cbf2420b9	PPC: use Altivec IMDCT only for supported sizes The Altivec IMDCT works with size 32 and higher only. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-07-05 16:01:56 +01:00
Jason Garrett-Glaser	c90b94424c	4:4:4 H.264 decoding support Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.	2011-06-13 21:16:30 -07:00
Diego Biurrun	1f6b9cc31d	Replace some nonstandard DEBUG_* preprocessor directives by plain DEBUG.	2011-06-07 13:20:58 +02:00
Mans Rullgard	0b5e44ed29	mpegaudiodsp: fix x86 and ppc makefiles Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-05-19 16:32:24 +01:00
Mans Rullgard	c4f5c2d6f4	Move some mpegaudio functions to new mpegaudiodsp subsystem This separation allows these functions to be used in a cleaner fashion from other codecs (e.g. qdm2) and simplifies creating optimised versions of them. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-05-19 12:25:34 +01:00
Oskar Arvidsson	19a0729b4c	Adds 8-, 9- and 10-bit versions of some of the functions used by the h264 decoder. This patch lets e.g. dsputil_init chose dsp functions with respect to the bit depth to decode. The naming scheme of bit depth dependent functions is <base name>_<bit depth>[_<prefix>] (i.e. the old clear_blocks_c is now named clear_blocks_8_c). Note: Some of the functions for high bit depth is not dependent on the bit depth, but only on the pixel size. This leaves some room for optimizing binary size. Preparatory patch for high bit depth h264 decoding support. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2011-05-10 07:24:36 -04:00
Ronald S. Bultje	18b6a69ce9	Revert "VC1: merge idct8x8, coeff adjustments and put_pixels." This reverts commit `f8bed30d8b`. The reason for this is that the overlap filter, which runs after IDCT, should run on unclamped values, and thus IDCT and put_pixels() cannot be merged if we want to attempt to be bitexact.	2011-05-04 07:40:01 -04:00
Alex Converse	187a537904	Convert some undefined 1<<31 shifts into 1U<<31. According to ISO 9899:1999 S 6.5.7/4: The result of E1 << E2 is E1 left-shifted E2 bit positions; vacated bits are filled with zeros. If E1 has an unsigned type, the value of the result is E1× 2^E2, reduced modulo one more than the maximum value representable in the result type. If E1 has a signed type and nonnegative value, and E1× 2^E2 is representable in the result type, then that is the resulting value; otherwise, the behavior is undefined.	2011-04-11 21:47:42 -07:00
Mans Rullgard	2912e87a6c	Replace FFmpeg with Libav in licence headers Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-03-19 13:33:20 +00:00
Justin Ruggles	d21be5f15b	cosmetics: rename ff_fmt_convert_init_ppc() to ff_fmt_convert_init_altivec(). It only has Altivec functions and is not compiled if Altivec is disabled.	2011-03-07 11:15:29 -05:00
Mans Rullgard	e0e46cae37	vp8: ppc: fix invalid reads in altivec epel mc The 4-tap filters should only access one row/column before the reference block. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-02-21 20:28:41 +00:00
Mans Rullgard	381efba0ec	ppc: fix vc1 inverse transform, unbreak build GCC 4.3 and later are more particular about signedness matching in vector operations. The operations under if(rangered) were missing assignments and thus had no effect. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-02-21 20:28:37 +00:00
Ronald S. Bultje	f8bed30d8b	VC1: merge idct8x8, coeff adjustments and put_pixels. Merging these functions allows merging some loops, which makes the results (particularly after SIMD optimizations) much faster.	2011-02-21 10:23:44 -05:00
Ronald S. Bultje	ed040f35f2	Fix PPC build.	2011-02-17 20:22:39 -05:00
Ronald S. Bultje	12802ec060	dsputil: move VC1-specific stuff into VC1DSPContext.	2011-02-17 17:35:35 -05:00
Ronald S. Bultje	1da6ea3954	VC1: transpose IDCT 8x8 coeffs while reading.	2011-02-17 17:35:35 -05:00
Justin Ruggles	c73d99e672	Separate format conversion DSP functions from DSPContext. This will be beneficial for use with the audio conversion API without requiring it to depend on all of dsputil. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-02-02 02:44:53 +00:00
Justin Ruggles	80ba1ddb58	Remove unneeded add bias from 3 functions. DSPContext.vector_fmul_window() DCADSPContext.lfe_fir() SynthFilterContext.synth_filter_float() Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-01-31 20:28:42 +00:00
Vitor Sessak	3af1fe829e	Fix overread in altivec DSP function sad16 Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-01-29 15:32:14 +00:00
Justin Ruggles	6eabb0d3ad	Change DSPContext.vector_fmul() from dst=dstsrc to dest=src0src1. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-01-22 17:53:27 +00:00
Janne Grunau	2c3589bfda	consolidate .gitignore patters into a single file Signed-off-by: Janne Grunau <janne-ffmpeg@jannau.net>	2011-01-18 21:32:05 +01:00
Janne Grunau	348b8218f7	convert svn:ignore properties to .gitignore files Signed-off-by: Janne Grunau <janne-ffmpeg@jannau.net>	2011-01-17 15:50:14 +01:00
Stefano Sabatini	c6c98d0897	Move mm_support() from libavcodec to libavutil, make it a public function and rename it to av_get_cpu_flags(). Originally committed as revision 25076 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-09-08 15:07:14 +00:00
Stefano Sabatini	ccf22d3ed1	Merge has_altivec() function into mm_support(), remove it and use mm_support() instead. Reduce complexity and simplify pending move to libavutil. Originally committed as revision 25074 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-09-08 10:02:40 +00:00
Stefano Sabatini	7160bb716b	Rename FF_MM_ symbols related to CPU features flags as AV_CPU_FLAG_ symbols, and move them from libavcodec/avcodec.h to libavutil/cpu.h. Originally committed as revision 25040 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-09-04 09:59:08 +00:00
Måns Rullgård	c0ec9918b0	Remove global mm_flags variable Originally committed as revision 24909 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-08-24 17:47:05 +00:00

1 2 3 4 5 ...

442 Commits