ffmpeg/libavcodec/x86
Loren Merritt 11ab1e409f FFT: factor a shuffle out of the inner loop and merge it into fft_permute.
6% faster SSE FFT on Conroe, 2.5% on Penryn.

Signed-off-by: Janne Grunau <janne-ffmpeg@jannau.net>
(cherry picked from commit e6b1ed693a)
2011-02-14 23:58:19 +01:00
..
ac3dsp_mmx.c Add x86-optimized versions of exponent_min(). 2011-02-11 02:54:09 +01:00
ac3dsp.asm Add x86-optimized versions of exponent_min(). 2011-02-11 02:54:09 +01:00
cavsdsp_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
dct32_sse.c xmm_clobbers: list xmm registers first in clobber list 2010-10-31 18:14:48 +00:00
deinterlace.asm Use SECTION .text for yasm code. 2010-12-01 13:12:39 +00:00
dnxhd_mmx.c dnxhd_mmx: prefer xmm registers below xmm6 when they are available 2010-11-02 03:09:16 +00:00
dsputil_mmx_avg_template.c Replace ASMALIGN() with .p2align 2011-01-18 20:48:24 +00:00
dsputil_mmx_qns_template.c Replace ASMALIGN() with .p2align 2011-01-18 20:48:24 +00:00
dsputil_mmx_rnd_template.c Replace ASMALIGN() with .p2align 2011-01-18 20:48:24 +00:00
dsputil_mmx.c Separate format conversion DSP functions from DSPContext. 2011-02-04 03:08:09 +01:00
dsputil_mmx.h Move lpc_compute_autocorr() from DSPContext to a new struct LPCContext. 2011-01-23 19:32:06 +01:00
dsputil_yasm.asm Fix ff_emu_edge_core_sse() on Win64. 2011-02-09 03:33:55 +01:00
dsputilenc_mmx.c Move lpc_compute_autocorr() from DSPContext to a new struct LPCContext. 2011-01-23 19:32:06 +01:00
dsputilenc_yasm.asm Don't access upper 32 bits of a 32-bit int on 64-bit systems. 2010-09-17 12:24:22 +00:00
fdct_mmx.c cosmetics: split long line 2010-10-31 13:46:17 +00:00
fft_3dn2.c imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits". 2010-08-23 15:51:09 +00:00
fft_3dn.c
fft_mmx.asm FFT: factor a shuffle out of the inner loop and merge it into fft_permute. 2011-02-14 23:58:19 +01:00
fft_sse.c Fix ff_imdct_calc_sse() on gcc-4.6 2011-02-04 03:08:09 +01:00
fft.c FFT: factor a shuffle out of the inner loop and merge it into fft_permute. 2011-02-14 23:58:19 +01:00
fft.h SSE optimized 32-point DCT 2010-07-06 16:58:54 +00:00
fmtconvert_mmx.c Separate format conversion DSP functions from DSPContext. 2011-02-04 03:08:09 +01:00
fmtconvert.asm Separate format conversion DSP functions from DSPContext. 2011-02-04 03:08:09 +01:00
h264_chromamc.asm For rounding in chroma MC SSSE3, use 16-byte pw_3/4 instead of reading 8 bytes 2010-12-24 17:23:22 +00:00
h264_deblock.asm Port latest x264 deblock asm (before they moved to using NV12 as internal 2010-09-03 16:52:46 +00:00
h264_i386.h
h264_idct.asm H.264: split luma dc idct out and implement MMX/SSE2 versions 2011-01-14 21:34:25 +00:00
h264_intrapred_init.c Port pred8x8l_down_left_mmxext (H.264 intra prediction) from x264 (authors: 2010-12-29 23:48:44 +00:00
h264_intrapred.asm x86: fix overflow in h264 8x8 planar prediction 2011-01-26 03:43:29 +01:00
h264_qpel_mmx.c xmm_clobbers: list xmm registers first in clobber list 2010-10-31 18:14:48 +00:00
h264_weight.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
h264dsp_mmx.c H.264: split luma dc idct out and implement MMX/SSE2 versions 2011-01-14 21:34:25 +00:00
idct_mmx_xvid.c
idct_mmx.c Fix compilation in x86_64. I broke it with r24580. 2010-07-29 22:45:21 +00:00
idct_sse2_xvid.c Replace ASMALIGN() with .p2align 2011-01-18 20:48:24 +00:00
idct_xvid.h
lpc_mmx.c cosmetics related to LPC changes. 2011-01-23 19:32:06 +01:00
Makefile Add x86-optimized versions of exponent_min(). 2011-02-11 02:54:09 +01:00
mathops.h
mlpdsp.c
motion_est_mmx.c Replace ASMALIGN() with .p2align 2011-01-18 20:48:24 +00:00
mpegaudiodec_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
mpegvideo_mmx_template.c Replace ASMALIGN() with .p2align 2011-01-18 20:48:24 +00:00
mpegvideo_mmx.c Replace ASMALIGN() with .p2align 2011-01-18 20:48:24 +00:00
simple_idct_mmx.c Replace ASMALIGN() with .p2align 2011-01-18 20:48:24 +00:00
snowdsp_mmx.c snowdsp: Explicitly state the operand sizes 2010-10-04 13:08:13 +00:00
vc1dsp_mmx.c Replace ASMALIGN() with .p2align 2011-01-18 20:48:24 +00:00
vc1dsp_yasm.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
vp3dsp.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
vp8dsp-init.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vp8dsp.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
vp56_arith.h VP5/6/8: ~7% faster arithmetic decoding 2010-08-12 01:11:32 +00:00
vp56dsp_init.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vp56dsp.asm Fix typos when converting inline asm to yasm, fixes MMX-only fate-ea-vp61. 2010-08-26 14:33:39 +00:00
x86inc.asm sync yasm macros from x264 2010-07-21 22:45:16 +00:00
x86util.asm Add x86-optimized versions of exponent_min(). 2011-02-11 02:54:09 +01:00