419 Commits

Author SHA1 Message Date
Michael Niedermayer
30cd3e66bd remove code which become unused by the previous changes
Originally committed as revision 10166 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 00:05:30 +00:00
Michael Niedermayer
72dee89b5d Simplify and optimize the 4th vertical lifting step of the SSE2 code (untested)
This also reduces the needed headroom in that step by 1 bit

Originally committed as revision 10165 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 00:03:18 +00:00
Michael Niedermayer
d0dae46a90 Simplify and optimize the 4th vertical lifting step of the MMX code
This also reduces the needed headroom in that step by 1 bit

Originally committed as revision 10164 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 00:02:19 +00:00
Michael Niedermayer
1ffbbef217 Simplify and speedup code, reduce needed headroom by 2 bits in the 3rd
vertical lifting step of the SSE2 code (untested)

Originally committed as revision 10163 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:59:20 +00:00
Michael Niedermayer
4bf1790421 simplify, speedup and reduce needed headroom by 2 bits in the 3rd
vertical lifting step

Originally committed as revision 10162 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:54:49 +00:00
Michael Niedermayer
dd30437bbe replace <<1 by add for SSE2 (untested)
Originally committed as revision 10161 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:11:05 +00:00
Michael Niedermayer
7e665a3943 replace <<1 by add
Originally committed as revision 10160 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:09:39 +00:00
Michael Niedermayer
eee649d3b5 slightly change horizontal lift3 so it needs 1 bit less headroom
Originally committed as revision 10159 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:02:59 +00:00
Michael Niedermayer
be3b22f9ca remove idiotc double subtraction from the sse2 code (untested, no sse2 here)
Originally committed as revision 10158 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 22:41:47 +00:00
Michael Niedermayer
8510c519e7 fixme note
Originally committed as revision 10157 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 22:34:42 +00:00
Michael Niedermayer
3a9f44d5d5 and of course the unneeded double subtractions were blindly put in the
mmx code
this also makes the affected code 4% faster

Originally committed as revision 10156 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 22:29:21 +00:00
Aurelien Jacobs
73f51a4d3e help some gcc version to optimize out those functions
Originally committed as revision 9785 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-07-24 08:54:56 +00:00
Aurelien Jacobs
674eeb5f57 cosmetics: indentation
Originally committed as revision 9582 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-07-10 20:27:50 +00:00
Aurelien Jacobs
eb75a69818 Avoid linking with h263.c functions when the relevant codecs
are not compiled in.

Originally committed as revision 9581 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-07-10 20:23:08 +00:00
Måns Rullgård
a00177a952 make arguments to ssd_int8_vs_int16() const
Originally committed as revision 9548 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-07-08 23:15:00 +00:00
Diego Biurrun
09a6682648 typos
Originally committed as revision 9486 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-07-05 12:06:02 +00:00
Diego Biurrun
e5a389a1b7 license header consistency cosmetics
Originally committed as revision 9484 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-07-05 10:40:25 +00:00
Diego Biurrun
7b94177e37 Group all copyright and author notices together.
Originally committed as revision 9483 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-07-05 10:37:29 +00:00
Guillaume Poirier
efb775777f add a comment to indicate which #endif belong to which #define
Originally committed as revision 9356 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-06-17 18:59:28 +00:00
Måns Rullgård
699b3f99d0 add multiple inclusion guards to headers
Originally committed as revision 9345 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-06-17 00:01:30 +00:00
Måns Rullgård
8657d5e549 correct type for {put,avg}_h264_chroma_mc2_mmx2
Originally committed as revision 9333 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-06-16 14:51:57 +00:00
Diego Biurrun
90b5b51eab misc typo fixes
Originally committed as revision 9291 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-06-12 18:50:50 +00:00
Diego Biurrun
755bfeabcc misc spelling fixes
Originally committed as revision 9289 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-06-12 09:29:25 +00:00
Graham Booker
ee387b57fd work around issues with the old version of Gnu Assembler shipped on
Intel / OSX 10.4.9
patch by Graham Booker % gbooker A cod3r P com %
Original thread:
Date: May 25, 2007 11:51 PM
Subject: [FFmpeg-devel] r9017 breaks WMA decoding on Intel Macs
The patch:
Date: Jun 3, 2007 4:37 PM
Subject: Re: [FFmpeg-devel] r9017 breaks WMA decoding on Intel Macs

Originally committed as revision 9260 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-06-08 20:35:30 +00:00
Diego Biurrun
34933442ac Preprocessor #defines starting with __ are reserved for the system.
Originally committed as revision 9251 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-06-07 17:19:37 +00:00
Ronald S. Bultje
3d4e3e560f Use FFmpeg-specific CPU feature definitions.
taken from a patch by Ronald S. Bultje, rbultje ronald.bitfreak net
Date: Thu, 07 Jun 2007 08:57:46 -0400
Subject: Re: [FFmpeg-devel] [PATCH] Coremake support - ffmpeg_nommx.patch (1/1)
 - ffmpeg-nommx.patch (1/1) - ff-cputest-noavconfig.patch (1/1)

Originally committed as revision 9250 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-06-07 17:18:01 +00:00
Carl Eugen Hoyos
154e30f6c2 rename attribute_unused to av_unused and moves its declaration to common.h
patch by Carl Eugen Hoyos cehoyos chez ag or at
original thread: [FFmpeg-devel] [PATCH] attribute_unused -> av_unused
date: 05/29/2007 01:23 PM

Originally committed as revision 9155 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-30 09:32:25 +00:00
Zuxy Meng
663deb54af Remove incorrect comment; MMX2 is preferred over 3DNow! on Athlon
Originally committed as revision 9079 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-20 05:07:44 +00:00
Zuxy Meng
038bfcf9d6 3DNow! and SSSE3 optimization to QNS DSP functions; use pmulhrw/pmulhrsw instead of pmulhw
Originally committed as revision 9053 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-18 08:18:56 +00:00
Aurelien Jacobs
5b0b7054b4 better separation of vp3dsp functions from dsputil_mmx.c
Originally committed as revision 9039 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-16 23:23:45 +00:00
Ronald S. Bultje
b550bfaa61 Add libavcodec to compiler include flags in order to simplify header
include paths in the source files.
mostly from a patch by Ronald S. Bultje, rbultje ronald.bitfreak net

Originally committed as revision 9034 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-16 09:51:45 +00:00
Panagiotis Issaris
9b5dc86746 Make vp3dsp*.c compilation optional.
Originally committed as revision 9025 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-14 14:28:13 +00:00
Reimar Döffinger
e36d79c837 Change some leftover __attribute__((unused)) and __attribute__((used)) to
attribute_unused and attribute_used respectively to ease compiling on non-gcc.

Originally committed as revision 9024 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-14 14:07:50 +00:00
Zuxy Meng
25e4f8aaee Faster SSE FFT/MDCT, patch by Zuxy Meng %zuxy P meng A gmail P com%
unrolls some loops, utilizing all 8 xmm registers. fft-test
shows ~10% speed up in (I)FFT and ~8% speed up in (I)MDCT on Dothan

Originally committed as revision 9017 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-13 16:32:32 +00:00
Loren Merritt
ff506a906e sse2 & ssse3 versions of dct_quantize.
core2: mmx2=154 sse2=73 ssse3=66 (cycles)
k8: mmx2=179 sse2=149
p4: mmx2=284 sse2=194

Originally committed as revision 9003 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-12 05:55:09 +00:00
Loren Merritt
1edbfe1994 factor sum_abs_dctelem out of dct_sad, and simd it.
sum_abs_dctelem_* alone:
core2: c=186 mmx2=39 sse2=21 ssse3=13 (cycles)
k8: c=163 mmx2=33 sse2=31
p4: c=370 mmx2=60 sse2=60
 dct_sad including sum_abs_dctelem_*:
core2: c=405 mmx2=258 sse2=240 ssse3=232
k8: c=624 mmx2=394 sse2=392
p4: c=849 mmx2=556 sse2=556

Originally committed as revision 9001 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-12 02:41:25 +00:00
Loren Merritt
561f940c03 sse2 & ssse3 versions of hadamard. unroll and inline diff_pixels.
core2: before mmx2=193 cycles. after mmx2=174 sse2=122 ssse3=115 (cycles).
k8: before mmx2=205. after mmx2=184 sse2=180.
p4: before mmx2=342. after mmx2=314 sse2=309.

Originally committed as revision 9000 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-12 01:16:06 +00:00
Loren Merritt
ba53071acb 10l, r8991 broke mmx1 sad
Originally committed as revision 8993 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-11 03:29:06 +00:00
Loren Merritt
72946825fa sse2 version of fullpel sad.
16% faster on core2, 5% faster on p4. 10% slower (and thus disabled) on k8.

Originally committed as revision 8992 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-11 01:11:45 +00:00
Loren Merritt
164d75ebf3 tweak mmx2 sad.
40% faster on core2, 18% faster on k8, 5% faster on p4.

Originally committed as revision 8991 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-11 00:45:07 +00:00
Loren Merritt
eca3810e31 tweak mmx2 sad.
6% faster on core2 and k8, no change on p4.

Originally committed as revision 8984 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-10 22:24:19 +00:00
Loren Merritt
7c3a9fe2a3 sse2 version of fdct_col.
k8: 72->61 cycles, core2: 51->26 cycles.

Originally committed as revision 8966 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-10 03:13:41 +00:00
Loren Merritt
5adf43e47e cosmetics: remove code duplication in hadamard8_diff_mmx
Originally committed as revision 8946 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-09 01:46:33 +00:00
Loren Merritt
bba5293bb7 cosmetics: remove duplicate transpose macro
Originally committed as revision 8939 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-08 17:55:56 +00:00
Reimar Döffinger
a1ce61108b Fix parts missed in clip -> av_clip rename
Originally committed as revision 8760 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-04-19 16:12:06 +00:00
Diego Biurrun
fe0372296a typos
Originally committed as revision 8642 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-04-07 14:10:02 +00:00
Loren Merritt
5900637219 mmx 16-bit ssd. 2.3x faster svq1 encoding.
Originally committed as revision 8559 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-03-30 19:15:31 +00:00
Diego Biurrun
d42f88025a Fix wrong conditional, Snow decoding, not encoding, was SIMD-accelerated.
Originally committed as revision 8116 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-02-24 11:58:52 +00:00
Michael Niedermayer
58e31fb1d5 reorder a few more paddws to reduce dependancy chains
chroma mc4 put 2480 -> 2460 dezicyles on duron

Originally committed as revision 8098 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-02-23 15:44:56 +00:00
Michael Niedermayer
b4fe97696c reorder paddws to reduce dependancy chain
put_h264_chroma_mc2_mmx2() 927 -> 902 dezicyles on duron

Originally committed as revision 8097 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-02-23 15:28:35 +00:00