Commit Graph

513 Commits

Author SHA1 Message Date
Diego Biurrun
a12b44d7fb Add missing required header directly.
Originally committed as revision 13103 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-05-09 14:34:52 +00:00
Diego Biurrun
20cd685ae8 Add missing path to #include.
Originally committed as revision 13102 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-05-09 14:33:55 +00:00
Diego Biurrun
245976da2a Use full path for #includes from another directory.
Originally committed as revision 13098 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-05-09 11:56:36 +00:00
Ramiro Polla
40d0e665d0 Do not misuse long as the size of a register in x86.
typedef x86_reg as the appropriate size and use it instead.

Originally committed as revision 13081 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-05-08 21:11:24 +00:00
Diego Biurrun
57105ddd03 Rename i386/cputest.c --> i386/cpuid.c.
Originally committed as revision 13002 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-26 16:02:22 +00:00
Diego Biurrun
c88c253d8b cosmetics: __asm__ __volatile__ --> asm volatile
Originally committed as revision 12885 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-17 21:57:52 +00:00
Diego Biurrun
80465c7eed cosmetics: Fix nonstandard indentation.
Originally committed as revision 12863 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-16 20:51:39 +00:00
Jeff Downs
591d87babe Cosmetics:
Break long lines.
Correct spelling in comment (duplicatin -> duplicating)

Originally committed as revision 12862 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-16 20:43:37 +00:00
Jeff Downs
52cb7981e2 Redo r12838, this time using svn copy to create h264_i386.h from cabac.h.
Move decode_significance_x86() and decode_significance_8x8_x86() to
i386-specific file from cabac.h.
New file is h264-oriented and only included from h264.c
Resolves compilation when configured with --disable-optimizations due to
decode_significance_8x8_x86 using last_coeff_flag_offset_8x8, which is
only defined in h264.c

Originally committed as revision 12846 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-16 04:40:21 +00:00
Jeff Downs
3aa9ede400 Revert 12838 to redo it the right way (use svn copy to create new
file based on old).

Originally committed as revision 12845 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-16 04:26:52 +00:00
Alexander Strange
f73a6393e7 Add a new xvid-style IDCT using SSE2.
Originally committed as revision 12843 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-16 01:36:14 +00:00
Jeff Downs
e6cfd8fffb Move decode_significance_x86() and decode_significance_8x8_x86() to
i386-specific file from cabac.h.
New file is h264-oriented and only included from h264.c
Resolves compilation when configured with --disable-optimizations due to
decode_significance_8x8_x86 using last_coeff_flag_offset_8x8, which is
only defined in h264.c

Originally committed as revision 12838 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-15 13:51:41 +00:00
Luca Barbato
3fbe711832 Eliminate movdqu in vp3dsp_sse2, patch from Alexander Strange astrangeAtithinkswDoTcom
Originally committed as revision 12824 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-14 20:54:23 +00:00
Alexander Strange
54a0b6e590 Add a header file to declare Xvid IDCT functions.
patch by Alexander Strange, astrange ithinksw com

Originally committed as revision 12794 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-12 16:54:36 +00:00
Loren Merritt
96275520a3 Fix H.264 interframe decoding when compiling with icc. Patch by Loren
Merritt:

"It seems that icc copies the constants from their global var onto the
stack, at which point they're not aligned, hence the crash.
[This change] really shouldn't mean anything different, but maybe it'll
confuse icc into not performing that 'optimization'."

Originally committed as revision 12772 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-08 23:49:34 +00:00
Loren Merritt
ce53144bac h264 chroma mc ssse3
width8: 180->92, width4: 78->63 cycles (core2)

Originally committed as revision 12661 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-04-01 04:51:28 +00:00
Diego Biurrun
04932b0d97 cosmetics: typo fixes
Originally committed as revision 12554 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-03-22 16:46:36 +00:00
Zuxy Meng
9e8e6d318c Add missed call to ff_cavsdsp_init_3dnow() in dsputil_init_mmx()
Originally committed as revision 12540 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-03-21 12:36:49 +00:00
Michael Niedermayer
943032b155 Hardcode register to prevent aparent miscompilation.
Fixes regression tests with gcc 2.95.

Originally committed as revision 12512 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-03-20 14:24:29 +00:00
Michael Niedermayer
dea00a4623 remove unused temp
Originally committed as revision 12511 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-03-20 14:09:31 +00:00
Måns Rullgård
b55aa9a904 get register names from x86_cpu.h
Originally committed as revision 12482 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-03-17 23:08:19 +00:00
Aurelien Jacobs
5a6a9e78ab move draw_edges() into dsputil
Originally committed as revision 12309 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-03-04 00:07:41 +00:00
Aurelien Jacobs
97d1d009e2 split encoding part of dsputil_mmx into its own file
Originally committed as revision 12223 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-25 23:14:22 +00:00
Reimar Döffinger
f2217d6f90 __asm __volatile -> asm volatile part 2
Originally committed as revision 12189 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-24 14:47:42 +00:00
Reimar Döffinger
78d3d94f14 __asm __volatile -> asm volatile, improves code consistency and works
(as far as that is possible) with the Sun C compiler.

Originally committed as revision 12188 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-24 14:46:22 +00:00
Loren Merritt
4a9ca0a279 simd and unroll png_filter_row
cycles per 1000 pixels on core2:
left: 9211->5170
top: 9283->2138
avg: 12215->7611
paeth: 64024->17360
overall rgb png decoding speed: +45%
overall greyscale png decoding speed: +6%

Originally committed as revision 12164 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-21 07:10:46 +00:00
Michael Niedermayer
1435e4ccde Disabling all SSE* code for old gcc to avoid alignment issues.
Originally committed as revision 12163 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-21 00:06:07 +00:00
Reimar Döffinger
754bf3d8a1 Fix warnings:
i386/vp3dsp_sse2.c:805: warning: cast discards qualifiers from pointer target type
i386/vp3dsp_sse2.c:806: warning: cast discards qualifiers from pointer target type

Originally committed as revision 12150 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-19 21:55:13 +00:00
Diego Biurrun
5edac5dc94 cosmetics: Replace // by /* */ comments.
sync with upstream libmpeg2 0.4.1

Originally committed as revision 11915 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-13 01:18:12 +00:00
Loren Merritt
ec199cc94c asm argument that might be in memory needs a size
Originally committed as revision 11890 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-10 01:45:42 +00:00
Loren Merritt
2c70770e33 use fewer registers in apply_welch_window_sse2
Originally committed as revision 11882 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-09 05:29:47 +00:00
Loren Merritt
1d67b037f7 sse2 h264 motion compensation. not new code, just separate out the cases that didn't need ssse3.
Originally committed as revision 11877 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-06 12:32:31 +00:00
Loren Merritt
20d565be6d put loop counter in a register if possible. makes some of the qpel functions 3% faster.
Originally committed as revision 11876 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-06 04:44:21 +00:00
Loren Merritt
7080ec2937 fix aliasing warnings. simpler too.
Originally committed as revision 11875 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-06 04:14:07 +00:00
Loren Merritt
a2b7bc8e71 constant was excessively aligned
Originally committed as revision 11874 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-06 03:51:53 +00:00
Loren Merritt
ddf969705f ssse3 h264 motion compensation.
25% faster tham mmx on core2, 35% if you discount fullpel, 4% overall decoding.

Originally committed as revision 11871 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-05 11:22:55 +00:00
Loren Merritt
b64dfbb8d2 add qpel rounder once during hv rather than twice during hv and whatever it's averaged with
Originally committed as revision 11870 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-05 03:58:13 +00:00
Loren Merritt
fa9b873e08 clean up an ugliness introduced in r11826. this syntax will require fewer changes when adding future sse2 code.
Originally committed as revision 11868 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-05 01:16:48 +00:00
Michael Niedermayer
9a7871f77d Deprecate old and inefficient per instruction asm().
Originally committed as revision 11865 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-04 20:03:08 +00:00
Loren Merritt
b2f775860b reduce code duplication
Originally committed as revision 11863 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-04 16:20:46 +00:00
Loren Merritt
b313e8159c avg_pixels4_mmx2
Originally committed as revision 11829 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-03 17:04:33 +00:00
Loren Merritt
6c01d0069d use mmx2/3dnow avg functions in avg_qpel*_mc00
Originally committed as revision 11828 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-03 16:19:37 +00:00
Loren Merritt
ed5d7a531c ff_h264_idct8_add_sse2.
compared to mmx, 217->126 cycles on core2, 262->220 on k8.

Originally committed as revision 11826 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-03 07:05:11 +00:00
Loren Merritt
51f0ac65df remove some movq in ff_h264_idct8_add_mmx. 225->217 cycles on core2.
Originally committed as revision 11825 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-02-03 03:21:47 +00:00
Baptiste Coudurier
066e0cc50d add parenthesis, fix warning: i386/dsputil_mmx.c:2618: warning: suggest parentheses around arithmetic in operand of |
Originally committed as revision 11673 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-30 23:54:59 +00:00
Baptiste Coudurier
afa4778989 fix prototypes, remove warning: i386/dsputil_mmx.c:3594: warning: assignment from incompatible pointer type
Originally committed as revision 11672 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-30 23:52:42 +00:00
Reimar Döffinger
766324fc49 Add and use DECLARE_ASM_CONST for constants used in assembler code.
Should make it easier to work around compilation problems with e.g. ICC.

Originally committed as revision 11641 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-27 20:50:16 +00:00
Reimar Döffinger
038f0f9b34 Use DECLARE_ALIGNED in yet another place
Originally committed as revision 11640 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-27 20:45:49 +00:00
Reimar Döffinger
6a1a2fa0fe Use DECLARE_ALIGNED and remove unneeded attribute_used
Originally committed as revision 11639 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-27 19:59:11 +00:00
Reimar Döffinger
27215c6bf4 Use DECLARE_ALIGNED
Originally committed as revision 11630 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-27 14:46:56 +00:00
Diego Biurrun
426d18b85b Rename illegal identifiers, _ followed by capital is reserved for the system.
Originally committed as revision 11541 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-16 21:21:30 +00:00
Christophe Gisquet
28748a9128 Factorize some duplicated code from CAVS and H.264 into a common file.
patch by Christophe Gisquet, christophe.gisquet free fr

Originally committed as revision 11504 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-11 08:29:58 +00:00
Christophe Gisquet
ae904fd028 Fix issue #301:
summary of changes:
- Use MANGLE when loading some constants into MMX registers.
- Convert those constants to non-static and thus add ff_ prefix.
- Remove last parameter of MSPEL_FILTER13_CORE (was constant).
- Use of "+r" instead of stricter but unnecessary "+g".
- Use of REG_c and direct loading of some of the above.
patch by Christophe GISQUET, christophe.gisquet free fr
Subject: [FFmpeg-devel] [PATCH] Roundup issue #301
Date: Fri, 28 Dec 2007 19:22:18 +0100

Originally committed as revision 11376 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-01-02 19:24:42 +00:00
Christophe Gisquet
9fa3572903 add MMX version for put_no_rnd_h264_chroma_mc8_c, used in VC-1 decoding.
patch by Christophe GISQUET %christophe P gisquet A free P fr%
original thread:
date: Nov 25, 2007 12:35 AM
subject: Re: [FFmpeg-devel] MMX version for put_no_rnd_h264_chroma_mc8_c

Originally committed as revision 11298 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-21 23:11:22 +00:00
Diego Biurrun
9fbd14acb8 Fix typo in macro name: WARPER8_16_SQ --> WRAPPER8_16_SQ.
Originally committed as revision 11296 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-21 12:38:41 +00:00
Aurelien Jacobs
407c50a024 move FLAC mmx dsp to its own file
Originally committed as revision 11244 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-16 22:20:47 +00:00
Reimar Döffinger
15c57ced2f Add 'l' suffix where it is necessary because type can not always be
inferred from arguments. Fixes compilation with Intel compiler

Originally committed as revision 11227 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-15 23:08:15 +00:00
Aurelien Jacobs
1b77e87756 add required include to make this file self-contained
Originally committed as revision 11211 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-12 22:45:03 +00:00
Diego Biurrun
571bf37f6d typo/clarification
Originally committed as revision 11201 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-11 18:47:04 +00:00
Diego Biurrun
56cc85a068 Misc spelling fixes, prefer American over British English.
Originally committed as revision 11126 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-02 15:43:08 +00:00
Vitor Sessak
52b541ad79 spelling
Originally committed as revision 11122 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-12-01 22:21:04 +00:00
Aurelien Jacobs
bb6cc730e5 remove some unused ff_p* vars from dsputil
Originally committed as revision 11106 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:57:26 +00:00
Aurelien Jacobs
dbb5fdbdc8 remove useless #ifdef around extern declaration
Originally committed as revision 11105 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:56:15 +00:00
Aurelien Jacobs
7c35b55111 cosmetics: indentation
Originally committed as revision 11104 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:54:46 +00:00
Aurelien Jacobs
51ac88228a convert some #ifdef CONFIG_ to if(ENABLE_
Originally committed as revision 11103 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:54:18 +00:00
Aurelien Jacobs
5b67ce2afa build vc1dsp_mmx.c in its own compilation unit
Originally committed as revision 11102 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:42:55 +00:00
Aurelien Jacobs
43de50659b use ff_ prefix for extern vars
Originally committed as revision 11101 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:36:15 +00:00
Aurelien Jacobs
182f56cb0c make ff_p* vars extern so that they can be used in various *_mmx.c files
Originally committed as revision 11100 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-27 22:23:34 +00:00
Christophe Gisquet
ac40ce4244 Typo fix. Previous version had some picture error building up until next keyframe.
Now MMX version decodes 1:1 what the C version does
patch by Christophe GISQUET %christophe P gisquet A free P fr%

Originally committed as revision 11090 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-25 09:43:35 +00:00
Christophe Gisquet
d3a9c44e1d Strip debug stuff from vc1dsp_mmx.c, patch by Christophe GISQUET %hristophe P gisquet A free P fr%
Original thread:
date: Nov 24, 2007 3:09 PM
subject: [FFmpeg-devel] [PATCH] Strip debug stuff from vc1dsp_mmx.c

Originally committed as revision 11088 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-24 14:34:25 +00:00
Christophe Gisquet
82821c913b add VC-1 MMX DSP functions, under MIT license.
patch by Christophe GISQUET %christophe P gisquet A free P fr%
original thread:
date: Jul 7, 2007 12:52 PM
subject: [FFmpeg-devel] [PATCH] VC-1 MMX DSP functions

Originally committed as revision 11074 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-21 22:41:31 +00:00
Michael Niedermayer
02d361919a tring to workaround gcc 2.95 bug which causes random failures
Originally committed as revision 11003 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-11-12 02:04:01 +00:00
Diego Biurrun
deb43f0bd8 Explain why there are no multiple inclusion guards in these header files.
Originally committed as revision 10771 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-17 22:29:13 +00:00
Aurelien Jacobs
ab54bff20a Remove wrong multiple inclusion guards.
Those files are really meant to be included several times.

Originally committed as revision 10766 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-17 11:19:42 +00:00
Diego Biurrun
5b21bdabe4 Add FFMPEG_ prefix to all multiple inclusion guards.
Originally committed as revision 10765 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-17 09:37:46 +00:00
Diego Biurrun
31b2c1446f Add missing multiple inclusion guards.
Originally committed as revision 10763 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-17 09:31:06 +00:00
Shane
bdb273564c Fix intended order of operations for 4 assert() checks.
Patch by Shane, gnome42 T gmail O com

Originally committed as revision 10711 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-10-11 22:18:38 +00:00
Loren Merritt
6810b93a81 sse2 version of compute_autocorr().
4x faster than c (somehow, even though doubles only allow 2x simd).
overal flac encoding: 15-50% faster on core2, 4-11% on k8, 3-13% on p4.

Originally committed as revision 10621 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-09-29 22:31:18 +00:00
Reimar Döffinger
eafa1c90e5 Replace complicated and currently broken manual alignment code by
DECLARE_ALIGNED_16. Fixes crash in ff_snow_horizontal_compose97i_sse2

Originally committed as revision 10261 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-30 11:41:58 +00:00
Michael Niedermayer
267b9479ac typo
Originally committed as revision 10250 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-27 10:39:31 +00:00
Ramiro Polla
7bcc1d5b66 CONFIG_7REGS has been renamed to HAVE_7REGS
Originally committed as revision 10237 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 16:10:53 +00:00
Michael Niedermayer
90e9e94d0f workaround gcc bug, untested as my gcc is not complaining
Originally committed as revision 10236 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 12:34:22 +00:00
Michael Niedermayer
cefa599953 optimize the first vertical lifting step, this also prevents another
overflow, the last known possible overflow

Originally committed as revision 10234 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 11:16:23 +00:00
Michael Niedermayer
c90762766a optimize 1st horizontal lifting step
Originally committed as revision 10231 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 08:31:55 +00:00
Michael Niedermayer
1104bf2b20 typo
Originally committed as revision 10230 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 08:03:23 +00:00
Michael Niedermayer
8b5029295d get rid of totally senseless "m" + read in register we have enough
registers to keep everything in registers

Originally committed as revision 10229 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 06:51:51 +00:00
Michael Niedermayer
bc1e78d896 simplify senselessly complex addressing
Originally committed as revision 10228 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 02:02:14 +00:00
Michael Niedermayer
25bb359f73 cosmetics
remove brain amputated mmx wrappers around sse2 macros
fix name of ..._sub macro to match ..._add naming

Originally committed as revision 10227 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 01:20:09 +00:00
Michael Niedermayer
629750290f avoid overflow in the 3rd lifting step, this now needs mmx2 at minimum
(patch for plain mmx support is welcome ...)

Originally committed as revision 10226 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 01:11:02 +00:00
Michael Niedermayer
b696a4c91d avoid an overflow in the 1 horizontal lifting step
Originally committed as revision 10225 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-25 19:04:33 +00:00
Michael Niedermayer
9caa1cccb9 prevent one overflow in the first vertical lifting step
Originally committed as revision 10224 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-25 16:28:45 +00:00
Michael Niedermayer
3e0f7126b5 update mmx code to latest snow changes
note, the code likely can overflow and thus needs some more changes
sse2 updated too but disabled as it is untested

Originally committed as revision 10223 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-25 15:20:56 +00:00
Michael Niedermayer
d593e32983 use 16bit IDWT (a SIMD implementation of it should be >2x faster then with
the old 32bit code)
disable mmx/sse2 optimizations as they need a rewrite now

Originally committed as revision 10218 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-25 03:00:51 +00:00
Michael Niedermayer
ce611a27be Change rounding of the horizontal DWT to match the vertical one.
This allows some simplifications and optimizations and should
not have any effect on quality.

Originally committed as revision 10172 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 16:29:40 +00:00
Michael Niedermayer
30cd3e66bd remove code which become unused by the previous changes
Originally committed as revision 10166 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 00:05:30 +00:00
Michael Niedermayer
72dee89b5d Simplify and optimize the 4th vertical lifting step of the SSE2 code (untested)
This also reduces the needed headroom in that step by 1 bit

Originally committed as revision 10165 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 00:03:18 +00:00
Michael Niedermayer
d0dae46a90 Simplify and optimize the 4th vertical lifting step of the MMX code
This also reduces the needed headroom in that step by 1 bit

Originally committed as revision 10164 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 00:02:19 +00:00
Michael Niedermayer
1ffbbef217 Simplify and speedup code, reduce needed headroom by 2 bits in the 3rd
vertical lifting step of the SSE2 code (untested)

Originally committed as revision 10163 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:59:20 +00:00
Michael Niedermayer
4bf1790421 simplify, speedup and reduce needed headroom by 2 bits in the 3rd
vertical lifting step

Originally committed as revision 10162 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:54:49 +00:00
Michael Niedermayer
dd30437bbe replace <<1 by add for SSE2 (untested)
Originally committed as revision 10161 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:11:05 +00:00