Commit Graph

8273 Commits

Author SHA1 Message Date
Jason Garrett-Glaser
b9fe706305 Simplify chroma AC in CABAC residual decoding.
Originally committed as revision 16227 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-19 00:43:30 +00:00
Michael Niedermayer
8955b66950 Optimize ctx calculation in decode_cabac_mb_mvd(), code by dark shikari.
The case for 16x16 blocks becomes 10 cpu cycles faster on pentium dual,
i could not find a speed difference in the case of subblocks though.

Originally committed as revision 16226 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-19 00:05:39 +00:00
Michael Niedermayer
17779f39b6 Remove unacceptable NULL pointer hack from mc code.
Originally committed as revision 16225 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 23:52:32 +00:00
Michael Niedermayer
04618b98e3 Check ref values in CABAC H.264 for validity.
Originally committed as revision 16224 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 23:31:10 +00:00
Michael Niedermayer
c25ac15a07 Move idct_(dc)add closer to where it is needed.
Originally committed as revision 16223 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 18:25:11 +00:00
Michael Niedermayer
aebb5d6d96 indent
Originally committed as revision 16222 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 18:14:38 +00:00
Michael Niedermayer
96465b90a1 Reorder ifs in chroma hl_decode_mb to avoid a duplicate transform_bypass
check.
14 cpu cycles speedup on Pentium Dual

Originally committed as revision 16221 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 18:12:59 +00:00
Michael Niedermayer
6456d6d87c s/h->cbp_table[mb_xy]/h->cbp/
Originally committed as revision 16220 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 17:42:44 +00:00
Michael Niedermayer
04824298a9 Faster CAVLC decoding of trailing_ones. Based on a patch by dark shikari.
decode_residual is about 3.3% faster.

Originally committed as revision 16219 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 16:11:19 +00:00
Jason Garrett-Glaser
93445d1617 Replace i by trailing_ones, part of a patch by dark shikari.
No speed change meassureable with START/STOP_TIMER, but this is needed
for future optimizations.

Originally committed as revision 16218 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 15:51:32 +00:00
Michael Niedermayer
c375d87f6f Remove if() surrounding decode_cabac_mb_type() that can never be true.
Originally committed as revision 16217 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 15:32:07 +00:00
Michael Niedermayer
c325b5054f Remove unreachable else clause, found by dark shikari.
Originally committed as revision 16216 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 15:21:15 +00:00
Michael Niedermayer
dae006d7d7 Remove useless IS_8x8DCT check i forgot, spotted by dark shikari.
Originally committed as revision 16215 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 11:17:12 +00:00
Michael Niedermayer
2a5a9c2815 10l, svq3 didnt set h->cbp, this broke decoding a little.
Originally committed as revision 16214 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 11:01:38 +00:00
Kostya Shishkov
efeb298ea0 cosmetics: remove one unneeded space
Originally committed as revision 16211 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 06:00:12 +00:00
Michael Niedermayer
1eb960352b Do not calculate idct_dc_add/idct_add when the variables are unused.
Originally committed as revision 16210 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 04:13:02 +00:00
Michael Niedermayer
62bc966f8f Remove redundant nnz variable.
Originally committed as revision 16209 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 03:04:53 +00:00
Michael Niedermayer
0a8ca22f4e indent
Originally committed as revision 16208 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 02:56:44 +00:00
Michael Niedermayer
2fd1f0e026 Use the new idct functions (except chroma as it was slower in benchmarks)
cathedral +0.5% speed
aladin +0.6% speed [note aladin has been cat-ed 10 times to reduce the influence
of init time]
Speedup also verified via START/STOP_TIMER (difference was very significant
for the changed parts)

Originally committed as revision 16207 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 02:53:18 +00:00
Michael Niedermayer
ac22385931 H.264 idct functions that include the chroma, inter luma and intra16 luma loops
thus avoiding the calling overhead.
New functions are not yet used.

Originally committed as revision 16206 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 02:36:48 +00:00
Michael Niedermayer
49c084a745 Skip non intra luma code when there is no coded luma.
0.7% speedup for the cathedral sample.

Originally committed as revision 16203 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 01:10:51 +00:00
Loren Merritt
00794ed0d9 1.6x faster pred*_vertical_add_c
Originally committed as revision 16202 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 01:05:06 +00:00
Michael Niedermayer
621561cdf3 Skip chroma handling when there is no coded chroma.
0.5% overall speedup for the cathedral sample.

Originally committed as revision 16201 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-18 00:46:54 +00:00
Kostya Shishkov
17b17c53f4 add const qualifier to some pointers for input data
Originally committed as revision 16195 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 20:17:07 +00:00
Kostya Shishkov
bb39171beb use precalculated mb_pos where available
Originally committed as revision 16194 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 20:10:15 +00:00
Kostya Shishkov
6c3fca6479 Merge deblocking pattern with CBP for RV30/40 loop filtering
Originally committed as revision 16192 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 20:00:48 +00:00
Kostya Shishkov
26bc319c1d Generate correct deblock pattern for RV30
Originally committed as revision 16191 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 19:39:56 +00:00
Kostya Shishkov
f610fc81c1 Unfortunately RV30 luma thirdpel MC functions cannot be substituted with
applying two separate filtering passes, so implement them explicitly.

Originally committed as revision 16190 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 19:37:45 +00:00
Kostya Shishkov
6a4583e925 4l: TIFF stores short strings inside tag, do not interpret it is as an offset
Fixes issue 753

Originally committed as revision 16189 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 17:39:35 +00:00
Guillaume Poirier
5f51afd602 add AltiVec implementation of int32_to_float_fmul_scalar
Originally committed as revision 16186 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 09:47:06 +00:00
Michael Niedermayer
54dab66174 Fix svq3 decoding, is_complex was not initialized.
Originally committed as revision 16184 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 03:41:48 +00:00
Michael Niedermayer
4080e67c8e Replace != 0 || check by |
3 cpu cycles faster

Originally committed as revision 16183 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 02:53:03 +00:00
Michael Niedermayer
ad9ca7e720 Split filter_mb_dir() out of filter_mb().
1% overall decoding speed up for cathedral-beta2-400extra-crop-avc.mp4
no speed change for Aladin.mpg
Benchmarks done on Pentium dual

Originally committed as revision 16182 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 02:35:14 +00:00
Måns Rullgård
6c414bb6d4 AC3: fix strict aliasing violation in parser
Originally committed as revision 16181 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 02:30:26 +00:00
Måns Rullgård
cb56b44086 AAC: fix strict aliasing violation in parser
Originally committed as revision 16180 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 02:30:22 +00:00
Måns Rullgård
a2fc0f6a6d ARM: replace "armv4l" with "arm"
Originally committed as revision 16179 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 00:54:54 +00:00
Diego Biurrun
2600f8c86d Remove obsolete and misleading comments.
Originally committed as revision 16178 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-17 00:39:45 +00:00
Michael Niedermayer
ac0623b23c Fix indention, also do a little vertical alignment of changed lines.
Originally committed as revision 16176 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 22:16:11 +00:00
Stefano Sabatini
7b1454c439 Implement av_set_string3().
Originally committed as revision 16175 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 21:35:35 +00:00
Michael Niedermayer
6120a343aa Factorize 3 multiplications out, code becomes 3 cpu cycles faster.
(not significant as thats just per MB)

Originally committed as revision 16174 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 21:08:16 +00:00
Michael Niedermayer
1dd488e955 Move ENABLE_SMALL back to the per MB check, as otherwise gcc wont remove
the code.

Originally committed as revision 16173 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 20:43:39 +00:00
Michael Niedermayer
89db0bae26 Do not recalculate constant part of is_complex for every MB.
Originally committed as revision 16172 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 20:26:22 +00:00
Michael Niedermayer
41e4055b63 Support the new lossless mode.
Originally committed as revision 16171 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 20:09:09 +00:00
Måns Rullgård
479044cea5 dct-test: add ARM IDCTs
Originally committed as revision 16170 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 20:00:07 +00:00
Måns Rullgård
875f31259b dct-test: add PARTTRANS permutation
Originally committed as revision 16169 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 19:59:50 +00:00
Michael Niedermayer
d358caed7e Horizontal and vertical prediction functions needed for new lossless mode
of H.264.

Originally committed as revision 16168 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 19:39:01 +00:00
Benoit Fouet
f683ea1cab Add const qualifiers to decode functions.
Originally committed as revision 16165 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 08:02:05 +00:00
Benoit Fouet
7d2cc552ed Fix warning "format not a string literal and no format arguments".
Originally committed as revision 16164 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 07:56:33 +00:00
David Conrad
2169b79c2a Workaround for shared libs on OS X with Apple gcc 4.0.1
Originally committed as revision 16158 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-16 00:47:59 +00:00
Laurent Aimar
ac52613c16 Check extradata is large enough.
Patch by Laurent Aimar fenrir \:/ via ecp fr

Originally committed as revision 16157 to svn://svn.ffmpeg.org/ffmpeg/trunk
2008-12-15 23:01:54 +00:00