ffmpeg

Author	SHA1	Message	Date
Anton Khirnov	759001c534	lavc decoders: work with refcounted frames.	2013-03-08 07:38:30 +01:00
Ronald S. Bultje	7ebfb466ae	h264: Don't store intra pcm samples in h->mb Instead, keep them in the bitstream buffer until we read them verbatim, this saves a memcpy() and a subsequent clearing of the target buffer. decode_cabac+decode_mb for a sample file (CAPM3_Sony_D.jsv) goes from 6121.4 to 6095.5 cycles, i.e. 26 cycles faster. Signed-off-by: Martin Storsjö <martin@martin.st>	2013-02-19 22:34:14 +02:00
Anton Khirnov	2c54155407	h264: deMpegEncContextize Most of the changes are just trivial are just trivial replacements of fields from MpegEncContext with equivalent fields in H264Context. Everything in h264* other than h264.c are those trivial changes. The nontrivial parts are: 1) extracting a simplified version of the frame management code from mpegvideo.c. We don't need last/next_picture anymore, since h264 uses its own more complex system already and those were set only to appease the mpegvideo parts. 2) some tables that need to be allocated/freed in appropriate places. 3) hwaccels -- mostly trivial replacements. for dxva, the draw_horiz_band() call is moved from ff_dxva2_common_end_frame() to per-codec end_frame() callbacks, because it's now different for h264 and MpegEncContext-based decoders. 4) svq3 -- it does not use h264 complex reference system, so I just added some very simplistic frame management instead and dropped the use of ff_h264_frame_start(). Because of this I also had to move some initialization code to svq3. Additional fixes for chroma format and bit depth changes by Janne Grunau <janne-libav@jannau.net> Signed-off-by: Anton Khirnov <anton@khirnov.net>	2013-02-15 16:35:16 +01:00
Diego Biurrun	88bd7fdc82	Drop DCTELEM typedef It does not help as an abstraction and adds dsputil dependencies. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2013-01-22 18:32:56 -08:00
Ronald S. Bultje	ddd7559ad9	h264: check for invalid zeros_left before writing Prevent an invalid write into coeffs[scantable[-1]] if zeros_left itself was an invalid VLC code (and thus -1). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	2012-12-08 17:04:22 +01:00
Mans Rullgard	c4cccc8d3f	h264: fix invalid pointer arithmetic Subtracting a (positive) value from the address of an array violates C99 section 6.5.6: If both the pointer operand and the result point to elements of the same array object, or one past the last element of the array object, the evaluation shall not produce an overflow; otherwise, the behavior is undefined. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-10-27 17:02:46 +01:00
Ronald S. Bultje	f6f7d15041	h264: don't touch H264Context->ref_count[] during MB decoding The variable is copied to subsequent threads at the same time, so this may cause wrong ref_count[] values to be copied to subsequent threads. This bug was found using TSAN. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	2012-10-05 02:49:45 +02:00
Diego Biurrun	0becb07842	h264: Factorize declaration of mb_sizes array.	2012-04-05 17:17:22 +02:00
Ronald S. Bultje	45b7bd7c53	h264: disallow constrained intra prediction modes for luma. Conversion of the luma intra prediction mode to one of the constrained ("alzheimer") ones can happen by crafting special bitstreams, causing a crash because we'll call a NULL function pointer for 16x16 block intra prediction, since constrained intra prediction functions are only implemented for chroma (8x8 blocks). Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind CC: libav-stable@libav.org	2012-02-09 22:57:01 -08:00
Alex Converse	7181c4edee	cosmetics: Remove extra newlines at EOF	2012-01-27 17:19:09 -08:00
Diego Biurrun	58c42af722	doxygen: misc consistency, spelling and wording fixes	2011-12-12 23:06:23 +01:00
Baptiste Coudurier	76741b0e56	h264: 4:2:2 intra decoding support Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2011-10-21 01:00:41 -07:00
Mans Rullgard	8babfc033e	h264: fix invalid shifts in init_cavlc_level_tab() The level_code expression includes a shift which is invalid in those cases where the value is not used. Moving the calculation to the branch where the result is used avoids these. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-10-11 15:00:56 +01:00
Diego Biurrun	657ccb5ac7	Eliminate FF_COMMON_FRAME macro. FF_COMMON_FRAME holds the contents of the AVFrame structure and is also copied to struct Picture. Replace by an embedded AVFrame structure in struct Picture.	2011-07-11 00:19:00 +02:00
Jason Garrett-Glaser	3b7ebeb4d5	H.264: faster write_back_* Avoid aliasing, unroll loops, and inline more functions.	2011-07-03 15:05:55 -07:00
Jason Garrett-Glaser	c90b94424c	4:4:4 H.264 decoding support Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.	2011-06-13 21:16:30 -07:00
Jason Garrett-Glaser	504811baea	Roll back 4:4:4 H.264 for now Needs some ARM/PPC asm modifications.	2011-06-13 13:38:46 -07:00
Jason Garrett-Glaser	c9c493872c	4:4:4 H.264 decoding support Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.	2011-06-13 12:21:39 -07:00
Oskar Arvidsson	fcc0224e4f	Add support for higher QP values in h264. In high bit depth, the QP values may now be up to (51 + 6*(bit_depth-8)). Preparatory patch for high bit depth h264 decoding support. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2011-05-10 07:24:35 -04:00
Oskar Arvidsson	6e3ef511d7	Add the notion of pixel size in h264 related functions. In high bit depth the pixels will not be stored in uint8_t like in the normal case, but in uint16_t. The pixel size is thus 1 in normal bit depth and 2 in high bit depth. Preparatory patch for high bit depth h264 decoding support. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2011-05-10 07:24:33 -04:00
Stefano Sabatini	975a1447f7	Replace deprecated FF__TYPE symbols with AV_PICTURE_TYPE_. Signed-off-by: Diego Biurrun <diego@biurrun.de>	2011-05-02 12:18:44 +02:00
Stefano Sabatini	6209669de4	Replace deprecated av_get_pict_type_char() with av_get_picture_type_char(). Signed-off-by: Diego Biurrun <diego@biurrun.de>	2011-05-02 11:24:45 +02:00
Mans Rullgard	2912e87a6c	Replace FFmpeg with Libav in licence headers Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-03-19 13:33:20 +00:00
Ronald S. Bultje	66c6b5e2a5	Revert `2a1f431d38`, it broke H264 lossless.	2011-01-20 17:24:44 -05:00
Ronald S. Bultje	8bcfe7f7fd	Set gray (128) U/V planes for chroma-less samples. Fixes two fate samples when played with -flags emu_edge.	2011-01-20 17:24:44 -05:00
Jason Garrett-Glaser	2a1f431d38	H.264/SVQ3: make chroma DC work the same way as luma DC No speed improvement, but necessary for some future stuff. Also opens up the possibility of asm chroma dc idct/dequant. Originally committed as revision 26349 to svn://svn.ffmpeg.org/ffmpeg/trunk	2011-01-15 01:10:46 +00:00
Jason Garrett-Glaser	5657d14094	H.264: switch to x264-style tracking of luma/chroma DC NNZ Useful so that we don't have to run the hierarchical DC iDCT if there aren't any coefficients. Opens up some future opportunities for optimization as well. Originally committed as revision 26337 to svn://svn.ffmpeg.org/ffmpeg/trunk	2011-01-14 21:36:16 +00:00
Jason Garrett-Glaser	19fb234e4a	H.264: split luma dc idct out and implement MMX/SSE2 versions About 2.5x the speed. NOTE: the way that the asm code handles large qmuls is a bit suboptimal. If x264-style dequant was used (separate shift and qmul values), it might be possible to get some extra speed. Originally committed as revision 26336 to svn://svn.ffmpeg.org/ffmpeg/trunk	2011-01-14 21:34:25 +00:00
Jason Garrett-Glaser	b70c95e05a	H.264: 8% faster CAVLC zero-run decoding Originally committed as revision 24736 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-08-07 12:30:44 +00:00
Diego Biurrun	ba87f0801d	Remove explicit filename from Doxygen @file commands. Passing an explicit filename to this command is only necessary if the documentation in the @file block refers to a file different from the one the block resides in. Originally committed as revision 22921 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-04-20 14:45:34 +00:00
Michael Niedermayer	9885284c22	Check level_prefix a bit (this just checks the max our bitreader can handle, as i did nt find a limit in the spec) This should stop cavlc_decode_residual() on a zero bitstream Originally committed as revision 22429 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-03-10 09:55:03 +00:00
Michael Niedermayer	8897b247a5	Remove some unneeded fill_rectangle() for 16x16 blocks. Originally committed as revision 22124 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-02-28 23:54:24 +00:00
Michael Niedermayer	69cc31832f	Move check for and call of predict_field_decoding_flag() from the mb code to the row code. This function would only be needed on a MB basis for MBAFF+FMO Originally committed as revision 21860 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-02-17 02:14:02 +00:00
Michael Niedermayer	c1bb66ac19	Split setting neighboring MBs from fill_decode_caches() no speed change. Originally committed as revision 21842 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-02-15 22:07:02 +00:00
Michael Niedermayer	996b099a0f	Branchless setting of MB_TYPE_8x8DCT. Not benchmarked as i failed to find a sample that uses this one. But it should be faster. Originally committed as revision 21435 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-24 20:54:09 +00:00
Michael Niedermayer	81afcf1fae	Remove cruft. Originally committed as revision 21434 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-24 20:52:49 +00:00
Michael Niedermayer	449d1442a6	a[b-1] -> (a-1)[b]. Helps gcc not to add seperate -1 instructions. Originally committed as revision 21432 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-24 18:42:22 +00:00
Michael Niedermayer	7abc860323	Optimize suffix_length computation, 1 cpu cycle speedup. Originally committed as revision 21431 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-24 18:23:46 +00:00
Michael Niedermayer	eeb1e92feb	Simplify suffix_length computation, same speed. Originally committed as revision 21430 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-24 18:18:08 +00:00
Michael Niedermayer	c78295ad1b	Optimize level_code computation, 6cpu cycles speedup. Originally committed as revision 21428 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-24 18:17:01 +00:00
Michael Niedermayer	8ba436171f	1 cpu cycle faster suffix_length calculation. Originally committed as revision 21425 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-24 18:05:02 +00:00
Michael Niedermayer	1f445f5473	Move dquant check into qscale overflow check. This should be faster (couldnt meassue a difference), and its less picky on slightly out of spec dquant. Originally committed as revision 21373 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-21 21:01:26 +00:00
Michael Niedermayer	87df989ee3	Merge multiple IS_* macro uses where possible. Originally committed as revision 21340 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-20 01:15:30 +00:00
Michael Niedermayer	2b3649f656	Fix compilation with -O0. Originally committed as revision 21308 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-18 23:41:12 +00:00
Michael Niedermayer	439d6b1dcf	filter_mb_fast needs cbp_table to be set. Originally committed as revision 21290 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-18 19:45:02 +00:00
Michael Niedermayer	f432b43b08	Split fill_caches() between filter and decoder. Originally committed as revision 21271 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-17 21:43:08 +00:00
Michael Niedermayer	c988f97566	Rearchitecturing the stiched up goose part 1 Run loop filter per row instead of per MB, this also should make it much easier to switch to per frame filtering and also doing so in a seperate thread in the future if some volunteer wants to try. Overall decoding speedup of 1.7% (single thread on pentium dual / cathedral sample) This change also allows some optimizations to be tried that would not have been possible before. Originally committed as revision 21270 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-17 20:35:55 +00:00
Michael Niedermayer	ddd60f28d8	Replace cabac checks in inline functions from h264.h with constants. No benchmark because its just replacing variables with litteral constants (so no risk for slowdown outside gcc silliness) and i need sleep. Originally committed as revision 21237 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-16 05:41:33 +00:00
Michael Niedermayer	8e71d89a7b	Move golomb_to_int*cbp tables back to h264_data.h as svq3.c used them. Yes i did compile&test, no svq3.c was not recompiled. Originally committed as revision 21180 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-13 02:17:16 +00:00
Michael Niedermayer	e1e949026e	Split cavlc out of h264.c. Seems to speed the code up a little... The placement of many generic functions between h264.c and h264.h is still open Currently they are a little randomly placed between them. Originally committed as revision 21178 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-01-13 01:59:19 +00:00

50 Commits