openh264

Author	SHA1	Message	Date
Sindre Aamås	1e83bec860	[UT] Add some missing quantization tests	2016-04-13 11:56:44 +02:00
Sindre Aamås	abaf3a4104	[UT] Reduce duplication in quantization tests	2016-04-13 08:59:16 +02:00
Sindre Aamås	48a520915a	[Encoder/x86] Add AVX2 SATD routines WelsSampleSatd16x16_avx2 (~2.31x speedup over SSE4.1 on Haswell). WelsSampleSatd16x8_avx2 (~2.19x speedup over SSE4.1 on Haswell). WelsSampleSatd8x16_avx2 (~1.68x speedup over SSE4.1 on Haswell). WelsSampleSatd8x8_avx2 (~1.53x speedup over SSE4.1 on Haswell).	2016-03-08 11:31:17 +01:00
Gregory J. Wolfe	03890fe86f	Added support for "video signal type present" information. The "Video signal type present" information is written to the output video file when it is created, and later is used by the decoder to properly decode the compressed video data. The saved attributes are: - format type (PAL, NTSC, etc.) - color primaries (BT709, SMPTE170M, etc.) - transfer characteristics (BT709, SMPTE170M, etc.) - color matrix ((BT709, SMPTE170M, etc.) These modifications allow the client to specify these attributes and, if specified, makes sure they are written to the output file.	2016-02-24 10:33:18 -05:00
Gregory J. Wolfe	c7fcba06c7	Added support for "video signal type present" information. The "Video signal type present" information is written to the output video file when it is created, and later is used by the decoder to properly decode the compressed video data. The saved attributes are: - format type (PAL, NTSC, etc.) - color primaries (BT709, SMPTE170M, etc.) - transfer characteristics (BT709, SMPTE170M, etc.) - color matrix ((BT709, SMPTE170M, etc.) These modifications allow the client to specify these attributes and, if specified, makes sure they are written to the output file.	2016-02-23 13:21:06 -05:00
sijchen	aaa25160ec	Merge pull request #2353 from saamas/encoder-x86-dct-opt2 [Encoder] x86 DCT optimizations	2016-02-08 15:00:12 -08:00
Sindre Aamås	c8c74903f8	[Encoder] Add single-block AVX2 4x4 DCT/IDCT routines We do four blocks at a time when possible, but need to handle single blocks at a time for intra prediction. ~3.15x speedup over MMX for the DCT on Haswell. ~2.94x speedup over MMX for the IDCT on Haswell. Returns diminish with increasing vector length because a larger proportion of the time is spent on load/store/shuffling.	2016-02-02 17:22:49 +01:00
Sindre Aamås	f90960983c	[Encoder] Add single-block SSE2 4x4 DCT/IDCT routines We do four blocks at a time when possible, but need to handle single blocks at a time for intra prediction. ~2.31x speedup over MMX for the DCT on Haswell. ~1.92x speedup over MMX for the IDCT on Haswell.	2016-02-02 17:22:48 +01:00
unknown	3873addc3d	fix frame size constraints for width and height	2016-02-01 15:55:53 +08:00
Sindre Aamås	cc8d541432	[UT] Utilize DCT function pointer typedefs	2016-01-19 22:00:24 +01:00
Sindre Aamås	a45c10cf91	[UT] Only run AVX2 tests if host supports AVX2	2016-01-19 14:27:46 +01:00
Sindre Aamås	3088d96978	[Encoder] Add an AVX2 4x4 IDCT implementation ~2.03x faster on Haswell as compared to the SSE2 version.	2016-01-19 13:12:28 +01:00
Sindre Aamås	b267163f10	[Encoder] Add an AVX2 4x4 DCT implementation ~2.52x faster on Haswell as compared to the SSE2 version.	2016-01-19 13:12:28 +01:00
Sindre Aamås	b9adbcf37c	[UT] Add missing SSE2 4x4 IDCT test IDCT input is defined in such a way that the intermediate values cannot legally overflow an int16_t. The use of random values as input causes such overflows. This results in implementation- dependent output depending on which type is used to hold intermediate results. Use a template for the test reference implementation to test implementations with different intermediate representation.	2016-01-19 13:12:28 +01:00
Sindre Aamås	8764231784	[UT] Improve DCT tests Initialize input arrays with different random values. Otherwise, the input to the DCT routines is effectively all zero values after taking the difference. Reduce duplication.	2016-01-19 13:12:28 +01:00
sijchen	aeb5ab4b99	[Encoder] put the logic related to multiple D layer into a class for better structure	2015-11-11 22:55:16 -08:00
sijchen	33c378f7b7	change API for slicing part for easier usage (the UseLoadBalancing flag is still under working)	2015-11-10 09:50:06 -08:00
Sijia Chen	819f6f5d93	[Encoder] add encoder tasks and task-management class https://rbcommons.com/s/OpenH264/r/1334/	2015-10-19 22:48:28 -07:00
karina li	2c830e64d7	exception case for width or height is less than 16	2015-09-08 17:21:56 +08:00
Guangwei Wang	e42ce60cc9	add UT for sub8x8 modes assembly functions	2015-07-30 10:02:32 +08:00
Martin Storsjö	78e0ec6130	Convert tabs to spaces before comments	2015-06-10 10:22:29 +03:00
Martin Storsjö	764793d74b	Remove tabs in struct and class definitions	2015-06-10 10:22:01 +03:00
Martin Storsjö	ca51ee0f44	Remove tabs where a simple space is just enough	2015-06-10 10:21:52 +03:00
Martin Storsjö	51efa57a3d	Convert tabs to spaces in vertically aligned code	2015-06-10 10:21:29 +03:00
Martin Storsjö	723044837a	Convert tabs to spaces in defines	2015-06-10 10:21:25 +03:00
Martin Storsjö	ebbcb67fb7	Convert tabs to spaces in assignment of SIMD function pointers	2015-06-03 15:39:30 +03:00
Martin Storsjö	0298b3f580	Initialize enough samples in the new 4x8 tests This fixes valgrind warnings about tests using uninitialized data.	2015-06-03 09:45:06 +03:00
huili2	f76325edc7	Merge pull request #1973 from huili2/sub8 modify some functions extending to sub8x8 usage, especially in ME part	2015-06-02 14:44:06 +08:00
huili2	c3cfce5223	modify some functions extending to sub8x8 usage, especially in ME part	2015-06-02 13:39:38 +08:00
sijchen	5588e82fce	Merge pull request #1961 from mstorsjo/fix-warnings Remove a redundant check of this!=NULL	2015-06-01 10:42:56 +08:00
Martin Storsjö	1239bb24ba	Remove a redundant check of this!=NULL 'this' can't be NULL in well-defined C++ code. This fixes a warning with clang 3.6 from Xcode 6.3.	2015-05-27 11:46:53 +03:00
Sijia Chen	9442a7a0b5	add parameter checking on resolution and related UT	2015-05-26 15:41:47 +08:00
Martin Storsjö	b90eca78cd	Avoid endian assumptions in FillQpelLocationByFeatureValue_c These values are read as two separate 16 bit integers from an array in the FeatureSearchOne function, therefore we should also store them in a well-defined order. This fixes encoding of screen content on big endian; now the full testsuite passes on big endian.	2015-05-15 13:11:23 +03:00
Martin Storsjö	7a80c21526	Reformat tables without tabs	2015-05-13 22:06:58 +03:00
Haibo Zhu	61b82d28c4	Add framerate & spatialbitrate comparison for encoder UT	2015-05-05 18:53:50 -07:00
Martin Storsjö	8d34c68ad6	Add a missing newline at the end of a file Some tools (like git) complain if a file lacks a newline at the end of a file, and some editors will automatically readd it when editing such files.	2015-05-04 12:46:48 +03:00
Sijia Chen	1922b533f6	change the range of frame rate from 30 to 60	2015-04-16 12:45:43 +08:00
ruil2	cce966fbba	update bGapsInFrameNumValueAllowedFlag according to parameters setting	2015-03-18 13:44:03 +08:00
ruil2	7d055cae94	Merge pull request #1786 from sijchen/fix_over improve error logging in UT	2015-02-06 12:17:56 +08:00
sijchen	5fdd01ec0c	Merge pull request #1787 from mstorsjo/remove-stray-semicolon Remove accidental double semicolons	2015-02-02 18:15:02 +08:00
sijchen	e7a7a35611	Merge pull request #1779 from mstorsjo/share-memalign Move the memory allocation/deallocation routines to the common library	2015-02-02 18:14:55 +08:00
Martin Storsjö	a3063531c4	Remove accidental double semicolons	2015-02-02 09:20:35 +02:00
Martin Storsjö	4394d22b70	Set m_iPicResSize in PrepareOneSrcFrame If the calling test hasn't set m_iPicResSize, it is set to the maximum frame size, which takes much longer to initialize than the current actual frame size. This reduces the runtime of EncoderInterfaceTest.SkipFrameCheck in valgrind from 229 seconds to 8 seconds, and the total runtime of all the test cases in EncoderInterfaceTest from 405 seconds to 89 seconds.	2015-01-30 16:02:40 +02:00
Martin Storsjö	2356eb1ff6	Move the memory allocation/deallocation routines to the common library They are still used slightly differently in the encoder and decoder; the decoder uses plain functions while the encoder uses one object keeping track of the number of allocated bytes, and keeping track of the requested alignment.	2015-01-30 11:30:59 +02:00
Sijia Chen	3c085aec79	improve error logging in UT	2015-01-30 15:43:20 +08:00
Martin Storsjö	76b0f8c17a	Avoid writing outside of the target rect in MCHalfPelFilterAnchor This simplifies comparison between the output of MCHalfPelFilterAnchor and the actual MC function implementations.	2015-01-29 08:51:02 +02:00
dongzha	4f4adcec49	Merge pull request #1772 from mstorsjo/mc-unify Move the MC implementation to the common library	2015-01-29 09:28:57 +08:00
Martin Storsjö	605f2fb858	Remove the now unnecessary mc_test_common.h	2015-01-28 23:59:41 +02:00
Martin Storsjö	f03edd79c2	Merge all MC unit tests into one file Since both encoder and decoder use the same MC implementation now, one set of tests is enough.	2015-01-28 23:59:37 +02:00
Martin Storsjö	9a0663620a	Move the MC routines to the common library Use the decoder versions of the functions (which are capable of handling widths 4/8/16 for luma, not only 16 as in the encoder). By using the more generic versions, there may be a small performance loss since the functions need to check the width in every call. Actual measurements show that the actual change is very small (and the shared routines turn out to actually be faster than the existing ones in ARM NEON setups).	2015-01-28 15:32:46 +02:00

1 2 3 4

195 Commits