generic-library/vpx

Author	SHA1	Message	Date
Yaowu Xu	afffa3d9b0	cleanup cpplint warnings Suggested by James Zern to clear out cpplint warnings for all unit test code. Change-Id: I731a3fa4d2a257eb9ef733426ba84286fbd7ea34	2013-09-06 10:13:49 -07:00
Scott LaVarnway	dae17734ec	New mode_info_context storage mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of a pointer to a MODE_INFO struct and a "in the image" flag. The MODE_INFO structs are now stored as a stream, eliminating unnecessary copies and is a little more cache friendly. For the test clips used, the decoder performance improved by ~4.3% (1080p) and ~9.7% (720p). Patch Set 2: Re-encoded clips with latest. Now ~1.7% (1080p) and 5.9% (720p). Change-Id: I846f29e88610fce2523ca697a9a9ef2a182e9256	2013-09-06 12:33:34 -04:00
Jim Bankoski	e4e864586c	Merge "fix loop filter setup_mask could reach out of bounds issue"	2013-09-06 06:21:28 -07:00
hkuang	3476404912	Merge "Speed up idct8x8 by rearrange instructions. Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf."	2013-09-05 17:37:13 -07:00
Jim Bankoski	736114f44b	fix loop filter setup_mask could reach out of bounds issue Change-Id: Ic8446c4f26b6782a6dc482c19ea73c77646df418	2013-09-05 15:53:31 -07:00
Jingning Han	170be56a74	Merge "Enable 32x32 Transform unit test"	2013-09-05 15:23:27 -07:00
Jingning Han	4ad52a8f18	Enable 32x32 Transform unit test This commit enabled a full functional test on 32x32 forward/inverse transform, including round-trip error and memory overflow check. It tests the prototype functions in C and all other implementations if applicable. Change-Id: I9cc50b05abdb4863e7abbcb29209a19b1fe90da7	2013-09-05 14:46:51 -07:00
Jingning Han	1c263d6918	Merge "Use saturated addition in SSSE3 of 32x32 quant"	2013-09-05 14:09:40 -07:00
Jim Bankoski	2156ccaa4a	Merge "resolve clang warnings : uninitialized vars in vp9_entropy.h"	2013-09-05 12:55:32 -07:00
Jingning Han	458c2833c0	Use saturated addition in SSSE3 of 32x32 quant The 32x32 forward transform can potentially reach peak coefficient value close to 32700, while the rounding factor can go upto 610. This could cause overflow issue in the SSSE3 implementation of 32x32 quantization process. This commit resolves this issue by replacing the addition operations with saturated addition operations in 32x32 block quantization. Change-Id: Id6b98996458e16c5b6241338ca113c332bef6e70	2013-09-05 12:49:12 -07:00
Jim Bankoski	9fc3d32a50	Merge "faster accounting of inc_mv"	2013-09-05 12:38:56 -07:00
Yaowu Xu	9158b8956f	Merge "make bsize requirement for SEG_LVL_SKIP explicit"	2013-09-05 08:15:03 -07:00
Yaowu Xu	7bc775d93d	Merge "Added ClearSystemState in a unit test"	2013-09-05 08:14:44 -07:00
Jim Bankoski	2e4ca9d1a5	resolve clang warnings : uninitialized vars in vp9_entropy.h This helps clear out some of the warnings Change-Id: Ie7ccaca8fd92542386a7f1b257398e1bdf2f55dc	2013-09-04 18:38:41 -07:00
Jim Bankoski	e8feb2932f	Merge "wrap non420 loop filter code in macro"	2013-09-04 17:20:53 -07:00
Paul Wilkins	e5deed06c0	Merge "Attempt to fix speed 4"	2013-09-04 17:19:22 -07:00
Yaowu Xu	1ee66933c1	make bsize requirement for SEG_LVL_SKIP explicit The segment feature SEG_LVL_SKIP requires the prediction unit size to be at least BLOCK_8X8. This commit makes the requirement to be explicit. This is to prevent future encoder implementations from making wrong choices. Change-Id: I0127f0bd4c66e130b81f0cb0a8d3dbfe3b2da5c2	2013-09-04 16:32:26 -07:00
hkuang	01c4e04424	Speed up idct8x8 by rearrange instructions. Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf. Change-Id: I3e2cc818ec14b432204ff43732f39b6438db685d	2013-09-04 15:57:22 -07:00
Yaowu Xu	e494df1a37	Added ClearSystemState in a unit test There is another unit test that has been failing randomly on win32 build. Investigation has shown that the failure was caused by simd register state is not reset appropriately in the fdct8x8 test. This commit added ClearSystemState() in the teardown of this test, tests showed it resolved the random failure issue for win32 build. Related issue: https://code.google.com/p/webm/issues/detail?id=614 Change-Id: I9381d0c1a6f4b855ccaeef1aca8c417ac8c71ee2	2013-09-04 15:07:34 -07:00
Yaowu Xu	72872d3d8c	Merge "Fixing problem with invalid delta_q reading."	2013-09-04 14:21:30 -07:00
hkuang	3c05bda058	Merge "Add neon optimize vp9_short_iht4x4_add."	2013-09-04 13:35:09 -07:00
hkuang	3b8614a8f6	Add neon optimize vp9_short_iht4x4_add. Change-Id: I42c497b68ae1ee645b59c9968ad805db0a43e37e	2013-09-04 12:37:58 -07:00
Dmitry Kovalev	890eee3b47	Fixing problem with invalid delta_q reading. This is a bitstream change but no currently produces videos should be affected. https://code.google.com/p/webm/issues/detail?id=610 Change-Id: Ic85a6477df6c201cdf7f70f6bd84607b71f4593c	2013-09-04 11:25:43 -07:00
Yaowu Xu	76a437a31b	Merge "Replacing init_dequantizer() with setup_plane_dequants()."	2013-09-04 10:42:12 -07:00
Jim Bankoski	872c6d85c0	Merge "speed up inc_mv_component"	2013-09-04 10:35:51 -07:00
Jim Bankoski	bb2313db28	Merge "make vp9 postproc a config option"	2013-09-04 10:35:26 -07:00
Yunqing Wang	9fd2767200	Merge "Use correct bit cost while static-thresh is on"	2013-09-04 10:26:37 -07:00
Jim Bankoski	c3c21e3c14	wrap non420 loop filter code in macro Change-Id: I62bca0e7a4bffc1a78b750dbb9df9d2378e92423	2013-09-04 10:24:42 -07:00
Jim Bankoski	79401542f7	make vp9 postproc a config option Vp9 postproc is disabled for now as its not been shown to help and may be merged with vp8. Change-Id: I25620d6cd34c6e10331b18c7b5ef7482e39c6057	2013-09-04 10:02:08 -07:00
Jim Bankoski	532179e845	faster accounting of inc_mv Moves counting of mv branches to where we have a new mv, instead of after the whole frame is summed. Change-Id: I945d9f6d9199ba2443fe816c92d5849340d17bbd	2013-09-04 09:47:57 -07:00
Dmitry Kovalev	d6606d1ea7	Replacing init_dequantizer() with setup_plane_dequants(). Change-Id: Ib67e996b4a6dcb6f481889f5a0d84811a9e3c5d1	2013-09-04 09:22:59 -07:00
Jim Bankoski	5dda1d2394	speed up inc_mv_component Convert mv_class if statements to look up. re order to avoid ifs... Change-Id: I76966a21bf517bb1f9a7957c08c476c7bb3e9a63	2013-09-04 07:11:30 -07:00
James Zern	1cf2272347	Merge "Fix intermediate height in convolve_c"	2013-09-03 15:50:33 -07:00
Paul Wilkins	49317cddad	Attempt to fix speed 4 Speed 4 fixed partition size. Use fixed size unless it does not fit inside image, in which case use the largest size that does. Change-Id: I250f7a80506750dd82ab355721624a1344247223	2013-09-03 17:46:25 +01:00
Jingning Han	010c0ad0eb	Merge "Fix 32x32 forward transform SSE2 version"	2013-09-03 08:58:03 -07:00
Scott LaVarnway	948aaab4ca	Merge "Improved mb_lpf_horizontal_edge_w_sse2_8"	2013-09-03 05:44:01 -07:00
Jingning Han	3cf46fa591	Fix 32x32 forward transform SSE2 version This commit fixed the potential overflow issue in the SSE2 implementation of 32x32 forward DCT. It resolved the corrupted coded frames in the border of scenes. Change-Id: If87eef2d46209269f74ef27e7295b6707fbf56f9	2013-08-31 18:47:08 -07:00
Yunqing Wang	0ca7855f67	Use correct bit cost while static-thresh is on While static-thresh is on, we only need to transmit skip flag if skip = 1. The cost of skip bit is added to the total rate cost. Change-Id: I64e73e482bc297eba22907026298a15fa8cc3920	2013-08-30 15:25:13 -07:00
Paul Wilkins	2b9baca4f0	Merge "Added per pixel inter rd hit count stats"	2013-08-30 08:56:01 -07:00
Jingning Han	e22bb0dc8e	Merge "Refactor 16x16 unit tests"	2013-08-30 08:53:19 -07:00
Tero Rintaluoma	e326cecf18	Fix intermediate height in convolve_c - Intermediate height was not correct i.e. when block size is 4 and y_step_q4 is 6. In this case intermediate height was (4*6) >> 4 = 1 and vertical interpolation needs two source pixels plus 7 extra pixels for taps. - Also if the current output block is 16x16 and we are using 4x upscaling we need only 12 rows after horizontal filtering instead of 16. Patch Set 2: Intermediate_height updated after CL 66723 "Fix bug in convolution functions (filter selection)" Change-Id: I5a1a1bc2ac9d5edb3a6e0818de618bf318fdd589	2013-08-30 10:31:21 +03:00
Jim Bankoski	1d44fc0c49	Merge "rework filter_block_plane"	2013-08-29 20:11:09 -07:00
Jim Bankoski	bc50961a74	rework filter_block_plane Change-Id: I55c3b60c4c0f4910d3dfb70e3edaae00cfa8dc4d	2013-08-29 17:00:05 -07:00
Jingning Han	ec4b2742e7	Refactor 16x16 unit tests Make the new test module comply to the unit test rules. Change-Id: Id79ff7f03f870973ffbc74f26d64edb418b75299	2013-08-29 16:49:11 -07:00
Jingning Han	c86c5443eb	Merge "Fix overflow issue in SSSE3 32x32 quantization"	2013-08-29 16:49:04 -07:00
Paul Wilkins	1f4bf79d65	Added per pixel inter rd hit count stats Added some code to output normalized rd hit count stats. In effect this approximates to the average number of rd operations/tests per pixel for the sequence. The results are not quite accurate and I have not bothered to account for partial SB64s at frame edges and for key frames However they do give some idea of the number of modes / prediction methods being tested for each pixel across the different partition sizes. This indicates how much scope their is for further gains either by reducing the number of partitions examined or the modes per partition through heuristics. Patch 3 moved place where count incremented so partial rd tests that are aborted with INT_MAX return are also counted. Example numbers for first 50 frames of Akiyo. Speed 0 ~84.4 rd operations / pixel Speed 1 ~28.8 Speed 2 ~11.9 Change-Id: Ib956e787e12f7fa8b12d3a1a2f6cda19a65a6cb8	2013-08-30 00:13:51 +01:00
Deb Mukherjee	b6dbf11ed5	Merge "Adds a speed feature for fast 1-loop forw updates"	2013-08-29 15:54:04 -07:00
James Zern	e83e8f0426	Merge changes Ib1e853f9,Ifd75c809,If3e83404 * changes: consistently name VP9_COMMON variables #3 consistently name VP9_COMMON variables #2 consistently name VP9_COMMON variables #1	2013-08-29 15:50:56 -07:00
Yaowu Xu	ee961599e1	Merge "Fixed potential overflows"	2013-08-29 15:43:26 -07:00
James Zern	d765df2796	consistently name VP9_COMMON variables #3 stragglers Change-Id: Ib1e853f9a331b7b66639dc34d79568d84d1930f1	2013-08-29 13:27:41 -07:00

... 2 3 4 5 6 ...

6456 Commits