generic-library/vpx

Author	SHA1	Message	Date
Hui Su	e7fb03c8ae	Merge "ext-intra: refactor rd loop in interframe" into nextgenv2	2016-06-15 02:46:00 +00:00
Jingning Han	1faf288798	Rework transform quantization pipeline This commit reworks the transform and quantization unit. It enables the use of adaptive quantization for intra modes. This further improves the compression performance: lowres 0.36% midres 0.79% hdres 0.73% The key frame coding performance is improved: lowres 1.7% midres 1.9% hdres 3.3% The overall coding gains are: lowres 1.1% midres 1.8% hdres 2.3% Change-Id: Iaec1a3a4c1d5eac883ab526ed076d957060479dd	2016-06-14 16:32:04 -07:00
Hui Su	69f6fd2134	Merge "Fix rate cost calculation for ext-intra" into nextgenv2	2016-06-14 23:11:25 +00:00
hui su	8c3b3d3686	Handle intra modes when tx type speed feature is enabled Change-Id: I9dc156214f3b3ded33ab30d558124b3151548161	2016-06-14 13:46:53 -07:00
hui su	8f9c9b28a8	Speed up ext-intra inter frame encoding Skip filter intra mode search when regular intra modes have large rd cost. Encoding speed improvement: 8%. Compression performance drop: 0.02% / 0.09% / 0.03% on lowres / midres / hdres Change-Id: I94d3e48781bff6ae6895a54f271dd65c959bb976	2016-06-14 13:46:17 -07:00
hui su	70566f0563	ext-intra: refactor rd loop in interframe Move filter intra modes search to the end, after regular mode search. On average no performance changes. Change-Id: I9293c8fdf706ebf831fbd61c6bb81959790f4848	2016-06-14 13:46:17 -07:00
hui su	7fa61d7d51	Fix rate cost calculation for ext-intra It was broken by commit 8ee640f979. Change-Id: I26b9eba810c74849b0805e64da2d269ab0685cb9	2016-06-14 13:46:17 -07:00
Jingning Han	a116ab7092	Merge "Make tx_type speed feature default" into nextgenv2	2016-06-14 20:45:03 +00:00
Geza Lore	3cf3ce949f	Disable loop restoration when LPF_PICK_MINIMAL_LPF. The speed feature sf->lpf_picl == LPF_PICK_MINIMAL_LPF is used to disable loop filtering. This did not work with the loop-restoration experiment, but now it is respected. Note that this speed feature is only used in real-time cpu-used >= 8 settings. Change-Id: I193723c9ac5f802ec31d8c8b4d37650796e065fd	2016-06-14 16:07:51 +01:00
Geza Lore	58168f5bf4	Remove magic number from traversal (CYCLIC_REFRESH_AQ). mi->stride now depends on the maximum superblock size, and hence the constant 8 padding is no longer appropriate. Traverse the array using mi->stride instead. Change-Id: I8e84b9fe1728f6663f8c10765fe32206375f1e71	2016-06-14 16:07:51 +01:00
Geza Lore	44b91a0e76	Select segment based loopfilter strength for supertx blocks. Segment based loopfilter strength for supertx coded blocks is now selected based on the minimum of all segment IDs within a supertx coded block (same as the quantiser settings). Change-Id: Ib056bd0d05f6a1d3b512a76deb4e2ad4db0f7dc4	2016-06-14 16:07:51 +01:00
Geza Lore	7faae780a5	Remove now superfluous argument from predict_b_extend. Change-Id: I7a76756842af9ce806c6e0e1f98f294af748e8bd	2016-06-14 16:07:50 +01:00
Geza Lore	7dd90c9d22	Rework supertx segment handling and adaptive quantization. Segment level quantizer settings for supertx coded blocks are now selected based on the minimum of all segment IDs within a supertx coded block. This also fixes the 3 adaptive quantization modes with supertx. Change-Id: Ib5db099539d4f82f240e1d745d6e5264f8b34cde	2016-06-14 16:07:50 +01:00
Geza Lore	9e95919414	Re-initialise quantiser after changing segment. When using VARIANCE_AQ, we can change the segment assignment after initialising the quantiser in set_offsets, so re-initialise it when we do so. Change-Id: I1f168553aaf0ade419f0d4bf05820cd591b87659	2016-06-14 16:07:50 +01:00
Geza Lore	d60523bc28	Refactor variance aq. Explicitly signal when the segment map is being refreshed when using VARIANE_AQ. This simplifies decisions about when the segment id needs to be set from the previous segment map vs based on the current variance. Change-Id: Ieb12c950e9cfbc3f53f4d184880071dea805563c	2016-06-14 16:07:50 +01:00
Geza Lore	2a588555bb	Pass segment id explicitly to quantizer init. This is purely refactoring in preparation of fixing supertx segment handling Change-Id: I74bcae34241fdf2b592e1cd45b67af77b9e16c9a	2016-06-14 16:07:37 +01:00
Debargha Mukherjee	902ee5060c	A crash fix for supertx / ext-inter combination. Change-Id: I9860376c98aa3b25f5bf86ed13d4a7631fa6b153	2016-06-13 13:57:30 -07:00
Jingning Han	a9a8c5993b	Refactor the trellis optimization process Speed up the trellis optimization unit by 10%. Change-Id: If055f6c0589a405c008d2900bb8fbc11b1246f66	2016-06-13 12:19:57 -07:00
Jingning Han	04f26783c4	Make tx_type speed feature default Revisit the compression performance and complexity trade-off after making the SIMD version of trellis optimizations. Before that, reduce the transform-quantization function calls temporarily. This would cause about 0.3% performance drop for lowres set. Change-Id: I16917a6bd5c44ec6cd8cd0b59f3c336c4fd96dd2	2016-06-13 12:19:54 -07:00
Jingning Han	4588676cfb	Merge "Trellis based adaptive quantization" into nextgenv2	2016-06-13 17:36:19 +00:00
Debargha Mukherjee	81f8b3f31c	Merge "Some refactoring to support warped motion mode" into nextgenv2	2016-06-10 23:18:39 +00:00
Jingning Han	25ca322957	Trellis based adaptive quantization This commit combines uniform quantizer with trellis based coefficient level optimization. It improves the codebase compression performance: lowres 0.8% midres 1.0% hdres 1.6% Note that the current trellis optimization unit is using C code. This will make the cost of the overall quantization process slower. A number of optimizations will come up next. Change-Id: Id441dd238e4844409d0f08f82604be777f3f5282	2016-06-10 12:56:14 -07:00
Debargha Mukherjee	03be30ba3e	Some refactoring to support warped motion mode Change-Id: I15d54a3ae48b2b33082668116792c6595bdb3ddb	2016-06-10 12:04:18 -07:00
Sarah Parker	a21afd421b	Move new quant experiment from nextgen This experiment implements non-uniform quantization where the width of the bins increases gradually to more closely match a laplacian distribution of the coeficcients. Performance Gain: derflr: 0.15% hevcmr: 0.675% Change-Id: I25234244e3bcd94b87c1f77cf682190b61c8ef94	2016-06-10 08:06:22 -07:00
Angie Chiang	95340fccb3	Revert "Optimize wedge partition selection." This reverts commit efda2831e5f758b4f350679b5c55c0b9282449b0. This commit causes segmentation fault at SSE2/SumSquares2DTest.RandomValues/0 Change-Id: I171937e4daf6f15323e8206418773deb03bd8c53	2016-06-09 19:17:37 -07:00
Aamir Anis	de2a20b411	Merge "Updated loop restoration" into nextgenv2	2016-06-09 20:57:09 +00:00
Alex Converse	d279cadbe0	Port active map / cyclic refresh fixes to VP10. Bring commits 575e81f and 3d6b8a6 to VP10. These changes predate the creation of the active map cyclic refresh test. BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1224 Change-Id: I3559b6933ffa5649926a4b214e45ed0fae523a25	2016-06-09 16:52:43 +00:00
Jingning Han	f59bf76eef	Merge "Take out skip_recode speed feature" into nextgenv2	2016-06-08 21:46:55 +00:00
Jingning Han	cedf90a9d6	Merge "Remove swap buffer speed feature" into nextgenv2	2016-06-08 19:45:54 +00:00
Jingning Han	025fa11c75	Take out skip_recode speed feature The assumption doesn't hold true in the current codebase. Remove this speed feature to simplify the codebase. Change-Id: I9b69f484c9b7cd612b825047cc5b2fce63ee0af7	2016-06-08 18:27:36 +00:00
Jingning Han	0d6980d7a1	Remove swap buffer speed feature The inter prediction residual can undergo different transform types during the rate-distortion optimization search. The assumption used in this speed feature no longer holds true. This commit removes the related code to clean up the codebase and clear out unit test failure in higher speed setting. Change-Id: I7f7cd4df2345ed3e607c9fae75b38cd2dbde0cac	2016-06-08 11:27:00 -07:00
Jingning Han	b48eb90023	Merge "Add tx type speed feature to recursive transform block partitioning" into nextgenv2	2016-06-07 23:44:01 +00:00
Jingning Han	0b7f864213	Merge "Rework the tx type speed feature" into nextgenv2	2016-06-07 23:42:43 +00:00
Zoe Liu	ba61de387b	Merge "Fix a RD performance bug in bipredictive frames" into nextgenv2	2016-06-07 21:34:56 +00:00
Jingning Han	33dafdb58b	Add tx type speed feature to recursive transform block partitioning Change-Id: I45440a72b4287d98cbe21b72defc67138a8eb953	2016-06-07 11:34:30 -07:00
Jingning Han	9a858e868c	Rework the tx type speed feature This commit re-works the transform type speed feature. It moves the transform type selection outside of the coding mode loop. This avoids repeated motion search if the best prediction mode is chosen as NEWMV. It improves the speed performance for clips that contain more motion activities. For mobile_cif at 1000 kbps, this makes the baseline encoding 7% faster and makes the encoding with dynamic motion vector referencing scheme enabled 10% faster. Change-Id: I93e2714b3e461303372c4b66a4134ee212faffd1	2016-06-07 11:32:27 -07:00
Zoe Liu	5414abb4a0	Fix a RD performance bug in bipredictive frames This patch will make sure the use of the BWDREF_FRAME for the encoding of both the two types of bipredictive frames, namely LAST_BIPRED_UPDATE and BIPRED_UPDATE. To realize it, the updates on the cpi->ref_frame_flags have been moved to before the encoding of one frame, instread of originally handled after the encoding of one frame. RD performance has been improved slightly, approximately by 0.17% compared to before the applying of this patch: lowres: Avg -3.474; BDRate -3.324 derflr: Avg -2.097; BDRate -1.353 Change-Id: I0aa19afd752293e345489fbff104c4351ca5498c	2016-06-07 09:45:10 -07:00
Geza Lore	f304d5c8e7	Zero segment counter before accumulating. The segment counts are computed as part of packing the bitstream, so they might have been computed already in the recode loop. Zero the accumulator to avoid double counting. This fixes some encoder/decoder mismatches. Change-Id: Ib7816034cbbb1db41101116b706302b02fad3a2c	2016-06-07 17:02:03 +01:00
Debargha Mukherjee	13155e7725	Merge "Optimize wedge partition selection." into nextgenv2	2016-06-07 09:50:13 +00:00
Aamir Anis	99d9a8fe30	Updated loop restoration 1. Wiener restoration filter now has normalization and evaluation of quantization procedure. 2. Corrected scaling of bits in RD cost computation. 3. Changed dynamic range and number of bits for Wiener filter. Observed gains: Overall 0.58% for low_res, 0.7% for mid_res sequences. Change-Id: I8928b3ea493bfe1790926b00388d6c4bafc08e19	2016-06-06 15:49:52 -07:00
Jingning Han	3713949b6d	Merge "Make ref-mv experiment support ActiveMap" into nextgenv2	2016-06-06 16:06:41 +00:00
Geza Lore	efda2831e5	Optimize wedge partition selection. We can optimize wedge partition selection by pre-computing the residuals of the 2 underlying predictors, and then blend these to compute the sse of the compound predictor, without actually having to compute and subtract the compound predictor. Similarly we can pre-compute a proxy array which we can use to cheaply check which mask sign would have lower sse. Details are in wedge_utils.c. Mathematically these are equivalence transformations, but due to the finite precision the encoder output will be perturbed, though on average this should make 0% difference. ext-inter gains about ~4.5% speedup. Change-Id: Ib2657c3209ae161b4090b58b4b6c392641bf2792	2016-06-06 14:43:10 +01:00
Debargha Mukherjee	b85d0adadf	Merge "Always include the cost of tx size in rate for Y." into nextgenv2	2016-06-03 22:57:17 +00:00
Debargha Mukherjee	33c57e6223	Merge "Check if sub8x8 rd stats are valid before reusing them." into nextgenv2	2016-06-03 22:38:56 +00:00
Debargha Mukherjee	fc61d92bf8	Merge "Compute rate of partition type accurately for edge blocks." into nextgenv2	2016-06-03 22:37:33 +00:00
Jingning Han	27d8a948c1	Make ref-mv experiment support ActiveMap Reset the ref_mv_idx and predicted motion vector when the coding block belongs to skip segment. Change-Id: I5746ab315a436b829b64a1a25121989d3c11c995	2016-06-03 15:04:18 -07:00
Geza Lore	b87078d51e	Always include the cost of tx size in rate for Y. The transform can only be skipped if both Y and U/V can be skipped, so we always include the cost of tx size in the rate for Y. This will get later subtracted if the transform is actually skipped. Change-Id: I136a223e5596f18b69bb9f743e7e08438183a215	2016-06-03 11:51:35 -07:00
Geza Lore	d9870c32a9	Check if sub8x8 rd stats are valid before reusing them. Change-Id: I5d49f15a07de58c226d4003b4691e001abf1f3f8	2016-06-03 11:47:34 -07:00
Geza Lore	8ee640f979	Compute cost of UV mode accurately for intra blocks. We used to cache the cost of the UV mode from the search with a different previously tried Y mode, but the UV mode is contexted on the Y mode, so caching the cost is inaccurate. Change-Id: Ib003510afb6fc9befb7808b67b0be64f1c0a0804	2016-06-03 11:13:51 -07:00
Geza Lore	1354c6942c	Compute rate of partition type accurately for edge blocks. This patch factors in the different partition coding syntax used for right and bottom edge blocks when doing RD search. Change-Id: I2f31650512b6a4a7a2c03352414693aff6fbf87b	2016-06-03 06:43:34 -07:00

... 2 3 4 5 6 ...

1015 Commits