generic-library/vpx

Author	SHA1	Message	Date
Marco	7e79831016	vp9: aq-mode=3: On key frame reset cr->reduce_refresh to 0. This prevent possible reduction of cyclic refresh after key frame. Change-Id: Idd4e49b69cd95476e7eccfa31b2bd8669569e9e8	2017-02-22 10:50:08 -08:00
Jerome Jiang	3d1fa00fce	vp9: Only compute y_sad for golden in variance partition for speed < 8. Only affects speed 8. No obvious quality regression. Systematic speed ups by ~1% on Nexus 6. Change-Id: Ia904ca28ea041c3281c532911ec38fb7d7f46a17	2017-02-22 10:19:09 -08:00
Yunqing Wang	66f36f4735	Merge "Refactored the row based multi-threading code"	2017-02-22 16:55:04 +00:00
Jerome Jiang	b1dcaf7f1e	Merge "Fix segmentation fault caused by denoiser working with spatial SVC."	2017-02-22 04:44:55 +00:00
Marco	7f2daa74a0	vp9: Incorporate source sum_diff into non-rd partition thresholds. Increase the variance partition thresholds for superblocks that have low sum-diff (from source analysis prior to encoding frame). Use it for now only for speed >= 7 or for denoising on. Small change on metrics for rtc set: less than ~0.1 avgPNSR decrease on RTC set, for both speed 7 and 8. Change-Id: I38325046ebd5f371f51d6e91233d68ff73561af1	2017-02-21 17:22:11 -08:00
Johann Koenig	1e224dcb83	Merge "Drop zbin_ptr and quant_shift_ptr"	2017-02-21 18:16:38 +00:00
Jerome Jiang	0d1e5a21c4	Fix segmentation fault caused by denoiser working with spatial SVC. Re-enable the affected test. BUG=webm:1374 Change-Id: I98cd49403927123546d1d0056660b98c9cb8babb	2017-02-21 09:38:28 -08:00
Paul Wilkins	4d4231352c	Merge "Change to prediction decay calculation."	2017-02-21 09:42:38 +00:00
Marco	4e1ba35458	vp9: Fix for non-rd pickmode for high-bitdepth build. Use the simple block_yrd under certain conditions. The optimization code is completed but the speed is still slower (~6% on 720p) than the low-bitdepth build. For now, use the more complex block_yrd under certain conditions (always use it for speed <= 5, otherwise use it on key frames and for bsize >= 32x32). This gives about ~2-3% gain in quality for speed 7 on RTC set (over high bitdepth build), with about the same encoder fps as the low bitdepth build. Change-Id: Ibe92a1945d0bd635f880befb4c815727df62d754	2017-02-20 20:25:36 -08:00
Ranjit Kumar Tulabandu	97d6a4cbd1	Refactored the row based multi-threading code Modified the code to facilitate bit-match tests in first pass Added unit-tests to test the row based multi-threading behavior for bit-exactness Change-Id: Ieaf6a8f935bb1075597e0a3b52d9989c8546d7df	2017-02-20 16:13:45 +05:30
paulwilkins	a63adac604	Change to prediction decay calculation. This change subtracts out low complexity intra regions that are also low error in the inter domain, in the calculation of the frame prediction decay. The rationale here his that low complexity regions (such as sky) do not imply high prediction decay in the same way as high error intra or neutral blocks. The effect of this is small in most clips but in a few clips it can be > 10%. (E.g. In to tree) Change-Id: If67ac23d17fca14285cad2defa464c61c9ea861c	2017-02-17 09:29:24 +00:00
James Zern	b5bc9ee02d	Merge "cosmetics: Fix spelling mistake in compile flag name."	2017-02-17 00:04:42 +00:00
Johann Koenig	a9b81da575	Merge "block error avx2: use tran_low_t"	2017-02-16 23:51:14 +00:00
paulwilkins	d218b0914e	cosmetics: Fix spelling mistake in compile flag name. agressive -> aggressive after: ce7b38459 Aggressive VBR method. Change-Id: Ie0f30b1bbc77ed9f32bec047b4a9b3d0cf4853f5	2017-02-16 14:51:31 -08:00
Johann	ca4e27f5da	Drop zbin_ptr and quant_shift_ptr vp9[_highbd]_quantize]_fp[_32x32] and vp9_fdct8x8_quant do not make use of these parameters. scan is used for C code and iscan is used for SIMD implementations. Change-Id: I908a0ff7d3febac33da97e0596e040ec7bc18ca5	2017-02-16 13:20:32 -08:00
Johann	2104454607	block error avx2: use tran_low_t Change-Id: Ic5f3a1f569d6f82afeaf4fcd7235374bb460db3c	2017-02-16 12:39:02 -08:00
Johann Koenig	cc43012674	Merge changes I267050a5,Iebade0ef,Id96a8df3 * changes: quantize_fp_32x32 highbd ssse3: enable existing function quantize_fp highbd ssse3: use tran_low_t for coeff quantize_fp highbd sse2: use tran_low_t for coeff	2017-02-16 20:34:48 +00:00
Yunqing Wang	0bf6b51572	Merge "Structured the mode ordering code to avoid redundant memcpy"	2017-02-16 16:22:54 +00:00
Johann	ff37a911ce	quantize_fp_32x32 highbd ssse3: enable existing function This was created as part of the quantize_fp_ssse3 change. Both functions use the same source file with different macro parameters. Change-Id: I267050a559426a85955d215aa0aaca270439c5ab	2017-02-16 07:40:56 -08:00
Johann	4682130b60	quantize_fp highbd ssse3: use tran_low_t for coeff Change-Id: Iebade0efc0efbb0a80a0f3adbef4962e3a2f25e8	2017-02-16 07:40:56 -08:00
Johann	ac3996a6d1	quantize_fp highbd sse2: use tran_low_t for coeff Change-Id: Id96a8df33354a7987ce890a3d6798c7375ffa4aa	2017-02-16 07:40:55 -08:00
Johann	44600442dc	bitdepth conversion: really use num elements The previous implementation confused bit/bytes/elements. It was using '32' as the multiplier but that was mistakenly adopted because a 32x32 transform embedded the stride. Change-Id: Ieeb867a332416b9a40580b5e7c9b20088e9e691a	2017-02-16 15:02:48 +00:00
Ranjit Kumar Tulabandu	5127e58dab	Structured the mode ordering code to avoid redundant memcpy Change-Id: I4f5d6b54018bd1928cd9e5e42619e6f55b334803	2017-02-16 14:12:33 +00:00
Paul Wilkins	60a10116d1	Merge "Disconnect ARF breakout from frame boost."	2017-02-16 10:02:09 +00:00
Paul Wilkins	543ebc900f	Merge "Remove unnecessary factor."	2017-02-16 10:01:58 +00:00
Paul Wilkins	9216ba58d8	Merge "Bug in scale_sse_threshold()"	2017-02-16 10:01:46 +00:00
Paul Wilkins	e6c1993f1b	Merge "Additional first pass stats."	2017-02-16 09:39:29 +00:00
Marco	158b300952	vp9: Some code cleanup for aq-mode = 3. The weight segment needs to only be computed once per frame, so remove it from the funciton vp9_cyclic_refresh_rc_bits_per_mb(), which is called within a loop inside vp9_rc_regulate_q. Change-Id: Ia0e18b89abb97e42c466d4dbc47700d7f76555db	2017-02-15 14:07:04 -08:00
Marco	f82280820a	vp9. Use same source_sad threshold for all speeds. Only affects real-time mode. Change-Id: Iba836f110c4da936f5173cc0f54424d5b6121bff	2017-02-15 11:28:26 -08:00
Marco	716c1d5ff5	Vp9: Speed 8 aq-mode=3: Reduce computation in estimating bits per mb. vp9_compute_qdelta_by_rate has almost 2% overhead in profiling on Nexus 6. Reduce the calling of that function in speed 8 by estimating the delta-q. Both rtc and rtc_derf show little/no change in avg psnr/ssim. Encoding speed is 2~3% faster on Nexus 6. Change-Id: If25933715783f31104a18a5092ea347b1221b5f5	2017-02-15 09:28:16 -08:00
Linfeng Zhang	ccada0636b	Merge "Add vpx_highbd_idct16x16_38_add_c()"	2017-02-15 17:06:17 +00:00
paulwilkins	cfc79a357a	Disconnect ARF breakout from frame boost. This small change replaces the frame boost check in the arf group length break out clause with a test against a prediction decay value. The boost value is in fact partly dependent on the decay value but this change means that the per frame boost calculation can be adjusted without influencing the group length calculation. The value chosen gives a close match on all the test sets with the previous code (on average) but it was noted that a lower threshold was slightly better for 1080P and up and a slightly higher value for small image sizes. Change-Id: I4d5b9f67d5b17b0d99ea3f796d3d6202fd61ee0c	2017-02-15 10:46:14 +00:00
paulwilkins	b89ba05ab4	Remove unnecessary factor. Removed unnecessary scaling factor to simplify. Change-Id: I3fc9c5975a2597e72f1324e09dd586dea1facfa7	2017-02-15 10:45:43 +00:00
paulwilkins	76550dfdc0	Bug in scale_sse_threshold() The function scale_sse_threshold() returns a threshold scaled if necessary for use with 10 and 12 bit from an 8 bit baseline. SSE error values would be expected to rise for the 10 and 12 bit cases where there are more bits of precision. Hence the threshold used for the test should also be scaled up. Change-Id: I4009c98b6eecd1bf64c3c38aaa56598e0136b03d	2017-02-15 10:45:03 +00:00
paulwilkins	945ccfee59	Additional first pass stats. Added counts that split the intra coded blocks into low and high variance. Change-Id: Ic540144b34d5141659081bb22f7ee16fd6861f14	2017-02-15 10:44:37 +00:00
Paul Wilkins	7635ee0f37	Merge "Aggressive VBR method."	2017-02-15 10:37:02 +00:00
Johann Koenig	61927ba4ac	Merge "vp9 fdct higbd neon: connect existing highbd calls"	2017-02-15 01:33:00 +00:00
Linfeng Zhang	e07e74fb0f	Add vpx_highbd_idct16x16_38_add_c() When eob is less than or equal to 38 for high-bitdepth 16x16 idct, call this function. BUG=webm:1301 Change-Id: I09167f89d29c401f9c36710b0fd2d02644052060	2017-02-14 17:25:52 -08:00
Yunqing Wang	f2c1aea118	Merge "Row based multi-threading of encoding stage"	2017-02-15 00:54:10 +00:00
Ranjit Kumar Tulabandu	71061e9332	Row based multi-threading of encoding stage (Yunqing Wang) This patch implements the row-based multi-threading within tiles in the encoding pass, and substantially speeds up the multi-threaded encoder in VP9. Speed tests at speed 1 on STDHD(using 4 tiles) set show that the average speedups of the encoding pass(second pass in the 2-pass encoding) is 7% while using 2 threads, 16% while using 4 threads, 85% while using 8 threads, and 116% while using 16 threads. Change-Id: I12e41dbc171951958af9e6d098efd6e2c82827de	2017-02-15 00:49:34 +00:00
Johann	3e7aa8fda9	vp9 fdct higbd neon: connect existing highbd calls Change-Id: Ia8f822bd6e70b3911bc433a5a750bfb6f9a3a75c	2017-02-14 22:11:49 +00:00
Johann Koenig	9c2bb7f342	Merge "quantize_fp highbd neon: use tran_low_t for coeff"	2017-02-14 21:28:23 +00:00
clang-format	4b402746ca	apply clang-format Change-Id: I75e4a9e0b37bd4586f26c8d6c1fa27f3f6ff1bce	2017-02-14 12:45:52 -08:00
Johann	2b24aa87d9	quantize_fp highbd neon: use tran_low_t for coeff Change-Id: I90fd815f15884490ad138f35df575a00d31e8c95	2017-02-14 10:26:10 -08:00
Yunqing Wang	318ca07657	The bitstream bit match test in multi-threaded encoder While the new-mt mode is enabled(namely, allowing to use row-based multi-threading in encoder), several speed features that adaptively adjust encoding parameters during encoding would cause mismatch between single-thread encoded bitstream and multi-thread encoded bitstream. This patch provides a set_control API to disable these features, so that the bit match bitstream is obtained in the unit test. Change-Id: Ie9868bafdfe196296d1dd29e0dca517f6a9a4d60	2017-02-13 13:02:26 -08:00
James Zern	3c4ea94210	cosmetics,vp9_ratectrl: apply clang-format broken since: c3f095c8b Merge "Fix to avoid abrupt relaxation of max qindex in recode path" 5f21aba4b Fix to avoid abrupt relaxation of max qindex in recode path the original change pre-dated the addition of .clang-format Change-Id: If5e399d9a805bcad9147360b13b36fbc8c560a7c	2017-02-13 11:29:39 -08:00
paulwilkins	ce7b38459a	Aggressive VBR method. VBR method that allows a wider Q range for the first normal frame in each ARF group and then centers the min - max range for the rest of the arf group on the chosen Q value for that first frame. This allows for quite rapid adjustment of the active Q range even if the initial estimate is poor. In some cases where the ARF frames themselves are tending to undershoot but the normal frames are overshooting this can still give net undershoot. This can be corrected by allowing a larger Q delta for arf frames but is usually is a sign that the allocation to the arfs was to high. Change-Id: Icec87758925d8f7aeb2dca29aac0ff9496237469	2017-02-13 15:42:11 +00:00
Marco	22dcfa80aa	vp9: Non-rd mode: use simple block_yrd for 8 bit high bitdepth builds Temporary fix until optimization work for block_yrd is completed. This essentially reverts back to the state before the change: https://chromium-review.googlesource.com/c/433821/ Compression loss is about ~5-6% on RTC set. Speed-up (from using this simple/model-based block_yrd) over the low bitdepth builds (which uses more complex block_yrd) is ~5% on 720p. Change-Id: Ie0af9eb0d111e5595f587870c44f08317403b8d8	2017-02-10 10:15:35 -08:00
Paul Wilkins	c3f095c8b3	Merge "Fix to avoid abrupt relaxation of max qindex in recode path"	2017-02-09 17:17:55 +00:00
Paul Wilkins	82b88a7fd0	Merge "Fix for max qindex calculation of a gf interval"	2017-02-09 17:17:44 +00:00

... 2 3 4 5 6 ...

9511 Commits