generic-library/vpx

Author	SHA1	Message	Date
Marco	3b2d08a93b	vp9-denoiser: Modify skip denoising condition for small blocks. Skip denoising for blocks < 16x16, and for block = 16x16 skip denoising for low noise levels and width > 480 for now. Allow for some speed-up in denoiser. Change-Id: Ib46cefe4741962d145fa08775defea3a9c928567	2017-01-25 11:48:09 -08:00
hui su	519b2e48a8	Fix an overflow warning in optimize_b() BUG=webm:1361 Change-Id: Ib840bf3b39f7b3c8c017d3488a83434e9a0f45f5	2017-01-25 10:54:39 -08:00
Jerome Jiang	70a3652693	Merge "vp9: Adjust threshold for y sad used in copying partition."	2017-01-25 17:54:15 +00:00
Yunqing Wang	a762cef917	Merge "Initialize errorperbit and sabperbit in ARNR filtering"	2017-01-25 16:43:02 +00:00
Yunqing Wang	633dbcb458	Merge "Multi-threading of first pass stats collection"	2017-01-25 16:40:32 +00:00
Jerome Jiang	3a7ad43fb8	vp9: Adjust threshold for y sad used in copying partition. Visual quality improvement is observed for noisy clips. Little effects on speed tests on Nexus 6. Change-Id: Ib38e04002220708c34102de7b5c36e9940775d89	2017-01-24 17:20:05 -08:00
Ranjit Kumar Tulabandu	8b0c11c358	Multi-threading of first pass stats collection (yunqingwang) 1. Rebased the patch. Incorporated recent first pass changes. 2. Turned on the first pass unit test. Change-Id: Ia2f7ba8152d0b6dd6bf8efb9dfaf505ba7d8edee	2017-01-24 15:48:02 -08:00
Marco	8d0c8c5e6b	vp9: Adjust some parameters in aq-mode=3 mode. Increase the qp-delta, mainly for low resolutions, excluding case of very low bitrates. avgPSNR/SSSIM gain of ~3-5% on rtc_derf set. Small change on rtc set. Change-Id: Ice03d04bd0340404d1957666ef154fd64fed0606	2017-01-24 14:18:02 -08:00
Jerome Jiang	ac1358cd56	vp9: Copy partition using avg_source_sad. Affecting only speed 8. Speed tests on Nexus 6 show 4% faster for QVGA and 2.4% faster for VGA. Little/negligible quality regression observed on both rtc and rtc_derf sets. Change-Id: I337f301a2db49a568d18ba7623160f7678399ae1	2017-01-24 10:31:22 -08:00
Ranjit Kumar Tulabandu	75d2443bf0	Initialize errorperbit and sabperbit in ARNR filtering (Yunqing) This patch added the missing initialization in temporal filter. Borg test BDRate results: PSNR: -0.019%(lowres); -0.013%(hdres); SSIM: -0.001%(lowres); -0.010%(hdres). Other q values gave comparable but no better results. Change-Id: I7ad0c18b39e6f558342688e2fe1e12fdb133ce9b	2017-01-24 08:58:17 -08:00
Jerome Jiang	d82b9f62a9	Merge "vp9: Adjust the threshold to set avg_source_sad_sb flag."	2017-01-24 03:43:12 +00:00
Yunqing Wang	b987bc36af	Remove marco MVC in mcomp.c Removed MVC so that mv_err_cost() is always called while calculating the mv cost. Change-Id: I28123e05fbfc2352128e266c985d2ab093940071	2017-01-23 17:03:12 -08:00
Jerome Jiang	40ffa2839f	vp9: Adjust the threshold to set avg_source_sad_sb flag. Affect only speed 8. Small/Negligible regression on rtc set. Change-Id: I67a6b6b4008a22ed798bd980336d95bb799f64b4	2017-01-23 16:11:28 -08:00
Marco	f38ed0c560	vp9: Non-rd pickmode: fix to add ARF mode entries to THR_MODES. BUG=webm:1359 Change-Id: Ie0c66efa2e19d1ec9c744d14e3fa8f1e6214cdd6	2017-01-23 10:56:29 -08:00
Marco	219cdab676	vp9: Add feature to use block source_sad for realtime mode. Only for speed >= 7, and affects skipping of intra modes. Threshold is set low for now, needs to be tuned. Small/no difference in metrics on rtc clips. Change-Id: If9bdbd43f08d1f80407cdd2e9e5e96780dcd2424	2017-01-20 11:57:02 -08:00
Marco	0f9760ab6f	vp9: Modify usage of force_skip under low temporal variance in non-rd pickmode. For short_circuit set to level 1, skip newmv for 64x64 blocks if the low temporal variance flag is set. Also modify threshold for 64x64 split in variance partitioning. Overall speed-up on noisy clips of 2-4%. Only affect speed >= 7. Change-Id: I384b3772007e84de6f8707e480d2ddf1fe1f907d	2017-01-19 11:21:15 -08:00
Jerome Jiang	ee5b29ae30	vp9: Stop copying partition every a fixed number of frames. Avoid quality loss when copying partition of superblock with large motions. Maximum consecutively copied frames can be set (currently 5). Change-Id: I11c30575514f02194c0f001444cf4021609e5049	2017-01-18 11:23:59 -08:00
Jerome Jiang	9152d434dc	vp9: Disable partition copy when resizing is enabled. Change-Id: I4fa3262e0f1c4018604c954b020ec5d1e3d1465c	2017-01-17 18:21:31 -08:00
Jerome Jiang	255866419d	Merge "vp9: Set low variance flag when partition is copied."	2017-01-17 21:02:52 +00:00
Jerome Jiang	0c65aed099	vp9: Set low variance flag when partition is copied. Also set the flag to 1 when exit early choosing 64x64 block such that skipping new mv for golden works in these scenerios. Change the size of prev_segment_id to the number of superblocks to save memory. Borg test shows quality regression of 0.012% on average PSNR and 0.035% on SSIM. Change-Id: I5014224c8617d439d35c66ece3fed9ae30b31d23	2017-01-17 11:14:50 -08:00
Ranjit Kumar Tulabandu	5f21aba4b0	Fix to avoid abrupt relaxation of max qindex in recode path The fix relaxes the max qindex based on the data from previous loop of coding if output frame size is greater than maximum frame size allowed Change-Id: Iac1f63ec67559d68766e090a7cbb80b812b2560f	2017-01-16 18:03:27 +05:30
Marco	159cc3b33c	vp9: Add speed feature flag for computing average source sad. If enabled will compute source_sad for every superblock on every frame, prior to encoding. Off by default, only on for speed=8 when copy_partition is set. Change-Id: Iab7903180a23dad369135e8234b7f896f20e1231	2017-01-13 11:52:12 -08:00
Marco	47270b6858	vp9: Adjust threshold for copy partiton, for speed=8. Change-Id: I4799cb2b67d911ee385e6d6992c61633ca77e69d	2017-01-13 10:29:31 -08:00
Marco Paniconi	888bb6c133	Merge "vp9: Update threshold for partition copy."	2017-01-13 06:22:53 +00:00
Jerome Jiang	2ff2376fbc	vp9: Update threshold for partition copy. Avoid many visual artifacts. Compression quality is improved by more than 1%. Encode speed is about 4% for QVGA and 6% for VGA faster on android. Change-Id: I4dd0a81429ddf7efdef1e80a191da5fb8de8e8af	2017-01-12 18:48:38 -08:00
Marco Paniconi	baa4a290eb	Merge "vp9: Make the denoiser work with spatial SVC."	2017-01-12 17:54:41 +00:00
Jerome Jiang	f129e09529	vp9: Turn on the partition copy for speed 8. Tune threshold. For speed 8, it speeds up the encoding on android by 6% for QVGA and 7.4% for VGA with the new threshold. Overall PSNR is improved by 0.667 for rtc. Change-Id: I4a644560b32c0b5b4e9f49ffb953d000413a3732	2017-01-11 10:48:16 -08:00
Marco	7e3a82c384	vp9: Make the denoiser work with spatial SVC. If enabled denoiser will only denoise the top spatial layer for now. Added unittest for SVC with denoising. Change-Id: Ifa373771c4ecfa208615eb163cc38f1c22c6664b	2017-01-10 17:23:58 -08:00
Marco	91fc730d83	vp9: 1 pass cbr: Adjustments to usage of gf_cbr_boost and aq=3 mode. When aq=3 mode is on and the gf_cbr_boost is set: make sure golden frame is always refreshed, and don't incorporate segement cost in qp setting on the boosted golden frame. Better performance on RTC set with gf_cbr_boost on, for example with gf_cbr_boost=50, gains from ~0.5-3%. Change-Id: Ie811f5e4d444ff3320bd6e2c1745b2c4c09a8460	2017-01-10 09:42:06 -08:00
Jerome Jiang	299ef2f8eb	Merge "vp9: Set less aggresive short_circuit_low_temp_var for HD at speed 8."	2017-01-10 00:51:09 +00:00
Jerome Jiang	198b834c97	vp9: Set less aggresive short_circuit_low_temp_var for HD at speed 8. Quality improved by 1.866 and 0.386 for two noisy clips (dark720p and marcooffice720p), respectively. Change-Id: Ib33a7672ae9ca53da156208f7cd13f07b5543e44	2017-01-09 16:44:07 -08:00
Marco Paniconi	62cce50d55	Merge "vp9: 1 pass cbr: Fix to qp clamping when gf_cbr_boost_pct is used."	2017-01-09 23:30:32 +00:00
Marco	35c4a13eb7	vp9: Fix comment in speed features. Change-Id: I65d79c06b152922d725bf559adaa508f91cd5766	2017-01-09 13:05:31 -08:00
Marco	bea22782e9	vp9: 1 pass cbr: Fix to qp clamping when gf_cbr_boost_pct is used. Avoid the qp-clamping on gf/alt frame if gf_cbr_boost_pct is set. Change only affect CBR mode when gf_cbr_boost_pct is set. Change-Id: I0655ed4f2b047c8ed1ed33a070c17960ad776704	2017-01-09 12:52:50 -08:00
Marco Paniconi	ebe0b57c91	Merge "vp9: 1 pass cbr mode: increase threshold for gf_cbr_boost_pct usage."	2017-01-09 17:23:12 +00:00
Hui Su	c7e2bd6298	Merge "Add support for VP9 level targeting"	2017-01-07 00:55:41 +00:00
Marco	f1909d26f8	vp9: 1 pass cbr mode: increase threshold for gf_cbr_boost_pct usage. Increase the boost threshold below which GOLDEN update will use same rate correction factor as INTER_NORMAL. Improves performance when gf_cbr_boost_pct is set (between 0 and 100) in CBR mode. Change-Id: I9f54cc18664786a100b13a416b7137ae03bd0cab	2017-01-06 15:37:10 -08:00
Jerome Jiang	316071d79c	Merge "vp9: Enable more aggresive short circuit for speed 8."	2017-01-06 22:38:40 +00:00
Jerome Jiang	267e73446c	vp9: Enable more aggresive short circuit for speed 8. Set short_circuit_low_temp_var to 3 for speed 8 for all res. No strong visual difference on all clips. Change-Id: Ia6d9a314291ab1c14d5421bbdd769974083aeb2a	2017-01-06 10:23:34 -08:00
hui su	337ad83e58	Add support for VP9 level targeting Constraints on encoder config: -target_bandwidth is no larger than 80% of level bitrate limit -target_bandwidth * (1 + max_over_shoot_pct) is no larger than 88% of level bitrate limit -min_gf_interval is no smaller than level limit -tile_columns is no larger than level limit Constraints on rate control: -current frame size plus previous three frames' size is no larger than the CPB level limit -current frame size is no larger than 50%/40%/20% of the CPB level limit if it's a key/alt-ref/other frame. Change-Id: I84d1a2d6d6e3c82bfd533b3309ce999cfaba2c8b	2017-01-06 10:07:31 -08:00
Jerome Jiang	afc8c4836f	vp9: Compute source sad for every superblock when partition copy is on. The source sad could be used to copy the partition without going into choose_partitioning function to speed up vp9 encoding. Computing source sad takes little time. Speed test on Android and Linux shows little encoding time gain (less than 1.4%). Turned off for now since partition copy is turned off. Change-Id: I61c9d5b8f22329760cb29a4ee30a7f9c232ce8d3	2017-01-06 17:59:02 +00:00
Jerome Jiang	72746c079d	vp9: Set short circuit to level 3 for VGA for speed 8. vp9: Set short circuit to level 3 for VGA for speed 8. Also change the threshold_32x32 to 5/8*thresholds[1] to improve quality regression caused to VGA clips. Change-Id: Ia1590e91e7cb22be78d5b85013387bb1be4272e3	2017-01-04 11:28:31 -08:00
Marco	768b1f7281	vp9: 1 pass cbr: allow noise estimation down to 360p. Also adjust some thresholds for noise level setting. Change-Id: I7e03d7057ef2061c9447728deb9c6aff5d3da4b7	2017-01-03 16:26:22 -08:00
Ranjit Kumar Tulabandu	d3db846cc5	Fix for max qindex calculation of a gf interval Calculation of active_worst_quality of a gf interval is modified for coherency BUG=webm:1355 Change-Id: I84cc2b47a8713f102a69419fb33ab020cffa3e71	2017-01-03 10:24:02 -08:00
Yunqing Wang	99c573f018	Merge "Fix for out of range motion vector bug in joint motion search"	2017-01-03 17:46:15 +00:00
Ranjit Kumar Tulabandu	b67e1f701f	Fix for out of range motion vector bug in joint motion search Clamped the initial mv in vp9_refining_search_8p_c. BUG=webm:1354 Change-Id: I47d302b350937e3e6e52e95c983b5fb0b4c64fba	2017-01-03 09:12:32 -08:00
Yunqing Wang	ecdb6a00c2	Merge "Make sub-pixel mv search's return value consistent with the return type"	2016-12-29 19:16:01 +00:00
Yunqing Wang	c96a8dcb5b	Merge "Bug fix to avoid random crashes during ARNR filtering"	2016-12-29 17:24:24 +00:00
Gabriel Marin	e6b9609fc0	Merge "Remove superfluous conditional on 'shortcut'"	2016-12-29 06:03:43 +00:00
Yunqing Wang	1d12559b09	Make sub-pixel mv search's return value consistent with the return type For out-of-range cases, returned UINT_MAX instead of INT_MAX in the sub-pixel mv search to be consistent with the "uint32_t" return type. Change-Id: I8e206d771228c13d89bafbbe9f14722c8ecc6a7a	2016-12-27 12:08:38 -08:00
Ranjit Kumar Tulabandu	7cf13826b7	Bug fix to avoid random crashes during ARNR filtering The function 'vp9_find_best_sub_pixel_tree_pruned_more' is modified to return INT_MAX for handling invalid MV cases from UINT32_MAX. yunqingwang: patch 3: rebased on top of the tree. patch 4: The return type of vp9_find_best_sub_pixel_tree* was changed to uint32_t to fix ubsan warnings. Changing UINT_MAX back to INT_MAX was not quite right. Patch 4 modified vp9_temporal_filter.c to accept uint32_t. (Note: Inconsistency exists in vp9_find_best_sub_pixel_tree*, which will be fixed in a separate CL.) Change-Id: Ib1a79dc2aa41ea6335c21669c76883cdbb7e0535	2016-12-27 11:20:08 -08:00
Marco	e7c453b613	vp9: 1 pass vbr: Skip find_predictors in pickmode when source is altref. When source frame is altref, we only do zero-mv mode, so we can skip the find_predictors(). No change in compression. Small speed gain, ~1%. Only affects 1 pass vbr with lookhead altref, for ytlive with the macro flag USE_ALTREF_FOR_ONE_PASS on. Change-Id: I9318c5da8521f017bf54919cd652438b3a6313d1	2016-12-21 12:12:55 -08:00
Jerome Jiang	f27276f44f	Merge "vp9: Add feature to copy partition from the last frame."	2016-12-20 21:46:44 +00:00
Gabriel Marin	fce163cd54	Remove superfluous conditional on 'shortcut' Remove superfluous test. Produces a small improvement in instruction scheduling. Measured a 1% to 1.5% reduction in execution time for routine vp9_optimize_b with different compilers. No change in behavior. TEST=Verified that encoded files match bit for bit, with and without this change. BUG=b/33678225 Change-Id: I2bf248d4c25fc0256147d7a8766ff9108ae9cba3	2016-12-20 12:20:21 -08:00
Jerome Jiang	1d5ca84df6	vp9: Add feature to copy partition from the last frame. Add feature to copy partition from the last frame. The copy is only done under certain conditions that SAD is below threshold. Feature is currently disabled, until threshold is tuned. Feature will be initially used for Speed 8 (ARM). Under extreme case of always copying partition for speed 8: Encode time is reduced by 5.4% on rtc_derf and 7.8% on rtc. Overall PSNR reduced by 2.1 on rtc_derf and 0.968 on rtc. Change-Id: I1bcab515af3088e4d60675758f72613c2d3dc7a5	2016-12-19 16:24:03 -08:00
Gabriel Marin	85aead1790	Merge "Simplify address arithmetic in vp9_optimize_b"	2016-12-19 23:25:39 +00:00
Marco Paniconi	c1f5194842	Merge "vp9 denoiser: Fix the logic for re-evaluating zeromv after denoising."	2016-12-19 21:15:37 +00:00
Gabriel Marin	0549f5aae9	Simplify address arithmetic in vp9_optimize_b Simplify address arithmetic on token_costs to reduce the number of generated instructions that are used for address arithmetic inside routine vp9_optimize_b. It also helps improve instruction scheduling depending on compiler and optimization level. Measured a 9.3% reduction in retired instructions and 5.3% reduction in execution time for this routine with GCC v4.8.4 and optimization flags -O3, and a reduction of up to 11.6% in execution time with other compilers. No change in behavior. TEST=Verified that encoded files match bit for bit, with and without this change. BUG=b/33678225 Change-Id: I6098650fb5cd2aa04e014fe6e68ca20761f3a21f	2016-12-19 13:10:04 -08:00
Marco	6e8dbc76ad	vp9: With denoising on, only estimate noise level for higher resolns. Allow it for resolns above 640x360 for now. Change-Id: I087d0d8173f96b316164fdd4a499110ce2e7a233	2016-12-19 10:05:54 -08:00
Marco	61b569b461	vp9 denoiser: Fix the logic for re-evaluating zeromv after denoising. Correctly set interp_filter to SWITCHABLE for INTRA mode. Also reduce threshold on noise level for re-evaluating zeromv. Change-Id: Id32c01e193209fb380aa07204f0be3babf29f70a	2016-12-19 09:30:16 -08:00
Marco	4260a7f2b3	vp9: Change condition to enable recheck_zeromv_after_denoising. For when denoising enabled: change condition to enable the recheck_zeromv_after_denoising for only very high noise level. This is causing an issue, so enabling it for very high noise to effectively shut it off. Change-Id: Ic40d6025f3f398338cedd270d17c0ccd9a3daa84	2016-12-16 15:00:21 -08:00
Marco	5de798f2b2	vp9: Fix to usage of flag USE_ALTREF_FOR_ONE_PASS The flag USE_ALTREF_FOR_ONE_PASS allows for alt-ref lookahead in 1 pass vbr (from https://chromium-review.googlesource.com/#/c/365498). This change is to make sure this macro flag only has effect if the config flag cpi->oxcf.enable_auto_altef is also on. No change in ytlive encoding, as USE_ALTREF_FOR_ONE_PASS is not yet enabled. Change-Id: I1a69681e4a15c5244581a3dab4587fca08f02e0f	2016-12-14 15:07:38 -08:00
Marco	076d4bd91a	vp9: Fix to crash in svc code. use_base_mv assumes 2x2 scaling, so fix is to shutoff this feature unless spatial scale factors are 2. Added svc unittest for 2 spatial layers with 5x5 scaling, which generates the issue without this fix. Also fix some settings in svc unittest: let the speed setting vary (from 5 to 8), and enable static threshold. BUG=webm:1344 Change-Id: Idfd0a6c633c21b49a0479601506302cfe974e30e	2016-12-09 08:57:09 -08:00
Yunqing Wang	880adc3355	Merge "Remove an unused first pass statistic"	2016-12-08 22:46:44 +00:00
Yunqing Wang	394020383d	Remove an unused first pass statistic One of the first pass stats "new_mv_count" is no longer used in VP9, and is removed. This also makes it easy to implement a multi-threaded first pass. This change doesn't affect the coding performance, which has been verified by borg tests. Change-Id: I4c7c7bf9465fda838eb230814ef0c631c068c903	2016-12-07 15:32:25 -08:00
Marco	360ac89885	vp9: Adjust the weight factor for segment rate cost for aq-mode=3. Use the segment weight factor based on the target (cr->percent_refresh) if it less than the current estimate (avergae of past usage and target). Small improvement at low bitrates. Change-Id: Iba8fd909e203f94458901366d3a991f7ea854d49	2016-12-05 12:42:56 -08:00
Marco	d793950ec8	vp9: Adjust cyclic refresh parameters for low bitrates. Increase the motion threshold and qp-delta for segment#2 boost. This can increase the frame-drop at low bitrates, but generally better spatial quality. Only affects real-time mode with aq-mode=3, at very low bitrates. Change-Id: I5ccb784667f70d0c27d369806b93b1f93d5605d1	2016-11-23 12:14:28 -08:00
Marco	b6597745f9	vp9: Use more aggressive skip when short_circuit_low_temp_var = 1. Use the same feature as https://chromium-review.googlesource.com/#/c/411327/, but allow it to be used for speed = 6 and 7, where short_circuit_low_temp_var = 1. Speed up of ~2-3% for speed 7, with little/no loss in compression. Change-Id: I263a0f261ad9929034392d68f0153dc6376fdb5f	2016-11-22 14:54:28 -08:00
Jingning Han	f473e892f7	Merge "Enable asymptotic closed-loop encoding decision"	2016-11-19 04:12:55 +00:00
Jerome Jiang	4ddae8f524	Merge "vp9: Speed 8: More aggresive golden skip for low res."	2016-11-15 22:50:58 +00:00
Jerome Jiang	360217a233	vp9: Speed 8: More aggresive golden skip for low res. Add a new, more aggresive short circuit: short_circuit_low_temp_var = 3 to skip golden of any mode when variance is lower than threshold for low res. This change only affects speed = 8, low resolution. Metrics for avgPSNR/SSIM on rtc_derf (low resolution) show loss of 0.27/0.31%. On Nexus 6, the encoding time is reduced by ~2.3% on average across all low-res clips. Visually little change on rtc_derf clips. Change-Id: Ia8f7366fc2d49181a96733a380b4dbd7390246ec	2016-11-15 13:56:27 -08:00
Jerome Jiang	eff68a3a4d	vp9: Speed 8: Turn off 4x4avg for low-res non-key frames. Changes only affects speed = 8 for low resolutions. Metrics for avgPSNR/SSIM on rtc_derf (low resolutions) show loss of 0.5/0.6%. On Nexus 6, the encoding time is reduced by ~5.9% on average across all low-res clips. Visually little/no change on rtc_derf clips. Change-Id: I68dd50e558d72dcc1af8317d224bfae5e3bd872d	2016-11-14 11:17:14 -08:00
Jingning Han	44f8ee7258	Enable asymptotic closed-loop encoding decision This commit enables asymptotic closed-loop encoding decision for the key frame and alternate reference frame. It follows the regular rate control scheme, but leaves out additional iteration on the updated frame level probability model. It is enabled for speed 0. The compression performance is improved: lowres 0.2% midres 0.35% hdres 0.4% Change-Id: I905ffa057c9a1ef2e90ef87c9723a6cf7dbe67cb	2016-11-14 09:22:55 -08:00
Marco	18794d8ddc	vp9: Adjust thresholds for limiting cyclic refresh for noisy content. For noisy content, be more aggressive in skippping some blocks for delta-qp to reduce noise pulsing artifact. Also treat frame boundary case when dimension is not multiple of superblock size/64. Only affects non-screen content case, and when source noise is measured to be high (at least level kMedium). Change-Id: Ib13a2a20ed1ce37ff3c44d95c3ef2635fd695222	2016-11-08 15:50:46 -08:00
Johann	e10c95dc83	Update vp9_fdct8x8_quant_ssse3 for highbitdepth Borrow transition functions from fdct.h nee vpx_quantize_b_sse2 BUG=webm:1304 Change-Id: I9c88c3eec3ff8bb461411d98c26c3c236ea28ef1	2016-11-05 01:23:07 +00:00
Marco Paniconi	cca774c7df	Merge "vp9: Non-rd pickmode: fix logic in reference masking."	2016-11-03 23:12:05 +00:00
Marco	da9f762e24	vp9: Non-rd pickmode: fix logic in reference masking. Add condition that usable_ref_frame > LAST. This is to avoid potentially skipping all last-nonzero mv modes, if golden is used as a reference but skipped completely for the current block. This has no effect currenty, as we always consider testing golden mode for each block. Change-Id: I3182cf44664081935a90ed43aa7b32e710e60e22	2016-11-03 10:32:57 -07:00
Debargha Mukherjee	f93305aa07	Merge "Speed-up recode loop for extreme bitrate diffs"	2016-11-03 17:04:17 +00:00
Paul Wilkins	295cd3b493	Merge "Fixed bug in formatting of debug stats."	2016-11-02 17:10:07 +00:00
paulwilkins	de76d2e315	Fixed bug in formatting of debug stats. Fixed formatting bug introduced by the fix to BUG=webm:1322 ( Iedc4477aef1746aa0a4f84d88a1156296fd3ba87) Change-Id: I715ee446c0e8584967ab87ba4e355759dd394187	2016-11-02 09:38:18 +00:00
Paul Wilkins	84dcfced5b	Merge "Change to KF boost calculation."	2016-11-01 09:29:30 +00:00
Paul Wilkins	715c65914b	Change to KF boost calculation. This change is a step in a larger change to the way boost and interval are determined for ARF and Key frames. This patch contains some pluming for the general case but focuses on the key frame boost calculation. This now relies more heavily on the rate at which the error score increases between the primary and secondary reference frame. This seems to be less fragile when dealing with different frame sizes. For example larger image formats tend in the first pass to see a higher % of intra coded blocks and the use of this number in calculating the frame decay factor was leading to much lower boost numbers for 4K, for example, than the same clip coded at 2K. This change does give overall gains but they are MUCH larger for the 4K Netflix set. For the 4K Netflix set the average gain is around 3% with some clips > 20% whereas for the same set at 2K the average gain is 0.5-1%. In general for small image formats the boost is most often reduced a little whereas 4K clips the boost is increased. There are some -ve cases such as Akiyo at 352x288 where the reduced boost hurts the metrics, especially for SSIM, even while the set as a whole improves. This is most notable at very low Q and may be the subject of a future patch. Some common code for KF and ARF was separated in this patch for the purposes of tuning but may later be re-merged if appropriate. Change-Id: Iaa15ac5a58d2be89181100d95cef6a8dc4b12d0d	2016-10-28 15:35:59 +01:00
Debargha Mukherjee	4f7a59c802	Merge "Force recode if framesize exceeds max allowed size"	2016-10-28 04:21:44 +00:00
Debargha Mukherjee	1cd987d922	Speed-up recode loop for extreme bitrate diffs Adjusts the q adjustement step depending on how far the projected and target rates differ. Change-Id: I498d03523ca233a270512ca3972c372daa4ca2a8	2016-10-27 11:08:44 -07:00
Debargha Mukherjee	54e03017b6	Force recode if framesize exceeds max allowed size Fixes a case where recode is not triggered based on the value of maxq passed into the recode loop test function. BUG=b/32375284 Change-Id: I15ad985d0525c68e0443cfaf842440d2754b2266	2016-10-27 09:52:51 -07:00
Paul Wilkins	de859676dd	Changes to KF boost calculation. Remove double counting of decay. Limit maximum KF boost. Change-Id: I0fb2344d0f78b5e95bb899dfad12b0ca84034b2c	2016-10-26 17:53:29 +01:00
paulwilkins	ccd6a8e2fa	Removal of a couple of two pass adjustments. Removed a couple of adjustments that no longer move the needle much but complicate the process of tuning. Change-Id: Ie320f5cf155e6aac14a4757ea9ada2cd59f27590	2016-10-26 17:52:37 +01:00
Yunqing Wang	c192def8f3	Change 2 motion search counts to be tile data This patch modified the motion search counts used in: https://chromium-review.googlesource.com/#/c/305640/ These 2 counts were originally added as thread data, and used to make decisions in motion search. The tile encoding order can be inconsistent while using different number of threads, which can cause bitstream mismatch. Here moved them to tile data to solve the issue. BUG=webm:1322 Change-Id: Iedc4477aef1746aa0a4f84d88a1156296fd3ba87	2016-10-25 10:12:41 -07:00
Vignesh Venkatasubramanian	9a032fa262	Merge "vp9_bitstream: Encode tiles in parallel"	2016-10-22 02:23:06 +00:00
Vignesh Venkatasubramanian	5deffa1175	vp9_bitstream: Encode tiles in parallel Re-use the tile worker threads to pack the bitstream in parallel on a per-tile basis. Restricting this to real-time only for now (further testing is needed to ensure this does not make 2-pass worse in any case). BUG=webm:1309 Change-Id: I8a80da7c5089b837d0df79a5c49d5e3022dfc8ec	2016-10-21 17:35:03 -07:00
Marco	ee1b3f34c0	vp9: Nonrd variance partition: increase threshold for using 4x4 avg. In variance partition low resolutions may use varianace based on 4x4 average for better partitioning. Increase the threshold for doing this at speed = 8. Improves speed by ~5%, with little loss, < 1%, on RTC_derf set. Change-Id: Ib5ec420832ccff887a06cb5e1d2c73199b093941	2016-10-21 11:51:06 -07:00
Marco	a7d116aa67	vp9: Speed=8 real-time: Keep the bias_golden feature on. Small/no change in metrics on RTC set, speed increase by 2-3%. Change-Id: Iee997bd7433e8e508216e9267b1c31c5a9aa5121	2016-10-20 17:03:51 -07:00
James Zern	7f31bfeddb	Revert "vp9_bitstream: Encode tiles in parallel" This reverts commit `9e8efa5b18`. this change causes ubsan warnings, failures in vpxenc_vp9_webm_rt_multithread_tiled BUG=webm:1309 Change-Id: I020c7be985c771bfff4b3de1afe51cc8edb980da	2016-10-18 22:47:48 -07:00
Marco Paniconi	f6980ca68e	Merge "vp9: Non-rd variance partition: add condition for 64x64 split."	2016-10-18 00:03:17 +00:00
Marco	55a2b67368	vp9: Non-rd variance partition: add condition for 64x64 split. Add stronger condition for splitting 64x64, for low noise content. This reduces dragging artifact near moving head. Little/no change in metrics on RTC set. Change-Id: I39b38cfd20f2ece53ff49c2aaf76ba9f82761be1	2016-10-17 12:54:27 -07:00
Vignesh Venkatasubramanian	9e8efa5b18	vp9_bitstream: Encode tiles in parallel Re-use the tile worker threads to pack the bitstream in parallel on a per-tile basis. Restricting this to real-time only for now (further testing is needed to ensure this does not make 2-pass worse in any case). BUG=webm:1309 Change-Id: Ia2c982da56697756e12f02643f589189b3271d98	2016-10-17 10:42:03 -07:00
Vignesh Venkatasubramanian	769292017b	vp9_bitstream: Parameterize interp_filter_selected Facilitates encoding tiles in parallel. BUG=webm:1309 Change-Id: I37aa336d47babffc8352188dc767eebdb8a99474	2016-10-12 20:22:03 -07:00
Vignesh Venkatasubramanian	04a6010742	Merge "vp9_bitstream: Parameterize max_mv_magnitude"	2016-10-12 21:52:42 +00:00
Vignesh Venkatasubramanian	d03d1c8cd3	vp9_bitstream: Parameterize max_mv_magnitude Facilitates encoding tiles in parallel. BUG=webm:1309 Change-Id: I614a5a492c30b6773c30e7294cd6a6f456e02ab4	2016-10-12 12:50:17 -07:00
Marco	57c6bf291e	1 pass vbr: Allow for lookahead alt-ref in real-time mode. For 1 pass vbr real-time mode: Allow for the usage of alt-ref frame when non-zero lag-in-frames is used. Use non-filtered alt-ref, and select usage based on fast scene/content analysis/detection within the lag of frames. Positive gains on ytlive set: overall avgPSNR ~3-4%. Several clips are up between 5-14%, a few clips are neutral/small change. Current speed decrease is about ~5-10%. Use the flag USE_ALTREF_FOR_ONE_PASS to enable this feature (off by default for now). Change-Id: I802d2bf3d44f9cf01f6d15c76be9c90192314769	2016-10-11 10:13:17 -07:00
Marco	cdbd89197e	vp9: 1 pass vbr: some adjustments to gf interval. Put limit on gf interval based on lag, and allow for the adjustment on next gf group also on key frame. Small/neutral change on ytlive metrics. Change only affects 1 pass vbr real-time mode. Change-Id: I339c8f4398848698b6e10fe9482c52ca661b94a5	2016-10-11 08:34:12 -07:00
Vignesh Venkatasubramanian	ed50e7710c	write_modes: add MACROBLOCKD as a parameter This will enable bit stream packing of each tile column in parallel. BUG=webm:1309 Change-Id: Ie349d8cc5825326218ffda893a50730b2e68ed34	2016-10-07 10:25:02 -07:00
Geza Lore	0dc12b4a1c	Fix warning when building with GCC 5. These caused the following warning with GCC 5: warning: logical not is only applied to the left hand side of comparison [-Wlogical-not-parentheses] assert(!is_compound == (cm->reference_mode == SINGLE_REFERENCE)); Change-Id: If296aabb2311ceb7d903b395c1549ef81c2cbf9b (cherry picked from commit `c6cf7a6111`)	2016-10-01 12:23:15 -07:00
Marco Paniconi	0a9f56f146	Merge "vp9: On change_config() only call update_frame_size if needed."	2016-09-30 21:43:33 +00:00
Marco Paniconi	5e908aff34	Merge "vp9 real-time mode: Change loopfilter speed feature at speed 8."	2016-09-30 21:42:05 +00:00
Yunqing Wang	9afe2cf599	Merge "Fix an issue in vp9_first_pass for non-mulitple of 16 resolutions"	2016-09-30 00:49:06 +00:00
Deepa K G	2745f94deb	Fix an issue in vp9_first_pass for non-mulitple of 16 resolutions This patch sets the 16x16 src_diff to zero and ensures correct calculation of this_error for block sizes smaller than 16x16. Change-Id: I7b7c02d267433c9f22c8ac9b8d5df2f499175172	2016-09-29 16:19:23 -07:00
Marco	e765435293	vp9: On change_config() only call update_frame_size if needed. change_config() may be called often in real-time application, to update bitrate/framerate or qp-max/min. No need to do update_frame_size() unless frame size has changed. Change-Id: I23a51deade1e03adc91c468f9ffde3235298770c	2016-09-29 13:03:26 -07:00
Marco	d017548be6	vp9 real-time mode: Change loopfilter speed feature at speed 8. For real-time mode at speed 8: turn off MINIMAL_LF at speed 8, for non-screen content mode. Visually better, avgPSNR/SSIM on rtc set go up by ~4-5%. Speed decrease of about ~3%. Change-Id: I8eb69330f02e0ceece1507d43cfc8a049a1d8291	2016-09-29 12:59:01 -07:00
Paul Wilkins	b3ebea5e8a	Merge "Limit max arf boost and scale motion breakout for image size."	2016-09-27 14:08:29 +00:00
Marco	d9fc28c0a1	vp9: Reduce frame loopfilter-level for 1 pass cbr. Reduce the filt_guess for 1 pass cbr on inter-frames. This reduces visual artifact seen in rtc clip (jimred.vga), and improves metrics on rtc set. Metrics on rtc set for cbr mode overall positive, most clips are up: Speed 7 rtc: avgPSNR/SSIM up by: ~2.6/3.9% Speed 8 rtc: avgPSNR/SSIM up by: ~1.3/2.5% Change-Id: Ia4eccea1c19d65b583516df28823cd756c49464f	2016-09-26 10:12:43 -07:00
paulwilkins	0421d8e318	Limit max arf boost and scale motion breakout for image size. Added a cap on the maximum boost for an arf based on interval length. Fixed bug where by the image size was not accounted for in determining two of the motion breakout thresholds. Overall small gains of 0.2-0.4% psnr but on large image format clips with slow zooms the gain may be as much as 20% or more (e.g. in_to_tree at 1080P) Change-Id: Id0a47391203026742daa9c97afac5705fd8c4dfb	2016-09-26 15:38:29 +01:00
Nathan E. Egge	de7f5ce9e5	Code class0 using vpx_read() / vpx_write(). The vp9_mv_class0_tree is a balanced tree with two leafs and can simply be coded as a boolean with probability class0[0]. Change-Id: If294dac825a5f945371092c74aa8e3f84cd962b6 (cherry picked from commit be8a8ab62ebdd111c6f2e9a33b15630570671eba)	2016-09-19 10:50:39 -07:00
Alex Converse	01e2902521	Zero the whole rd_counts struct rather than the each member Change-Id: I495aa9cec2b2b8f1ae69bdab8b3feeca76358472	2016-09-19 10:04:47 -07:00
James Zern	7a9e476072	Merge changes from topic 'clang-format' * changes: apply clang-format .clang-format: update to 3.8.1	2016-09-16 07:11:33 +00:00
Marco	4c1a9fb8db	vp9: Small code cleanup. Remove the experiment LIMIT_QP_ONEPASS_VBR_LAG, as its not currently used and no plan to use in near future. Change-Id: Ib069f8d7225195be04b765d0ab477510dfba6a3b	2016-09-15 15:17:17 -07:00
clang-format	5f6d143b41	apply clang-format Change-Id: I501597b7c1e0f0c7ae2aea3ee8073f0a641b3487	2016-09-15 15:07:53 -07:00
Sarah Parker	c892521b1d	Fix missing write to opsnr in internal stats Change-Id: I21c8ad0b5ed7f8d843cae45c18f5727bceb8f859	2016-09-03 12:15:32 -07:00
Yaowu Xu	594e53514b	Merge "Fix formatting in internal stats for vp8 and vp9"	2016-09-01 23:55:23 +00:00
Yaowu Xu	454139ae13	Merge "Casts to remove some warnings."	2016-09-01 23:37:04 +00:00
Debargha Mukherjee	a6bc3dfb0f	Merge "Refactor uv tx size with lookup arrays"	2016-09-01 16:46:32 +00:00
paulwilkins	3e9e77008c	Casts to remove some warnings. Added casts to remove warnings: BUG=webm:1274 In regards to the safety of these casts they are of two types:- - Normalized bits per (16x16) MB stored in a 32 bit int (This is safe as bits per MB even with << 9 normalization cant overflow 32 bits. Even raw 12 bits hdr source even would only be 29 bits :- (4+4+12+9) and the encoder imposes much stricter limits than this on max bit rate. - Cast as part of variance calculations. There is an internal cast up to 64 bit for the Sum X Sum calculation, but after normalization dividing by the number of points the result will always be <= the SSE value. Change-Id: I4e700236ed83d6b2b1955e92e84c3b1978b9eaa0	2016-09-01 16:10:12 +01:00
Debargha Mukherjee	e6446b4b60	Refactor uv tx size with lookup arrays Change-Id: Ife6a3d301c5faaba89d16d188d638631083511f7	2016-08-31 13:15:38 -07:00
paulwilkins	6fc07a217d	Modified resize loop constraints. Using a tighter resize constraint on undershoot seems to help results (especially SSIM) as significant undershoot on a frame seems to have more of a damaging impact than overshoot. This patch has been tuned so that in local testing using the derf set it is encode speed neutral for speed setting 2. Average quality result for speed 2 (psnr,ssim) were as follows:- lowres 0.039, 0.453 midres 0.249, 0.853 hdres 0.159, 0.659 NetFlix -0.241, 0.360 Change-Id: Ie8d3a0d7d6f7ea89d9965d1821be17f8bda85062	2016-08-31 12:45:49 +01:00
Paul Wilkins	129814fcb4	Merge "Adjust coefficient optimization and tx_domain rd speed features."	2016-08-30 16:54:40 +00:00
James Zern	2917737879	vp9_alt_ref_aq_set_nsegments: harmonize fn signature Change-Id: I5f232664652a8dc3a71e43b8b1fa05ddb4a84ecc	2016-08-27 11:16:03 -07:00
Yury Gitman	507d272265	Move vp9_alt_ref_aq_private.h to vp9_alt_ref_aq.c + add a temporary dummy element to ALT_REF_AQ to avoid a warning about an empty struct Change-Id: Ib6e5c39ff62ad96eb4e3686d4882228a42b3843f	2016-08-27 10:53:41 -07:00
Jingning Han	dd2a475e43	Merge "Fix VS build warnings in vp9_alt_ref_aq files"	2016-08-26 17:19:12 +00:00
Paul Wilkins	badd32d914	Merge "Add ALLOW_RECODE_FIRST speed mode."	2016-08-26 15:46:45 +00:00
Jingning Han	84fccfe475	Fix VS build warnings in vp9_alt_ref_aq files Change-Id: I5b19ec00a1eb8b148026f665d217c12eb50b614a	2016-08-26 08:43:36 -07:00
paulwilkins	dc42f343ae	Add ALLOW_RECODE_FIRST speed mode. This patch is to address concerns that changes to allow recodes on the first frame in each ARF group do not give a good enough speed quality trade off for speed 2. Though the average impact on encode speed is 1-2%, for some hard clips it is > 5% rise. For speed 1 this is less an issue and for Speed 0 the previous patch actually improves speed. Change-Id: Ie1bcefdbfdf846d3f4428590173f621465dffe3a	2016-08-26 11:43:47 +01:00
Sarah Parker	37e83789f1	Fix formatting in internal stats for vp8 and vp9 This corrects a formatting error introduced in: I1e9d548ce445d29002f0c59ebfd3957a6f15e702 where spaces were used as delimiters instead of tabs. The corresponding fix for vp10 is in Ica3d625d6672b3c47e0e208b45eede29b9004030. Change-Id: Ibc4eb8fd82e6b926ba259a679dc98557cadba9b1	2016-08-25 17:46:18 -07:00
Yury Gitman	292d221fed	Create interface for the ALT_REF_AQ class Current commit is just an API template for the rest of the code, and I will add inner logic later. Altref frames generate a lot of bitrate and at the same time other frames refer to them a lot, so it makes sense to apply special compensation-based adaptive quantization scheme for altref frames. E.g., for blocks that are good predictors for the future apply rate-control chosen quantizer while for bad predictors apply worse one. Change-Id: Iba3f8ec349470673b7249f6a125f6859336a47c8	2016-08-25 10:55:14 -07:00
paulwilkins	635ae8bdc1	Adjust coefficient optimization and tx_domain rd speed features. Previously Tx domain rd was used in all cases above speed 0. Coefficient optimization was only enabled for best and speed 0. This patch selectively sets these features at other speed settings based on block complexity. For the Netflix and HD sets in particular the quality gains are large compared to the speed hit. At speed 1 the average psnr gain in the NF set is > 2.5% with one clip coming in at 18% and some points almost 30%. Average gains for the lower resolution test sets are around 1%. The gains are biggest at low Q so some further optimization may be possible. Change-Id: I340376c7b2a78e5389a34b7ebdc41072808d0576	2016-08-25 15:36:16 +01:00
Yury Gitman	d7c20079a6	Add --alt-ref-aq=<int> option In the future this option will activate adaptive quantization special for altref frames. Encoder will create the adaptive quantization map on the basis of lookahead buffers similarity which is the estimate of the future motion compensation performance. Change-Id: Ia0088b3babb0f9a4899c79d8d819947ba5a03df2	2016-08-24 15:49:25 -07:00
jackychen	8d4c0ec1f1	vp9: Refactor set_low_temp_var_flag. No need to pass in force_split, since we should use sb_type in the condition. Change-Id: Ide27243ef46e017bbb98d676347fc566a6c828f7	2016-08-23 15:11:40 -07:00
Yunqing Wang	ef98f49cb0	Disable split mode in 4k video encoding Disabled the split mode while encoding 4k video to speed up the encoder. Borg test result on 4k set: Overall PSNR: +0.029%; SSIM: +0.009%. Average encoder speedup at speed 2 is 2.5%. Change-Id: I1519c658f07c3ac838affbe5aff0ed9b94f3f8f4	2016-08-22 19:46:44 -07:00
Yunqing Wang	37169c0bd4	Merge "Adjust speed features for 4k video encoding"	2016-08-19 23:11:05 +00:00
Yunqing Wang	fe488cceff	Adjust speed features for 4k video encoding Adjusted speed 2 features to speed up 4k video encoding. BDBR results from borg test: PSNR: +0.313%; SSIM: +0.268%. Average speedup: 8.5% Change-Id: I1e2695a01fb3f3817c1df4480e184c2aed8f2eba	2016-08-19 09:30:32 -07:00
James Zern	149d082377	vp9_pickmode: quiet float conversion warnings Change-Id: I591e4f958955b3f2edb2f95a83c54cd83c8ef075	2016-08-19 01:28:01 -07:00
JackyChen	8be7e572a7	vp9 svc: SVC encoder speed up. Bias towards base_mv and skip 1/4 pixel motion search when using base mv. 2~3% speed up for 2 spatial layers, 3~5% speed up for 3 spatial layers. PSNR loss: (2 layers) 0.07dB for gips_stationary, 0.04dB for gips_motion; (3 layers) 0.07dB for gips_stationary, 0.06dB for gips_motion. Change-Id: I773acbda080c301cabe8cd259f842bcc5b8bc999	2016-08-18 11:25:45 -07:00
Marco	7eb7d6b227	vp9 non-rd pickmode: Add limit on newmv-last and golden bias. Add option, for newmv-last, to limit the rd-threshold update for early exit, under a source varianace condition. This can improve visual quality in low texture moving areas, like forehead/faces. Also add bias against golden to improve the speed/fps, will little/negligible loss in quality. Only affects CBR mode, non-svc, non-screen-content. Change-Id: I3a5229eee860c71499a6fd464c450b167b07534d	2016-08-17 14:33:44 -07:00
paulwilkins	af3b0de732	Add casting to fix warning. Frame bits can safely be stored int but group bits (kf or arf) use 64bit. Change-Id: I0800f2a28070f8749110a95721c116fc56987885	2016-08-17 11:18:07 +01:00
paulwilkins	ab7cd6d068	Add {} to try and keep Jenkins happy. Change-Id: If1ca3cf83e058317c9751d7da6caa7cd75eb6845	2016-08-17 11:17:36 +01:00
paulwilkins	5d881770e5	Change default recode rule for good speed 0 and best. Changes the default recode rule for Speed 0 and best quality from ALLOW_RECODE to ALLOW_RECODE_KFARFGF. Tested on the NF, hdres, midres and lowres test sets, this setting when combined with patch I40cb559... now performs "as well" in metrics terms (in fact it came out a tiny amount better overall) but encode time is 9.6% faster (measured as the average from 27 mid rate local encodes on clips in the derf/lowres set. Change-Id: I8c781c0cdfa3a9929cd9406d15582fce47d6ae3b	2016-08-15 10:52:54 +01:00
paulwilkins	de3b769524	Change to recode rules. Allow recodes for the first inter frame in each arf group even when the recode rule is set to ALLOW_RECODE_KFARFGF. Small gains of 0.05%. Change-Id: I40cb559d36a2bf0ebf5cf758c3f92e452b480577	2016-08-15 10:52:02 +01:00
Paul Wilkins	fe4dd4f43f	Merge "Modified ARF group allocation."	2016-08-15 09:42:30 +00:00
Yunqing Wang	a413dbe594	Fix another motion vector out of range bug This patch fixed a motion vector out of range bug: vpxenc: ../libvpx/vp9/encoder/vp9_mcomp.c:69: mv_cost: Assertion `mv->col >= -((1 << (11 + 1 + 2)) - 1) && mv->col < ((1 << (11 + 1 + 2)) - 1)' failed. For blocks that returned without having full-pixel search, the original MV limits were not restored, which caused the failure. Moved the set MV limit function down to fix the bug. Change-Id: Id7d798fc7214e95c6e4846c588f0233fcf1a4223	2016-08-12 09:27:58 -07:00
paulwilkins	656f4a88cf	Modified ARF group allocation. Small average gains in the range 0.05 - 0.1 Change-Id: I30e85c04be615cc84726427c5057388b20a6ff60	2016-08-10 14:22:01 -07:00
Alex Converse	941fe20336	Merge "Refactor mv limits."	2016-08-09 17:12:50 +00:00
Yury Gitman	c37d012ada	Merge "Add cpi parameter for forcing segmentation update"	2016-08-08 21:29:42 +00:00
Yury Gitman	7a730d5901	Add cpi parameter for forcing segmentation update Change-Id: I1b0bcb1ffe7604117bfaa0b9989d0e25ff04d28c	2016-08-08 13:20:42 -07:00
Alex Converse	6554333b59	Refactor mv limits. Change-Id: Ifebdc9ef37850508eb4b8e572fd0f6026ab04987	2016-08-08 11:54:00 -07:00
Yunqing Wang	6a8d4631a8	Merge "Fix a motion vector out of range bug"	2016-08-08 17:59:50 +00:00
James Zern	19d2e73dea	Merge changes Ice037acb,I806af11b,I344a7dd0,Ib7cb87fa * changes: vp9: normalize vpx_enc_frame_flags_t usage args.c: add some explicit casts webmdec: quiet -Wshorten-64-to-32 warning test/decode_test_driver: rm unused deadline member	2016-08-06 01:20:52 +00:00
Yunqing Wang	2fb826c4d5	Fix a motion vector out of range bug This patch fixed a motion vector(MV) out of range bug, which was caused by not restoring the original values of the MV min/max thresholds after the sub8x8 full pixel motion search. It occurred rarely and only was seen while encoding a 4k clip for 200 frames. BUG=webm:1271 Change-Id: Ibc4e0de80846f297431923cef8a0c80fe8dcc6a5	2016-08-05 15:23:05 -07:00
James Zern	7104833085	vp9: normalize vpx_enc_frame_flags_t usage quiets -Wshorten-64-to-32 warnings Change-Id: Ice037acb675d1d81bfedf2dfcfa91a8a29a19dfd	2016-08-04 23:37:49 -07:00
clang-format	3a4002b94d	vp9_ratectrl.c: apply clang-format after: `ff0a87c` vp9 1pass vbr: Adjustment to gf interval. Change-Id: I1296e53e601bf0c2b562e3a34082ac45c294a5f1	2016-08-04 11:57:00 -07:00
Marco Paniconi	9fdeeaf411	Merge "vp9 1pass vbr: Adjustment to gf interval."	2016-08-04 17:50:55 +00:00
Yaowu Xu	7a79fa1362	Fix msvc compiler warnings MSVC 2013 complained about using 32 shift where 64 bit shift should be used. Change-Id: I7a2b165d1a92d3c0a91dd4511b27aba7709b5e55	2016-08-03 18:33:06 -07:00
Marco	ff0a87ce38	vp9 1pass vbr: Adjustment to gf interval. Increase the minimum distance. Reduces the overshoot somewhat on some clips, small gain in avgPSNR (~0.1%) on ytlive set. Change-Id: Id5ddde20c2907dbdb536e79542eff775019c142b	2016-08-03 15:36:27 -07:00
clang-format	e0cc52db3f	vp9/encoder: apply clang-format Change-Id: I45d9fb4013f50766b24363a86365e8063e8954c2	2016-08-02 16:47:11 -07:00
Yaowu Xu	039f9e08f0	change HBD pixel value from uint8_t to uint16_t This fixes a regression in 10/12 bit encoding results. Change-Id: I438877352a41aae0a864a8d9979afe4aa2061d81	2016-08-02 11:01:39 -07:00
Yaowu Xu	dc5618f3bb	Add pointer conversion for HBD buffers This fixes a crash in HBD build. Change-Id: I7f688f50227323e69bba65df0d56f4360f01771b	2016-08-01 15:56:43 -07:00
Alex Converse	004eebed31	Merge "Unfork 8-bit in HBD path in vp9_model_rd_from_var_lapndz callers."	2016-08-01 16:42:39 +00:00
Alex Converse	2c3807b89f	Merge "Cache optimizations in optimize_b()."	2016-08-01 16:30:05 +00:00
Alex Converse	e446ffda45	Cache optimizations in optimize_b(). Move best index into the token state. Shrink it down to one byte. This is more cache friendly (access are group together) and uses less total memory. Results in 4% fewer cycles in optimize_b(). Change-Id: I75db484fb3dc82f59928d54b659d79c80ee40452	2016-07-29 12:06:49 -07:00
Jacky Chen	462a7c9f0a	Merge "vp9 svc: Enable different speed setting for each spatial layer."	2016-07-28 20:21:30 +00:00
Alex Converse	4508eb3123	Merge "Fix 64 to 32 narrowing warning."	2016-07-28 16:36:46 +00:00
Alex Converse	335cf67d8b	Fix 64 to 32 narrowing warning. - Solves potential integer overflow on 12-bit - Fixes Visual Studio build Change-Id: I26dd660451bbab23040e4123920d59e82585795c	2016-07-27 12:40:23 -07:00
JackyChen	47cc64cdf8	vp9 denoiser: Derefencing pointer should be after null check. BUG=webm:1267 Change-Id: I899fc9e8d784c6eefcbe27945c619845adb7b6f0	2016-07-26 17:31:17 -07:00
Alex Converse	34201e50c1	Unfork 8-bit in HBD path in vp9_model_rd_from_var_lapndz callers. BUG=b/29583530 Change-Id: Ia88a75f9572e08f228559ab84b8a77efb5aff0af	2016-07-26 21:57:58 +00:00
jackychen	8ce67d714a	vp9 svc: Enable different speed setting for each spatial layer. This change only affects 1 pass cbr svc mode. Change-Id: If0da87bb200f7e7762755340c40c8157cc7a16ca	2016-07-25 15:11:43 -07:00
Alex Converse	d6c5ef4557	Only consider visible 4x4s in pixel domain error. BDRATE change derf144: -0.327 lowres: -0.048 midres: -0.125 hdres: -0.238 Change-Id: I789aba9870b5c2952373a7dd4fc8ed45590c3c54	2016-07-25 21:44:06 +00:00
Alex Converse	511bf49b7e	Merge "Minor skip segment simplification."	2016-07-25 17:50:43 +00:00
Scott LaVarnway	ad5fea03e6	Merge "VP9: get_pred_context_switchable_interp() -- encoder side"	2016-07-25 11:58:24 +00:00
Alex Converse	9a62ecbd35	Minor skip segment simplification. Change-Id: I34863fce1abe94f9539e9a5a6149ae1efb6501bd	2016-07-22 15:31:18 -07:00
Marco Paniconi	53db633349	Merge "vp9 1pass-vbr: Adjust gf setting for nonzero-lag case."	2016-07-22 21:27:05 +00:00
Marco	c06a4b9df2	vp9 1pass-vbr: Adjust gf setting for nonzero-lag case. Change-Id: I230c586c6d5ae56ee9a6d37b7d9452351bb4bd80	2016-07-22 11:48:09 -07:00
Paul Wilkins	830fa866a5	Merge "Sample points to reduce encode overhead."	2016-07-22 09:27:34 +00:00
Paul Wilkins	063e4a2914	Merge "Noise energy Experiment in first pass."	2016-07-22 09:27:19 +00:00
Scott LaVarnway	c969b2b02b	VP9: get_pred_context_switchable_interp() -- encoder side Change-Id: I7217c90d5cf38c51b76759a2dc4f10070f3a40ac	2016-07-21 11:47:51 -07:00
jackychen	71f9cbcfc8	vp9: Fix the clang warning of unsigned int type. Change-Id: I6308db16bd626fa5943925471e9171f567669350	2016-07-20 15:58:35 -07:00
Yaowu Xu	690fcd793b	Change to call vp9_post_proc_frame() This commit changes the call in vp9 encoder from vp9_deblock() to vp9_post_proc_frame() to ensure the data structures used in the call are properly allocated. This fixes an encoder crash when configured with --enable-internal-stats. Change-Id: I2393b336c0f566665336df4f1ba91c405eb56764	2016-07-20 11:01:49 -07:00
James Zern	e3f7991f99	Merge changes Ia6004c08,I1954f9d6 * changes: cosmetics: Add a few explanatory comments cosmetics: Correct grammar/spelling in comments	2016-07-19 19:12:23 +00:00
Marco	05fe0f20a6	vp9: Allow usage of lookahead for real-time, 1 pass vbr. Allow usage of lookahead for VBR in real-time mode, for 1 pass vbr. Current usage is for fast checking of future scene cuts/changes, and adjusting rate control (gf interval and active_worst/target size). Added unittests (datarate) for 1 pass vbr mode, with non-zero lag. Added an experimental option to limit QP based on lookahead. Overall positive gain in metrics on ytlive set: avgPNSR/SSIM up on average ~1-3%; several clips up by 5, 7%. Change-Id: I960d57dfc89de121c4824b9a9bf88d2814e74b56	2016-07-18 15:20:17 -07:00
Yury Gitman	bdfdd7d993	cosmetics: Correct grammar/spelling in comments Change-Id: I1954f9d6e33abff9081fe7a5cf59d5497768e0df	2016-07-18 12:49:00 -07:00
hui su	248f6ad771	Revert "Eliminate isolated and small tail coefficients:" This reverts commit `ff19cdafdb`. Change-Id: I81f68870ca27a1ff683ee22090530b6997815fb2	2016-07-13 11:14:44 -07:00
Jingning Han	fed14a3e94	Merge "Disable trellis optimization when lossless is on"	2016-07-13 16:01:01 +00:00
Jacky Chen	19c157afe2	Merge "vp9 svc: Reuse scaled_temp in two stage downscaling."	2016-07-12 17:59:09 +00:00
JackyChen	110a2ddc9b	vp9 svc: Reuse scaled_temp in two stage downscaling. This change eliminates redundant computation in the two stage downscaling, which saves ~1% encoding time in 3-layer svc encoding. Change-Id: Ib4b218811b68499a740af1f9b7b5a5445e28d671	2016-07-12 10:09:55 -07:00
Jingning Han	efccbc9fb5	Disable trellis optimization when lossless is on Disable trellis coefficient optimization when the lossless mode is turned on. Change-Id: I9001bf626e86dc3c8c32331ede04fd39036e5f7c	2016-07-12 09:00:16 -07:00
Jim Bankoski	88e6951465	deblock filter : moved from vp8 code branch The deblocking filters used in vp8 have been moved to vpx_dsp for use by both vp8 and vp9. Change-Id: I5209d76edafc894b550f751fc76d3aa6799b392d	2016-07-12 05:53:00 -07:00
Scott LaVarnway	2e93fcf893	Merge "vp9_rd_pick_intra_mode_sb(): set interp_filter to"	2016-07-11 22:31:06 +00:00
paulwilkins	3a986eac57	Sample points to reduce encode overhead. Only noise filter sampled points in first pass to reduce any first pass speed overhead. Change-Id: Ic80d4400e59146d1c3332336c4350faf28ff8b17	2016-07-11 11:45:52 +01:00
Scott LaVarnway	ed7786869a	vp9_rd_pick_intra_mode_sb(): set interp_filter to SWITCHABLE_FILTERS. This is a partial fix for the build issues with Change 357240. Change-Id: I4e507c196175bae729a4f1397878ec8776b0146c	2016-07-09 09:47:34 -07:00
Yaowu Xu	5adb43b8be	Fix non-highbitdepth coding path for HBD build Change-Id: I38eb42b8d051924a7cd1ccc3421a4057cf6e170f	2016-07-08 11:26:34 -07:00
Marco Paniconi	20946cdd3b	Merge "vp9: Adjustment of gfu_boost and af_ratio for 1 pass vbr."	2016-07-08 16:26:06 +00:00
Yaowu Xu	dc008cc17d	Merge "Enable HBD support in real time encoding path"	2016-07-07 22:32:48 +00:00
Marco	cc431ad50a	vp9: Adjustment of gfu_boost and af_ratio for 1 pass vbr. Modify the gfu_boost and af_ratio setting based on the average frame motion level. Change only affects 1 pass vbr. Metrics overall positive on ytlive set. On average up by ~1%, several clips up by 2-4%. Change-Id: Ic18c49eb2df74cb4986b63cdb11be36d86ab5e8d	2016-07-07 15:18:14 -07:00
Marco Paniconi	a75965fa94	Merge "vp9: Adjustment to mv bias for non-rd pickmode."	2016-07-07 21:07:37 +00:00
Jingning Han	2f28f9072e	Enable coeff optimization for intra modes This further improves the coding performance by lowres 0.3% midres 0.5% hdres 0.6% Change-Id: I6a03b6da210b9cbc261474bad4a103e0ba021c68	2016-07-07 12:25:41 -07:00
Jingning Han	44354ee7bf	Use precise context to estimate coeff rate cost Use the precise context to estimate the zero token cost in trellis optimization process. This improves the speed 0 coding performance by 0.15% for lowres and 0.1% for midres. It improves the speed 1 coding performance by 0.2% for midres and hdres. Change-Id: I59c7c08702fc79dc4f8534b64ca594da909e2c91	2016-07-07 12:25:33 -07:00
Jingning Han	62aa642d71	Enable uniform quantization with trellis optimization in speed 0 This commit allows the inter prediction residual to use uniform quantization followed by trellis coefficient optimization in speed 0. It improves the coding performance by lowres 0.79% midres 1.07% hdres 1.44% Change-Id: I46ef8cfe042a4ccc7a0055515012cd6cbf5c9619	2016-07-07 12:25:33 -07:00
Jingning Han	541eb78994	Refactor coeff_cost() function Move the operations that update the context buffers outside this function. The coeff_cost() takes all input as const value and returns the coefficient cost. This makes preparation for the next coefficient optimization CLs. Change-Id: I850eec6e5470b91ea84646ff26b9231b09f70a0c	2016-07-07 18:09:39 +00:00
Jingning Han	7c1fdf02cd	Merge "Support measure distortion in the pixel domain"	2016-07-07 18:09:20 +00:00
Marco	f451b404ea	vp9: Adjustment to mv bias for non-rd pickmode. Replace the existing mv bias with a bias only for NEWMV, and based on the motion vector difference of its top/left neighbors. For cbr non-screen-content mode. Change-Id: I8a8cf56347cfa23e9ffd8ead69eec8746c8f9e09	2016-07-07 10:33:06 -07:00
paulwilkins	2580e7d63e	Noise energy Experiment in first pass. Use a measure of noise energy to adjust Q estimate and arf filter strength. Gains 0.3-0.5% on Lowres and \|Netflix sets. Hdres and Midres neutral. Change-Id: Ic0de552e7b6763e70eeeaa3651619831b423e151	2016-07-07 14:50:21 +01:00
Paul Wilkins	f037cf80c9	Merge "Add experimental spatial de-noise filter on key frames."	2016-07-07 13:30:07 +00:00
Jingning Han	e357b9efe0	Support measure distortion in the pixel domain Use pixel domain distortion metric in speed 0. This improves the compression performance by 0.3% for both low and high resolution test sets. Change-Id: I5b5b7115960de73f0b5e5d0c69db305e490e6f1d	2016-07-06 18:25:17 -07:00
Yaowu Xu	884c2ddc48	Enable HBD support in real time encoding path BUG=webm:1223 Change-Id: If83a613784e3b2a33c9c93f9ad0ba39dd4d23056	2016-07-06 14:18:37 -07:00
Jacky Chen	aa6108382e	Merge "vp9: Choose the scheme for modeling rd for 32x32 based on skin color."	2016-07-06 20:03:55 +00:00
JackyChen	2678aefc48	vp9: Choose the scheme for modeling rd for 32x32 based on skin color. For real time CBR mode, use model_rd_for_sb_y for 32x32 if the sb is a skin sb to avoid visual regression on the slowly moving face. Refer to the cl: https://chromium-review.googlesource.com/#/c/356020/ Change-Id: I42c36666b2b474ce5ee274239d52ae8ab400fd46	2016-07-06 11:12:03 -07:00
Min Ye	ff19cdafdb	Eliminate isolated and small tail coefficients: Improve hdres PSNR by 0.696% Improve midres PSNR by 0.313% Improve lowres PSNR by 0.142% Change-Id: Icabde78aa9689f539f6a03ec09f712c20758796c	2016-07-06 11:08:23 -07:00
Jingning Han	51aad61c8c	Merge "Remove txfrm_block_to_raster_xy() from vp9 encoder"	2016-07-06 16:00:18 +00:00
Jingning Han	14011f037d	Remove txfrm_block_to_raster_xy() from vp9 encoder The transform block row and column positions are always available outside the callees. There is no need to re-compute these values again. This approach has been used by the decoder. This commit removes txfrm_block_to_raster_xy() function. Change-Id: I5b90f91a0d8b7c35cfa7d171da9edf8202630108	2016-07-04 18:41:47 -07:00
Paul Wilkins	1d3f1983b2	Merge "Fix error in get_ul_intra_threshold() for 10/12 bit."	2016-06-30 16:26:14 +00:00
Paul Wilkins	f7c2d2a3de	Merge "Fix error in get_smooth_intra_threshold() for 10/12 bit."	2016-06-30 16:25:55 +00:00
paulwilkins	e25d6252a4	Fix error in get_ul_intra_threshold() for 10/12 bit. The scaling of the threshold for 10 and 12 bit here appears to be in the wrong direction. For 10 and 12 bit we expect sse values to be higher and hence the threshold used should be scaled up not down. Change-Id: I2678116652b539aef48100e0f22873edd4f5a786	2016-06-30 13:38:57 +01:00
paulwilkins	f9a3d08f1b	Fix error in get_smooth_intra_threshold() for 10/12 bit. This function seems to scale the threshold for testing an SSE value in the wrong direction for 10 and 12 bit inputs. Also for a true SSE the scalings should probably be << 4 and 8 Change-Id: Iba8047b3f70d04aa46d9688a824f3d49c1c58e90	2016-06-30 13:34:11 +01:00
Jacky Chen	e85607410e	Merge "vp9: Change the scheme for modeling rd for 32x32 on newmv_last mode."	2016-06-30 05:59:46 +00:00
JackyChen	5fc2d6cb9f	vp9: Change the scheme for modeling rd for 32x32 on newmv_last mode. For real time CBR mode, use model_rd_for_sb_y for 32x32 if the mode is newmv last, which is less aggressive in skipping transform and quantization, to avoid quality regression in some conditions. Change-Id: Ifa30be587f2a8a4a7f182a172de6ce277c0f8556	2016-06-29 16:28:15 -07:00
James Zern	3a6a81fc9a	Merge changes I9433d858,Iafd05637,If08ce6ca * changes: tests: remove redundant round() definition remove visual studio < 2010 workarounds configure: remove old visual studio support (<2010)	2016-06-29 23:07:16 +00:00
paulwilkins	be013eb396	Add experimental spatial de-noise filter on key frames. For forced key frames in particular this helps to make them blend better with the surrounding frames where noise tends to be suppressed by a combination of quantization and alt ref filtering. Currently disabled by default under and IFDEF flag pending wider testing. Change-Id: I971b5cc2b2a4b9e1f11fe06c67ef073f01b25056	2016-06-29 17:25:41 +01:00
Scott LaVarnway	74bb78df82	Merge "VP9: handle_inter_mode()... Use interp_filter"	2016-06-29 11:41:52 +00:00
James Zern	c125f4a594	remove visual studio < 2010 workarounds BUG=b/29583530 Change-Id: Iafd05637eb65f4da54a9c857e79204a77646858a	2016-06-28 20:58:49 -07:00
Scott LaVarnway	feb7e9a372	VP9: handle_inter_mode()... Use interp_filter only if above/left is inter. Change-Id: I0cc1f926425c021c84536df8271e9ee5f3f87caf	2016-06-28 14:09:59 -07:00
Jacky Chen	d004c64013	Merge "vp9: Increase thr_var for 32x32 blocks in var-based partitioning."	2016-06-28 20:54:06 +00:00
Jacky Chen	4736e5f9d1	Merge "vp9: Move chroma sensitivity check out from choose_partitioning."	2016-06-28 20:53:23 +00:00
jackychen	91038e0eb6	vp9: Move chroma sensitivity check out from choose_partitioning. Change-Id: Ie78185a30cac4d1841be3708bd23e6505d3733b6	2016-06-28 09:58:51 -07:00
jackychen	8cbd4f8701	vp9: Increase thr_var for 32x32 blocks in var-based partitioning. For real-time mode, increase variance threshold for 32x32 blocks in var-based partitioning for resolution >= 720p, so that it is more likely to stay at 32x32 for high resolution which accelerates the encoding speed with little/no PSNR drop. PSNR effect on different speed settings: speed 8 rtc: 0.02 overall PSNR drop, 0.285% SSIM drop speed 7 rtc: 0.196% overall PSNR increase, 0.066% SSIM increase speed 5 rtc_derf: no effect. Speed up: gips_motion_WHD, 1mbps: 2.5% faster on speed 7, 2.6% faster on speed8 gips_stat_WHD, 1mbps: 4.6% faster on speed 7, 5.6% faster on speed8 Change-Id: Ie7c33c4d2dd7d09294917e031357fc5476c3a4bb	2016-06-27 14:44:27 -07:00
Yaowu Xu	7676defca9	Merge "Port metric computation changes from nextgenv2"	2016-06-27 19:18:00 +00:00
Yaowu Xu	b9ec759bc2	Fix ubsan warnings: vp9/encoder/vp9_pickmode.c This commit fixes a number of integer out of range issue in HBD build. BUG=webm:1219 Change-Id: Ib4192dc74a500e1b86c37a399114c7f6d4ed5185	2016-06-27 05:53:46 +00:00
James Zern	913081ab02	Merge "s/UINT32_MAX/UINT_MAX/"	2016-06-25 21:09:55 +00:00
James Zern	ca88d22f39	s/UINT32_MAX/UINT_MAX/ provides better toolchain compatibility Change-Id: I8561a6de668a68ff54fe3886a4ee6300f0ae9c04	2016-06-25 12:15:51 -07:00
James Zern	1c0a9f36f1	vp9_pickmode: revert rd modeling change for hbd Avoids a segfault in high-bitdepth builds. This restores the condition to its state prior to: `7991241` vp9: Change the scheme for modeling rd for bsize 32x32. BUG=webm:1250 Change-Id: I6183d5b34cb89dfbf27b7bb589812148a72cd7de	2016-06-25 11:40:26 -07:00
Jacky Chen	168eea5d60	Merge "vp9: Change the scheme for modeling rd for bsize 32x32."	2016-06-25 00:43:40 +00:00
Jacky Chen	723e357ead	Merge "vp9: Code clean, move low temp var logic out of choose_partitioning."	2016-06-24 22:00:49 +00:00
James Zern	b34705f64f	Merge "cosmetics: Beautify whitespaces and line wrapping"	2016-06-24 21:51:01 +00:00
James Zern	efad6feb9a	Merge "cosmetics: Change few types to their posix version"	2016-06-24 21:50:45 +00:00
James Zern	9e5f355daf	Merge "cosmetics: Make few conditions clearer"	2016-06-24 21:50:32 +00:00
Yaowu Xu	003a9d20ad	Port metric computation changes from nextgenv2 Change-Id: I4aceffcdf7af59ffeb51984f0345c3a4c7e76a9f	2016-06-24 13:52:50 -07:00
jackychen	dd07443f72	vp9: Code clean, move low temp var logic out of choose_partitioning. Change-Id: I7093e74131e0964471c9993c1e972b4617c4731d	2016-06-24 13:38:22 -07:00
jackychen	7991241a50	vp9: Change the scheme for modeling rd for bsize 32x32. For real-time CBR mode, use model_rd_for_sb_y_large instead of model_rd_for_sb_y for 32x32 block. In the former model, transform might be skipped more aggressively in some condtions, which speeds up encoding time with only a little PSNR/SSIM drop on rtc test set. No obvious visual quality regression. PSNR effect on different speed settings: speed 8 rtc: 0.129% overall PSNR drop, 0.137% SSIM drop speed 7 rtc: 0.135% overall PSNR drop, 0.062% SSIM drop speed 5 rtc_derf: 0.105% overall PSNR drop, 0.095% SSIM drop Speed up: gips_motion_WHD, 1mbps: 3.29% faster on speed 7, 2.56% faster on speed8 gips_stat_WHD, 1mbps: 2.17% faster on speed 7, 1.62% faster on speed8 BUG=webm:1250 Change-Id: I818babce5b8549b4b1a7c3978df8591bffde7173	2016-06-24 12:09:13 -07:00
Yury Gitman	67611119b5	cosmetics: Beautify whitespaces and line wrapping Change-Id: I9afa02cae671bd3527cf344695e53d0cc767f549	2016-06-24 10:18:06 -07:00
Yury Gitman	3b2e2f2f77	cosmetics: Change few types to their posix version Change-Id: I6d7bc9ed7396e7b0d63ee97bfa473fdea002f9ee	2016-06-24 10:18:06 -07:00
Yury Gitman	79436fadfb	cosmetics: Make few conditions clearer Change-Id: Ib024b3e42efc7ce1af56824a4644fdefcd45b215	2016-06-24 10:17:51 -07:00
Yaowu Xu	7ed1d54ab4	Merge "Revert "vp9: Change the scheme for modeling rd for bsize 32x32.""	2016-06-24 16:05:55 +00:00
Yaowu Xu	26daa30da4	Merge "Rationalize type to avoid integer out of range"	2016-06-24 13:58:36 +00:00
Yaowu Xu	7738bcb350	Rationalize type to avoid integer out of range BUG=webm:1250 Change-Id: Id5bb2762ca1bf996ba4f9a60eec977a7994c1d94	2016-06-24 13:58:02 +00:00

... 3 4 5 6 7 ...

6548 Commits