generic-library/vpx

Author	SHA1	Message	Date
Ranjit Kumar Tulabandu	71061e9332	Row based multi-threading of encoding stage (Yunqing Wang) This patch implements the row-based multi-threading within tiles in the encoding pass, and substantially speeds up the multi-threaded encoder in VP9. Speed tests at speed 1 on STDHD(using 4 tiles) set show that the average speedups of the encoding pass(second pass in the 2-pass encoding) is 7% while using 2 threads, 16% while using 4 threads, 85% while using 8 threads, and 116% while using 16 threads. Change-Id: I12e41dbc171951958af9e6d098efd6e2c82827de	2017-02-15 00:49:34 +00:00
Marco	22dcfa80aa	vp9: Non-rd mode: use simple block_yrd for 8 bit high bitdepth builds Temporary fix until optimization work for block_yrd is completed. This essentially reverts back to the state before the change: https://chromium-review.googlesource.com/c/433821/ Compression loss is about ~5-6% on RTC set. Speed-up (from using this simple/model-based block_yrd) over the low bitdepth builds (which uses more complex block_yrd) is ~5% on 720p. Change-Id: Ie0af9eb0d111e5595f587870c44f08317403b8d8	2017-02-10 10:15:35 -08:00
Marco	1a5482d4d8	vp9: Denoiser speed-up: increase partition and ac skip thresholds. Add factor to increase varianace partition and ac skip thresholds, under certain conditions (noise level and sum_diff), to increase denoiser speed. Change-Id: I7671140ef3598bf5f114a72623d68792bcd7b77b	2017-02-07 10:33:13 -08:00
Jerome Jiang	aa327a1ed4	vp9: speed 8: Tune threshold of ac skip and partitioning. Threshold for partitioning only affects VGA and lower res. 0.07% quality regression is observed in borg tests on rtc_derf and 0.2% regression on rtc. 5.6% speed up for low res and 6.8% for VGA on Nexus 6. Change-Id: If85a2919b48c991de66059c90f32ed06980452be	2017-02-06 16:27:53 -08:00
Yunqing Wang	770c6663d6	Merge "Changes to facilitate row based multi-threading of ARNR filtering"	2017-02-01 22:04:15 +00:00
Ranjit Kumar Tulabandu	359a6796da	Changes to facilitate row based multi-threading of ARNR filtering Change-Id: I2fd72af00afbbeb903e4fe364611abcc148f2fbb	2017-02-01 13:03:52 -08:00
Jingning Han	969957f9f2	Fix real-time compression regression in hbd mode This commit resolves the compression performance regression in real-time encoding setting when high bit-depth mode is enabled. The current solution temporarily disables the SIMD implementations of vpx_satd, hadamard8x8, and hadamard16x16 in high bit-depth mode. The commit makes the coding results bit-wise identical between regular coding pipeline and high bit-depth at profile 0. BUG=webm:1365 Change-Id: Icfb900821733749685370460a1a5a7e07f76f4bf	2017-01-31 23:17:09 -08:00
Marco	d47f257484	vp9: Modify bsize condition for using model_rd_large for speed 7. In non-rd pickmode: Allow speed 7 to also use larger block size in model_rd. Small change in behavior for speed 7. Change-Id: I8c5523e424308e8f0bc71b3f6324dec42a464cc8	2017-01-30 11:16:51 -08:00
Marco	b16c77cdc4	vp9: Modify bsize condition for using model_rd_large. In non-rd pickmode: small change in behavior for speed 6 and 7. Remove condition on HIGHBITDEPTH flag. Change-Id: I360a13fcc313d72612fe9b918162ef4bb278cdea	2017-01-26 22:45:27 -08:00
Marco	f38ed0c560	vp9: Non-rd pickmode: fix to add ARF mode entries to THR_MODES. BUG=webm:1359 Change-Id: Ie0c66efa2e19d1ec9c744d14e3fa8f1e6214cdd6	2017-01-23 10:56:29 -08:00
Marco	219cdab676	vp9: Add feature to use block source_sad for realtime mode. Only for speed >= 7, and affects skipping of intra modes. Threshold is set low for now, needs to be tuned. Small/no difference in metrics on rtc clips. Change-Id: If9bdbd43f08d1f80407cdd2e9e5e96780dcd2424	2017-01-20 11:57:02 -08:00
Marco	0f9760ab6f	vp9: Modify usage of force_skip under low temporal variance in non-rd pickmode. For short_circuit set to level 1, skip newmv for 64x64 blocks if the low temporal variance flag is set. Also modify threshold for 64x64 split in variance partitioning. Overall speed-up on noisy clips of 2-4%. Only affect speed >= 7. Change-Id: I384b3772007e84de6f8707e480d2ddf1fe1f907d	2017-01-19 11:21:15 -08:00
Marco	7e3a82c384	vp9: Make the denoiser work with spatial SVC. If enabled denoiser will only denoise the top spatial layer for now. Added unittest for SVC with denoising. Change-Id: Ifa373771c4ecfa208615eb163cc38f1c22c6664b	2017-01-10 17:23:58 -08:00
Marco	e7c453b613	vp9: 1 pass vbr: Skip find_predictors in pickmode when source is altref. When source frame is altref, we only do zero-mv mode, so we can skip the find_predictors(). No change in compression. Small speed gain, ~1%. Only affects 1 pass vbr with lookhead altref, for ytlive with the macro flag USE_ALTREF_FOR_ONE_PASS on. Change-Id: I9318c5da8521f017bf54919cd652438b3a6313d1	2016-12-21 12:12:55 -08:00
Marco	61b569b461	vp9 denoiser: Fix the logic for re-evaluating zeromv after denoising. Correctly set interp_filter to SWITCHABLE for INTRA mode. Also reduce threshold on noise level for re-evaluating zeromv. Change-Id: Id32c01e193209fb380aa07204f0be3babf29f70a	2016-12-19 09:30:16 -08:00
Marco	4260a7f2b3	vp9: Change condition to enable recheck_zeromv_after_denoising. For when denoising enabled: change condition to enable the recheck_zeromv_after_denoising for only very high noise level. This is causing an issue, so enabling it for very high noise to effectively shut it off. Change-Id: Ic40d6025f3f398338cedd270d17c0ccd9a3daa84	2016-12-16 15:00:21 -08:00
Marco	b6597745f9	vp9: Use more aggressive skip when short_circuit_low_temp_var = 1. Use the same feature as https://chromium-review.googlesource.com/#/c/411327/, but allow it to be used for speed = 6 and 7, where short_circuit_low_temp_var = 1. Speed up of ~2-3% for speed 7, with little/no loss in compression. Change-Id: I263a0f261ad9929034392d68f0153dc6376fdb5f	2016-11-22 14:54:28 -08:00
Jerome Jiang	360217a233	vp9: Speed 8: More aggresive golden skip for low res. Add a new, more aggresive short circuit: short_circuit_low_temp_var = 3 to skip golden of any mode when variance is lower than threshold for low res. This change only affects speed = 8, low resolution. Metrics for avgPSNR/SSIM on rtc_derf (low resolution) show loss of 0.27/0.31%. On Nexus 6, the encoding time is reduced by ~2.3% on average across all low-res clips. Visually little change on rtc_derf clips. Change-Id: Ia8f7366fc2d49181a96733a380b4dbd7390246ec	2016-11-15 13:56:27 -08:00
Marco	da9f762e24	vp9: Non-rd pickmode: fix logic in reference masking. Add condition that usable_ref_frame > LAST. This is to avoid potentially skipping all last-nonzero mv modes, if golden is used as a reference but skipped completely for the current block. This has no effect currenty, as we always consider testing golden mode for each block. Change-Id: I3182cf44664081935a90ed43aa7b32e710e60e22	2016-11-03 10:32:57 -07:00
Marco	57c6bf291e	1 pass vbr: Allow for lookahead alt-ref in real-time mode. For 1 pass vbr real-time mode: Allow for the usage of alt-ref frame when non-zero lag-in-frames is used. Use non-filtered alt-ref, and select usage based on fast scene/content analysis/detection within the lag of frames. Positive gains on ytlive set: overall avgPSNR ~3-4%. Several clips are up between 5-14%, a few clips are neutral/small change. Current speed decrease is about ~5-10%. Use the flag USE_ALTREF_FOR_ONE_PASS to enable this feature (off by default for now). Change-Id: I802d2bf3d44f9cf01f6d15c76be9c90192314769	2016-10-11 10:13:17 -07:00
clang-format	5f6d143b41	apply clang-format Change-Id: I501597b7c1e0f0c7ae2aea3ee8073f0a641b3487	2016-09-15 15:07:53 -07:00
paulwilkins	3e9e77008c	Casts to remove some warnings. Added casts to remove warnings: BUG=webm:1274 In regards to the safety of these casts they are of two types:- - Normalized bits per (16x16) MB stored in a 32 bit int (This is safe as bits per MB even with << 9 normalization cant overflow 32 bits. Even raw 12 bits hdr source even would only be 29 bits :- (4+4+12+9) and the encoder imposes much stricter limits than this on max bit rate. - Cast as part of variance calculations. There is an internal cast up to 64 bit for the Sum X Sum calculation, but after normalization dividing by the number of points the result will always be <= the SSE value. Change-Id: I4e700236ed83d6b2b1955e92e84c3b1978b9eaa0	2016-09-01 16:10:12 +01:00
James Zern	149d082377	vp9_pickmode: quiet float conversion warnings Change-Id: I591e4f958955b3f2edb2f95a83c54cd83c8ef075	2016-08-19 01:28:01 -07:00
JackyChen	8be7e572a7	vp9 svc: SVC encoder speed up. Bias towards base_mv and skip 1/4 pixel motion search when using base mv. 2~3% speed up for 2 spatial layers, 3~5% speed up for 3 spatial layers. PSNR loss: (2 layers) 0.07dB for gips_stationary, 0.04dB for gips_motion; (3 layers) 0.07dB for gips_stationary, 0.06dB for gips_motion. Change-Id: I773acbda080c301cabe8cd259f842bcc5b8bc999	2016-08-18 11:25:45 -07:00
Marco	7eb7d6b227	vp9 non-rd pickmode: Add limit on newmv-last and golden bias. Add option, for newmv-last, to limit the rd-threshold update for early exit, under a source varianace condition. This can improve visual quality in low texture moving areas, like forehead/faces. Also add bias against golden to improve the speed/fps, will little/negligible loss in quality. Only affects CBR mode, non-svc, non-screen-content. Change-Id: I3a5229eee860c71499a6fd464c450b167b07534d	2016-08-17 14:33:44 -07:00
Alex Converse	6554333b59	Refactor mv limits. Change-Id: Ifebdc9ef37850508eb4b8e572fd0f6026ab04987	2016-08-08 11:54:00 -07:00
clang-format	e0cc52db3f	vp9/encoder: apply clang-format Change-Id: I45d9fb4013f50766b24363a86365e8063e8954c2	2016-08-02 16:47:11 -07:00
Alex Converse	34201e50c1	Unfork 8-bit in HBD path in vp9_model_rd_from_var_lapndz callers. BUG=b/29583530 Change-Id: Ia88a75f9572e08f228559ab84b8a77efb5aff0af	2016-07-26 21:57:58 +00:00
Scott LaVarnway	c969b2b02b	VP9: get_pred_context_switchable_interp() -- encoder side Change-Id: I7217c90d5cf38c51b76759a2dc4f10070f3a40ac	2016-07-21 11:47:51 -07:00
Yaowu Xu	5adb43b8be	Fix non-highbitdepth coding path for HBD build Change-Id: I38eb42b8d051924a7cd1ccc3421a4057cf6e170f	2016-07-08 11:26:34 -07:00
Yaowu Xu	dc008cc17d	Merge "Enable HBD support in real time encoding path"	2016-07-07 22:32:48 +00:00
Marco	f451b404ea	vp9: Adjustment to mv bias for non-rd pickmode. Replace the existing mv bias with a bias only for NEWMV, and based on the motion vector difference of its top/left neighbors. For cbr non-screen-content mode. Change-Id: I8a8cf56347cfa23e9ffd8ead69eec8746c8f9e09	2016-07-07 10:33:06 -07:00
Yaowu Xu	884c2ddc48	Enable HBD support in real time encoding path BUG=webm:1223 Change-Id: If83a613784e3b2a33c9c93f9ad0ba39dd4d23056	2016-07-06 14:18:37 -07:00
JackyChen	2678aefc48	vp9: Choose the scheme for modeling rd for 32x32 based on skin color. For real time CBR mode, use model_rd_for_sb_y for 32x32 if the sb is a skin sb to avoid visual regression on the slowly moving face. Refer to the cl: https://chromium-review.googlesource.com/#/c/356020/ Change-Id: I42c36666b2b474ce5ee274239d52ae8ab400fd46	2016-07-06 11:12:03 -07:00
Jingning Han	51aad61c8c	Merge "Remove txfrm_block_to_raster_xy() from vp9 encoder"	2016-07-06 16:00:18 +00:00
Jingning Han	14011f037d	Remove txfrm_block_to_raster_xy() from vp9 encoder The transform block row and column positions are always available outside the callees. There is no need to re-compute these values again. This approach has been used by the decoder. This commit removes txfrm_block_to_raster_xy() function. Change-Id: I5b90f91a0d8b7c35cfa7d171da9edf8202630108	2016-07-04 18:41:47 -07:00
JackyChen	5fc2d6cb9f	vp9: Change the scheme for modeling rd for 32x32 on newmv_last mode. For real time CBR mode, use model_rd_for_sb_y for 32x32 if the mode is newmv last, which is less aggressive in skipping transform and quantization, to avoid quality regression in some conditions. Change-Id: Ifa30be587f2a8a4a7f182a172de6ce277c0f8556	2016-06-29 16:28:15 -07:00
Yaowu Xu	b9ec759bc2	Fix ubsan warnings: vp9/encoder/vp9_pickmode.c This commit fixes a number of integer out of range issue in HBD build. BUG=webm:1219 Change-Id: Ib4192dc74a500e1b86c37a399114c7f6d4ed5185	2016-06-27 05:53:46 +00:00
James Zern	1c0a9f36f1	vp9_pickmode: revert rd modeling change for hbd Avoids a segfault in high-bitdepth builds. This restores the condition to its state prior to: 7991241 vp9: Change the scheme for modeling rd for bsize 32x32. BUG=webm:1250 Change-Id: I6183d5b34cb89dfbf27b7bb589812148a72cd7de	2016-06-25 11:40:26 -07:00
jackychen	7991241a50	vp9: Change the scheme for modeling rd for bsize 32x32. For real-time CBR mode, use model_rd_for_sb_y_large instead of model_rd_for_sb_y for 32x32 block. In the former model, transform might be skipped more aggressively in some condtions, which speeds up encoding time with only a little PSNR/SSIM drop on rtc test set. No obvious visual quality regression. PSNR effect on different speed settings: speed 8 rtc: 0.129% overall PSNR drop, 0.137% SSIM drop speed 7 rtc: 0.135% overall PSNR drop, 0.062% SSIM drop speed 5 rtc_derf: 0.105% overall PSNR drop, 0.095% SSIM drop Speed up: gips_motion_WHD, 1mbps: 3.29% faster on speed 7, 2.56% faster on speed8 gips_stat_WHD, 1mbps: 2.17% faster on speed 7, 1.62% faster on speed8 BUG=webm:1250 Change-Id: I818babce5b8549b4b1a7c3978df8591bffde7173	2016-06-24 12:09:13 -07:00
James Zern	d4596485be	Revert "vp9: Change the scheme for modeling rd for bsize 32x32." This reverts commit 5c29ee726e622baf6f950dfe980df0d4c116c1a8. Causes segfaults in VP9/EndToEndTestLarge.EndtoEndPSNRTest. BUG=webm:1250 Change-Id: I8a30e97be30589abdb76820b5c3c37c46cd6cafb	2016-06-23 15:59:25 -07:00
Angie Chiang	424982bc41	Merge "set interp_filter to SWITCHABLE_FILTER for intra block"	2016-06-23 18:56:27 +00:00
Angie Chiang	d9c417cb49	set interp_filter to SWITCHABLE_FILTER for intra block In vp9_pick_inter_mode(), instead of using vp9_get_pred_context_switchable_interp(xd) to assign filter_ref, we use a less strict condition on assigning filter_ref. This is to reduce the probabily of entering the flow of not assigning filter_ref and then skipping filter search. Overall PSNR gain 0.074% for rtc dataset Details: Low Mid High 0.185% -0.008% -0.082% Change-Id: Id5c5ab38d3766c213d5681e17b4d1afd1529e676	2016-06-22 17:19:43 -07:00
Jacky Chen	8496390e73	Merge "vp9: Change the scheme for modeling rd for bsize 32x32."	2016-06-22 23:50:46 +00:00
jackychen	5c29ee726e	vp9: Change the scheme for modeling rd for bsize 32x32. For real-time CBR mode, use model_rd_for_sb_y_large instead of model_rd_for_sb_y for 32x32 block. In the former model, transform might be skipped more aggressively in some condtions, which speeds up encoding time with only a little PSNR/SSIM drop on rtc test set. No obvious visual quality regression. PSNR effect on different speed setting: speed 8 rtc: 0.129% overall PSNR drop, 0.137% SSIM drop speed 7 rtc: 0.135% overall PSNR drop, 0.062% SSIM drop speed 5 rtc_derf: 0.105% overall PSNR drop, 0.095% SSIM drop Speed up: gips_motion_WHD, 1mbps: 3.29% faster on speed 7, 2.56% faster on speed8 gips_stat_WHD, 1mbps: 2.17% faster on speed 7, 1.62% faster on speed8 Change-Id: I902f62def225ea01c145d7e5a93497398b8f5edf	2016-06-22 11:17:56 -07:00
Yaowu Xu	87bf1a149c	Fix ubsan warnings: vp9/encoder/vp9_mcomp.c This commit fixes a number of ubsan warnings in HBD build. BUG=webm:1219 Change-Id: I05f0fd0ef50e93db4ba34205005c54af1ed32acc	2016-06-21 15:37:59 -07:00
James Zern	cf2bfcd9d8	Merge "vp9_pickmode: quiet int64->int conversion warning"	2016-06-17 03:58:46 +00:00
Marco Paniconi	8870756ac6	Merge "vp9: Adjustments to nonrd-pickmode for vbr"	2016-06-16 16:07:06 +00:00
James Zern	ff8edd3509	vp9_pickmode: quiet int64->int conversion warning since: 7901a05 VP9: block_yrd(): Use RD_COST * instead Change-Id: I808be7a7bc96b81eb7d6255ae96cea7e9b1ecf1f	2016-06-15 23:34:09 -07:00
Marco	8e070558b0	vp9: Adjustments to nonrd-pickmode for vbr For VBR: (1) allow newmv mode for golden ref to select interpolation filter (as in last ref case), and (2) don't use the more aggressive tx-skip testing logic for large blocks. Only affects 1 pass real-time vbr mode (speed >= 5). PSNR/SSIM metrics on ytlive set are all positive, ~0.5-2% gain. Change-Id: I0ffbb0a9755563a5acd6230c58236e4f19a47266	2016-06-15 14:31:28 -07:00

1 2 3 4 5 ...

440 Commits