generic-library/vpx

Author	SHA1	Message	Date
Ranjit Kumar Tulabandu	bf15ca1091	Fix for out of range motion vector bug in sub-pel motion estimation BUG=webm:1397 (yunqingwang) To verify that this patch wouldn't cause much performance change, the Borg tests were run. Here was the result: avg_psnr overall_psnr ssim hdres: -0.002 0.006 0.013 midres: 0 0 0 lowres: 0 0 0 Change-Id: Iae395ae7b741e0513cf5bab9dcace110b792a67d	2017-04-03 16:16:49 +00:00
Yunqing Wang	002cf38837	Merge "Enhance the row mt sync read to accept the sync_range greater than 1"	2017-04-03 15:59:51 +00:00
Yunqing Wang	f1600db3e4	Enhance the row mt sync read to accept the sync_range greater than 1 The row mt sync read uses sync_range = 1, and wouldn't work if we want to use a sync_range that is greater than 1. To make it work, this sync read code is modified. Pass in col instead of col - 1 to make it consistent with other row mt code in VP9, and then add 1 in "while" codition. Change-Id: I4a0e487190ac5d47b8216368da12d80fec779c1a	2017-03-31 10:48:38 -07:00
Marco	c824eda6cc	vp9: SVC: Fix issue with artifact for svc-denoising. Issue/bug happens for denoising with spatial layers, where the golden (spatial) reference is used in pickmode, but denoising is only done wrt to last (temporal). Fix is to make sure set_ref_ptrs is set before build predictors in denoiser. Change-Id: I793cf441341edf7c4a88b8ab1e1b22b3cb0eb508	2017-03-31 10:05:32 -07:00
Marco	fc83fcb7c4	vp9: SVC: fix to allow output of denoised result. Change-Id: Iaf55cfb5e9621d074eb33d6a32f184e4777968f8	2017-03-29 14:02:54 -07:00
Marco	32b3d2f174	vp9: 1 pass SVC: Modify condition for intra-mode search. Temporary override to condition for disallowing intra-search in SVC, since golden (spatial) reference is currently suppressed due to artifact issue. Change-Id: I28ed7fdddc9fcdbcc0a4175a247a3ecc94c11767	2017-03-29 09:24:50 -07:00
Marco	0169a985d9	vp9: Speed >= 8: avoid chrome check under some condition. For non-rd variance partition, avoid the chrome check unless y_sad is below some threshold. Small decrease in avgPSNR (~0.3) on RTC set. Small/negligible decrease on RTC_derf. Change-Id: I7af44235af514058ccf9a4f10bb737da9d720866	2017-03-27 13:18:21 -07:00
Marco	66c6b4d6fc	vp9: 1 pass: Move source sad computation into encodeframe loop. Refactor to split the 1 passs source sad computation into scene detection (currently used for VBR and screen-content mode), and superblock based source sad computation (used in non-rd CBR mode). This allows the source sad computation for CBR mode to be multi-threaded. No change in compression. Change-Id: I112f2918613ccbd37c1771d852606d3af18c1388	2017-03-27 11:11:05 -07:00
Marco	07ad5a15c2	vp9: Fix to condition on using source_sad for 1 pass real-time. Make the source_sad feature work properly for cases of VBR or screen_content with SVC. Added unittest for SVC with screen-content on. Change-Id: Iba5254fd8833fb11da521e00cc1317ec81d3f89b	2017-03-24 10:21:47 -07:00
Alex Converse	d7b220b467	Merge changes Ie989e60c,Ifc110b12 * changes: Backport "Optimize the use case of token_cost table" to VP9 Drop vp9_get_token_extracost	2017-03-23 18:05:13 +00:00
Marco	4863e07c01	vp9: Non-rd partition: avoid unneeded call to chrome_check Since y_sad is not computed yet (on the early exit due to source_sad), no need to check for setting color_sensitiviy. Only affects speed >=8. No change in behavior. Change-Id: I3a6f2d20fed38d8b8ec51b75bcacf9a21f2db916	2017-03-22 22:40:28 -07:00
James Zern	f16ea6a6eb	Merge "vp9_rdopt: correct size to vpx_sum_squares_2d_i16"	2017-03-23 00:53:22 +00:00
Marco Paniconi	ff0e0a76e8	Merge "vp9: Adjust some speed settings for speed 8."	2017-03-22 22:56:17 +00:00
Marco	4d50991320	vp9: Adjust some speed settings for speed 8. Allow for simple_block_rd for VGA resoln, and reduce adaptive_rd_thresh to 1. On average no loss on RTC set, ~4% speedup on mac. Change-Id: Ib549c4061c853776062b5e34040f839d470fbebc	2017-03-22 15:16:15 -07:00
Jerome Jiang	dcd6c87b80	Merge "vp9: Enable adaptive_rd_threshold for row mt for realtime speed 8."	2017-03-22 22:02:24 +00:00
James Zern	5661cd8ff4	vp9_rdopt: correct size to vpx_sum_squares_2d_i16 the current implementations expect pixel size, not the block type BUG=webm:1392 Change-Id: Ib91e9f30a1f56e13566b1fb76f089dae9bb50cdc	2017-03-22 12:04:33 -07:00
Johann	36d732c22b	vp9 temporal filter: add const to function prototype The input frames are not modified. Change-Id: Ideb810e3c5afeb4dbdc4c7d54024c43a8129ad39	2017-03-22 18:14:21 +00:00
Jerome Jiang	20c2892693	vp9: Enable adaptive_rd_threshold for row mt for realtime speed 8. Change it to row based array to avoid the slow down cause by sync. row-mt on, speed 8, 2 threads: ~4% speedup for VGA on ARM benefited from adaptive_rd_threshold. Change-Id: I887e65a53af20a6c4f48d293daaee09dab3512cf	2017-03-21 18:49:47 -07:00
Jerome Jiang	dbed479d79	Fix the data race caused by vp9 denoiser. BUG=webm:1391 Change-Id: I9669ae62fe9c695d4c6f9973094cb0f39bed51c7	2017-03-21 15:46:25 -07:00
Yunqing Wang	1935dfb294	Code refactoring in the partition search Computed the partition search early termination score in a separate function. Change-Id: I1894b517ff179a38b1c05e054d373ac4b7f4cbb4	2017-03-21 10:00:44 -07:00
Marco Paniconi	05c7259525	Merge "vp9: Nonrd variance partition: improve split to 16x16."	2017-03-21 00:17:35 +00:00
Yunqing Wang	bf43b4c4b4	Merge "Record the sum of tx block eobs in the partition block"	2017-03-20 23:20:12 +00:00
Marco	3135b85423	vp9: Nonrd variance partition: improve split to 16x16. Add additional condition to split to 16x16, for resolutions <= 360p, reduces dragging artifact near moving boundary. Small/no change on RTC metrics. Change-Id: I314694f2166435d918f74e7ab42f002b07f40dae	2017-03-20 15:44:46 -07:00
Marco	06c8713e89	vp9: Use sb content measure to bias against golden. For each superblock, keep track of how far from current frame was the last significant content change, and use that (along with GF distance), to turnoff GF search in non-rd pickmode. Only enabled for speed >= 8. avgPNSR on RTC/RTC_derf down by ~0.9/1.2. Speedup on mac: ~3-5%. Speedup on arm: 3.6% for VGA and 4.4% for HD. Change-Id: Ic3f3d6a2af650aca6ba0064d2b1db8d48c035ac7	2017-03-20 12:42:26 -07:00
Yunqing Wang	9c2552a1c1	Record the sum of tx block eobs in the partition block The sum of tx bloxk eobs is needed in the machine learning based partition early termination. The eobs are first accumulated during tx search, and then the value associated with the best tx_size is copied to ctx for later use. After the sum of eobs are calculated correctly, re-enabled ml_partition_search_early_termination speed feature. Re-did the quality/speed test to check the impact of the fix. 1. Borg test BDRATE result: 4k set: PSNR: +0.183%; SSIM: +0.100%; hdres set: PSNR: +0.168%; SSIM: +0.256%; midres set: PSNR: +0.186%; SSIM: +0.326%; 2.Average speed gain result: 4k clips: 21%; hd clips: 26%; midres clips: 15%. The result is in line with the original result. Change-Id: I4209a95c89be03b4cbfb6a95b16885f89feddbda	2017-03-20 17:12:15 +00:00
Jingning Han	ca9bedd538	Backport "Optimize the use case of token_cost table" to VP9 cherry picked from nextgenv2 90ea281f29df747282e56d3068a3ddbdde30cdd0 Change-Id: Ie989e60c6479ac3251cadaac9c7e795ccba52f4e	2017-03-17 16:54:22 -07:00
Alex Converse	ab71181545	Drop vp9_get_token_extracost vp9_get_token_cost does the same thing with one fewer lookup. Change-Id: Ifc110b12403cb1a04a3f91357ab435c67b4815d6	2017-03-17 16:53:09 -07:00
Alex Converse	0842daa24e	Merge "vp9_optimize_b: Combine extrabits cost with token lookup"	2017-03-17 16:18:21 +00:00
Marco	02975a604c	vp9: Fix speed 8 condition for enabling copy_partition. Change-Id: I2c090e6ba853a30fef1957b620853315f9471753	2017-03-16 17:08:37 -07:00
Alex Converse	3a6ec9ea72	vp9_optimize_b: Combine extrabits cost with token lookup About 0.6% fewer cycles spent in vp9_optimize_b. Change-Id: I2ae62a78374c594ed81d4e3100a5848e2f6f2c4e	2017-03-16 17:03:22 -07:00
Gabriel Marin	976ddb61d3	Add a vector form of routine vp9_model_rd_from_var_lapndz Add routine vp9_model_rd_from_var_lapndz_vec and call it from model_rd_for_sb to model the rate and distortion for MAX_MB_PLANE Laplacian sources in parallel. The caller ensures that all sources have non-zero variance. Measured a 18% to 25% reduction in retired instructions, and 17% to 24% reduction in instruction execution cost with different compilers for the Laplacian modeling. No change in behavior. TEST=Verified that encoded files match bit for bit, with and without this change. BUG=b/33678225 Change-Id: I6b76947f21c659a349adb896e13e99f6e3f951e6	2017-03-16 22:19:44 +00:00
Marco	bc7d4935bb	vp9: Fixes in non-rd pickmode for denoising with SVC. Don't denoise spatial layer frames whose base layer is a key frame. Disallow golden reference for SVC with denoising on frames that will be denoised (highest layer), as this removes bad artifact. Will re-enable when issue is resolved. Change-Id: I87a6597812330500966458172acfce54af65f70f	2017-03-16 12:59:41 -07:00
Jerome Jiang	bf40776aa4	Merge "Refactor: Change cpi->resize_state to enum values."	2017-03-16 16:43:42 +00:00
Marco Paniconi	cd47c1942e	Merge "vp9: Fix some issues with denoiser and SVC."	2017-03-16 02:42:55 +00:00
Marco	a340c64a79	vp9: Fix some issues with denoiser and SVC. Fix the update of the denoiser buffer when the base spatial layer is a key frame. And allow for better/lower QP on high spatial layers when their base layer is key frame. Change-Id: I96b2426f1eaa43b8b8d4c31a68b0c6d68c3024a2	2017-03-15 17:19:17 -07:00
Jerome Jiang	b5f7f7737a	Refactor: Change cpi->resize_state to enum values. Change-Id: Iab1409b0fc1175bc5a14afc4749a08c536c98c41	2017-03-15 17:16:17 -07:00
Marco	2c8430e223	vp9: Turn off ml_partition_search_early_termination. Fails on nightly ubsan, valgrind tests. Enabled on commit:6701014 Change-Id: Ied3f5cb38e39cba54ac134f4514107cdfdfce159	2017-03-15 15:00:38 -07:00
Jerome Jiang	27d5a57072	Merge "vp9: Using source sad for speedup for dynamic resizing."	2017-03-15 00:03:52 +00:00
Jerome Jiang	2fa7092808	Merge "vp9: Enable row multithreading for SVC in real-time mode."	2017-03-14 23:29:46 +00:00
Jerome Jiang	02463273c9	vp9: Using source sad for speedup for dynamic resizing. Only for speed >= 7. Change-Id: I3ac85fbb4023cf7e6f8333806b345b0174382a09	2017-03-14 15:47:19 -07:00
James Zern	1b91f41935	Merge "vp9/encoder: fix segfault on win32 using vs < 2015"	2017-03-14 19:21:42 +00:00
Yunqing Wang	c3e290963d	Merge "Apply machine learning-based early termination in VP9 partition search"	2017-03-14 18:07:05 +00:00
Marco Paniconi	78a6946904	Merge "vp9: Speed >= 8: Enable simple_block_yrd speed feature."	2017-03-14 17:50:17 +00:00
Marco	c0c789ab50	vp9: Adjust copy partition threshold, for speed 8. Reduce it from 5 to 4, small/no change in metrics or speed. Small reduction in dragging artifact near moving head. Change-Id: Ic3bc5ca67c70bf0c89fc2ed14454840a28ae5b6a	2017-03-14 09:18:53 -07:00
Marco	c216c8d6f2	vp9: Speed >= 8: Enable simple_block_yrd speed feature. Enable speed feature for resolutions > VGA. avgPSNR on RTC down by ~1.7%. Speedup on ARM: ~5%. Change-Id: I7a3fe5f7425aa8df3f4a2eced1afa355bc0d4c95	2017-03-14 09:10:28 -07:00
Marco	f0a22b23fe	vp9: Fix to source_sad feature for SVC. Allow speed feature sf->use_source_sad to be used on highest spatial layer for SVC. Change-Id: I260eb0478902764f49f83e43b17024fe86ff3b22	2017-03-13 11:00:40 -07:00
Yunqing Wang	670101439f	Apply machine learning-based early termination in VP9 partition search This patch was based on Yang Xian's intern project code. Further modifications were done. 1. Moved machine-learning related parameters into the context structure. 2. Corrected the calculation of sum_eobs. 3. Removed unused parameters and calculations. 4. Made it work with multiple tiles. 5. Added a speed feature for the machine-learning based partition search early termination. 6. Re-organized the code. The patch was rebased to the top-of-tree. Borg test BDRATE result: 4k set: PSNR: +0.144%; SSIM: +0.043%; hdres set: PSNR: +0.149%; SSIM: +0.269%; midres set: PSNR: +0.127%; SSIM: +0.257%; Average speed gain result: 4k clips: 22%; hd clips: 23%; midres clips: 15%. Change-Id: I0220e93a8277e6a7ea4b2c34b605966e3b1584ac	2017-03-13 09:54:18 -07:00
Marco	8c18df7fcd	vp9: Fix condition for intra search in non-rd pickmode. Fixes an issue when the LAST and golden is not used as a reference, in which case its possible no encoding mode is set (since intra may be skipped under certain codtions). Fix is to make sure intra is searched if no inter mode is checked. Issue can happen for temporal layer pattern#7 in vpx_temporal_svc_encoder.c Change-Id: I5ab4999b2f9dbd739044888e0916b5ec491d966b	2017-03-12 22:30:39 -07:00
James Zern	c09b290cea	vp9/encoder: fix segfault on win32 using vs < 2015 shift the bsse[] member of the macroblock struct to the front to avoid an incorrect offset (0) to the upper half of bsse[0] which leads to a negative resulting in a crash. restrict this to visual studio versions before 2015 (the bug was observed with 2013, fixed in 2015) to avoid any potential cache impact on other platforms. https://connect.microsoft.com/VisualStudio/feedback/details/2396360/bad-structure-offset-in-32-bit-code BUG=webm:1054 Change-Id: I40f68a1d421ccc503cc712192263bab4f7dde076	2017-03-10 17:37:17 -08:00
Marco	ffb3c50da1	vp9: Enable row multithreading for SVC in real-time mode. Enable row-mt for SVC for real-time mode, speed >=5. Add the controls to the sample encoders, but keep it off for now. Add the control and enable it for the 1 pass CBR unittests. For speed 7, 3 layer SVC, 2 threads, row-mt enabled gives about ~5% speedup. Change-Id: Ie8e77323c17263e3e7a7b9858aec12a3a93ec0c1	2017-03-10 01:01:07 +00:00

... 4 5 6 7 8 ...

6746 Commits