generic-library/vpx

Author	SHA1	Message	Date
Yunqing Wang	bfd0f41f9b	Force the bit exactness in the first pass Originally, for the purpose of keeping a fast first pass, the first-pass stats between row_mt_mode = 0 and row_mt_mode = 1 are not bit exact, but that difference is very small that doesn't cause a mismatch between the final bitstreams. However, if the encoder changes, this minor difference may cause a mismatch. Thus, this patch always forces the first pass to be bit exact. BUG=webm:1453 Change-Id: I2b67cf529dee81f660f9d9e7fe9a60ea3c7b12b8	2017-08-02 15:58:39 -07:00
Jerome Jiang	f027908ad0	Revert "Revert "vp9: Speed feature to adapt partition based on source_sad."" This reverts commit `c9266b8547`. Disable source_sad when resolution > 1080P. The test should pass now. BUG=webm:1452 Change-Id: I72dde88e66590ff9e41da5e5dd83f5550a83f082	2017-07-30 19:49:31 -07:00
James Zern	c9266b8547	Revert "vp9: Speed feature to adapt partition based on source_sad." This reverts commit `064fc570ff`. This causes an assertion failure in vp9_mcomp.c when running gtest_filter=VP9/MotionVectorTestLarge.OverallTest/41: `mv->col >= -((1 << (11 + 1 + 2)) - 1) && mv->col < ((1 << (11 + 1 + 2)) - 1)' Change-Id: I449e777bf18b661cb3f1d82253610c55c51687f6	2017-07-29 11:36:58 -07:00
Marco	064fc570ff	vp9: Speed feature to adapt partition based on source_sad. Move the source_sad feature to speed 6 (from speed 7), and add speed feature to switch from the variance-based partition to reference_partition (which uses nonrd-pickmode for bsize selection) if source_sad is high. Currently used only for speed 6 for resoln <= 360p. About 4-5% improvement on 360p in RTC set. Some speed slowdown, but still ~30% faster than speed 5. Change-Id: Ib0330ee5fe9fdd2608aed91359a2a339d967491c	2017-07-29 00:20:26 +00:00
Johann	109faffe9b	remove vp9_full_sad_search This code is unused in vp9. Only vp8 still contains references to vpx_sad_NxMx[3\|8] and only for sizes 16x16, 16x8, 8x16, 8x8 and 4x4. Remove the remaining sizes and all the highbitdepth versions. BUG=webm:1425 Change-Id: If6a253977c8e0c04599e25cbeb45f71a94f563e8	2017-07-10 11:20:35 -07:00
Marco	88d11f473c	vp9: Speed >= 8: Remove logic on reducing subpel. Existing logic was only affecting resolutions above 720p. Needs more testing for reducing subpel for speed >= 8. No change on RTC metrics. Change-Id: I2f4bf9f25891614aafa9a86aa5a5063a3ccfce4d	2017-06-27 20:27:02 -07:00
Marco Paniconi	737aa5c9e4	Merge "vp9: SVC: Rework the usage of base_mv for SVC."	2017-06-20 03:08:32 +00:00
Marco	ff7fb4b280	vp9: Speed >= 8: Adjust resolution threshold for subpel. Get some quality gain on RTC metrics (~7%), with ~5-8% speed slowdown. Change-Id: I0d02942a77074424ee0326b6e110ddff09f2df5e	2017-06-19 13:58:08 -07:00
Marco	112cd95507	vp9: SVC: Rework the usage of base_mv for SVC. Set the base_mv_aggressive for temporal enhancement layers (TL > 0). Under the aggressive mode, skip the NEWMV depending on the SSE of the base_mv. Also reduce the subpel motion to 1/2 under aggressive mode if base_mv is good. Speedup ~3% with small/negligible loss in quality on RTC. Affects speed >= 6. Change-Id: I89341b279cad6da2a04b76d5e726016191dacdb8	2017-06-18 22:35:46 -07:00
Marco	e540ca7155	vp9: SVC: Use prune_evenemore only for non_reference. Set subpel prune_evenmore only for non_reference frames, instead of all TL > 0 frames. Gain some quality back at cost of small speed loss (~1-2%). Change only effects SVC encoding at speed >= 7. Change-Id: I5b9f51e51dccfd7050521a66996176b0415ca3f9	2017-06-09 17:52:20 -07:00
Marco	14d4718043	vp9: SVC: Enable simple_block_yrd for temporal layers. Enable simple_block_yrd for temporal enhancement layers (TL > 0). And remove block size condiiton for SVC mode. Only affects speed >= 7 SVC. Speedup ~3-4%. avgPSNR regression on RTC for (3 spatial, 3 temporal) layers: ~1%. Change-Id: Iff4fc191623b71c69cd373e7c0823385e7ac67ed	2017-06-07 11:41:50 -07:00
Marco	7d2f5f8e9d	vp9: SVC: Adjust some speed settings for SVC speed >= 7. Keep the 1/4subpel for all frames, use SUBPEL_TREE_PRUNED_EVENMORE for all temporal enhancement layer frames. Change-Id: Ibc681acbb6fc75b7b3c57fc483fcb11d591dfc9a	2017-06-06 15:30:24 -07:00
Marco	e30781ff80	vp9: SVC: Force subpel search off under certain conditions. For SVC 1 pass non-rd mode: Force subpel seach off for SVC for non-reference frames under motion threshold. Add flag to svc context to indicate if the frame is not used as a reference. Little/no quaity loss, ~2% speedup. Change-Id: Ic433c44b514d19d08b28f80ff05231dc943b28e9	2017-06-01 20:48:52 -07:00
Marco	8c6fa5c5e3	vp9: Speed >8: Set subpel_search_method for low motion. Speed >=8: for resolutions above CIF, and for low motion content, set subpel_search_method to SUBPEL_TREE_PRUNED_EVENMORE. Small speed gain (~2%) on vga clips, RTC metrics up by ~2-3% on average. Change-Id: Ie26ba0264589652f92dfe74308740debf94cf0cc	2017-06-01 16:16:13 -07:00
Marco	747cf7a505	vp9: SVC: Enable copy partition for SVC speed >= 7. Adjust the max_copied_frame setting for temporal layers. Keep the same setting for non-SVC at speed 8. This change also enables copy_partiton for non-SVC at speed 7, but with smaller value of max_copied_frame (=2). ~2% speedup for SVC speed 7, 3 layers, with little/no quality loss. Change-Id: Ic65ac9aad764ec65a35770d263424b2393ec6780	2017-05-25 12:21:46 -07:00
Marco	ff9395eb3b	vp9: Speed >= 8: Modify condition for low-resoln. No change on RTC metrics. Change-Id: I5abc573cb56572188d900645d13ba479f55a1ea0	2017-05-21 22:14:38 -07:00
Marco	2ba4729ef8	vp9: Make copy partition work for SVC and dynamic resize. Only affects speed 8. Make changes to copy partition to fix a bug in setting microblock offset. Avg PSNR shows 0.02% gain on rtc_derf and 0.08% loss on rtc. Change-Id: I61c3e5914dde645331344388e7437e5638acd4f3	2017-05-18 11:33:56 -07:00
Jerome Jiang	1fcd5cca3c	vp9: speed 8: Fix seg fault in partition copy when drop frames. BUG=webm:1433 Change-Id: I4f3984ef28660d3218d48007d7c977bdbdaf8af6	2017-05-12 15:57:23 -07:00
Marco Paniconi	629279a45c	Merge "vp9: Adjust speed features for speed 8 at low resoln."	2017-05-12 00:35:40 +00:00
Jerome Jiang	04de501229	vp9: Fix condition for disabling adaptive_rd_thresh. Add speed constrains for disabling adaptive_rd_thresh when row_mt_bit_exact is set. Change-Id: I2445115c2f9a2e46b8a0966031a0fea488d4964e	2017-04-28 10:26:20 -07:00
Jerome Jiang	43e0e082d1	vp9: Don't force disabling of adaptive_rd_thresh for realtime. Don't force disabling of adaptive_rd_thresh for realtime when row_mt_bit_exact is set. Row based adaptive rd is made usable in CL 454882(https://chromium-review.googlesource.com/c/454882) for REALTIME. Change-Id: Ief023414f0fd6eb86f299dd46ae58f4436875af5	2017-04-26 13:17:57 -07:00
Yunqing Wang	b68f14d0ed	Merge "Make the row based multi-threaded encoder deterministic"	2017-04-26 16:12:14 +00:00
Marco	c614164cb6	vp9: SVC: Adjust some speed settings for temporal layers. Make some speed setting changes for temporal enhancement layers, and remove the switch in subpel_force_stop for the aggressive_base_mv in non-rd pickmode. Gain some 2-3% speed with little/negligible quality loss. Change-Id: I3e2a7f80ff45f38c0a6ceb01b34dbca2f53edbf0	2017-04-25 16:27:01 -07:00
Yunqing Wang	10a497bd38	Make the row based multi-threaded encoder deterministic This patch followed allow_exhaustive_searches feature modification and continued to modify the encoder to achieve the determinism in the row based multi-threaded encoding. While row-mt = 1 and using multiple threads, the adaptive feature in encoder was disabled, which gave BDRate gain(at speed 1, -0.6% ~ -0.7%; at speed 2, -0.46% ~ -0.59%), but some encoder speed losses(7% ~ 10% at speed 1 and 3% ~ 6% at speed 2). These speed losses were acceptable considering the speed gains obtained from row-mt. Change-Id: I60d87a25346ebc487a864b57d559f560b7e398bb	2017-04-24 16:28:27 -07:00
Yunqing Wang	bca4564683	Make allow_exhaustive_searches feature no longer adaptive A previous patch turned on allow_exhaustive_searches feature only for FC_GRAPHICS_ANIMATION content. This patch further modified the feature by removing the exhaustive search limit, and made it no longer adaptive. As a result, the 2 counts that recorded the number of motion searches were removed, which helped achieve the determinism in the row based multi-threading encoding. Tests showed that this patch didn't cause the encoder much slower. Used exhaustive_searches_thresh for this speed feature, and removed allow_exhaustive_searches. Also, refactored the speed feature code to follow the general speed feature setting style. Change-Id: Ib96b182c4c8dfff4c1ab91d2497cc42bb9e5a4aa	2017-04-21 11:14:02 -07:00
Yunqing Wang	e96e49c2f9	Only allow allow_exhaustive_searches for FC_GRAPHICS_ANIMATION content The allow_exhaustive_searches feature improves the encoding quality of FC_GRAPHICS_ANIMATION content a lot. For non-FC_GRAPHICS_ANIMATION content, the quality test result is almost neutral. This patch makes this feature to be used only for FC_GRAPHICS_ANIMATION content. The motivation of doing that is to make this feature no longer adaptive, which will be implemented in the following patch. Change-Id: Ic911df6dd757402b6480789cc247801e99840369	2017-04-20 00:03:27 +00:00
Marco	5f39262dcc	vp9: Adjust speed features for speed 8 at low resoln. For low resolutions (<= CIF): use quarter-pixel and simple_block_yrd. ~5% gain on RTC_derf. ~6-7% slowdown on ARM. Change-Id: I4439ebd1116b9decac04786503f978840b68a60c	2017-04-14 11:35:47 -07:00
Jerome Jiang	f16f08e55b	vp9: speed >= 8: Adjust speed settings on ARM. Set adaptive_rd_thresh to 2 when simple block yrd is not used. Fix regression caused by computing y sad without int_pro_motion_estimation on low res motion clips. Overall 0.07% quality loss on rtc_derf. Change only affects low res on speed 8. Change-Id: Ic6a188a56529f1034d6431005fb4b0e24e8a7e27	2017-04-11 00:26:56 +00:00
Yunqing Wang	f496032686	Merge "VP9 motion vector unit test"	2017-04-07 16:46:22 +00:00
Yunqing Wang	1aa46abbdf	VP9 motion vector unit test To prevent the motion vector out of range bug, added a motion vector unit test in VP9. In the 4k video encoding, always forced to use extreme motion vectors and also encouraged to use INTER modes. In the decoding, checked if the motion vector was valid, and also checked the encoder/decoder mismatch. The tests showed that this unit test could reveal the issue we saw before. Change-Id: I0a880bd847dad8a13f7fd2012faf6868b02fa3b4	2017-04-06 00:50:56 +00:00
Jerome Jiang	58ba880b94	Refactor: Clean memory allocation for copy partition. Move the memory allocation from setting speed features. Change-Id: I2e89dfaeb46daee63effe5a5df62feed732aa990	2017-04-05 15:33:24 -07:00
Marco	66c6b4d6fc	vp9: 1 pass: Move source sad computation into encodeframe loop. Refactor to split the 1 passs source sad computation into scene detection (currently used for VBR and screen-content mode), and superblock based source sad computation (used in non-rd CBR mode). This allows the source sad computation for CBR mode to be multi-threaded. No change in compression. Change-Id: I112f2918613ccbd37c1771d852606d3af18c1388	2017-03-27 11:11:05 -07:00
Marco Paniconi	ff0e0a76e8	Merge "vp9: Adjust some speed settings for speed 8."	2017-03-22 22:56:17 +00:00
Marco	4d50991320	vp9: Adjust some speed settings for speed 8. Allow for simple_block_rd for VGA resoln, and reduce adaptive_rd_thresh to 1. On average no loss on RTC set, ~4% speedup on mac. Change-Id: Ib549c4061c853776062b5e34040f839d470fbebc	2017-03-22 15:16:15 -07:00
Jerome Jiang	20c2892693	vp9: Enable adaptive_rd_threshold for row mt for realtime speed 8. Change it to row based array to avoid the slow down cause by sync. row-mt on, speed 8, 2 threads: ~4% speedup for VGA on ARM benefited from adaptive_rd_threshold. Change-Id: I887e65a53af20a6c4f48d293daaee09dab3512cf	2017-03-21 18:49:47 -07:00
Yunqing Wang	bf43b4c4b4	Merge "Record the sum of tx block eobs in the partition block"	2017-03-20 23:20:12 +00:00
Marco	06c8713e89	vp9: Use sb content measure to bias against golden. For each superblock, keep track of how far from current frame was the last significant content change, and use that (along with GF distance), to turnoff GF search in non-rd pickmode. Only enabled for speed >= 8. avgPNSR on RTC/RTC_derf down by ~0.9/1.2. Speedup on mac: ~3-5%. Speedup on arm: 3.6% for VGA and 4.4% for HD. Change-Id: Ic3f3d6a2af650aca6ba0064d2b1db8d48c035ac7	2017-03-20 12:42:26 -07:00
Yunqing Wang	9c2552a1c1	Record the sum of tx block eobs in the partition block The sum of tx bloxk eobs is needed in the machine learning based partition early termination. The eobs are first accumulated during tx search, and then the value associated with the best tx_size is copied to ctx for later use. After the sum of eobs are calculated correctly, re-enabled ml_partition_search_early_termination speed feature. Re-did the quality/speed test to check the impact of the fix. 1. Borg test BDRATE result: 4k set: PSNR: +0.183%; SSIM: +0.100%; hdres set: PSNR: +0.168%; SSIM: +0.256%; midres set: PSNR: +0.186%; SSIM: +0.326%; 2.Average speed gain result: 4k clips: 21%; hd clips: 26%; midres clips: 15%. The result is in line with the original result. Change-Id: I4209a95c89be03b4cbfb6a95b16885f89feddbda	2017-03-20 17:12:15 +00:00
Marco	02975a604c	vp9: Fix speed 8 condition for enabling copy_partition. Change-Id: I2c090e6ba853a30fef1957b620853315f9471753	2017-03-16 17:08:37 -07:00
Jerome Jiang	b5f7f7737a	Refactor: Change cpi->resize_state to enum values. Change-Id: Iab1409b0fc1175bc5a14afc4749a08c536c98c41	2017-03-15 17:16:17 -07:00
Marco	2c8430e223	vp9: Turn off ml_partition_search_early_termination. Fails on nightly ubsan, valgrind tests. Enabled on commit:6701014 Change-Id: Ied3f5cb38e39cba54ac134f4514107cdfdfce159	2017-03-15 15:00:38 -07:00
Jerome Jiang	27d5a57072	Merge "vp9: Using source sad for speedup for dynamic resizing."	2017-03-15 00:03:52 +00:00
Jerome Jiang	02463273c9	vp9: Using source sad for speedup for dynamic resizing. Only for speed >= 7. Change-Id: I3ac85fbb4023cf7e6f8333806b345b0174382a09	2017-03-14 15:47:19 -07:00
Yunqing Wang	c3e290963d	Merge "Apply machine learning-based early termination in VP9 partition search"	2017-03-14 18:07:05 +00:00
Marco Paniconi	78a6946904	Merge "vp9: Speed >= 8: Enable simple_block_yrd speed feature."	2017-03-14 17:50:17 +00:00
Marco	c0c789ab50	vp9: Adjust copy partition threshold, for speed 8. Reduce it from 5 to 4, small/no change in metrics or speed. Small reduction in dragging artifact near moving head. Change-Id: Ic3bc5ca67c70bf0c89fc2ed14454840a28ae5b6a	2017-03-14 09:18:53 -07:00
Marco	c216c8d6f2	vp9: Speed >= 8: Enable simple_block_yrd speed feature. Enable speed feature for resolutions > VGA. avgPSNR on RTC down by ~1.7%. Speedup on ARM: ~5%. Change-Id: I7a3fe5f7425aa8df3f4a2eced1afa355bc0d4c95	2017-03-14 09:10:28 -07:00
Marco	f0a22b23fe	vp9: Fix to source_sad feature for SVC. Allow speed feature sf->use_source_sad to be used on highest spatial layer for SVC. Change-Id: I260eb0478902764f49f83e43b17024fe86ff3b22	2017-03-13 11:00:40 -07:00
Yunqing Wang	670101439f	Apply machine learning-based early termination in VP9 partition search This patch was based on Yang Xian's intern project code. Further modifications were done. 1. Moved machine-learning related parameters into the context structure. 2. Corrected the calculation of sum_eobs. 3. Removed unused parameters and calculations. 4. Made it work with multiple tiles. 5. Added a speed feature for the machine-learning based partition search early termination. 6. Re-organized the code. The patch was rebased to the top-of-tree. Borg test BDRATE result: 4k set: PSNR: +0.144%; SSIM: +0.043%; hdres set: PSNR: +0.149%; SSIM: +0.269%; midres set: PSNR: +0.127%; SSIM: +0.257%; Average speed gain result: 4k clips: 22%; hd clips: 23%; midres clips: 15%. Change-Id: I0220e93a8277e6a7ea4b2c34b605966e3b1584ac	2017-03-13 09:54:18 -07:00
Marco	ea3c817ac2	vp9: Enable two speed features for SVC real-time mode. Enable short_circuit_low_temp_var and limit_newmv_early_exit for SVC, 1 pass CBR mode. Change-Id: I77df2b2c6cc40657bb8ea76e19dfc2fdaad6389e	2017-03-08 16:13:59 -08:00

1 2 3 4 5 ...

279 Commits