generic-library/vpx

Author	SHA1	Message	Date
Yunqing Wang	099e9bf1ff	Make the partition search early termination feature to be frame size dependent The 2 thresholds(i.e. partition_search_breakout_dist_thr and partition_search_breakout_rate_thr) are used as the partition search early termination speed feature. This refactoring patch made this feature to be frame size dependent consistently throughout the code. Change-Id: Idaa0bd8400badaa0f8e2091e3f41ed2544e71be9	2017-03-08 12:56:41 -08:00
Vignesh Venkatasubramanian	9e7140b451	Merge "vp9,realtime: Enable row multithreading for non-rd"	2017-03-03 19:05:52 +00:00
Marco	b60617f5ff	vp9: Speed 8: reduce the adaptive_rd_thresh level. Reduce the level from 4 to 2. This gives ~1-2% quality gain on RTC set, with small decreaee in speed (~1-2% on mac). Change-Id: I7d959731badcee3d45b2f4a08efe378765016a13	2017-03-02 13:34:10 -08:00
Vignesh Venkatasubramanian	453f18040f	vp9,realtime: Enable row multithreading for non-rd Enable row level multithreading for realtime encodes where non-rd path is used (speed >= 5). Change-Id: I5439cb49a02171166d8e1de06c7d5e6f8e819a41	2017-03-02 11:03:56 -08:00
Vignesh Venkatasubramanian	5881601488	vp9: Rename new_mt to row_mt new_mt is a very generic name that will get obsolete soon enough. Since this is exposed as a codec control, renaming it to row_mt to signify row level paralellism. Also renaming the ETHREAD_BIT_MATCH codec control to ROW_MT_BIT_EXACT. Change-Id: Ic7872d78bb3b12fb4cf92ba028ec8e08eb3a9558	2017-02-27 09:43:26 -08:00
Jerome Jiang	a6b6258284	Merge "vp9: Non-rd pickmode: use simple block_yrd under some conditons."	2017-02-22 23:19:29 +00:00
Marco	7e7d820d5b	vp9: Non-rd pickmode: use simple block_yrd under some conditons. For speed 8 only. 3% speed up for QVGA and 6.3% for VGA on Nexus 6. ~3% avgPSNR decrease on rtc_derf and 2.9% on rtc. Disabled for now. Change-Id: I70133f1f6c804d663d594df437bfe7fdb0030d6a	2017-02-22 13:22:53 -08:00
Marco	7f2daa74a0	vp9: Incorporate source sum_diff into non-rd partition thresholds. Increase the variance partition thresholds for superblocks that have low sum-diff (from source analysis prior to encoding frame). Use it for now only for speed >= 7 or for denoising on. Small change on metrics for rtc set: less than ~0.1 avgPNSR decrease on RTC set, for both speed 7 and 8. Change-Id: I38325046ebd5f371f51d6e91233d68ff73561af1	2017-02-21 17:22:11 -08:00
Yunqing Wang	f2c1aea118	Merge "Row based multi-threading of encoding stage"	2017-02-15 00:54:10 +00:00
Ranjit Kumar Tulabandu	71061e9332	Row based multi-threading of encoding stage (Yunqing Wang) This patch implements the row-based multi-threading within tiles in the encoding pass, and substantially speeds up the multi-threaded encoder in VP9. Speed tests at speed 1 on STDHD(using 4 tiles) set show that the average speedups of the encoding pass(second pass in the 2-pass encoding) is 7% while using 2 threads, 16% while using 4 threads, 85% while using 8 threads, and 116% while using 16 threads. Change-Id: I12e41dbc171951958af9e6d098efd6e2c82827de	2017-02-15 00:49:34 +00:00
clang-format	4b402746ca	apply clang-format Change-Id: I75e4a9e0b37bd4586f26c8d6c1fa27f3f6ff1bce	2017-02-14 12:45:52 -08:00
Marco	219cdab676	vp9: Add feature to use block source_sad for realtime mode. Only for speed >= 7, and affects skipping of intra modes. Threshold is set low for now, needs to be tuned. Small/no difference in metrics on rtc clips. Change-Id: If9bdbd43f08d1f80407cdd2e9e5e96780dcd2424	2017-01-20 11:57:02 -08:00
Jerome Jiang	ee5b29ae30	vp9: Stop copying partition every a fixed number of frames. Avoid quality loss when copying partition of superblock with large motions. Maximum consecutively copied frames can be set (currently 5). Change-Id: I11c30575514f02194c0f001444cf4021609e5049	2017-01-18 11:23:59 -08:00
Jerome Jiang	9152d434dc	vp9: Disable partition copy when resizing is enabled. Change-Id: I4fa3262e0f1c4018604c954b020ec5d1e3d1465c	2017-01-17 18:21:31 -08:00
Jerome Jiang	255866419d	Merge "vp9: Set low variance flag when partition is copied."	2017-01-17 21:02:52 +00:00
Jerome Jiang	0c65aed099	vp9: Set low variance flag when partition is copied. Also set the flag to 1 when exit early choosing 64x64 block such that skipping new mv for golden works in these scenerios. Change the size of prev_segment_id to the number of superblocks to save memory. Borg test shows quality regression of 0.012% on average PSNR and 0.035% on SSIM. Change-Id: I5014224c8617d439d35c66ece3fed9ae30b31d23	2017-01-17 11:14:50 -08:00
Marco	159cc3b33c	vp9: Add speed feature flag for computing average source sad. If enabled will compute source_sad for every superblock on every frame, prior to encoding. Off by default, only on for speed=8 when copy_partition is set. Change-Id: Iab7903180a23dad369135e8234b7f896f20e1231	2017-01-13 11:52:12 -08:00
Jerome Jiang	f129e09529	vp9: Turn on the partition copy for speed 8. Tune threshold. For speed 8, it speeds up the encoding on android by 6% for QVGA and 7.4% for VGA with the new threshold. Overall PSNR is improved by 0.667 for rtc. Change-Id: I4a644560b32c0b5b4e9f49ffb953d000413a3732	2017-01-11 10:48:16 -08:00
Jerome Jiang	198b834c97	vp9: Set less aggresive short_circuit_low_temp_var for HD at speed 8. Quality improved by 1.866 and 0.386 for two noisy clips (dark720p and marcooffice720p), respectively. Change-Id: Ib33a7672ae9ca53da156208f7cd13f07b5543e44	2017-01-09 16:44:07 -08:00
Jerome Jiang	267e73446c	vp9: Enable more aggresive short circuit for speed 8. Set short_circuit_low_temp_var to 3 for speed 8 for all res. No strong visual difference on all clips. Change-Id: Ia6d9a314291ab1c14d5421bbdd769974083aeb2a	2017-01-06 10:23:34 -08:00
Jerome Jiang	72746c079d	vp9: Set short circuit to level 3 for VGA for speed 8. vp9: Set short circuit to level 3 for VGA for speed 8. Also change the threshold_32x32 to 5/8*thresholds[1] to improve quality regression caused to VGA clips. Change-Id: Ia1590e91e7cb22be78d5b85013387bb1be4272e3	2017-01-04 11:28:31 -08:00
Jerome Jiang	1d5ca84df6	vp9: Add feature to copy partition from the last frame. Add feature to copy partition from the last frame. The copy is only done under certain conditions that SAD is below threshold. Feature is currently disabled, until threshold is tuned. Feature will be initially used for Speed 8 (ARM). Under extreme case of always copying partition for speed 8: Encode time is reduced by 5.4% on rtc_derf and 7.8% on rtc. Overall PSNR reduced by 2.1 on rtc_derf and 0.968 on rtc. Change-Id: I1bcab515af3088e4d60675758f72613c2d3dc7a5	2016-12-19 16:24:03 -08:00
Jingning Han	f473e892f7	Merge "Enable asymptotic closed-loop encoding decision"	2016-11-19 04:12:55 +00:00
Jerome Jiang	360217a233	vp9: Speed 8: More aggresive golden skip for low res. Add a new, more aggresive short circuit: short_circuit_low_temp_var = 3 to skip golden of any mode when variance is lower than threshold for low res. This change only affects speed = 8, low resolution. Metrics for avgPSNR/SSIM on rtc_derf (low resolution) show loss of 0.27/0.31%. On Nexus 6, the encoding time is reduced by ~2.3% on average across all low-res clips. Visually little change on rtc_derf clips. Change-Id: Ia8f7366fc2d49181a96733a380b4dbd7390246ec	2016-11-15 13:56:27 -08:00
Jingning Han	44f8ee7258	Enable asymptotic closed-loop encoding decision This commit enables asymptotic closed-loop encoding decision for the key frame and alternate reference frame. It follows the regular rate control scheme, but leaves out additional iteration on the updated frame level probability model. It is enabled for speed 0. The compression performance is improved: lowres 0.2% midres 0.35% hdres 0.4% Change-Id: I905ffa057c9a1ef2e90ef87c9723a6cf7dbe67cb	2016-11-14 09:22:55 -08:00
Marco	a7d116aa67	vp9: Speed=8 real-time: Keep the bias_golden feature on. Small/no change in metrics on RTC set, speed increase by 2-3%. Change-Id: Iee997bd7433e8e508216e9267b1c31c5a9aa5121	2016-10-20 17:03:51 -07:00
Marco	57c6bf291e	1 pass vbr: Allow for lookahead alt-ref in real-time mode. For 1 pass vbr real-time mode: Allow for the usage of alt-ref frame when non-zero lag-in-frames is used. Use non-filtered alt-ref, and select usage based on fast scene/content analysis/detection within the lag of frames. Positive gains on ytlive set: overall avgPSNR ~3-4%. Several clips are up between 5-14%, a few clips are neutral/small change. Current speed decrease is about ~5-10%. Use the flag USE_ALTREF_FOR_ONE_PASS to enable this feature (off by default for now). Change-Id: I802d2bf3d44f9cf01f6d15c76be9c90192314769	2016-10-11 10:13:17 -07:00
Marco	d017548be6	vp9 real-time mode: Change loopfilter speed feature at speed 8. For real-time mode at speed 8: turn off MINIMAL_LF at speed 8, for non-screen content mode. Visually better, avgPSNR/SSIM on rtc set go up by ~4-5%. Speed decrease of about ~3%. Change-Id: I8eb69330f02e0ceece1507d43cfc8a049a1d8291	2016-09-29 12:59:01 -07:00
paulwilkins	6fc07a217d	Modified resize loop constraints. Using a tighter resize constraint on undershoot seems to help results (especially SSIM) as significant undershoot on a frame seems to have more of a damaging impact than overshoot. This patch has been tuned so that in local testing using the derf set it is encode speed neutral for speed setting 2. Average quality result for speed 2 (psnr,ssim) were as follows:- lowres 0.039, 0.453 midres 0.249, 0.853 hdres 0.159, 0.659 NetFlix -0.241, 0.360 Change-Id: Ie8d3a0d7d6f7ea89d9965d1821be17f8bda85062	2016-08-31 12:45:49 +01:00
Paul Wilkins	129814fcb4	Merge "Adjust coefficient optimization and tx_domain rd speed features."	2016-08-30 16:54:40 +00:00
Paul Wilkins	badd32d914	Merge "Add ALLOW_RECODE_FIRST speed mode."	2016-08-26 15:46:45 +00:00
paulwilkins	dc42f343ae	Add ALLOW_RECODE_FIRST speed mode. This patch is to address concerns that changes to allow recodes on the first frame in each ARF group do not give a good enough speed quality trade off for speed 2. Though the average impact on encode speed is 1-2%, for some hard clips it is > 5% rise. For speed 1 this is less an issue and for Speed 0 the previous patch actually improves speed. Change-Id: Ie1bcefdbfdf846d3f4428590173f621465dffe3a	2016-08-26 11:43:47 +01:00
paulwilkins	635ae8bdc1	Adjust coefficient optimization and tx_domain rd speed features. Previously Tx domain rd was used in all cases above speed 0. Coefficient optimization was only enabled for best and speed 0. This patch selectively sets these features at other speed settings based on block complexity. For the Netflix and HD sets in particular the quality gains are large compared to the speed hit. At speed 1 the average psnr gain in the NF set is > 2.5% with one clip coming in at 18% and some points almost 30%. Average gains for the lower resolution test sets are around 1%. The gains are biggest at low Q so some further optimization may be possible. Change-Id: I340376c7b2a78e5389a34b7ebdc41072808d0576	2016-08-25 15:36:16 +01:00
Yunqing Wang	ef98f49cb0	Disable split mode in 4k video encoding Disabled the split mode while encoding 4k video to speed up the encoder. Borg test result on 4k set: Overall PSNR: +0.029%; SSIM: +0.009%. Average encoder speedup at speed 2 is 2.5%. Change-Id: I1519c658f07c3ac838affbe5aff0ed9b94f3f8f4	2016-08-22 19:46:44 -07:00
Yunqing Wang	37169c0bd4	Merge "Adjust speed features for 4k video encoding"	2016-08-19 23:11:05 +00:00
Yunqing Wang	fe488cceff	Adjust speed features for 4k video encoding Adjusted speed 2 features to speed up 4k video encoding. BDBR results from borg test: PSNR: +0.313%; SSIM: +0.268%. Average speedup: 8.5% Change-Id: I1e2695a01fb3f3817c1df4480e184c2aed8f2eba	2016-08-19 09:30:32 -07:00
JackyChen	8be7e572a7	vp9 svc: SVC encoder speed up. Bias towards base_mv and skip 1/4 pixel motion search when using base mv. 2~3% speed up for 2 spatial layers, 3~5% speed up for 3 spatial layers. PSNR loss: (2 layers) 0.07dB for gips_stationary, 0.04dB for gips_motion; (3 layers) 0.07dB for gips_stationary, 0.06dB for gips_motion. Change-Id: I773acbda080c301cabe8cd259f842bcc5b8bc999	2016-08-18 11:25:45 -07:00
Marco	7eb7d6b227	vp9 non-rd pickmode: Add limit on newmv-last and golden bias. Add option, for newmv-last, to limit the rd-threshold update for early exit, under a source varianace condition. This can improve visual quality in low texture moving areas, like forehead/faces. Also add bias against golden to improve the speed/fps, will little/negligible loss in quality. Only affects CBR mode, non-svc, non-screen-content. Change-Id: I3a5229eee860c71499a6fd464c450b167b07534d	2016-08-17 14:33:44 -07:00
paulwilkins	5d881770e5	Change default recode rule for good speed 0 and best. Changes the default recode rule for Speed 0 and best quality from ALLOW_RECODE to ALLOW_RECODE_KFARFGF. Tested on the NF, hdres, midres and lowres test sets, this setting when combined with patch I40cb559... now performs "as well" in metrics terms (in fact it came out a tiny amount better overall) but encode time is 9.6% faster (measured as the average from 27 mid rate local encodes on clips in the derf/lowres set. Change-Id: I8c781c0cdfa3a9929cd9406d15582fce47d6ae3b	2016-08-15 10:52:54 +01:00
clang-format	e0cc52db3f	vp9/encoder: apply clang-format Change-Id: I45d9fb4013f50766b24363a86365e8063e8954c2	2016-08-02 16:47:11 -07:00
Jingning Han	efccbc9fb5	Disable trellis optimization when lossless is on Disable trellis coefficient optimization when the lossless mode is turned on. Change-Id: I9001bf626e86dc3c8c32331ede04fd39036e5f7c	2016-07-12 09:00:16 -07:00
Jingning Han	62aa642d71	Enable uniform quantization with trellis optimization in speed 0 This commit allows the inter prediction residual to use uniform quantization followed by trellis coefficient optimization in speed 0. It improves the coding performance by lowres 0.79% midres 1.07% hdres 1.44% Change-Id: I46ef8cfe042a4ccc7a0055515012cd6cbf5c9619	2016-07-07 12:25:33 -07:00
Jingning Han	e357b9efe0	Support measure distortion in the pixel domain Use pixel domain distortion metric in speed 0. This improves the compression performance by 0.3% for both low and high resolution test sets. Change-Id: I5b5b7115960de73f0b5e5d0c69db305e490e6f1d	2016-07-06 18:25:17 -07:00
JackyChen	f9c0587200	vp9: Encoding cycle reduction for speed 8. 1. Skip golden non-zeromv and newmv-last for bsize >= 16x16 if the temporal variance obtained from choose_partitioning is very low. 2. Skip horz and vert INTRA mode for speed 8. This change works best on the clips with little noise and with some motion (e.g. gips_motion which has > 5% speed up). PSNR drop is 1.78% on rtc test set, no obvious visual quality regression found. Change-Id: Ib43b5b20e67809d03c5a6890818ddff59e1fc94a	2016-06-13 09:33:22 -07:00
JackyChen	891dbe1e52	vp9: Fix valgrind failure for short circuit on low temporal vaiance block. Add check for actual split before using the variance of the split. Change-Id: If0f93248be0b16d17738675d16c90516054dad2b	2016-06-02 15:56:58 -07:00
JackyChen	a32f341539	Disable short circuit feature for low temporal variance. The featrue fails in libvpx_unit_tests-valgrind. Will re-enable it after fixing the issue. Change-Id: I8ba132f04e98f4615b31fbff2097eda83c5e42bc	2016-06-02 09:45:00 -07:00
jackychen	bacc67f4a8	vp9: Skip some modes when variance is low for big blocks, for 1 pass real-time. Skip intra-mode and some inter-modes (newmv, nearmv, nearestmv) for golden frame if the variance got from choose_partitioning is very low. Only for 1 pass real-time CBR mode and bsize >= 32x32, it has ~2.5% speed up with less than 0.1% PSNR drop for rtc test set. Don't see visual regression. Change-Id: I70efbc95a1007231ae36f02c5b2fbf6cd35077ad	2016-06-01 13:54:18 -07:00
Marco	f94124cf31	vp9: 1 pass vbr mode at speed 5: switch to use mv.search to NSTEP. Change only affects 1 pass, vbr, speed = 5 (real-time mode). Some improvement for high motion content. AvgPSNR/SSIM metrics for ytlive set all up, on average ~2%, some clips (high motion ones) up 4/5%. Encoder speed down: on mynintendo_x1.1280_720.y4m: 47fps -> 44fps. Change-Id: I9e3eaa6392dcb6b5b44ee6f43004f97ba859bc11	2016-03-25 15:33:55 -07:00
Alex Converse	55859e8428	Use whole pixel only at speed 8 screen content. +5.857% BD-RATE on SCREEN_CONTENT Leaving this off for non-screen content because: +25.300% on TWITCH120 +37.833% BD-RATE on RTC Change-Id: Ie0a312182d6cc859fb04298e4cd81d02b39e23fe	2016-03-15 15:04:48 -07:00
Marco	89cc682528	vp9-real-time mode: Fix condition for allowing reference masking. Add frame-level condition for reference masking: under external or internal dynamic resize, allow for reference masking if none of the references have been scaled. Peviously, reference masking was turned off for the stream if dynamic resize feature was enabled or an external resize event occurred. reference_masking gives speed up with little/no loss in compression. For speed 7 on rtc set: encoding time decreases by about 5-7%, avgPSNR/SSIM goes down ~0.2%. Change-Id: Ie4444577451ef954414d8fb4b2c99d65cadf1746	2016-02-16 13:10:27 -08:00
Marco	34d12d1160	vp9-resize: Force reference masking off for external dynamic-resizing. An issue exists with reference_masking in non-rd pickmode for spatial scaling. It was kept off for internal dynamic resizing and svc, this change is to keep it off also for external dynamic resizing. Update to external resize test, and update TODO to re-enable this at frame level when references have same scale as source. Change-Id: If880a643572127def703ee5b2d16fd41bdbf256c	2016-02-11 08:35:57 -08:00
Alex Converse	7da6324cab	Short circuit flat blocks when coding screen content at realtime speed. In inter mode search skip all modes except NEARESTMV and DC_PRED. 10% less encode latency for large frames using the chromium remoting_perftests. +0.313% BDRATE on the screencast set at speed -6. Change-Id: Ib97a39dd8bcdeab545509e0e02d78ce7033f8c63	2016-01-22 12:40:45 -08:00
Marco	c8a2c31ec1	Non-rd speed >=5: Include H/V intra for bsize=16x16. H/V intra mode was only enabled for bsize < 16x16, enable it also for bsize=16x16. Metrics are neutral with this change: Overall very small gain (0.1%), small visual gain on some RTC clips. Change-Id: Ib2d7a44382433bfc11cf324aa3cc5c382ea9e088	2015-12-17 17:18:44 -08:00
Marco	ad7e765319	vp9 denoiser: Fix to re-evaluate mode selection. This fix allows to enable reuse_inter_pred. Change-Id: I53f2bf1163bb0036ffb6df92117a86debdca11d1	2015-11-30 08:59:10 -08:00
Marco	5b0ddb931d	vp9 denoiser: Re-evaluate ZEROMV after denoiser filtering. For denoising, and for noise level above threshold, re-evaluate ZEROMV for mode selection after denoising. Current change only does this check if selected best mode (before denoising) was intra. Change-Id: I4b1435b68d26c78f7597b995ee7bff0ddd5f9511	2015-11-24 17:30:32 -08:00
paulwilkins	8ba98516fd	Changes to best quality settings. Small changes to the best quality default speed trade off. Some speedup settings are worth while even for best quality as they have only a very small impact on quality but a significant impact on encode time. These changes give as much as a further 50-60% increase in encode speed for my test animations clip with minimal impact on quality. For this sequence these changes improve the best quality encode speed to about the same level as good quality speed 0 in Q3 2015 whilst retaining the large quality gain of over 1 db For many natural videos though the quality difference from good 0 to best is much smaller. Change-Id: I28b3840009d77e129817a78a7c41e29cb03e1132	2015-11-17 16:20:20 +00:00
paulwilkins	0149fb3d6b	Changes to exhaustive motion search. This change alters the nature and use of exhaustive motion search. Firstly any exhaustive search is preceded by a normal step search. The exhaustive search is only carried out if the distortion resulting from the step search is above a threshold value. Secondly the simple +/- 64 exhaustive search is replaced by a multi stage mesh based search where each stage has a range and step/interval size. Subsequent stages use the best position from the previous stage as the center of the search but use a reduced range and interval size. For example: stage 1: Range +/- 64 interval 4 stage 2: Range +/- 32 interval 2 stage 3: Range +/- 15 interval 1 This process, especially when it follows on from a normal step search, has shown itself to be almost as effective as a full range exhaustive search with step 1 but greatly lowers the computational complexity such that it can be used in some cases for speeds 0-2. This patch also removes a double exhaustive search for sub 8x8 blocks which also contained a bug (the two searches used different distortion metrics). For best quality in my test animation sequence this patch has almost no impact on quality but improves encode speed by more than 5X. Restricted use in good quality speeds 0-2 yields significant quality gains on the animation test of 0.2 - 0.5 db with only a small impact on encode speed. On most clips though the quality gain and speed impact are small. Change-Id: Id22967a840e996e1db273f6ac4ff03f4f52d49aa	2015-11-13 10:16:31 +00:00
paulwilkins	cdc359989a	Changes to partition breakout rules. Changes to the breakout behavior for partition selection. The biggest impact is on speed 0 where encode speed in some cases more than doubles with typically less than 1% impact on quality. Speed 0 encode speed impact examples Animation test clip: +128% Park Joy: +59% Old town Cross: + 109% Change-Id: I222720657e56cede1b2a5539096f788ffb2df3a1	2015-10-13 14:19:06 -07:00
Marco	6ddbc845cc	Remove unneeded/incorrect comment. Change-Id: I5c923223c284ad4fda0c45572a66bebc8528dd1d	2015-09-11 08:49:13 -07:00
Johann	c5f11912ae	Include vpx_dsp_common.h when using VPXMIN/MAX Change-Id: I2e387a06484a06301f3cd6600c4ba2f4335b61ee	2015-08-31 14:36:35 -07:00
James Zern	5e16d397bd	vpx_dsp_common: add VPX prefix to MIN/MAX prevents redeclaration warnings; vp8 has its own define which will be resolved in a future commit Change-Id: Ic941fef3dd4262fcdce48b73075fe6b375f11c9c	2015-08-26 20:11:32 -07:00
Marco	3d181a4516	Adjust speed setting for temporal layers in 1 pass non-rd mode. For speed 7, real-time mode: Base layer frames are further apart (for #temporal layers = 3, this is every 4 frames) so worth keeping same motion search parameters (as in speed 6) on the base layer frames. Change-Id: Idebf49dda6ef4f3d9a55aee55129a68253f692fb	2015-08-11 11:21:01 -07:00
Alex Converse	af6d2c7d42	Turn off simple_model_rd_from_var at speed 4. This got erroneously changed during the refactor. This fixes SvcTest.TwoPassEncode2TemporalLayersWithMultipleFrameContextsAndTiles. Change-Id: Ifa5ab0e098396c5e2d10478db87df256eadfa4c7	2015-07-31 15:50:17 -07:00
Alex Converse	c827c59eaf	Convert simple_model_rd_from_var from a speed check to a speed feature. Change-Id: I8877025e172fff29bc4e270790211463b676b4d7	2015-07-30 13:53:26 -07:00
paulwilkins	7d15444d07	Fix bug in setting sf->use_square_partition_only. Fix bug in setting this flag for animated content. The bug did cause quality to increase because far more frames are not boosted than boosted. However, the speed trade off to gain is a lot less favorable and the behavior was not as intended. Change-Id: I89fb70419c88b26f40b3534de0481730a1b3fcfa	2015-07-16 16:20:39 +01:00
paulwilkins	8dd466edc8	Changes to use of rectangular partitions. Changes to allow more use of rectangular partitions at speeds 1 and 2 for content classed by the first pass as animation and for blocks near the active image edge. This has quite a big impact in quality for the animated test sequence but also hurts encode speed for speed 2. For other content types the impact on both speed and quality is small. Added some plumbing for detection of internal vertical image edges. Change-Id: I3fc48de2349f8cb87946caaf0b06dbb0ea261a9a	2015-07-08 18:14:12 +01:00
paulwilkins	a126b6ce7d	Change speed and rd features for formatting bars. Change speed features / behavior for split mode when there is an internal active edge (e.g. formatting bars). Remove some threshold constraints in rd code near the active edge of the image. Add some plumbing for left and right active edge detection. Patch set 5. Limit rd pass through for sub 8x8 to internal active edges. This takes away any speed penalty for most clips but keeps the enhanced edge coding for the more critical case of internal image edges Change-Id: If644e4762874de4fe9cbb0a66211953fa74c13a5	2015-07-08 17:51:42 +01:00
Paul Wilkins	4a28da5843	Enable more split modes for animated content. For content that is identified as likely to contain some animation or graphics content, increase the availability of split modes for good quality speeds 1-3. On a problem test animation clip this improves metrics results by about 0.25 db and makes a noticeable difference visually. It also causes a small drop in file size (~0.5%) but a rise in encode time of about 5-6% at speed 2. For more normal content it should have no effect. Change-Id: Ic4cd9a8de065af9f9402f4477a17442aebf0e439	2015-06-09 14:50:44 +01:00
Paul Wilkins	b19b16cfa1	Merge "Animation and dead zone detection."	2015-06-08 14:26:07 +00:00
Marco	8710cceb45	Fix to spatial svc: set reference_frame masking. For real-time mode: keep reference_frame masking off for spatial svc. Change-Id: I15e123c06f67ea040172b8d4042a672f3525b9d8	2015-06-05 08:25:33 -07:00
Paul Wilkins	668e804504	Animation and dead zone detection. Adds code to detect dead zone bars at the top and bottom of reformatted letterbox video (note that the code only looks at the top of the image and assumes any dead zone is symmetrical). Use of this to adapt rate control etc. will follow in a subsequent patch. Also counts other blocks (excluding the dead zone) that have no intra signal. The presence of a significant number of such blocks can be used as a identify that the frame may be artificial (e.g. animation, screen capture, graphics). This patch contains plumbing only and does not use the signal. Change-Id: I59bc93529cd4065416cef773e405fda3ae006a20	2015-06-04 01:01:20 +01:00
Marco	e88de49faa	Change tx_size_search_method setting for non-rd speed 5. Use the same settting as in speed >=6. This will use same logic for tx_size selecton as in speed >=6, which limits the transform size and reduces ringing artifact. Also metrics go up on average with this change: ~2% for PSNR, ~10% for SSIM. Change-Id: Ia2d50db236ae1cc72f742bfa6c9ec5ea50ff0e0a	2015-05-15 11:12:47 -07:00
paulwilkins	aecb1770d5	Merge "Image size restriction to rd auto partition search."	2015-05-07 14:12:14 +00:00
paulwilkins	af76953448	Merge "Remove CONSTRAIN_NEIGHBORING_MIN_MAX."	2015-05-05 09:32:11 +00:00
Marco	b9a72d3c4d	Allow for H and V intra modes for non-rd mode. For non-rd mode (speed >=5): use mask based on prediction block size, and (for non-screen content mode) allow for checking horiz and vert intra modes for blocks sizes < 16x16. Avg psnr/ssim metrics go up by about ~0.2%. Only allowing H/V intra on block sizes below 16x16 for now, to keep encoding time increase very small, and also when allowing H/V on 16x16 blocks, metrics went down on a few clips which need to be further examined. Change-Id: I8ae0bc8cb2a964f9709612c76c5661acaab1381e	2015-05-04 09:48:41 -07:00
paulwilkins	4a7dcf8eb2	Image size restriction to rd auto partition search. Impose a limit on the rd auto partition search based on the image format. Smaller formats require that the search includes includes a smaller minimum block size. This change is intended to mitigate the visual impact of ringing in some problem clips, for smaller image formats. Change-Id: Ie039e5f599ee079bbef5d272f3e40e2e27d8f97b	2015-05-01 16:16:02 +01:00
paulwilkins	287b0c6da9	Remove CONSTRAIN_NEIGHBORING_MIN_MAX. Remove one of the auto partition size cases. This case can behaves badly in some types of animated content and was only used for the rd encode path. A subsequent patch will add additional checks to help further improve visual quality. Change-Id: I0ebd8da3d45ab8501afa45d7959ced8c2d60ee4e	2015-05-01 15:15:16 +01:00
Marco	fa20a60f0d	Speed 5: use non-rd mode for key frame coding. Metrics on RTC set go down by ~1.5% on average. Key frame encoding time goes down by factor of ~5. Change-Id: Ia83acc55848613870e5ac6efe7f3d904d877febb	2015-03-27 16:19:26 -07:00
Adrian Grange	23ebacdb81	Auto-adaptive encoder frame resizing logic Note: This feature is still in development. Add an option for the encoder to decide the resolution at which to encode each frame. Each KF/GF/ARF goup is tested to see if it would be better encoded at a lower resolution. At present, each KF/GF/ARF is coded first at full-size and if the coded size exceeds a threshold (twice target data rate) at the maximum active Q then the entire group is encoded at lower resolution. This feature is enabled in vpxenc by setting: --resize-allowed=1 In addition, if the vpxenc command line also specifies valid frame dimensions using: --resize-width=XXXX & --resize_height=YYYY then all frames will be encoded at this resolution. Change-Id: I13f341e0a82512f9e84e144e0f3b5aed8a65402b	2015-02-10 09:59:32 -08:00
Yaowu Xu	65a1a3e85d	adjust rtc setting and threshold 1. Adjusted the threshold for coef update computation based on counts of tx used, avoid coef update computation when count is low (<20) 2. Move sf->lpf_pick = LPF_PICK_MINIMAL_LPF to speed 8. Change-Id: I02b44309e40fcdbf135c7934ae067a3f42502d30	2015-02-02 17:43:46 -08:00
Adrian Grange	527e073163	Remove elevate_newmv_thresh from SPEED_FEATURES (unused) Change-Id: I78ef7f89586a329787f6bc4c58ec83af210989a3	2015-01-22 16:12:50 -08:00
Yaowu Xu	a16f075375	Corrected value range of --cpu-used for vp9 This commit removes undefined value options of cpu-used for VP9 and changed vpxenc prompt to reflect the usable range of [-8,8] Change-Id: Ib80fef3dbb6ec9aabac45ed13e8ab6fbaf94f55e	2014-12-17 15:18:01 -08:00
Jingning Han	74ded4863e	Enable conditional skip path in rd_pick_intra_sby_mode These speed-up features for key frame coding are only turned on in the settings of hybrid non-RD and RD mode decision. It provides about 20% speed-up to the hybrid key frame coding at the expense of certain compression performance loss. For vidyo1, the key frame coding statistics are changed 9838F, 35.020 dB, 61677 us -> 9920F, 34.834 dB, 47556 us Overall rtc set compression performance is down by -0.257%. Change-Id: I0025447fda26bb7855e982955642b5f55d71b51f	2014-12-05 09:36:09 -08:00
Jingning Han	07711e9b27	Use hybrid RD and non-RD coding flow for key frame coding When block size is below 16x16, the encoder swap from non-RD to RD mode for key frame coding. This largely brough back the key frame compression performance. For vidyo1 at 1000 kbps, the key frame coding statistics are changed 9978F, 34.183 dB, 36807 us -> 9838F, 35.020 dB, 61677 us As compared to the full RD case 7187F, 34.930 dB, 214470 us The overall rtc set coding performance (single key frame setting) is improved by 1.5%. Change-Id: I78a4ecf025d7b24ec911e85be94e01da05e77878	2014-12-05 09:35:27 -08:00
Jingning Han	228ec17ff2	Merge "Rework coeff probability model update for rtc coding"	2014-12-03 11:34:35 -08:00
Marco	8fd3f9a2fb	Enable non-rd mode coding on key frame, for speed 6. For key frame at speed 6: enable the non-rd mode selection in speed setting and use the (non-rd) variance_based partition. Adjust some logic/thresholds in variance partition selection for key frame only (no change to delta frames), mainly to bias to selecting smaller prediction blocks, and also set max tx size of 16x16. Loss in key frame quality (~0.6-0.7dB) compared to rd coding, but speeds up key frame encoding by at least 6x. Average PNSR/SSIM metrics over RTC clips go down by ~1-2% for speed 6. Change-Id: Ie4845e0127e876337b9c105aa37e93b286193405	2014-12-03 09:18:08 -08:00
Jingning Han	8fe50191c6	Rework coeff probability model update for rtc coding This commit reworks the ONE_LOOP_REDUCED coefficient probability model update process. It allows model update for every coefficient across the spectrum at a coarser resolution, instead of performing precise update only for certain subset of probability models. The overall runtime remains nearly same (<1% change) for speed -6. The compression performance is improved by 7.5% in PSNR for speed -5 and 4.57% for speed -6, respectively. Change-Id: Ifb17136382ee7e39a9f34ff4a4f09a753125c8d1	2014-12-03 09:15:25 -08:00
Jingning Han	a6df0cbcca	Remove repeated search_type_check_frequency assign This parameter is initialized as 50. No need to re-assign the same value in speed -6. Change-Id: I8735a5593412df2fdcee53ae45c8ebd1c3d792e7	2014-11-25 18:36:41 -08:00
Yunqing Wang	edbd61e136	vp9_ethread: modify VP9_COMP structure This patch modified struct VP9_COMP. Created a struct ThreadData to include data that need to be copied for each thread. In multiple thread case, one thread processes one tile. all threads share one copy of VP9_COMP, (refer to VP9_COMP cpi in the code) but each thread has its own copy of ThreadData, (refer to ThreadData td in the code). Therefore, within the scope of encode_tiles(), both cpi and td need to be passed as function parameters. In single thread case, the FRAME_COUNTS pointer in ThreadData points to "counts" in VP9_COMMON. Change-Id: Ib37908b2d8e2c0f4f9c18f38017df5ce60e8b13e	2014-11-24 17:57:38 -08:00
Jingning Han	2fbdfd2c66	Key frame non-RD mode decision process This commit makes a non-RD coding mode decision process for key frame coding. It can be optionally turned on in speed -6 and above. Change-Id: I0847258b392877a0210b4768bef88ebc9ad009b5	2014-11-24 09:04:28 -08:00
Alex Converse	bc1b3d8412	Allow DC/H/V/TM on screen content. 6.3% better compression less than 1% compression time increase Change-Id: Ie83c059436e54c09de9e7c87e06e0a6d40dc38fe	2014-11-20 18:04:57 -08:00
Alex Converse	722e9d611b	Drop special inter mode selection for screen content. Better mode selection was implemented for all content. Change-Id: I479778ed21d3968892f4dce396c83733583f4f23	2014-11-20 18:04:57 -08:00
Yunqing Wang	54ba65a63e	Merge "vp9_ethread: move max/min partition size to mb struct"	2014-11-20 14:00:37 -08:00
Yunqing Wang	ad7586a9e1	vp9_ethread: move max/min partition size to mb struct The max_partition_size and max_partition_size are set at the beginning while setting speed features, and then adjusted at SB level. Moving them to mb struct ensures there is a local copy for each thread. Change-Id: I7dd08dc918d9f772fcd718bbd6533e0787720ad4	2014-11-20 09:24:50 -08:00
Yunqing Wang	70c9d2983b	Revert "vp9_ethread: include a pointer to mb in VP9_COMP" This reverts commit `6906d218dd`. Another way will be used to handle mb struct. Change-Id: Ic1111a46b2b1ee00f8f9e3fcd4cf3eb6030b2dc4	2014-11-20 08:31:12 -08:00
Yaowu Xu	1687c47bfd	change to call vp9_refining_search_sad() directly The function pointer in compressor instance does not change, so this commit changes to call the function directly. Change-Id: I9c9c460e3475711c384b74c9842f0b4f3d037cc5	2014-11-17 11:30:17 -08:00
Yunqing Wang	6906d218dd	vp9_ethread: include a pointer to mb in VP9_COMP Modified VP9_COMP struct to include MACROBLOCK *mb. This change makes it feasible in multi-thread case to allocate a mb for each thread. Change-Id: I624d6d1aa9c132362200753e5d90b581b1738d6e	2014-11-14 12:31:06 -08:00
Adrian Grange	35de9db312	Merge "Prepare for dynamic frame resizing in the recode loop"	2014-11-13 15:01:49 -08:00
Adrian Grange	0d085ebc0a	Prepare for dynamic frame resizing in the recode loop Prepare for the introduction of frame-size change logic into the recode loop. Separated the speed dependent features into separate static and dynamic parts, the latter being those features that are dependent on the frame size. Change-Id: Ia693e28c5cf069a1a7bf12e49ecf83e440e1d313	2014-11-13 11:41:20 -08:00
Jingning Han	e717d22b63	Use reconstructed pixels for intra prediction This commit makes the speed -6 and above use the reconstructed boundary pixels for precise intra prediction. This allows more intra prediction modes to be tested in the non-RD coding process. Enabling horizontal and vertical intra prediction modes can improve the speed -6 compression performance for rtc set by 0.331%. Change-Id: I3a99f9d12c6af54de2bdbf28c76eab8e0905f744	2014-11-11 10:04:43 -08:00

1 2 3 4 5 ...

279 Commits