generic-library/vpx

Author	SHA1	Message	Date
Jerome Jiang	b5f7f7737a	Refactor: Change cpi->resize_state to enum values. Change-Id: Iab1409b0fc1175bc5a14afc4749a08c536c98c41	2017-03-15 17:16:17 -07:00
Vignesh Venkatasubramanian	5881601488	vp9: Rename new_mt to row_mt new_mt is a very generic name that will get obsolete soon enough. Since this is exposed as a codec control, renaming it to row_mt to signify row level paralellism. Also renaming the ETHREAD_BIT_MATCH codec control to ROW_MT_BIT_EXACT. Change-Id: Ic7872d78bb3b12fb4cf92ba028ec8e08eb3a9558	2017-02-27 09:43:26 -08:00
Marco	7f2daa74a0	vp9: Incorporate source sum_diff into non-rd partition thresholds. Increase the variance partition thresholds for superblocks that have low sum-diff (from source analysis prior to encoding frame). Use it for now only for speed >= 7 or for denoising on. Small change on metrics for rtc set: less than ~0.1 avgPNSR decrease on RTC set, for both speed 7 and 8. Change-Id: I38325046ebd5f371f51d6e91233d68ff73561af1	2017-02-21 17:22:11 -08:00
Ranjit Kumar Tulabandu	71061e9332	Row based multi-threading of encoding stage (Yunqing Wang) This patch implements the row-based multi-threading within tiles in the encoding pass, and substantially speeds up the multi-threaded encoder in VP9. Speed tests at speed 1 on STDHD(using 4 tiles) set show that the average speedups of the encoding pass(second pass in the 2-pass encoding) is 7% while using 2 threads, 16% while using 4 threads, 85% while using 8 threads, and 116% while using 16 threads. Change-Id: I12e41dbc171951958af9e6d098efd6e2c82827de	2017-02-15 00:49:34 +00:00
Yunqing Wang	318ca07657	The bitstream bit match test in multi-threaded encoder While the new-mt mode is enabled(namely, allowing to use row-based multi-threading in encoder), several speed features that adaptively adjust encoding parameters during encoding would cause mismatch between single-thread encoded bitstream and multi-thread encoded bitstream. This patch provides a set_control API to disable these features, so that the bit match bitstream is obtained in the unit test. Change-Id: Ie9868bafdfe196296d1dd29e0dca517f6a9a4d60	2017-02-13 13:02:26 -08:00
Yunqing Wang	dbc5090b5e	Merge "Changes to facilitate multi-threading of encoding stage"	2017-02-04 01:02:29 +00:00
Ranjit Kumar Tulabandu	12ec948490	Changes to facilitate multi-threading of encoding stage Modified the encoding stage to have row level entry points with relevant initializations and to access the token information at row level Change-Id: Ife10e55a7c1a420ee906d711caf75002688d9e39	2017-02-02 14:47:13 +05:30
Ranjit Kumar Tulabandu	359a6796da	Changes to facilitate row based multi-threading of ARNR filtering Change-Id: I2fd72af00afbbeb903e4fe364611abcc148f2fbb	2017-02-01 13:03:52 -08:00
Ranjit Kumar Tulabandu	8b0c11c358	Multi-threading of first pass stats collection (yunqingwang) 1. Rebased the patch. Incorporated recent first pass changes. 2. Turned on the first pass unit test. Change-Id: Ia2f7ba8152d0b6dd6bf8efb9dfaf505ba7d8edee	2017-01-24 15:48:02 -08:00
Marco	219cdab676	vp9: Add feature to use block source_sad for realtime mode. Only for speed >= 7, and affects skipping of intra modes. Threshold is set low for now, needs to be tuned. Small/no difference in metrics on rtc clips. Change-Id: If9bdbd43f08d1f80407cdd2e9e5e96780dcd2424	2017-01-20 11:57:02 -08:00
Jerome Jiang	ee5b29ae30	vp9: Stop copying partition every a fixed number of frames. Avoid quality loss when copying partition of superblock with large motions. Maximum consecutively copied frames can be set (currently 5). Change-Id: I11c30575514f02194c0f001444cf4021609e5049	2017-01-18 11:23:59 -08:00
Jerome Jiang	0c65aed099	vp9: Set low variance flag when partition is copied. Also set the flag to 1 when exit early choosing 64x64 block such that skipping new mv for golden works in these scenerios. Change the size of prev_segment_id to the number of superblocks to save memory. Borg test shows quality regression of 0.012% on average PSNR and 0.035% on SSIM. Change-Id: I5014224c8617d439d35c66ece3fed9ae30b31d23	2017-01-17 11:14:50 -08:00
Marco	7e3a82c384	vp9: Make the denoiser work with spatial SVC. If enabled denoiser will only denoise the top spatial layer for now. Added unittest for SVC with denoising. Change-Id: Ifa373771c4ecfa208615eb163cc38f1c22c6664b	2017-01-10 17:23:58 -08:00
hui su	337ad83e58	Add support for VP9 level targeting Constraints on encoder config: -target_bandwidth is no larger than 80% of level bitrate limit -target_bandwidth * (1 + max_over_shoot_pct) is no larger than 88% of level bitrate limit -min_gf_interval is no smaller than level limit -tile_columns is no larger than level limit Constraints on rate control: -current frame size plus previous three frames' size is no larger than the CPB level limit -current frame size is no larger than 50%/40%/20% of the CPB level limit if it's a key/alt-ref/other frame. Change-Id: I84d1a2d6d6e3c82bfd533b3309ce999cfaba2c8b	2017-01-06 10:07:31 -08:00
Jerome Jiang	1d5ca84df6	vp9: Add feature to copy partition from the last frame. Add feature to copy partition from the last frame. The copy is only done under certain conditions that SAD is below threshold. Feature is currently disabled, until threshold is tuned. Feature will be initially used for Speed 8 (ARM). Under extreme case of always copying partition for speed 8: Encode time is reduced by 5.4% on rtc_derf and 7.8% on rtc. Overall PSNR reduced by 2.1 on rtc_derf and 0.968 on rtc. Change-Id: I1bcab515af3088e4d60675758f72613c2d3dc7a5	2016-12-19 16:24:03 -08:00
Yunqing Wang	c192def8f3	Change 2 motion search counts to be tile data This patch modified the motion search counts used in: https://chromium-review.googlesource.com/#/c/305640/ These 2 counts were originally added as thread data, and used to make decisions in motion search. The tile encoding order can be inconsistent while using different number of threads, which can cause bitstream mismatch. Here moved them to tile data to solve the issue. BUG=webm:1322 Change-Id: Iedc4477aef1746aa0a4f84d88a1156296fd3ba87	2016-10-25 10:12:41 -07:00
Vignesh Venkatasubramanian	5deffa1175	vp9_bitstream: Encode tiles in parallel Re-use the tile worker threads to pack the bitstream in parallel on a per-tile basis. Restricting this to real-time only for now (further testing is needed to ensure this does not make 2-pass worse in any case). BUG=webm:1309 Change-Id: I8a80da7c5089b837d0df79a5c49d5e3022dfc8ec	2016-10-21 17:35:03 -07:00
James Zern	7f31bfeddb	Revert "vp9_bitstream: Encode tiles in parallel" This reverts commit `9e8efa5b18`. this change causes ubsan warnings, failures in vpxenc_vp9_webm_rt_multithread_tiled BUG=webm:1309 Change-Id: I020c7be985c771bfff4b3de1afe51cc8edb980da	2016-10-18 22:47:48 -07:00
Vignesh Venkatasubramanian	9e8efa5b18	vp9_bitstream: Encode tiles in parallel Re-use the tile worker threads to pack the bitstream in parallel on a per-tile basis. Restricting this to real-time only for now (further testing is needed to ensure this does not make 2-pass worse in any case). BUG=webm:1309 Change-Id: Ia2c982da56697756e12f02643f589189b3271d98	2016-10-17 10:42:03 -07:00
Marco	57c6bf291e	1 pass vbr: Allow for lookahead alt-ref in real-time mode. For 1 pass vbr real-time mode: Allow for the usage of alt-ref frame when non-zero lag-in-frames is used. Use non-filtered alt-ref, and select usage based on fast scene/content analysis/detection within the lag of frames. Positive gains on ytlive set: overall avgPSNR ~3-4%. Several clips are up between 5-14%, a few clips are neutral/small change. Current speed decrease is about ~5-10%. Use the flag USE_ALTREF_FOR_ONE_PASS to enable this feature (off by default for now). Change-Id: I802d2bf3d44f9cf01f6d15c76be9c90192314769	2016-10-11 10:13:17 -07:00
Yury Gitman	292d221fed	Create interface for the ALT_REF_AQ class Current commit is just an API template for the rest of the code, and I will add inner logic later. Altref frames generate a lot of bitrate and at the same time other frames refer to them a lot, so it makes sense to apply special compensation-based adaptive quantization scheme for altref frames. E.g., for blocks that are good predictors for the future apply rate-control chosen quantizer while for bad predictors apply worse one. Change-Id: Iba3f8ec349470673b7249f6a125f6859336a47c8	2016-08-25 10:55:14 -07:00
Yury Gitman	d7c20079a6	Add --alt-ref-aq=<int> option In the future this option will activate adaptive quantization special for altref frames. Encoder will create the adaptive quantization map on the basis of lookahead buffers similarity which is the estimate of the future motion compensation performance. Change-Id: Ia0088b3babb0f9a4899c79d8d819947ba5a03df2	2016-08-24 15:49:25 -07:00
Yury Gitman	c37d012ada	Merge "Add cpi parameter for forcing segmentation update"	2016-08-08 21:29:42 +00:00
Yury Gitman	7a730d5901	Add cpi parameter for forcing segmentation update Change-Id: I1b0bcb1ffe7604117bfaa0b9989d0e25ff04d28c	2016-08-08 13:20:42 -07:00
James Zern	7104833085	vp9: normalize vpx_enc_frame_flags_t usage quiets -Wshorten-64-to-32 warnings Change-Id: Ice037acb675d1d81bfedf2dfcfa91a8a29a19dfd	2016-08-04 23:37:49 -07:00
clang-format	e0cc52db3f	vp9/encoder: apply clang-format Change-Id: I45d9fb4013f50766b24363a86365e8063e8954c2	2016-08-02 16:47:11 -07:00
paulwilkins	be013eb396	Add experimental spatial de-noise filter on key frames. For forced key frames in particular this helps to make them blend better with the surrounding frames where noise tends to be suppressed by a combination of quantization and alt ref filtering. Currently disabled by default under and IFDEF flag pending wider testing. Change-Id: I971b5cc2b2a4b9e1f11fe06c67ef073f01b25056	2016-06-29 17:25:41 +01:00
James Zern	b34705f64f	Merge "cosmetics: Beautify whitespaces and line wrapping"	2016-06-24 21:51:01 +00:00
James Zern	efad6feb9a	Merge "cosmetics: Change few types to their posix version"	2016-06-24 21:50:45 +00:00
Yury Gitman	67611119b5	cosmetics: Beautify whitespaces and line wrapping Change-Id: I9afa02cae671bd3527cf344695e53d0cc767f549	2016-06-24 10:18:06 -07:00
Yury Gitman	3b2e2f2f77	cosmetics: Change few types to their posix version Change-Id: I6d7bc9ed7396e7b0d63ee97bfa473fdea002f9ee	2016-06-24 10:18:06 -07:00
hui su	72d4890caf	Add vp9 encoder API VP9E_GET_LEVEL to provide bitstream level Change-Id: I1ef3df0192491035728fe9d5eb25cc66dc2965de	2016-06-15 12:53:28 -07:00
hui su	be3f0698b0	Add VP9 encoder API for level specification. Add control API VP9E_SET_TARGET_LEVEL that allows the encoder to control the output bitstream level and/or keep level related statistics. Usage: 255 do not care about level (default) 0 keep level related stats only 10 target for level 1 11 target for level 1.1 . . . 62 target for level 6.2 Usage for vpxenc: --target-level=0/255/10/11... Change-Id: I31d1aeca19358b893e7577b4e63748c8e614034a	2016-05-10 11:48:16 -07:00
hui su	667f6320b0	Fix comment for target_bandwidth in VP9 and VP10 Unlike in VP8, it is in units of bits per second in VP9 and VP10. Change-Id: Iee1936cc58cdfaff205624c2fe87cecdf7eda123	2016-05-09 16:43:02 -07:00
Marco	adf8533cee	vp9: Move consec_zero_mv from cyclic refresh to cpi struct. So it can be used even with aq-mode=3 not enabled. Also cleans up some code in the places where its used. No change in behavior. Change-Id: Ib6b265308dbd483f691200da9a0be4da4b380dbc	2016-04-22 08:09:39 -07:00
Marco	c83bcb3474	vp9-svc: Allow for 2 stage downscaling for spatial layers. For 1 pass cbr mode: allow for two-stage 1:2 scaling (which will use the 1:2 optimized scaler) if the spatial layer is 1/4x1/4 of souce. Without this change, the base layer for 3 spatial layers would be using the non-normative scaler which is un-optimized/C code. Change-Id: I9d73f92a4a96927d0f1d6bf75315c1e60513226a	2016-03-01 15:48:42 -08:00
James Zern	8062e10162	Revert "vp9-svc: Fix speed issue with source downscaling for spatial layers." This reverts commit `f51f0998e1`. This causes datarate tests to fail. Some are due to the new default keyframe distance, another causes an assert even forcing 9999: [ RUN ] VP9/DatarateOnePassCbrSvc.OnePassCbrSvc3SpatialLayers/0 test_libvpx: vpx_dsp/x86/vpx_subpixel_8t_intrin_ssse3.c:853: scaledconvolve2d: Assertion `y_step_q4 <= 32' failed. Change-Id: I4ee4fea97f47e4f1a23b82a62e6afc6280961e38	2016-02-26 16:53:26 -08:00
Marco	f51f0998e1	vp9-svc: Fix speed issue with source downscaling for spatial layers. For 1 pass cbr mode: allow for two-stage 1:2 scaling (which will use the 1:2 optimized scaler) if the spatial layer is 1/4x1/4 of souce. Without this change, the base layer for 3 spatial layers would be using the non-normative scaler which is un-optimized/C code. Change-Id: Ifcf526ec2aaf3e5fa7924588d9dd8660bf02fb46	2016-02-26 08:11:37 -08:00
Marco	34d12d1160	vp9-resize: Force reference masking off for external dynamic-resizing. An issue exists with reference_masking in non-rd pickmode for spatial scaling. It was kept off for internal dynamic resizing and svc, this change is to keep it off also for external dynamic resizing. Update to external resize test, and update TODO to re-enable this at frame level when references have same scale as source. Change-Id: If880a643572127def703ee5b2d16fd41bdbf256c	2016-02-11 08:35:57 -08:00
Marco	734dc36173	vp9: Add flag to control usage of skin detection. Set off as default; on for 1 pass cbr mode, speed >=5, non-screen-content. Change-Id: I03f2497e4028b354fd83b8a7d0e072c2a6bec878	2016-02-01 11:57:56 -08:00
Debargha Mukherjee	02345be986	Adding an aq mode for 360 videos Different quality levels are used for different regions in the frame depending on how far they are vertically from the center. Specifically, three segments are used based on the mi_row index with respect number to the number of mi_rows in the frame. Change-Id: Ifc8b777bc58ea8521dffc4640360c67d99f8d381	2016-01-13 16:17:37 -08:00
paulwilkins	0149fb3d6b	Changes to exhaustive motion search. This change alters the nature and use of exhaustive motion search. Firstly any exhaustive search is preceded by a normal step search. The exhaustive search is only carried out if the distortion resulting from the step search is above a threshold value. Secondly the simple +/- 64 exhaustive search is replaced by a multi stage mesh based search where each stage has a range and step/interval size. Subsequent stages use the best position from the previous stage as the center of the search but use a reduced range and interval size. For example: stage 1: Range +/- 64 interval 4 stage 2: Range +/- 32 interval 2 stage 3: Range +/- 15 interval 1 This process, especially when it follows on from a normal step search, has shown itself to be almost as effective as a full range exhaustive search with step 1 but greatly lowers the computational complexity such that it can be used in some cases for speeds 0-2. This patch also removes a double exhaustive search for sub 8x8 blocks which also contained a bug (the two searches used different distortion metrics). For best quality in my test animation sequence this patch has almost no impact on quality but improves encode speed by more than 5X. Restricted use in good quality speeds 0-2 yields significant quality gains on the animation test of 0.2 - 0.5 db with only a small impact on encode speed. On most clips though the quality gain and speed impact are small. Change-Id: Id22967a840e996e1db273f6ac4ff03f4f52d49aa	2015-11-13 10:16:31 +00:00
hui su	6ab6ac450b	Use accurate bit cost for uv_mode in UV intra mode RD selection On derflr, +0.1% for VP10; however, -0.03% on VP9. Change-Id: I09c724232ede74254043d61d3cadc506256af0af	2015-11-06 14:45:43 -08:00
Marco	c7da053d4b	Move noise level estimate outside denoiser. Source noise level estimate is also useful for setting variance encoder parameters (variance thresholds, qp-delta, mode selection, etc), so allow it to be used also if denoising is not on. Change-Id: I4fe23d47607b4e17a35287057f489c29114beed1	2015-11-02 12:15:26 -08:00
Yaowu Xu	568429512e	Add a new enum type vpx_color_range_t to make meaning of color_range obvious. Change-Id: I303582e448b82b3203b497e27b22601cc718dfff	2015-10-16 16:27:18 -07:00
Ronald S. Bultje	812945a8f1	vp9/10: improve support for render_width/height. In the decoder, map this to the output variable vpx_image_t.r_w/h. This is intended as an improved version of VP9D_GET_DISPLAY_SIZE, which doesn't work with parallel frame decoding. In the encoder, map this to a codec control func (VP9E_SET_RENDER_SIZE) that takes a w/h pair argument in a int[2] (identical to VP9D_GET_DISPLAY_SIZE). Also add render_size to the encoder_param_get_to_decoder unit test. See issue 1030. Change-Id: I12124c13602d832bf4c44090db08c1009c94c7e8	2015-09-25 22:18:22 -04:00
Ronald S. Bultje	eeb5ef0a24	Add support for color-range. In decoder, export (eventually) into vpx_image_t.range field. In encoder, use oxcf->color_range to set it (same way as for color_space). See issue 1059. Change-Id: Ieabbb2a785fa58cc4044bd54eee66f328f3906ce	2015-09-16 06:41:46 -04:00
Marco	4d1424faf9	For 1 pass: always use the normative filter in vp9_scale_if_required() The normative (convolve8) filter is optimized/faster than the nonnormative one. Pass usage of scaler (normative/nonomorative) to vp9_scale_if_required(), and always use normative one for 1 pass. Change-Id: I2b71d9ff18b3c7499b058d1325a9554de993dd52	2015-09-14 13:13:32 -07:00
James Zern	5e35c3c9a0	vp9_encoder: make vp9_alloc_compressor_data private Change-Id: I38b4de692f4f7e880766316783981cbd1134bed9	2015-08-28 18:53:57 -07:00
Marco	93ffe9d6dc	Update to dynamic resize for 1 pass CBR: source scaling. Switch to use the normative (convolve8) filter for source scaling, only for 1/2x1/2 scaling for now. This is faster and has better quality than either the vpx_scale_frame or the nonnormative scaler. Remove the vp9_scale_if_required_fast, which is now not used. Change-Id: I2f7d73950589d19baafb1fa650eac987d531bcc8	2015-08-20 16:34:01 -07:00

1 2 3 4 5

213 Commits