generic-library/vpx

Author	SHA1	Message	Date
Jerome Jiang	0afa2dad76	Fix vp8 race when build --enable-vp9-highbitdepth. Split vp8/vp9 implementations on yv12_copy_frame_c. Remove high-bitdepth codes from vp8_yv12_extend_frame_borders_c. Clean up vp8 codes usage in vp9. BUG=webm:1435 Change-Id: Ic68e79e9d71e1b20ddfc451fb8dcf2447861236d	2017-05-26 09:45:01 -07:00
Marco	146005a911	vp9: SVC: Fix to condiiton on using source_sad. Fix the condition on usage of source_sad for temporal layers. FIx allows it to be used for the case of 1 temporal layer. Change-Id: I02b1b0ade67a7889d1b93cee66d27c0951131fc3	2017-05-26 08:46:50 -07:00
Marco Paniconi	9ec9415fd9	Merge "vp9: Use source_sad only on top temporal enhancement layer."	2017-05-26 05:24:06 +00:00
Marco	ea914456af	vp9: Use source_sad only on top temporal enhancement layer. For 1 pass CBR SVC mode. Change-Id: Ic026740f9d0ec5eee7c5845be9c5b15884fec48d	2017-05-25 16:32:05 -07:00
Marco	747cf7a505	vp9: SVC: Enable copy partition for SVC speed >= 7. Adjust the max_copied_frame setting for temporal layers. Keep the same setting for non-SVC at speed 8. This change also enables copy_partiton for non-SVC at speed 7, but with smaller value of max_copied_frame (=2). ~2% speedup for SVC speed 7, 3 layers, with little/no quality loss. Change-Id: Ic65ac9aad764ec65a35770d263424b2393ec6780	2017-05-25 12:21:46 -07:00
Marco Paniconi	b3bf91bdc6	Merge "vp9: Adjustments to cyclic refresh for high motion."	2017-05-22 06:27:30 +00:00
Marco	2adc0443dd	vp9: Adjustments to cyclic refresh for high motion. For aq-mode=3: refactor the condition for turning off the refresh. Add some adjustments for high motion content. No/little change in RTC metrics, only affects high motion case. Change-Id: I7da8eabfb0e61db014be4562806f72ee5ef4a43b	2017-05-21 22:21:44 -07:00
Marco	ff9395eb3b	vp9: Speed >= 8: Modify condition for low-resoln. No change on RTC metrics. Change-Id: I5abc573cb56572188d900645d13ba479f55a1ea0	2017-05-21 22:14:38 -07:00
Paul Wilkins	a7977ece93	Merge "Changes to modified error."	2017-05-19 12:24:32 +00:00
Marco	1205e3207e	vp9: SVC: Modify condition to allow for copy partition. When temporal layers are used, only allow for copy partition on the top temporal enhancement layer frames. Change-Id: I5472abdc0f9f6c8dafa75a7a84c615e08ae22af8	2017-05-18 14:19:31 -07:00
Jerome Jiang	6b6ff9c969	Merge "vp9: Make copy partition work for SVC and dynamic resize."	2017-05-18 19:37:30 +00:00
Marco	2ba4729ef8	vp9: Make copy partition work for SVC and dynamic resize. Only affects speed 8. Make changes to copy partition to fix a bug in setting microblock offset. Avg PSNR shows 0.02% gain on rtc_derf and 0.08% loss on rtc. Change-Id: I61c3e5914dde645331344388e7437e5638acd4f3	2017-05-18 11:33:56 -07:00
paulwilkins	5680b4517f	Changes to modified error. The modified error was a derivative of the "coded_error" that was used to allocate bits between different frames on the assumption that the allocation should be linear in terms of this modified error. I.e. a frame with double the modified error score should all things being equal get double the number of bits. The code also included upper and lower caps derived from input VBR parameters. This patch improves the initial calculation of the clip mean error (now called "mean_mod_score" as it is no longer a prediction error) used as the midpoint for the rate distribution function and normalizes the output "modified scores" scores such that 1.0 indicates a frame in the middle of the distribution. The VBR upper and lower caps are then applied directly to a frame's normalized score. This refactoring is intended to make it easier to drop in alternative distribution functions or to base the rate allocation on a corpus wide midpoint (rather than the clip mean). Change-Id: I4fb09de637e93566bfc4e022b2e7d04660817195	2017-05-18 12:56:02 +01:00
Yaowu Xu	bde2c04fb7	Merge "Experiment. Store first pass errors as per MB values."	2017-05-17 17:38:15 +00:00
paulwilkins	42e5073f94	Experiment. Store first pass errors as per MB values. Most existing first pass stats are stored in a form normalized to a macro-block scale. However the error scores for intra / inter etc were stored as frame level values but mainly used as MB level values. This change fixes that. Normalized per MB values make comparisons between different formats easier and in any case this is usually what is wanted. An change in results should be limited to slight differences in rounding. *** Change after patch 8 +2 requiring new approval. Final pre-submit testing showed one 4K clip with above expected change. Investigation showed this was due to a value used to test for ultra low intra complexity in key frame detection. This was a per frame not per MB value but also did not scale with frame size. Replacement with a small per MB value (based on original per frame value and cif frame size) resolved the KF detection problem. Also converted kf_group_error_left to a double in line with other error values to reduce rounding problems in KF group bit allocation All clips and sets now show nominal (or 0) change as expected. Change-Id: Ic2d57980398c99ade2b7380e3e6ca6b32186901f	2017-05-17 12:00:18 +01:00
Johann Koenig	8739a182c8	Merge "move neon load/stores to a new file"	2017-05-15 18:15:27 +00:00
Johann	1088b4f87c	move neon load/stores to a new file Move the tran_low_t helper functions to a new file. Additional load/store functions will be added here. Change-Id: I52bf652c344c585ea2f3e1230886be93f5caefc3	2017-05-15 08:29:43 -07:00
Jerome Jiang	6b9d130214	Merge "vp9: speed 8: Fix seg fault in partition copy when drop frames."	2017-05-13 03:20:49 +00:00
Cheng Chen	4c0655f26b	Merge "Speed up encoding by skipping altref recode"	2017-05-13 01:29:59 +00:00
Jerome Jiang	1fcd5cca3c	vp9: speed 8: Fix seg fault in partition copy when drop frames. BUG=webm:1433 Change-Id: I4f3984ef28660d3218d48007d7c977bdbdaf8af6	2017-05-12 15:57:23 -07:00
Marco Paniconi	629279a45c	Merge "vp9: Adjust speed features for speed 8 at low resoln."	2017-05-12 00:35:40 +00:00
Marco Paniconi	c64667c338	Merge "vp9: SVC: Increase the partiiton and acskip thresholds"	2017-05-11 23:37:32 +00:00
Marco Paniconi	37cdd3bfc2	Merge "vp9; Adjust noise estimation thresholds."	2017-05-11 21:58:40 +00:00
Marco	c5c31b9eb6	vp9: SVC: Increase the partiiton and acskip thresholds Increase the partition and acskip thresholds for temporal enhancement layers. ~1-2% speedup, with negligible loss in quality. Change-Id: Id527398a05855298ad9ddac10ada972482415627	2017-05-11 12:28:19 -07:00
Marco	c5a4376aed	vp9: SVC: allow for setting the interp_filter in non-rd pickmode. For SVC 1 pass non-rd pickmode, the interpolation filter for the upsampling of the golden (spatial) reference was not being explicitly set and instead was takin gwhatever value was set in the previous mode/block (which would be either EIGHTTAP or EIGHTAP_SMOOTH). Fix it to the default EIGHTTAP for now, to be updated/selected adaptively in a later change. Minor adjustmemt to rate targeting thresholds in datarate unittests. Change-Id: I52085048674072c6cfb7163e11e9a2658d773826	2017-05-11 11:45:09 -07:00
Paul Wilkins	3caaf21c5b	Merge "Tuning of factor used to calculate Q range in two pass."	2017-05-11 18:25:45 +00:00
Jerome Jiang	d35541fe29	Merge "vp9: Fix ubsan failure in denoiser."	2017-05-11 16:38:59 +00:00
paulwilkins	9a7625652c	Tuning of factor used to calculate Q range in two pass. A more detailed explanation of the experimentation leading to this change can be found in:- https://docs.google.com/a/google.com/document/d/13lsYhxgPyxUHvEess6wg9nikaonIZKY9Ak_Lpafv5Mo/edit?usp=sharing This change gives gains across all our standard test sets for overall psnr, ssim, fast ssim and psnr-HVS. Values expressed as % reduction in bitrate. Low res set -0.257, -0.192, -0.173, -0.101 Mid res set -0.233, -0.336, -0.367, -0.139 High res set -0.999, -1.039, -1.111, -0.567 NetFlix 2K set -0.734, -0.174, -0.389, -0.820 Netflix 4K set -0.814, -0.485, -0.796, -0.839 Change-Id: Ie981fb3c895c9dfcfc8682640d201a86375db5c8	2017-05-11 16:19:59 +01:00
Cheng Chen	76567d84ce	Speed up encoding by skipping altref recode Speed up for speed 0. Reduce 10+% of encoding time for hdres in speed 0, with less than 0.1% PSNR loss. Compute total difference of previous and current frame context probability model. If the diff is less than the threshold, skip recoding the frame. Borg test (positive number means performance loss): lowres midres hdres PSNR: 0.030 0.032 0.065 Local speed test: bitrate set at 1200 blue_sky pedestrian rush_hour Encoding time: -10.0% -16.5% -16.5% Change-Id: I4e2d200ea3115d48b2c3e890143596b31b8ef9e9	2017-05-10 22:15:01 -07:00
Marco	2f11a65c99	vp9; Adjust noise estimation thresholds. Change-Id: Ia41a11df18e5a58d2b8bbecd11c249d357de2a8f	2017-05-10 16:48:10 -07:00
Jerome Jiang	597d1f4c03	vp9: Fix ubsan failure in denoiser. Fix the overflow for subtraction between two unsigned integers. BUG=webm:1432 Change-Id: I7b665e93ba5850548810eff23258782c4f5ee15a	2017-05-10 13:43:17 -07:00
Jerome Jiang	2574573fea	vp9: Wrap threshold tuning for HD only when denoiser is enabled. Fixes a speed regression. Change-Id: I23d942e4af17fa81fe4a366c7369b3ad537e59b0	2017-05-10 12:15:41 -07:00
Marco Paniconi	db2fad7516	Merge "vp9: Adjustment to noise estimation."	2017-05-10 17:11:18 +00:00
Marco	1b59964162	vp9: Adjustment to noise estimation. When the noise estimate is forced off due to large motion, reset the counter and set smaller window for next estimate. Change-Id: Ifa4ec95396134173a00d48353ad52f1b6a40c217	2017-05-10 09:39:08 -07:00
Marco	4e23998fb4	vp9: SVC: Add option to set downsampling filter type. Add option in SVC to set the filter type and phase for the frame level downsampling filters. For 3 spatial layers: set downsampling filter type to bilinear and set phase to 8, for lowest spatial layer. Change-Id: Id81f4b1ba93db19c1cd37b6a46d1281a2c61bc43	2017-05-09 17:22:44 -07:00
Marco	9586d5e682	vp9: SVC: Modify conditon for setting downsample filter type. Base the condition on the resolution of the spatial layer. And remove restriction on scaling factor. Change-Id: Iad00177ce364279d85661654bff00ce7f48a672e	2017-05-08 14:13:49 -07:00
Linfeng Zhang	2c3a2ad6f1	Merge changes I0cfe4117,I3581d80d,Ida62c941 * changes: Split dsp/x86/inv_txfm_sse2.c Update highbd idct functions arguments to use uint16_t dst Clean CONVERT_TO_BYTEPTR/SHORTPTR in idct	2017-05-08 16:15:57 +00:00
Marco Paniconi	f4653c1efc	Merge "vp9: SVC: Set downsample filtertype for lowest spatial layer."	2017-05-06 02:31:00 +00:00
Marco	9b729748ac	vp9: SVC: Set downsample filtertype for lowest spatial layer. For lowest spatial layer, in 3 layer SVC, set the downsampling filtertype to get averaging filter. Needed for reducing aliasing on low-res layer, small increase in overall encoder time. Change-Id: Ia31460123bd91b72eca49b46dd924b9f226d4563	2017-05-05 19:29:09 -07:00
Jerome Jiang	3453c8d6c4	Merge "vp9: Neon optimization for denoiser. Add unit tests."	2017-05-06 01:28:32 +00:00
Jerome Jiang	069eedb3a0	vp9: Neon optimization for denoiser. Add unit tests. Denoiser on Neon is 5x faster than C code. BUG=webm:1420 Change-Id: I805ab64f809ff2137354116be6213e7ec29c1dcb	2017-05-05 16:40:52 -07:00
Marco	34cce144d8	vp9: Adjust some thresholds for noise estimation. Adjust thresholds for noise estimation, for resolutions above VGA. Tends to push cleaner/low noise clips to LowLow state. No change in RTC metrics. Change-Id: I739ca6b797d0a60ccd1c6c6a2775269b1f007e5e	2017-05-05 12:00:12 -07:00
Jerome Jiang	af69ed20c4	vp9: Enable noise estimation on low res. Set noise level to kLowLow for high motion low res clips. Change the normalization in noise metric for low res. Reduce the initial time-window for all resolutions. Change-Id: Iaed39dbb50b205cd9c735dc5b84822304fb01987	2017-05-04 15:38:23 -07:00
Linfeng Zhang	d5de63d2be	Update highbd idct functions arguments to use uint16_t dst BUG=webm:1388 Change-Id: I3581d80d0389b99166e70987d38aba2db6c469d5	2017-05-03 13:59:16 -07:00
Linfeng Zhang	081b39f2b7	Clean CONVERT_TO_BYTEPTR/SHORTPTR in idct BUG=webm:1388 Change-Id: Ida62c941f2b836d6c9e27b427a7d5008ab6dc112	2017-05-03 13:58:31 -07:00
Hui Su	5048d6e7ee	Merge "vp9 level: add tentative max cpb values for high levels"	2017-05-03 20:51:03 +00:00
Hui Su	f701a44305	Merge "Adjust alt-ref selection in define_gf_group()"	2017-05-03 20:50:29 +00:00
Johann Koenig	240a5a15ef	Merge "block error sse2: sum in 32 bits when possible"	2017-05-02 14:16:47 +00:00
Johann	cd94d5f68e	block error avx2: rename variables Change-Id: I2b8a9253f2c3d1fd85304c2970ebe70213870fe9	2017-05-01 17:54:29 -07:00
Johann Koenig	b1a31f8066	Merge "block error avx2: sum in 32 bits when possible"	2017-05-02 00:52:59 +00:00
Marco Paniconi	1e112bce37	Merge "vp9: SVC: Early exit on golden ref in non-rd pickmode."	2017-05-01 21:04:52 +00:00
Linfeng Zhang	e8655d49f5	Merge "Clean vp9_highbd_build_inter_predictor() and highbd_inter_predictor()"	2017-05-01 19:54:40 +00:00
Kyle Siefring	760c214519	block error avx2: sum in 32 bits when possible Add 31bit pairs before unpacking in x86 block error code AVX2 code provides a very minor performance improvement. BUG=webm:1210 Change-Id: I4c82308eaf65741dca2f5c6db9be9c85f905073a	2017-05-01 12:51:33 -07:00
Marco	ae0215f945	vp9: SVC: Early exit on golden ref in non-rd pickmode. For SVC 1 pass real-time: add condition to skip the golden (spatial) reference mode in non-rd pickmode. Condition is to skip golden if the sse of zeromv-last mode is below threshold. And change order in ref_mode_set_svc to make sure golden zeromv is tested after last-nearest. Speedup ~3-4% with little/negligible quality loss. Change-Id: I6cbe314a93210454ba2997945f714015f1b2fca3	2017-05-01 10:36:54 -07:00
Kyle Siefring	8394990b27	block error sse2: sum in 32 bits when possible Add 31bit pairs before unpacking in x86 block error code BUG=webm:1210 Change-Id: I5ca8c7f7775585a17fe09d6bbfc25e1f2955eb0a	2017-05-01 09:59:18 -07:00
Johann	2ff01aa1e4	move vp9_error_intrin_avx2.c There is only one avx2 implementation. Drop '_intrin' Change-Id: I887a0d27d58567eaad49f749f127eca61313f312	2017-05-01 09:13:01 -07:00
Johann Koenig	ef5918098d	Merge "Use uint32_t for accumulator"	2017-04-28 18:32:09 +00:00
Jerome Jiang	ce2e278059	Merge "vp9: Fix condition for disabling adaptive_rd_thresh."	2017-04-28 18:10:36 +00:00
Jerome Jiang	04de501229	vp9: Fix condition for disabling adaptive_rd_thresh. Add speed constrains for disabling adaptive_rd_thresh when row_mt_bit_exact is set. Change-Id: I2445115c2f9a2e46b8a0966031a0fea488d4964e	2017-04-28 10:26:20 -07:00
Johann	657f3e9f14	Use uint32_t for accumulator Be specific about the data type size. Use convenience macro vp9_zero_array. Change-Id: I5fadf7dbd408befb73820d85db0be4832e8cfcbd	2017-04-28 06:36:59 -07:00
Johann Koenig	94ebdba71d	Merge "vp9 temporal filter: sse4 implementation"	2017-04-28 13:22:41 +00:00
Yaowu Xu	0e8fea6c13	Merge "VP9: enable trellis for high bitdepth intra"	2017-04-28 00:16:56 +00:00
Johann	6dfeea6592	vp9 temporal filter: sse4 implementation Approximates division using multiply and shift. Speeds up both sizes (8x8 and 16x16) by 30 times. Fix the call sites to use the RTCD function. Delete sse2 and mips implementation. They were based on a previous implementation of the filter. It was changed in Dec 2015: `ece4fd5d22` BUG=webm:1378 Change-Id: I0818e767a802966520b5c6e7999584ad13159276	2017-04-26 22:03:05 -07:00
Jerome Jiang	43e0e082d1	vp9: Don't force disabling of adaptive_rd_thresh for realtime. Don't force disabling of adaptive_rd_thresh for realtime when row_mt_bit_exact is set. Row based adaptive rd is made usable in CL 454882(https://chromium-review.googlesource.com/c/454882) for REALTIME. Change-Id: Ief023414f0fd6eb86f299dd46ae58f4436875af5	2017-04-26 13:17:57 -07:00
Yunqing Wang	b68f14d0ed	Merge "Make the row based multi-threaded encoder deterministic"	2017-04-26 16:12:14 +00:00
Linfeng Zhang	54c4e0f7a5	Merge "Update highbd convolve functions arguments to use uint16_t src/dst"	2017-04-26 15:50:46 +00:00
Marco Paniconi	004fab120a	Merge "vp9: SVC: Adjust some speed settings for temporal layers."	2017-04-26 15:45:06 +00:00
Peter de Rivaz	66117b97c5	VP9: enable trellis for high bitdepth intra BUG=webm:1409 Change-Id: I5236595aac1c09386c60ffe8ad621e01422ed5a7	2017-04-26 11:43:01 +01:00
hui su	d01c9febe9	vp9 level: add tentative max cpb values for high levels Add tentative max cpb size values for levels 5.2 and up. Otherwise encoding will fail when targeting for these levels. Change-Id: Ib7e0ba4b9836ea1ac900b6822543812843d48463	2017-04-25 18:03:55 -07:00
hui su	8069f31076	Adjust alt-ref selection in define_gf_group() `107de19698` changes the encoder alt-ref selection behavior. Assuming min_gf_interval = max_gf_interval = 4, the frame order would be frm_1 arf_1 frm_2 frm_3 frm_4 frm_5 arf_2 before 107de19698; frm_1 arf_1 frm_2 frm_3 frm_4 arf_2 frm_5 after `107de19698`. This patch reverts such alt-ref placement change. Change-Id: I93a4a65036575151286f004d455d4fcea88a1550	2017-04-25 18:03:47 -07:00
Jerome Jiang	997e54ea43	Merge "vp9: speed >= 8: Skip uv variance in model_rd_sb_y_large"	2017-04-26 00:09:22 +00:00
Marco	c614164cb6	vp9: SVC: Adjust some speed settings for temporal layers. Make some speed setting changes for temporal enhancement layers, and remove the switch in subpel_force_stop for the aggressive_base_mv in non-rd pickmode. Gain some 2-3% speed with little/negligible quality loss. Change-Id: I3e2a7f80ff45f38c0a6ceb01b34dbca2f53edbf0	2017-04-25 16:27:01 -07:00
Jerome Jiang	69b0242e9a	vp9: speed >= 8: Skip uv variance in model_rd_sb_y_large For speed >= 8 and color_sensitivity not set, skip the transform skipping test in UV planes. Add a new condition to check noise level to skip chroma check for speed >= 8 if y_sad is high. 1~2% speedup on ARM for speed 8. Borg tests show neutral results in both rtc and rtc_derf. Change-Id: Idecd3ff6e28c97757a43bb6f3a7082c85f72109c	2017-04-25 16:21:36 -07:00
Linfeng Zhang	4758d20227	Clean vp9_highbd_build_inter_predictor() and highbd_inter_predictor() BUG=webm:1388 Change-Id: I7ee32e0c08f0fb41712a8cc640b2c5bba872421d	2017-04-25 14:32:20 -07:00
Linfeng Zhang	51dc998f3a	Update highbd convolve functions arguments to use uint16_t src/dst BUG=webm:1388 Change-Id: I6912de2639895d817ce850da8ea9f6c8fe21da42	2017-04-25 14:22:19 -07:00
Marco	92ec0674fd	vp9; Reduce artifact in non-rd pickmode for lighting changes. Add a low-variance high-sumdiff to the superblock content state and use it to limit the mv and bias some decisions in non-rd pickmode. Only affects speed >= 6. Reduces artifact for lighting changes. Small/no difference in metrics on RTC set. Change-Id: Ic84b2379fe0ae3fa71ae826ee6bae3eaf551a25b	2017-04-24 17:08:43 -07:00
Yunqing Wang	10a497bd38	Make the row based multi-threaded encoder deterministic This patch followed allow_exhaustive_searches feature modification and continued to modify the encoder to achieve the determinism in the row based multi-threaded encoding. While row-mt = 1 and using multiple threads, the adaptive feature in encoder was disabled, which gave BDRate gain(at speed 1, -0.6% ~ -0.7%; at speed 2, -0.46% ~ -0.59%), but some encoder speed losses(7% ~ 10% at speed 1 and 3% ~ 6% at speed 2). These speed losses were acceptable considering the speed gains obtained from row-mt. Change-Id: I60d87a25346ebc487a864b57d559f560b7e398bb	2017-04-24 16:28:27 -07:00
Yunqing Wang	c530208ae3	Merge "Make allow_exhaustive_searches feature no longer adaptive"	2017-04-24 17:41:10 +00:00
Marco Paniconi	b35f64241f	Merge "vp9: SVC: fix condition for partition/skip threshold when denoising."	2017-04-21 21:28:17 +00:00
Yunqing Wang	bca4564683	Make allow_exhaustive_searches feature no longer adaptive A previous patch turned on allow_exhaustive_searches feature only for FC_GRAPHICS_ANIMATION content. This patch further modified the feature by removing the exhaustive search limit, and made it no longer adaptive. As a result, the 2 counts that recorded the number of motion searches were removed, which helped achieve the determinism in the row based multi-threading encoding. Tests showed that this patch didn't cause the encoder much slower. Used exhaustive_searches_thresh for this speed feature, and removed allow_exhaustive_searches. Also, refactored the speed feature code to follow the general speed feature setting style. Change-Id: Ib96b182c4c8dfff4c1ab91d2497cc42bb9e5a4aa	2017-04-21 11:14:02 -07:00
Jerome Jiang	58fe1bde59	Merge "vp9: Non-rd pickmode: Avoid computation duplication."	2017-04-21 00:51:47 +00:00
Marco	5de0e9ed08	vp9: SVC: fix condition for partition/skip threshold when denoising. The more aggressive settings should only be used when denoise_svc condition is satisfied (which means top spatial layer). Change-Id: Ia8e3515b27f31bf21b1976ca80a2fa826daece3a	2017-04-20 16:36:55 -07:00
Jerome Jiang	7ae1e321a1	vp9: Non-rd pickmode: Avoid computation duplication. In non-rd pickmode (speed >= 5), avoid duplication of computations in model_rd_for_sb_y when the speed feature use_simple_block_yrd is enabled (or for high bitdepth build under certain conditions). QVGA, VGA and HD have 1.23%, 2.68% and 1.7% speedup on ARM for speed 8, respectively. Encoding results are bitexact for speed >= 5. Change-Id: I3f9130810c21439f5ad7e159e21cb2243dcd05f1	2017-04-20 16:20:59 -07:00
Marco	29938b3a5a	vp9: 1 pass SVC: Fix comment and condition for up-sampling reference. No change in behavior. Change-Id: I218fb30289091da623acb23324027435b8510d0e	2017-04-20 14:21:05 -07:00
Yunqing Wang	30ef50b522	Merge "Only allow allow_exhaustive_searches for FC_GRAPHICS_ANIMATION content"	2017-04-20 19:57:46 +00:00
Marco	3134a52d26	vp9: SVC: Redefine the source downsample filter choice. Rename the source downsampling filter, and define it per spatial layers. Used 1 pass CBR SVC. Change-Id: I8135f2ab89c535c53429b9c58b586f746bb668c7	2017-04-20 10:17:13 -07:00
Yunqing Wang	e96e49c2f9	Only allow allow_exhaustive_searches for FC_GRAPHICS_ANIMATION content The allow_exhaustive_searches feature improves the encoding quality of FC_GRAPHICS_ANIMATION content a lot. For non-FC_GRAPHICS_ANIMATION content, the quality test result is almost neutral. This patch makes this feature to be used only for FC_GRAPHICS_ANIMATION content. The motivation of doing that is to make this feature no longer adaptive, which will be implemented in the following patch. Change-Id: Ic911df6dd757402b6480789cc247801e99840369	2017-04-20 00:03:27 +00:00
Linfeng Zhang	fbbdba3b04	Merge changes I9e18a73b,Ie47c8cd4 * changes: Clean CONVERT_TO_BYTEPTR/SHORTPTR in convolve Create CAST_TO_BYTEPTR/SHORTPTR	2017-04-19 23:55:58 +00:00
Linfeng Zhang	bf8a49abbd	Clean CONVERT_TO_BYTEPTR/SHORTPTR in convolve Replace by CAST_TO_BYTEPTR/SHORTPTR. The rule is: if a short ptr is casted to a byte ptr, any offset operation on the byte ptr must be doubled. We do this by casting to short ptr first, adding offset, then casting back to byte ptr. BUG=webm:1388 Change-Id: I9e18a73ba45ddae58fc9dae470c0ff34951fe248	2017-04-19 12:13:49 -07:00
Marco	348bdc0195	vp9: Add phase to get averaging filter for 1:2 downsampling. The scaling filter with zero shift will give sub-sampling for 2x downsampling. Allow for a phase shift to get an averaging filter. Usage is for source scaling in 1 pass SVC mode for 1:2 downscale. Reduces aliasing in downsampled image. Keep the phase to 0/off for now. Change-Id: Ic547ea0748d151b675f877527e656407fcf4d51e	2017-04-18 16:56:15 -07:00
Marco	ad2e3598d2	vp9: Add key_frame condition to is_reference check for loopfilter. This condiiton is not needed as key_frame should set the refresh of the reference frames, but good to have for clarity in condition. Change-Id: Icf9838e7e4f0ff5cf0a9562ae3b5d6c7e6f78702	2017-04-17 15:18:46 -07:00
Marco Paniconi	9aa429a66d	Revert "Revert "vp9: Avoid encoder loopfilter for non-reference frames."" This reverts commit `e9b7f98c56`. Reason for revert: Commit `d578bdad` fixes the issue (encoder/decoder mismatch in 3TL datarate test) that causes the original revert. Original change's description: > Revert "vp9: Avoid encoder loopfilter for non-reference frames." > > This reverts commit `863f860bfc`. > > This causes encoder / decoder mismatches in various > VP9/DatarateTestVP9Large.BasicRateTargeting3TemporalLayers tests > > BUG=webm:1408 > > Change-Id: Ic200c39d7ed9c0b0247ef562f5d6f7b2625f7e14 > TBR=jzern@google.com,marpan@google.com,builds@webmproject.org,jianj@google.com BUG=webm:1408 Change-Id: Ifeb81460856d1d56482d4e0477a70ee98f8bfaa6	2017-04-17 11:02:02 -07:00
James Zern	e9b7f98c56	Revert "vp9: Avoid encoder loopfilter for non-reference frames." This reverts commit `863f860bfc`. This causes encoder / decoder mismatches in various VP9/DatarateTestVP9Large.BasicRateTargeting3TemporalLayers tests BUG=webm:1408 Change-Id: Ic200c39d7ed9c0b0247ef562f5d6f7b2625f7e14	2017-04-14 11:50:06 -07:00
Marco	5f39262dcc	vp9: Adjust speed features for speed 8 at low resoln. For low resolutions (<= CIF): use quarter-pixel and simple_block_yrd. ~5% gain on RTC_derf. ~6-7% slowdown on ARM. Change-Id: I4439ebd1116b9decac04786503f978840b68a60c	2017-04-14 11:35:47 -07:00
Marco Paniconi	b937f1c839	Merge "vp9: SVC: fix to allow use_base_mv to be used for 3 layers."	2017-04-14 17:12:58 +00:00
Marco	adb9b4eddf	vp9: SVC: fix to allow use_base_mv to be used for 3 layers. Allow use_base_mv to be used for 3 spatial layers where base is 4x4 scale from the top layer. Change-Id: If6641baf8b8e4d0fd5dc67619d873c6d75065f43	2017-04-13 20:43:43 -07:00
Marco Paniconi	f0ccaff553	Merge "vp9: Avoid encoder loopfilter for non-reference frames."	2017-04-14 00:45:42 +00:00
Marco	6bff6cb5a9	vp9: 1 pass VBR: Fix to rate control at low min-q. Fix to avoid getting stuck at very low Q even though content is changing, which can happen for --min-q=0. Fix is to more aggressively increase active_worst_quality when detecting significant rate_deviation at very low Q. Change will only affect 1 pass VBR for --min-q < 4, so no change in ytlive metrics for --min-q >= 4. Change-Id: I4dd77dd7c08a30a4390da0ff2c8bda6fccfa76d7	2017-04-13 11:44:35 -07:00
Marco	863f860bfc	vp9: Avoid encoder loopfilter for non-reference frames. Useful for SVC, where the top layer enhancement frames may not update any reference buffers, as is the case for the patterns in the 1 pass CBR SVC when #temporal_layers > 1. ~3% encoder speedup for SVC patterns with temporal layers in 1 pass CBR mode. Updated the SVC datarate tests for the mismatch frames. Set the frame-dropper off in some tests with #temporal_layers > 1 so we can correctly set #mismatch frames. Adjusted rate target threshold for tests where frame-dropper was turned off. Change-Id: Ia0c142f02100be0fed61cd2049691be9c59d6793	2017-04-13 09:51:55 -07:00
Yunqing Wang	f22b828d68	Fix an integer overflow in vp9_mcomp.c The MV unit test revealed an integer overflow issue in vp9_mcomp.c. This was caused if the MV was very large. In mv_err_cost(), when mv->row = 8184, mv->col = 8184 and ref_mv is 0, mv_cost = 34363 and error_per_bit = 132412, causing the overflow. BUG=webm:1406 Change-Id: I35f8299f22f9bee39cd9153d7b00d0993838845e	2017-04-10 18:09:50 -07:00
Jerome Jiang	2420f44342	Merge "vp9: speed >= 8: Adjust speed settings on ARM."	2017-04-11 00:45:21 +00:00
Jerome Jiang	f16f08e55b	vp9: speed >= 8: Adjust speed settings on ARM. Set adaptive_rd_thresh to 2 when simple block yrd is not used. Fix regression caused by computing y sad without int_pro_motion_estimation on low res motion clips. Overall 0.07% quality loss on rtc_derf. Change only affects low res on speed 8. Change-Id: Ic6a188a56529f1034d6431005fb4b0e24e8a7e27	2017-04-11 00:26:56 +00:00
Marco	6557baf336	vp9: 1 pass CBR: avoid nonrd_pick_partition on segment. For speed 5, 1 pass CBR: Don't use the nonrd_pick_partition on the segment, rather use choose_partitioning followed by nonrd_select_partition (as is done on base segment). Little/no quality loss on RTC and RTC_derf (< 0.3%), speedup of at least 5%. Change-Id: I5273d5f950e60adf5e437b4ca8c4f63964641e83	2017-04-10 15:02:49 -07:00
Marco Paniconi	ff1fef9607	Merge "vp9: Fix to noise estimation for temporal denoising."	2017-04-07 17:13:22 +00:00
Yunqing Wang	f496032686	Merge "VP9 motion vector unit test"	2017-04-07 16:46:22 +00:00
Marco	349c3118bd	vp9: Fix to noise estimation for temporal denoising. If the noise estimation is avoided due to large motion, the last_source for denoising should still be updated. Change-Id: I67155ea7dbe9ac2785978e64a27bdafd7d57aac0	2017-04-07 09:23:30 -07:00
Marco	18b54ef468	vp9: Adjust consec_zeromv threshold for aq-mode=3. To reduce refresh on partial super-blocks on boundary, for noisy input. Reduces some artifacts on noisy input. Change-Id: I10b5808a296874e08c7f378b3df58466591d8dbe Edit	2017-04-07 08:54:09 -07:00
James Zern	04e9456567	Merge changes from topic 'Wshorten' * changes: configure: enable -Wshorten-64-to-32 for hbd vp9_encodeframe: resolve -Wshorten-64-to-32 in hbd Resolve -Wshorten-64-to-32 in highbd variance.	2017-04-07 07:32:14 +00:00
Marco	3227a9be5f	vp9; Move the denoising condition for speed 5. Move the condition for effectively disabling the denoising for speed 5 into the vp9_denoiser_denoise(). This is cleaner, and also moving the condition into vp9_denoiser_denoise will keep the denoiser buffer updated with the current source. This allows for more consistent behavior if speed is changed midstream. Change-Id: Ia001f591c56e454bf724c3ae73c024badb183ef8	2017-04-06 11:03:04 -07:00
Jerome Jiang	c9fbb1881a	Merge "vp9: speed 8: Compute y sad without int_pro_motion_estimation."	2017-04-06 02:57:16 +00:00
Jerome Jiang	705fc9f107	Merge "Refactor: Clean memory allocation for copy partition."	2017-04-06 02:57:08 +00:00
Yunqing Wang	1aa46abbdf	VP9 motion vector unit test To prevent the motion vector out of range bug, added a motion vector unit test in VP9. In the 4k video encoding, always forced to use extreme motion vectors and also encouraged to use INTER modes. In the decoding, checked if the motion vector was valid, and also checked the encoder/decoder mismatch. The tests showed that this unit test could reveal the issue we saw before. Change-Id: I0a880bd847dad8a13f7fd2012faf6868b02fa3b4	2017-04-06 00:50:56 +00:00
James Zern	b3e2eb14c5	vp9_encodeframe: resolve -Wshorten-64-to-32 in hbd vp9_high_get_sby_perpixel_variance the variance operated on in is already in 32-bits Change-Id: I97006eb9c08dbd0f88ee35e1a1ca205737508296	2017-04-05 17:34:06 -07:00
Jerome Jiang	288d73c861	vp9: speed 8: Compute y sad without int_pro_motion_estimation. Little change in overall PSNR in rtc. 2-4% speedup on VGA on ARM. Change-Id: I3395806d7afd456deacd4077c330adca13ab0645	2017-04-05 17:25:47 -07:00
Marco	2136de9374	vp9: Temporal denoising: avoid denoising for speed <= 5. Temporal denoiser runs in non-rd pickmode, so it is only used for speed >= 5. Regression exists for speed 5, due to use of reference_partition (which use non-rd pickmode for partitioning). Avoid denoising for now at speed 5. Change-Id: I74a74d2e1404d7cfd33dcf4ec06dd2e503256cf0	2017-04-05 16:43:39 -07:00
Jerome Jiang	58ba880b94	Refactor: Clean memory allocation for copy partition. Move the memory allocation from setting speed features. Change-Id: I2e89dfaeb46daee63effe5a5df62feed732aa990	2017-04-05 15:33:24 -07:00
Jerome Jiang	fb60204d4c	vp9: Remove legacy comments for avg_source_sad. Change-Id: Ia6e8614535a097f17f37fc382cef8e22e03b70f6	2017-04-04 16:28:27 -07:00
Marco	8097b49997	vp9: Adjust condition of golden update with cyclic refresh. Base the low_content_frame metric on the motion vectors, and adjust the logic for preventing golden update. Small change in behavior: small positive gain (~0.2-1%) on clips with high activity. Change-Id: I0b861c8e9666cd82b45cde5ee57ee8a1e5ab453c	2017-04-04 09:55:24 -07:00
Marco	6b3f4bc794	vp9: 1 pass CBR: cleanup to cyclic refresh. Code cleanup: merged two functions that were doing postencode update for cylic refresh, remove some unused code and fix comments. No change in behavior. Change-Id: I9be0d7e346d34dec29bf4e5bb380a7bf81c8480a	2017-04-03 16:37:45 -07:00
Yunqing Wang	41fac44707	Merge "Fix for out of range motion vector bug in sub-pel motion estimation"	2017-04-03 18:27:57 +00:00
Marco Paniconi	9d403d6f48	Merge "vp9: SVC: Fix issue with artifact for svc-denoising."	2017-04-03 16:23:25 +00:00
Ranjit Kumar Tulabandu	bf15ca1091	Fix for out of range motion vector bug in sub-pel motion estimation BUG=webm:1397 (yunqingwang) To verify that this patch wouldn't cause much performance change, the Borg tests were run. Here was the result: avg_psnr overall_psnr ssim hdres: -0.002 0.006 0.013 midres: 0 0 0 lowres: 0 0 0 Change-Id: Iae395ae7b741e0513cf5bab9dcace110b792a67d	2017-04-03 16:16:49 +00:00
Yunqing Wang	002cf38837	Merge "Enhance the row mt sync read to accept the sync_range greater than 1"	2017-04-03 15:59:51 +00:00
Yunqing Wang	f1600db3e4	Enhance the row mt sync read to accept the sync_range greater than 1 The row mt sync read uses sync_range = 1, and wouldn't work if we want to use a sync_range that is greater than 1. To make it work, this sync read code is modified. Pass in col instead of col - 1 to make it consistent with other row mt code in VP9, and then add 1 in "while" codition. Change-Id: I4a0e487190ac5d47b8216368da12d80fec779c1a	2017-03-31 10:48:38 -07:00
Marco	c824eda6cc	vp9: SVC: Fix issue with artifact for svc-denoising. Issue/bug happens for denoising with spatial layers, where the golden (spatial) reference is used in pickmode, but denoising is only done wrt to last (temporal). Fix is to make sure set_ref_ptrs is set before build predictors in denoiser. Change-Id: I793cf441341edf7c4a88b8ab1e1b22b3cb0eb508	2017-03-31 10:05:32 -07:00
Marco	fc83fcb7c4	vp9: SVC: fix to allow output of denoised result. Change-Id: Iaf55cfb5e9621d074eb33d6a32f184e4777968f8	2017-03-29 14:02:54 -07:00
Marco	32b3d2f174	vp9: 1 pass SVC: Modify condition for intra-mode search. Temporary override to condition for disallowing intra-search in SVC, since golden (spatial) reference is currently suppressed due to artifact issue. Change-Id: I28ed7fdddc9fcdbcc0a4175a247a3ecc94c11767	2017-03-29 09:24:50 -07:00
Marco	0169a985d9	vp9: Speed >= 8: avoid chrome check under some condition. For non-rd variance partition, avoid the chrome check unless y_sad is below some threshold. Small decrease in avgPSNR (~0.3) on RTC set. Small/negligible decrease on RTC_derf. Change-Id: I7af44235af514058ccf9a4f10bb737da9d720866	2017-03-27 13:18:21 -07:00
Marco	66c6b4d6fc	vp9: 1 pass: Move source sad computation into encodeframe loop. Refactor to split the 1 passs source sad computation into scene detection (currently used for VBR and screen-content mode), and superblock based source sad computation (used in non-rd CBR mode). This allows the source sad computation for CBR mode to be multi-threaded. No change in compression. Change-Id: I112f2918613ccbd37c1771d852606d3af18c1388	2017-03-27 11:11:05 -07:00
Marco	07ad5a15c2	vp9: Fix to condition on using source_sad for 1 pass real-time. Make the source_sad feature work properly for cases of VBR or screen_content with SVC. Added unittest for SVC with screen-content on. Change-Id: Iba5254fd8833fb11da521e00cc1317ec81d3f89b	2017-03-24 10:21:47 -07:00
Alex Converse	d7b220b467	Merge changes Ie989e60c,Ifc110b12 * changes: Backport "Optimize the use case of token_cost table" to VP9 Drop vp9_get_token_extracost	2017-03-23 18:05:13 +00:00
Marco	4863e07c01	vp9: Non-rd partition: avoid unneeded call to chrome_check Since y_sad is not computed yet (on the early exit due to source_sad), no need to check for setting color_sensitiviy. Only affects speed >=8. No change in behavior. Change-Id: I3a6f2d20fed38d8b8ec51b75bcacf9a21f2db916	2017-03-22 22:40:28 -07:00
James Zern	f16ea6a6eb	Merge "vp9_rdopt: correct size to vpx_sum_squares_2d_i16"	2017-03-23 00:53:22 +00:00
Marco Paniconi	ff0e0a76e8	Merge "vp9: Adjust some speed settings for speed 8."	2017-03-22 22:56:17 +00:00
Marco	4d50991320	vp9: Adjust some speed settings for speed 8. Allow for simple_block_rd for VGA resoln, and reduce adaptive_rd_thresh to 1. On average no loss on RTC set, ~4% speedup on mac. Change-Id: Ib549c4061c853776062b5e34040f839d470fbebc	2017-03-22 15:16:15 -07:00
Jerome Jiang	dcd6c87b80	Merge "vp9: Enable adaptive_rd_threshold for row mt for realtime speed 8."	2017-03-22 22:02:24 +00:00
James Zern	5661cd8ff4	vp9_rdopt: correct size to vpx_sum_squares_2d_i16 the current implementations expect pixel size, not the block type BUG=webm:1392 Change-Id: Ib91e9f30a1f56e13566b1fb76f089dae9bb50cdc	2017-03-22 12:04:33 -07:00
Johann	36d732c22b	vp9 temporal filter: add const to function prototype The input frames are not modified. Change-Id: Ideb810e3c5afeb4dbdc4c7d54024c43a8129ad39	2017-03-22 18:14:21 +00:00
Jerome Jiang	20c2892693	vp9: Enable adaptive_rd_threshold for row mt for realtime speed 8. Change it to row based array to avoid the slow down cause by sync. row-mt on, speed 8, 2 threads: ~4% speedup for VGA on ARM benefited from adaptive_rd_threshold. Change-Id: I887e65a53af20a6c4f48d293daaee09dab3512cf	2017-03-21 18:49:47 -07:00
Jerome Jiang	dbed479d79	Fix the data race caused by vp9 denoiser. BUG=webm:1391 Change-Id: I9669ae62fe9c695d4c6f9973094cb0f39bed51c7	2017-03-21 15:46:25 -07:00
Yunqing Wang	1935dfb294	Code refactoring in the partition search Computed the partition search early termination score in a separate function. Change-Id: I1894b517ff179a38b1c05e054d373ac4b7f4cbb4	2017-03-21 10:00:44 -07:00
Marco Paniconi	05c7259525	Merge "vp9: Nonrd variance partition: improve split to 16x16."	2017-03-21 00:17:35 +00:00
Yunqing Wang	bf43b4c4b4	Merge "Record the sum of tx block eobs in the partition block"	2017-03-20 23:20:12 +00:00
Marco	3135b85423	vp9: Nonrd variance partition: improve split to 16x16. Add additional condition to split to 16x16, for resolutions <= 360p, reduces dragging artifact near moving boundary. Small/no change on RTC metrics. Change-Id: I314694f2166435d918f74e7ab42f002b07f40dae	2017-03-20 15:44:46 -07:00
Marco	06c8713e89	vp9: Use sb content measure to bias against golden. For each superblock, keep track of how far from current frame was the last significant content change, and use that (along with GF distance), to turnoff GF search in non-rd pickmode. Only enabled for speed >= 8. avgPNSR on RTC/RTC_derf down by ~0.9/1.2. Speedup on mac: ~3-5%. Speedup on arm: 3.6% for VGA and 4.4% for HD. Change-Id: Ic3f3d6a2af650aca6ba0064d2b1db8d48c035ac7	2017-03-20 12:42:26 -07:00
Yunqing Wang	9c2552a1c1	Record the sum of tx block eobs in the partition block The sum of tx bloxk eobs is needed in the machine learning based partition early termination. The eobs are first accumulated during tx search, and then the value associated with the best tx_size is copied to ctx for later use. After the sum of eobs are calculated correctly, re-enabled ml_partition_search_early_termination speed feature. Re-did the quality/speed test to check the impact of the fix. 1. Borg test BDRATE result: 4k set: PSNR: +0.183%; SSIM: +0.100%; hdres set: PSNR: +0.168%; SSIM: +0.256%; midres set: PSNR: +0.186%; SSIM: +0.326%; 2.Average speed gain result: 4k clips: 21%; hd clips: 26%; midres clips: 15%. The result is in line with the original result. Change-Id: I4209a95c89be03b4cbfb6a95b16885f89feddbda	2017-03-20 17:12:15 +00:00
Jingning Han	ca9bedd538	Backport "Optimize the use case of token_cost table" to VP9 cherry picked from nextgenv2 `90ea281f29` Change-Id: Ie989e60c6479ac3251cadaac9c7e795ccba52f4e	2017-03-17 16:54:22 -07:00
Alex Converse	ab71181545	Drop vp9_get_token_extracost vp9_get_token_cost does the same thing with one fewer lookup. Change-Id: Ifc110b12403cb1a04a3f91357ab435c67b4815d6	2017-03-17 16:53:09 -07:00
Alex Converse	0842daa24e	Merge "vp9_optimize_b: Combine extrabits cost with token lookup"	2017-03-17 16:18:21 +00:00
Marco	02975a604c	vp9: Fix speed 8 condition for enabling copy_partition. Change-Id: I2c090e6ba853a30fef1957b620853315f9471753	2017-03-16 17:08:37 -07:00
Alex Converse	3a6ec9ea72	vp9_optimize_b: Combine extrabits cost with token lookup About 0.6% fewer cycles spent in vp9_optimize_b. Change-Id: I2ae62a78374c594ed81d4e3100a5848e2f6f2c4e	2017-03-16 17:03:22 -07:00
Gabriel Marin	976ddb61d3	Add a vector form of routine vp9_model_rd_from_var_lapndz Add routine vp9_model_rd_from_var_lapndz_vec and call it from model_rd_for_sb to model the rate and distortion for MAX_MB_PLANE Laplacian sources in parallel. The caller ensures that all sources have non-zero variance. Measured a 18% to 25% reduction in retired instructions, and 17% to 24% reduction in instruction execution cost with different compilers for the Laplacian modeling. No change in behavior. TEST=Verified that encoded files match bit for bit, with and without this change. BUG=b/33678225 Change-Id: I6b76947f21c659a349adb896e13e99f6e3f951e6	2017-03-16 22:19:44 +00:00
Marco	bc7d4935bb	vp9: Fixes in non-rd pickmode for denoising with SVC. Don't denoise spatial layer frames whose base layer is a key frame. Disallow golden reference for SVC with denoising on frames that will be denoised (highest layer), as this removes bad artifact. Will re-enable when issue is resolved. Change-Id: I87a6597812330500966458172acfce54af65f70f	2017-03-16 12:59:41 -07:00
Jerome Jiang	bf40776aa4	Merge "Refactor: Change cpi->resize_state to enum values."	2017-03-16 16:43:42 +00:00
Marco Paniconi	cd47c1942e	Merge "vp9: Fix some issues with denoiser and SVC."	2017-03-16 02:42:55 +00:00
Marco	a340c64a79	vp9: Fix some issues with denoiser and SVC. Fix the update of the denoiser buffer when the base spatial layer is a key frame. And allow for better/lower QP on high spatial layers when their base layer is key frame. Change-Id: I96b2426f1eaa43b8b8d4c31a68b0c6d68c3024a2	2017-03-15 17:19:17 -07:00
Jerome Jiang	b5f7f7737a	Refactor: Change cpi->resize_state to enum values. Change-Id: Iab1409b0fc1175bc5a14afc4749a08c536c98c41	2017-03-15 17:16:17 -07:00
Marco	2c8430e223	vp9: Turn off ml_partition_search_early_termination. Fails on nightly ubsan, valgrind tests. Enabled on commit:6701014 Change-Id: Ied3f5cb38e39cba54ac134f4514107cdfdfce159	2017-03-15 15:00:38 -07:00
Jerome Jiang	27d5a57072	Merge "vp9: Using source sad for speedup for dynamic resizing."	2017-03-15 00:03:52 +00:00
Jerome Jiang	2fa7092808	Merge "vp9: Enable row multithreading for SVC in real-time mode."	2017-03-14 23:29:46 +00:00
Jerome Jiang	02463273c9	vp9: Using source sad for speedup for dynamic resizing. Only for speed >= 7. Change-Id: I3ac85fbb4023cf7e6f8333806b345b0174382a09	2017-03-14 15:47:19 -07:00
James Zern	1b91f41935	Merge "vp9/encoder: fix segfault on win32 using vs < 2015"	2017-03-14 19:21:42 +00:00
Yunqing Wang	c3e290963d	Merge "Apply machine learning-based early termination in VP9 partition search"	2017-03-14 18:07:05 +00:00
Marco Paniconi	78a6946904	Merge "vp9: Speed >= 8: Enable simple_block_yrd speed feature."	2017-03-14 17:50:17 +00:00
Marco	c0c789ab50	vp9: Adjust copy partition threshold, for speed 8. Reduce it from 5 to 4, small/no change in metrics or speed. Small reduction in dragging artifact near moving head. Change-Id: Ic3bc5ca67c70bf0c89fc2ed14454840a28ae5b6a	2017-03-14 09:18:53 -07:00
Marco	c216c8d6f2	vp9: Speed >= 8: Enable simple_block_yrd speed feature. Enable speed feature for resolutions > VGA. avgPSNR on RTC down by ~1.7%. Speedup on ARM: ~5%. Change-Id: I7a3fe5f7425aa8df3f4a2eced1afa355bc0d4c95	2017-03-14 09:10:28 -07:00
Marco	f0a22b23fe	vp9: Fix to source_sad feature for SVC. Allow speed feature sf->use_source_sad to be used on highest spatial layer for SVC. Change-Id: I260eb0478902764f49f83e43b17024fe86ff3b22	2017-03-13 11:00:40 -07:00
Yunqing Wang	670101439f	Apply machine learning-based early termination in VP9 partition search This patch was based on Yang Xian's intern project code. Further modifications were done. 1. Moved machine-learning related parameters into the context structure. 2. Corrected the calculation of sum_eobs. 3. Removed unused parameters and calculations. 4. Made it work with multiple tiles. 5. Added a speed feature for the machine-learning based partition search early termination. 6. Re-organized the code. The patch was rebased to the top-of-tree. Borg test BDRATE result: 4k set: PSNR: +0.144%; SSIM: +0.043%; hdres set: PSNR: +0.149%; SSIM: +0.269%; midres set: PSNR: +0.127%; SSIM: +0.257%; Average speed gain result: 4k clips: 22%; hd clips: 23%; midres clips: 15%. Change-Id: I0220e93a8277e6a7ea4b2c34b605966e3b1584ac	2017-03-13 09:54:18 -07:00
Marco	8c18df7fcd	vp9: Fix condition for intra search in non-rd pickmode. Fixes an issue when the LAST and golden is not used as a reference, in which case its possible no encoding mode is set (since intra may be skipped under certain codtions). Fix is to make sure intra is searched if no inter mode is checked. Issue can happen for temporal layer pattern#7 in vpx_temporal_svc_encoder.c Change-Id: I5ab4999b2f9dbd739044888e0916b5ec491d966b	2017-03-12 22:30:39 -07:00
James Zern	c09b290cea	vp9/encoder: fix segfault on win32 using vs < 2015 shift the bsse[] member of the macroblock struct to the front to avoid an incorrect offset (0) to the upper half of bsse[0] which leads to a negative resulting in a crash. restrict this to visual studio versions before 2015 (the bug was observed with 2013, fixed in 2015) to avoid any potential cache impact on other platforms. https://connect.microsoft.com/VisualStudio/feedback/details/2396360/bad-structure-offset-in-32-bit-code BUG=webm:1054 Change-Id: I40f68a1d421ccc503cc712192263bab4f7dde076	2017-03-10 17:37:17 -08:00
Marco	ffb3c50da1	vp9: Enable row multithreading for SVC in real-time mode. Enable row-mt for SVC for real-time mode, speed >=5. Add the controls to the sample encoders, but keep it off for now. Add the control and enable it for the 1 pass CBR unittests. For speed 7, 3 layer SVC, 2 threads, row-mt enabled gives about ~5% speedup. Change-Id: Ie8e77323c17263e3e7a7b9858aec12a3a93ec0c1	2017-03-10 01:01:07 +00:00
James Zern	cb60e66085	Merge "move vp9_scale_and_extend_frame_c to vp9_frame_scale.c"	2017-03-09 22:51:08 +00:00
James Zern	2f31a16445	move vp9_scale_and_extend_frame_c to vp9_frame_scale.c this is similar to the x86 configuration and helps mitigate an issue with a circular dependency between this function and the ssse3 variant causing an outsized increase in binary size (~300K for chrome) chrome.dll: .text 255B000 -> 252B000 .data 7B000 -> 75000 -221184 bytes BUG=chromium:697956 Change-Id: Ic95b142ecd62dd4f1795788aa27dd8fab59b708c	2017-03-08 21:13:50 -08:00
Marco	ea3c817ac2	vp9: Enable two speed features for SVC real-time mode. Enable short_circuit_low_temp_var and limit_newmv_early_exit for SVC, 1 pass CBR mode. Change-Id: I77df2b2c6cc40657bb8ea76e19dfc2fdaad6389e	2017-03-08 16:13:59 -08:00
Yunqing Wang	099e9bf1ff	Make the partition search early termination feature to be frame size dependent The 2 thresholds(i.e. partition_search_breakout_dist_thr and partition_search_breakout_rate_thr) are used as the partition search early termination speed feature. This refactoring patch made this feature to be frame size dependent consistently throughout the code. Change-Id: Idaa0bd8400badaa0f8e2091e3f41ed2544e71be9	2017-03-08 12:56:41 -08:00
Marco	45de35fc58	vp9: Fix for denoising with SVC. Fix the conditon for getting last_source when denoising is on. This avoids unneeded scaling in the case of SVC. No change in quality. Change-Id: I32c1c2c9085104da51af8535716bcc4d55fb0f42	2017-03-08 09:45:58 -08:00
Alex Converse	15dac923b9	Merge "Narrow cat6_high_cost tables to uint16_t"	2017-03-03 23:45:39 +00:00
Alex Converse	bcd12de6c3	Narrow cat6_high_cost tables to uint16_t Saves 2688 bytes of rodata. Change-Id: I46633b6e50c2845181c70fff6273a8e58fdd1e56	2017-03-03 23:09:12 +00:00
Vignesh Venkatasubramanian	9e7140b451	Merge "vp9,realtime: Enable row multithreading for non-rd"	2017-03-03 19:05:52 +00:00
Marco	b60617f5ff	vp9: Speed 8: reduce the adaptive_rd_thresh level. Reduce the level from 4 to 2. This gives ~1-2% quality gain on RTC set, with small decreaee in speed (~1-2% on mac). Change-Id: I7d959731badcee3d45b2f4a08efe378765016a13	2017-03-02 13:34:10 -08:00
Vignesh Venkatasubramanian	453f18040f	vp9,realtime: Enable row multithreading for non-rd Enable row level multithreading for realtime encodes where non-rd path is used (speed >= 5). Change-Id: I5439cb49a02171166d8e1de06c7d5e6f8e819a41	2017-03-02 11:03:56 -08:00
James Zern	8697d14ec8	Revert "Fix for max qindex calculation of a gf interval" This reverts commit `d3db846cc5`. This change causes a large drop in psnr (4-5db) on low framerate difficult content (tested at 360/480p) BUG=b/35804225 Change-Id: I8e90012d3b9c8a0cddb062ba93b01b36c0e0c0a0	2017-02-28 16:26:13 -08:00
Marco	defe094e9e	vp9: Fix an issue with setting variance thresholds. From commit: https://chromium-review.googlesource.com/c/441393/ On non-segment the set_vbp_thresholds() should be called again to adjust thresholds based on content_state of superblock. This was the intended behavior from 441393. Small change in RTC metrics and speed. Change-Id: I45e5fbdc4af74db76b3cb4f13074fcae0eb2219e	2017-02-27 12:09:51 -08:00
Vignesh Venkatasubramanian	5881601488	vp9: Rename new_mt to row_mt new_mt is a very generic name that will get obsolete soon enough. Since this is exposed as a codec control, renaming it to row_mt to signify row level paralellism. Also renaming the ETHREAD_BIT_MATCH codec control to ROW_MT_BIT_EXACT. Change-Id: Ic7872d78bb3b12fb4cf92ba028ec8e08eb3a9558	2017-02-27 09:43:26 -08:00
Jerome Jiang	e96ab22462	Merge "Make vp9_scale_and_extend_frame_ssse3 work for hbd when bitdepth = 8."	2017-02-24 16:56:33 +00:00
Johann	904b957ae9	consolidate block_error functions vp9_highbd_block_error_8bit_c was a very simple wrapper around vp9_block_error_c. The SSE2 implemention was practically identical to the non-HBD one. It was missing some minor improvements which only went into the original version. In quick speed tests, the AVX implementation showed minimal improvement over SSE2 when it does not detect overflow. However, when overflow is detected the function is run a second time. The OperationCheck test seems to trigger this case and reverses any speed benefits by running ~60% slower. AVX2 on the other hand is always 30-40% faster. Change-Id: I9fcb9afbcb560f234c7ae1b13ddb69eca3988ba1	2017-02-24 05:25:26 +00:00
Johann Koenig	aa911e8b41	Merge "block error sse2: use tran_low_t"	2017-02-24 05:24:34 +00:00
Jerome Jiang	0998a146d4	Make vp9_scale_and_extend_frame_ssse3 work for hbd when bitdepth = 8. Only works for bitdepth = 8 when compiled with high bitdepth flag. 4x speed ups for handling 1:2 down/upsampling. Validated manually for: 1) Dynamic resize for a single layer encoding 2) SVC encoding with 3 spatial layers Results are bitexact with the patch and the speed gain (~4x) in the scaling was verified. BUG=webm:1371 Change-Id: I1bdb5f4d4bd0df67763fc271b6aa355e60f34712	2017-02-23 20:40:28 -08:00
Johann	3c16bbb73b	block error sse2: use tran_low_t Change-Id: Ib04990e4a7bda9fbf501f294da2057a2b2595deb	2017-02-24 01:33:35 +00:00
Marco Paniconi	1d12a125e7	Merge "vp9: 1pass CBR: modify condition for reducing loop filter."	2017-02-23 03:24:26 +00:00
Jerome Jiang	a6b6258284	Merge "vp9: Non-rd pickmode: use simple block_yrd under some conditons."	2017-02-22 23:19:29 +00:00
Marco	84f106f198	vp9: 1pass CBR: modify condition for reducing loop filter. The reduction showed improvement on RTC when aq-mode=3 is on. Add that (cyclic refresh enabled) to the condition. Only affects 1 pass CBR. Change-Id: I5d0843002d8e31d7c165098a62e7a71146b08664	2017-02-22 15:09:45 -08:00
Marco	7e7d820d5b	vp9: Non-rd pickmode: use simple block_yrd under some conditons. For speed 8 only. 3% speed up for QVGA and 6.3% for VGA on Nexus 6. ~3% avgPSNR decrease on rtc_derf and 2.9% on rtc. Disabled for now. Change-Id: I70133f1f6c804d663d594df437bfe7fdb0030d6a	2017-02-22 13:22:53 -08:00
Marco Paniconi	0acc270830	Merge "vp9: aq-mode=3: On key frame reset cr->reduce_refresh to 0."	2017-02-22 19:52:24 +00:00
Marco	7e79831016	vp9: aq-mode=3: On key frame reset cr->reduce_refresh to 0. This prevent possible reduction of cyclic refresh after key frame. Change-Id: Idd4e49b69cd95476e7eccfa31b2bd8669569e9e8	2017-02-22 10:50:08 -08:00
Jerome Jiang	3d1fa00fce	vp9: Only compute y_sad for golden in variance partition for speed < 8. Only affects speed 8. No obvious quality regression. Systematic speed ups by ~1% on Nexus 6. Change-Id: Ia904ca28ea041c3281c532911ec38fb7d7f46a17	2017-02-22 10:19:09 -08:00
Yunqing Wang	66f36f4735	Merge "Refactored the row based multi-threading code"	2017-02-22 16:55:04 +00:00
Jerome Jiang	b1dcaf7f1e	Merge "Fix segmentation fault caused by denoiser working with spatial SVC."	2017-02-22 04:44:55 +00:00
Marco	7f2daa74a0	vp9: Incorporate source sum_diff into non-rd partition thresholds. Increase the variance partition thresholds for superblocks that have low sum-diff (from source analysis prior to encoding frame). Use it for now only for speed >= 7 or for denoising on. Small change on metrics for rtc set: less than ~0.1 avgPNSR decrease on RTC set, for both speed 7 and 8. Change-Id: I38325046ebd5f371f51d6e91233d68ff73561af1	2017-02-21 17:22:11 -08:00
Johann Koenig	1e224dcb83	Merge "Drop zbin_ptr and quant_shift_ptr"	2017-02-21 18:16:38 +00:00
Jerome Jiang	0d1e5a21c4	Fix segmentation fault caused by denoiser working with spatial SVC. Re-enable the affected test. BUG=webm:1374 Change-Id: I98cd49403927123546d1d0056660b98c9cb8babb	2017-02-21 09:38:28 -08:00
Paul Wilkins	4d4231352c	Merge "Change to prediction decay calculation."	2017-02-21 09:42:38 +00:00
Marco	4e1ba35458	vp9: Fix for non-rd pickmode for high-bitdepth build. Use the simple block_yrd under certain conditions. The optimization code is completed but the speed is still slower (~6% on 720p) than the low-bitdepth build. For now, use the more complex block_yrd under certain conditions (always use it for speed <= 5, otherwise use it on key frames and for bsize >= 32x32). This gives about ~2-3% gain in quality for speed 7 on RTC set (over high bitdepth build), with about the same encoder fps as the low bitdepth build. Change-Id: Ibe92a1945d0bd635f880befb4c815727df62d754	2017-02-20 20:25:36 -08:00
Ranjit Kumar Tulabandu	97d6a4cbd1	Refactored the row based multi-threading code Modified the code to facilitate bit-match tests in first pass Added unit-tests to test the row based multi-threading behavior for bit-exactness Change-Id: Ieaf6a8f935bb1075597e0a3b52d9989c8546d7df	2017-02-20 16:13:45 +05:30
paulwilkins	a63adac604	Change to prediction decay calculation. This change subtracts out low complexity intra regions that are also low error in the inter domain, in the calculation of the frame prediction decay. The rationale here his that low complexity regions (such as sky) do not imply high prediction decay in the same way as high error intra or neutral blocks. The effect of this is small in most clips but in a few clips it can be > 10%. (E.g. In to tree) Change-Id: If67ac23d17fca14285cad2defa464c61c9ea861c	2017-02-17 09:29:24 +00:00
James Zern	b5bc9ee02d	Merge "cosmetics: Fix spelling mistake in compile flag name."	2017-02-17 00:04:42 +00:00
Johann Koenig	a9b81da575	Merge "block error avx2: use tran_low_t"	2017-02-16 23:51:14 +00:00
paulwilkins	d218b0914e	cosmetics: Fix spelling mistake in compile flag name. agressive -> aggressive after: `ce7b38459` Aggressive VBR method. Change-Id: Ie0f30b1bbc77ed9f32bec047b4a9b3d0cf4853f5	2017-02-16 14:51:31 -08:00
Johann	ca4e27f5da	Drop zbin_ptr and quant_shift_ptr vp9[_highbd]_quantize]_fp[_32x32] and vp9_fdct8x8_quant do not make use of these parameters. scan is used for C code and iscan is used for SIMD implementations. Change-Id: I908a0ff7d3febac33da97e0596e040ec7bc18ca5	2017-02-16 13:20:32 -08:00
Johann	2104454607	block error avx2: use tran_low_t Change-Id: Ic5f3a1f569d6f82afeaf4fcd7235374bb460db3c	2017-02-16 12:39:02 -08:00
Johann Koenig	cc43012674	Merge changes I267050a5,Iebade0ef,Id96a8df3 * changes: quantize_fp_32x32 highbd ssse3: enable existing function quantize_fp highbd ssse3: use tran_low_t for coeff quantize_fp highbd sse2: use tran_low_t for coeff	2017-02-16 20:34:48 +00:00
Yunqing Wang	0bf6b51572	Merge "Structured the mode ordering code to avoid redundant memcpy"	2017-02-16 16:22:54 +00:00
Johann	4682130b60	quantize_fp highbd ssse3: use tran_low_t for coeff Change-Id: Iebade0efc0efbb0a80a0f3adbef4962e3a2f25e8	2017-02-16 07:40:56 -08:00
Johann	ac3996a6d1	quantize_fp highbd sse2: use tran_low_t for coeff Change-Id: Id96a8df33354a7987ce890a3d6798c7375ffa4aa	2017-02-16 07:40:55 -08:00
Johann	44600442dc	bitdepth conversion: really use num elements The previous implementation confused bit/bytes/elements. It was using '32' as the multiplier but that was mistakenly adopted because a 32x32 transform embedded the stride. Change-Id: Ieeb867a332416b9a40580b5e7c9b20088e9e691a	2017-02-16 15:02:48 +00:00
Ranjit Kumar Tulabandu	5127e58dab	Structured the mode ordering code to avoid redundant memcpy Change-Id: I4f5d6b54018bd1928cd9e5e42619e6f55b334803	2017-02-16 14:12:33 +00:00
Paul Wilkins	60a10116d1	Merge "Disconnect ARF breakout from frame boost."	2017-02-16 10:02:09 +00:00
Paul Wilkins	543ebc900f	Merge "Remove unnecessary factor."	2017-02-16 10:01:58 +00:00
Paul Wilkins	9216ba58d8	Merge "Bug in scale_sse_threshold()"	2017-02-16 10:01:46 +00:00
Paul Wilkins	e6c1993f1b	Merge "Additional first pass stats."	2017-02-16 09:39:29 +00:00
Marco	158b300952	vp9: Some code cleanup for aq-mode = 3. The weight segment needs to only be computed once per frame, so remove it from the funciton vp9_cyclic_refresh_rc_bits_per_mb(), which is called within a loop inside vp9_rc_regulate_q. Change-Id: Ia0e18b89abb97e42c466d4dbc47700d7f76555db	2017-02-15 14:07:04 -08:00
Marco	f82280820a	vp9. Use same source_sad threshold for all speeds. Only affects real-time mode. Change-Id: Iba836f110c4da936f5173cc0f54424d5b6121bff	2017-02-15 11:28:26 -08:00
Marco	716c1d5ff5	Vp9: Speed 8 aq-mode=3: Reduce computation in estimating bits per mb. vp9_compute_qdelta_by_rate has almost 2% overhead in profiling on Nexus 6. Reduce the calling of that function in speed 8 by estimating the delta-q. Both rtc and rtc_derf show little/no change in avg psnr/ssim. Encoding speed is 2~3% faster on Nexus 6. Change-Id: If25933715783f31104a18a5092ea347b1221b5f5	2017-02-15 09:28:16 -08:00
paulwilkins	cfc79a357a	Disconnect ARF breakout from frame boost. This small change replaces the frame boost check in the arf group length break out clause with a test against a prediction decay value. The boost value is in fact partly dependent on the decay value but this change means that the per frame boost calculation can be adjusted without influencing the group length calculation. The value chosen gives a close match on all the test sets with the previous code (on average) but it was noted that a lower threshold was slightly better for 1080P and up and a slightly higher value for small image sizes. Change-Id: I4d5b9f67d5b17b0d99ea3f796d3d6202fd61ee0c	2017-02-15 10:46:14 +00:00
paulwilkins	b89ba05ab4	Remove unnecessary factor. Removed unnecessary scaling factor to simplify. Change-Id: I3fc9c5975a2597e72f1324e09dd586dea1facfa7	2017-02-15 10:45:43 +00:00
paulwilkins	76550dfdc0	Bug in scale_sse_threshold() The function scale_sse_threshold() returns a threshold scaled if necessary for use with 10 and 12 bit from an 8 bit baseline. SSE error values would be expected to rise for the 10 and 12 bit cases where there are more bits of precision. Hence the threshold used for the test should also be scaled up. Change-Id: I4009c98b6eecd1bf64c3c38aaa56598e0136b03d	2017-02-15 10:45:03 +00:00
paulwilkins	945ccfee59	Additional first pass stats. Added counts that split the intra coded blocks into low and high variance. Change-Id: Ic540144b34d5141659081bb22f7ee16fd6861f14	2017-02-15 10:44:37 +00:00
Paul Wilkins	7635ee0f37	Merge "Aggressive VBR method."	2017-02-15 10:37:02 +00:00
Johann Koenig	61927ba4ac	Merge "vp9 fdct higbd neon: connect existing highbd calls"	2017-02-15 01:33:00 +00:00
Yunqing Wang	f2c1aea118	Merge "Row based multi-threading of encoding stage"	2017-02-15 00:54:10 +00:00
Ranjit Kumar Tulabandu	71061e9332	Row based multi-threading of encoding stage (Yunqing Wang) This patch implements the row-based multi-threading within tiles in the encoding pass, and substantially speeds up the multi-threaded encoder in VP9. Speed tests at speed 1 on STDHD(using 4 tiles) set show that the average speedups of the encoding pass(second pass in the 2-pass encoding) is 7% while using 2 threads, 16% while using 4 threads, 85% while using 8 threads, and 116% while using 16 threads. Change-Id: I12e41dbc171951958af9e6d098efd6e2c82827de	2017-02-15 00:49:34 +00:00
Johann	3e7aa8fda9	vp9 fdct higbd neon: connect existing highbd calls Change-Id: Ia8f822bd6e70b3911bc433a5a750bfb6f9a3a75c	2017-02-14 22:11:49 +00:00
Johann Koenig	9c2bb7f342	Merge "quantize_fp highbd neon: use tran_low_t for coeff"	2017-02-14 21:28:23 +00:00
clang-format	4b402746ca	apply clang-format Change-Id: I75e4a9e0b37bd4586f26c8d6c1fa27f3f6ff1bce	2017-02-14 12:45:52 -08:00
Johann	2b24aa87d9	quantize_fp highbd neon: use tran_low_t for coeff Change-Id: I90fd815f15884490ad138f35df575a00d31e8c95	2017-02-14 10:26:10 -08:00
Yunqing Wang	318ca07657	The bitstream bit match test in multi-threaded encoder While the new-mt mode is enabled(namely, allowing to use row-based multi-threading in encoder), several speed features that adaptively adjust encoding parameters during encoding would cause mismatch between single-thread encoded bitstream and multi-thread encoded bitstream. This patch provides a set_control API to disable these features, so that the bit match bitstream is obtained in the unit test. Change-Id: Ie9868bafdfe196296d1dd29e0dca517f6a9a4d60	2017-02-13 13:02:26 -08:00
James Zern	3c4ea94210	cosmetics,vp9_ratectrl: apply clang-format broken since: `c3f095c8b` Merge "Fix to avoid abrupt relaxation of max qindex in recode path" `5f21aba4b` Fix to avoid abrupt relaxation of max qindex in recode path the original change pre-dated the addition of .clang-format Change-Id: If5e399d9a805bcad9147360b13b36fbc8c560a7c	2017-02-13 11:29:39 -08:00
paulwilkins	ce7b38459a	Aggressive VBR method. VBR method that allows a wider Q range for the first normal frame in each ARF group and then centers the min - max range for the rest of the arf group on the chosen Q value for that first frame. This allows for quite rapid adjustment of the active Q range even if the initial estimate is poor. In some cases where the ARF frames themselves are tending to undershoot but the normal frames are overshooting this can still give net undershoot. This can be corrected by allowing a larger Q delta for arf frames but is usually is a sign that the allocation to the arfs was to high. Change-Id: Icec87758925d8f7aeb2dca29aac0ff9496237469	2017-02-13 15:42:11 +00:00
Marco	22dcfa80aa	vp9: Non-rd mode: use simple block_yrd for 8 bit high bitdepth builds Temporary fix until optimization work for block_yrd is completed. This essentially reverts back to the state before the change: https://chromium-review.googlesource.com/c/433821/ Compression loss is about ~5-6% on RTC set. Speed-up (from using this simple/model-based block_yrd) over the low bitdepth builds (which uses more complex block_yrd) is ~5% on 720p. Change-Id: Ie0af9eb0d111e5595f587870c44f08317403b8d8	2017-02-10 10:15:35 -08:00
Paul Wilkins	c3f095c8b3	Merge "Fix to avoid abrupt relaxation of max qindex in recode path"	2017-02-09 17:17:55 +00:00
Paul Wilkins	82b88a7fd0	Merge "Fix for max qindex calculation of a gf interval"	2017-02-09 17:17:44 +00:00
Johann Koenig	b73f99745b	Merge "block_error_fp highbd sse2: use tran_low_t for coeff"	2017-02-07 23:26:10 +00:00
Marco Paniconi	71f5314993	Merge "vp9: Denoiser speed-up: increase partition and ac skip thresholds."	2017-02-07 22:25:00 +00:00
Yunqing Wang	b106abe570	Merge "Row based multi-threading of ARNR filtering stage"	2017-02-07 19:55:41 +00:00
Marco Paniconi	259e835b1b	Merge "vp9: Adjust rate_err threshold for setting active_worst factor."	2017-02-07 19:25:47 +00:00
Marco	1a5482d4d8	vp9: Denoiser speed-up: increase partition and ac skip thresholds. Add factor to increase varianace partition and ac skip thresholds, under certain conditions (noise level and sum_diff), to increase denoiser speed. Change-Id: I7671140ef3598bf5f114a72623d68792bcd7b77b	2017-02-07 10:33:13 -08:00
Marco	3c2f076ad0	vp9: Adjust rate_err threshold for setting active_worst factor. Only affects 1 pass vbr. Small improvement on ytlive set. Change-Id: I09a7456fe658fbea82ece1035cf683bd8bd8bd14	2017-02-07 09:38:16 -08:00
Johann	537949a9df	block_error_fp highbd sse2: use tran_low_t for coeff BUG=webm:1365 Change-Id: Id2ed3ebaaaa6a4b68628c23e08b64ea5f1341761	2017-02-07 15:03:28 +00:00
Ranjit Kumar Tulabandu	91f01a2060	Row based multi-threading of ARNR filtering stage Change-Id: Ic238d32c7e10b730342224ab56712a89a6026a8f	2017-02-07 14:03:19 +05:30
Johann Koenig	85f3a82355	Merge "highbd x86: consolidate tran_low_t conversions"	2017-02-07 02:49:58 +00:00

... 3 4 5 6 7 ...

6817 Commits