generic-library/vpx

Author	SHA1	Message	Date
paulwilkins	aecb1770d5	Merge "Image size restriction to rd auto partition search."	2015-05-07 14:12:14 +00:00
Marco	76fe5dfc67	Remvoe EIGHTTAP_SHARP filter check for non-rd mode. Using EIGHTTAP and EIGHTTAP_SMOOTH seem sufficient. Hard to see any visual gain from allowing EIGHTTAP_SHARP, and it is rarely selected. PSNR/SSIM metrics go up by ~0.18/0.14%. Change-Id: I96fa0d98f9321b913e3ebcd464d4ff3c63018791	2015-05-06 17:08:34 -07:00
Johann	d5d9289800	Move shared SAD code to vpx_dsp Create a new component, vpx_dsp, for code that can be shared between codecs. Move the SAD code into the component. This reduces the size of vpxenc/dec by 36k on x86_64 builds. Change-Id: I73f837ddaecac6b350bf757af0cfe19c4ab9327a	2015-05-06 16:58:20 -07:00
Yunqing Wang	36eabb1c3c	Add intra mode early termination in non-rd mode Added the intra mode early termination in order to speed up the mode search in non-rd case since we started to include more intra modes in the search list. Borg tests(rtc set) showed a 0.048% PSNR gain and 0.061 SSIM gain. No speed change. Change-Id: I6f255fe534dc50b736e6a66a726ad458eb9b4443	2015-05-05 16:31:36 -07:00
paulwilkins	af76953448	Merge "Remove CONSTRAIN_NEIGHBORING_MIN_MAX."	2015-05-05 09:32:11 +00:00
paulwilkins	4cd65e4f19	Merge "Adjust ARF min and max interval."	2015-05-05 09:31:38 +00:00
Marco	b9a72d3c4d	Allow for H and V intra modes for non-rd mode. For non-rd mode (speed >=5): use mask based on prediction block size, and (for non-screen content mode) allow for checking horiz and vert intra modes for blocks sizes < 16x16. Avg psnr/ssim metrics go up by about ~0.2%. Only allowing H/V intra on block sizes below 16x16 for now, to keep encoding time increase very small, and also when allowing H/V on 16x16 blocks, metrics went down on a few clips which need to be further examined. Change-Id: I8ae0bc8cb2a964f9709612c76c5661acaab1381e	2015-05-04 09:48:41 -07:00
Yunqing Wang	d31256cd38	Merge "Reduce intra_cost_penalty for BLOCK_8X8"	2015-05-01 18:29:38 +00:00
Yunqing Wang	57fefd5f9a	Merge "Adjust the vbp early termination threshold slightly"	2015-05-01 18:29:25 +00:00
paulwilkins	4a7dcf8eb2	Image size restriction to rd auto partition search. Impose a limit on the rd auto partition search based on the image format. Smaller formats require that the search includes includes a smaller minimum block size. This change is intended to mitigate the visual impact of ringing in some problem clips, for smaller image formats. Change-Id: Ie039e5f599ee079bbef5d272f3e40e2e27d8f97b	2015-05-01 16:16:02 +01:00
paulwilkins	287b0c6da9	Remove CONSTRAIN_NEIGHBORING_MIN_MAX. Remove one of the auto partition size cases. This case can behaves badly in some types of animated content and was only used for the rd encode path. A subsequent patch will add additional checks to help further improve visual quality. Change-Id: I0ebd8da3d45ab8501afa45d7959ced8c2d60ee4e	2015-05-01 15:15:16 +01:00
paulwilkins	e0786c280e	Adjust ARF min and max interval. Previously limit on max interval set to 0.5 seconds. Though this helped some low frame rate material it appears to be a bit too aggressive for some 24 and 25 fps content. This patch relaxes the limit to 0.75 seconds. The patch also adds a new minimum interval variable to replace the current hard wired value. This allows us to impose a limit on the maximum number of primary arfs per second for high frame rate (e.g. 50 & 60fps) content. This is to address concerns regarding playback performance on some platforms if there is a high base frame rate and very frequent arfs. Change-Id: I373e8b6b2a8ef522eced6c6d2cceb234ff763fcf	2015-05-01 15:11:49 +01:00
Yunqing Wang	4907c29904	Reduce intra_cost_penalty for BLOCK_8X8 This patch reduced the BLOCK_8X8's intra_cost_penalty, which allows 8x8 blocks to conduct intra mode search. Borg test result(rtc set): 0.077% PSNR gain, 0.228% SSIM gain. No speed changes. Change-Id: Icfe90c4f6969de24bda8ecacbd3da50330bf22b2	2015-04-30 11:03:06 -07:00
Yunqing Wang	fd90ce2711	Merge "Improve golden frame refreshing in non-rd mode"	2015-04-30 15:57:55 +00:00
Yunqing Wang	a257e469e1	Adjust the vbp early termination threshold slightly Calculated cpi->vbp_threshold_sad from this frame's dequant value. The encoding quality and speed didn't change much. Borg test result: PSNR: -0.002%, SSIM: -0.003%. Change-Id: I97c9826986f39582f29910d637d08a69c90afdee	2015-04-30 08:51:02 -07:00
Yunqing Wang	d31698b0e0	Improve golden frame refreshing in non-rd mode The default golden frame interval was doubled. After encoding a frame, the background motion was measured. If the motion was high, the current frame was set as the golden frame. Currently, the changes were applied only while aq-mode 3 was on. Borg tests(rtc set) showed a 0.226% PSNR gain and 0.312% SSIM gain. No speed changes. Change-Id: Id1e2793cc5be37e8a9bacec1380af6f36182f9b1	2015-04-29 16:43:43 -07:00
James Zern	f58011ada5	vpx_mem: remove vpx_memset vestigial. replace instances with memset() which they already were being defined to. Change-Id: Ie030cfaaa3e890dd92cf1a995fcb1927ba175201	2015-04-28 20:00:59 -07:00
James Zern	f274c2199b	vpx_mem: remove vpx_memcpy vestigial. replace instances with memcpy() which they already were being defined to. Change-Id: Icfd1b0bc5d95b70efab91b9ae777ace1e81d2d7c	2015-04-28 19:59:41 -07:00
James Zern	fbd3b89488	vpx_mem: remove vpx_memmove vestigial. replace instances with memmove() which they already were being defined to. Change-Id: If396d3f9e3cf79c0ee5d7429615ef3d6b2a34afa	2015-04-28 19:59:40 -07:00
Yaowu Xu	b3e411e481	Add validation of UV partition size For color sampling format other than 420, valid partion size in Y may not work for UV plane. This commit adds validation of UV partition size before select the partition choice. This fixes a crash for real time encoding of 422 input. Change-Id: I1fe3282accfd58625e8b5e6a4c8d2c84199751b6	2015-04-24 12:34:18 -07:00
Jim Bankoski	a6e9ae9066	Adds worst frame metrics for a bunch of metrics. Change-Id: Ieaccc36ed1bee024bb644a9cfaafdaaa65d31772	2015-04-22 06:45:56 -07:00
paulwilkins	e07b141da0	Merge "Modified test for auto key frame detection."	2015-04-22 02:29:17 -07:00
paulwilkins	5d8877a944	Merge "Limit arf interval for low fpf clips."	2015-04-22 02:25:38 -07:00
Jim Bankoski	3b35e962e2	Merge "Adds a new temporal consistency metric to libvpx."	2015-04-21 16:11:11 -07:00
Scott LaVarnway	8b17f7f4eb	Revert "Remove mi_grid_* structures." (see I3a05cf1610679fed26e0b2eadd315a9ae91afdd6) For the test clip used, the decoder performance improved by ~2%. This is also an intermediate step towards adding back the mode_info streams. Change-Id: Idddc4a3f46e4180fbebddc156c4bbf177d5c2e0d	2015-04-21 11:16:45 -07:00
Jim Bankoski	ee87e20d53	Adds a new temporal consistency metric to libvpx. Change-Id: Id61699ebf57ae4f8af96a468740c852b2f45f8e1	2015-04-21 10:05:37 -07:00
paulwilkins	3606b78108	Modified test for auto key frame detection. The existing test was triggering a lot of false positives on some types of animated material with very plain backgrounds. These were triggering code designed to catch key frames in letter box format clips. This patch tightens up the criteria and imposes a minimum requirement on the % blocks coded intra in the first pass and the ratio between the % coded intra and the modified inter % after discounting neutral (flat) blocks that are coded equally well either way. On a particular problem animation clip this change eliminated a large number of false positives including some cases where the old code selected kf several times in a row. Marginal false negatives are less damaging typically to compression and in the problem clip there are now a couple of cases where "visual" scene cuts are ignored because of well correlated content across the scene cut. Replaced some magic numbers related to this with #defines and added explanatory comments. Change-Id: Ia3d304ac60eb7e4323e3817eaf83b4752cd63ecf	2015-04-21 12:50:11 +01:00
Yaowu Xu	b423a6b212	Resolve configuration conflict Between --enable-internal-stats and --enable-vp9-highbitdepth Change-Id: I36b741554e835033e69883270b6b0e5374a1aafa	2015-04-20 16:44:12 -07:00
Yaowu Xu	305492c375	Move declaration before statement Change-Id: Ib64786fcc0d6dc11c4e66f5b7f3e93b2a4fcb664	2015-04-20 09:50:59 -07:00
Jim Bankoski	03829f2fea	Merge "Adds a blockiness metric to internal stats."	2015-04-17 16:06:26 -07:00
Jim Bankoski	3d2f037a44	Merge "adds psnrhvs to internal stats."	2015-04-17 16:06:10 -07:00
Jim Bankoski	f2cbee9a04	Merge "Adds a fastssim metric to VPX internal stats."	2015-04-17 16:05:53 -07:00
Jim Bankoski	1777413a2a	Adds a blockiness metric to internal stats. Change-Id: Iedceeb020492050063acf3fd2326f96c29db9ae5	2015-04-17 11:13:18 -07:00
Jim Bankoski	9757c1aded	adds psnrhvs to internal stats. PSNR HVS is a human visual system weighted version of SNR that's gained some popularity from academia and apparently better matches MOS testing. This code is borrowed from the Daala Project but uses our FDCT code. Change-Id: Idd10fbc93129f7f4734946f6009f87d0f44cd2d7	2015-04-17 10:29:27 -07:00
Jim Bankoski	3f7f194304	Adds a fastssim metric to VPX internal stats. This code appeared in the Daala project first and was originally committed by Nathan Egge. Change-Id: Iadce416a091929c51b46637ebdec984cddcaf18c	2015-04-17 10:23:24 -07:00
Jingning Han	73bce9ec7e	Merge "Remove unnecessary backup token stream pointer"	2015-04-17 09:13:53 -07:00
Marco Paniconi	f76ccce5bc	Revert "Revert "Force_split on 16x16 blocks in variance partition."" This reverts commit `004b9d83e3` Change-Id: I2f2d0bdb9368c2c07f1d29a69cd461267a3a8743	2015-04-16 17:52:13 -07:00
Jingning Han	645c70f852	Remove unnecessary backup token stream pointer When the tokenization is not taking effect, the tokenization pointer remains unchanged. No need to re-assign the backup pointer value. Change-Id: I58fe1f6285aa3b4a88ceb864c11d5de8ac6235dd	2015-04-16 16:44:44 -07:00
Minghai Shang	29b5cf6a9d	Merge "[svc] Fix syntax error when encoding multiple tiles."	2015-04-16 13:43:44 -07:00
Minghai Shang	4aa9255efa	[svc] Fix syntax error when encoding multiple tiles. Change-Id: Ia77b551415f3b3386e22a6c805f244f2d13fe3e3	2015-04-16 12:56:30 -07:00
paulwilkins	effd974b16	Limit arf interval for low fpf clips. This patch limits the maximum arf interval length to approximately half a second. In some low fps animations in particular the existing code was selecting an overly long interval which was hurting visual quality. For a sample problem test clip (360P animation , 15fps, ~200Kbit/s) this change also improved metrics by >0.5 db. There may be some clips where this hurts metrics a little, but the worst case impact visually is likely to be less than having an interval that is much too long. On more normal material at 24 fps or higher, the impact is likely to be nil/minimal. Change-Id: Id8b57413931a670c861213ea91d7cc596375a297	2015-04-16 11:50:37 +01:00
Yunqing Wang	14e7203e7b	Merge "Fix Tsan errors"	2015-04-15 15:34:03 -07:00
Yunqing Wang	63c5bf2b9c	Fix Tsan errors This patch fixed 2 reported Tsan errors while running VP9 real-time encoder. Change-Id: Ib0278fe802852862c3ce87c4a500e544d7089f67	2015-04-15 12:33:39 -07:00
Johann	14ef4aeafb	Reorganize *_rtcd() calling conventions Change-Id: Ib1e17d8aae9b713b87f560ab5e49952ee2bfdcc2	2015-04-15 11:12:05 -04:00
Yunqing Wang	004b9d83e3	Revert "Force_split on 16x16 blocks in variance partition." This reverts commit `eb8c667570`. The patch caused mismatch while using multi-threads. Change-Id: Icd646340af25b5d91e32f03ed3ea212e00e3e0be	2015-04-14 15:19:31 -07:00
Marco	eb8c667570	Force_split on 16x16 blocks in variance partition. Force split on 16x16 block (to 8x8) based on the minmax over the 8x8 sub-blocks. Also increase variance threshold for 32x32, and add exit condiiton in choose_partition (with very safe threshold) based on sad used to select reference frame. Some visual improvement near moving boundaries. Average gain in psnr/ssim: ~0.6%, some clips go up ~1 or 2%. Encoding time increase (due to more 8x8 blocks) from ~1-4%, depending on clip. Change-Id: I4759bb181251ac41517cd45e326ce2997dadb577	2015-04-13 12:05:07 -07:00
Jingning Han	2404332c1b	Merge "Remove get_nonrd_var_based_fixed_partition function"	2015-04-09 14:45:19 -07:00
Jingning Han	4565812032	Merge "Compute prediction filter type cost only when needed"	2015-04-09 14:45:11 -07:00
Jingning Han	93d9c50419	Merge "SSSE3 assembly implementation of 8x8 Hadamard transform"	2015-04-09 11:16:11 -07:00
Jingning Han	208aa6158b	Remove get_nonrd_var_based_fixed_partition function This function has been replaced by other approaches and is not in use now. Change-Id: I387f45b5607d202539e482468ccc70e6c0f9341f	2015-04-09 09:49:55 -07:00
Debargha Mukherjee	59681be0a0	Merge "Improve accuracy of rate control in CQ mode"	2015-04-08 10:48:17 -07:00
James Zern	2ed0cf06f9	Merge "vp9_full_search_sadx[38]: align sad arrays"	2015-04-07 20:57:21 -07:00
Yaowu Xu	c88ce84bb5	Merge "Optimize the checking for transform skipping"	2015-04-07 16:29:51 -07:00
Yaowu Xu	90517b5e85	Merge "move ref_frame_cost computations into a function"	2015-04-07 16:29:45 -07:00
Debargha Mukherjee	60bd744c88	Improve accuracy of rate control in CQ mode Modifies a special handling that improves rate control accuracy in the constrained quality mode, when the undershoot and overshoot limits are set tighter. Change-Id: If62103f0ef3ed1cac92807400678c93da50cf046	2015-04-07 16:29:21 -07:00
James Zern	e1ff83f4b0	vp9_full_search_sadx[38]: align sad arrays the sse4 code expects 16-byte aligned arrays; vp8 already had a similar change applied: `b2aa401` Align SAD output array to be 16-byte aligned Change-Id: I5e902035e5a87e23309e151113f3c0d4a8372226	2015-04-07 14:34:06 -07:00
Jingning Han	927693a991	Merge "Enable Hadamard transform based cost estimate for all block sizes"	2015-04-07 12:51:27 -07:00
Jingning Han	6de407b638	Merge "Account for eob cost in the RTC mode decision process"	2015-04-07 12:50:30 -07:00
Jingning Han	25206e7b7f	Compute prediction filter type cost only when needed Skip redundant prediction filter type cost in filter search loop, if the rate value will be reset in Hadamard transform based rate distortion estimate. Change-Id: Ie5221f4bc8da9461c449df367251aeeac52c6e5d	2015-04-07 12:41:46 -07:00
Yaowu Xu	0bb897211d	Optimize the checking for transform skipping If U is not skippable, then do not perform the check on V. Change-Id: Iba5e8362bd42390197f373c44388a426a4404549	2015-04-06 17:54:05 -07:00
Jingning Han	7f629dfca4	SSSE3 assembly implementation of 8x8 Hadamard transform It uses about 10% less CPU cycles than the SSE2 intrinsic implementation. Change-Id: I91017c0c068679a214b98cdd4cff3a6facfb7499	2015-04-04 09:59:37 -07:00
Jingning Han	9922e4344a	Enable Hadamard transform based cost estimate for all block sizes This commit turns on the Hadamard transform based rate distortion estimate for all block sizes in RTC coding mode. It conditionally skips the rate distortion estimation if all zero block flag is set on. No significant encoding speed change is observed. The compression performance of speed -6 is improved by 1.7% over using it only for block sizes of 32x32 and below. Change-Id: I768145e6f05c737b05b5b5f1ee674e929532cafb	2015-04-04 09:58:45 -07:00
Yunqing Wang	b2baaa215b	Merge "Fix the scaling factor in UV skipping test"	2015-04-03 17:09:59 -07:00
Yunqing Wang	1a1114d21c	Fix the scaling factor in UV skipping test The threshold scaling factor was calculated wrong using partition size "bsize". Thank Yaowu for pointing it out. It was fixed and no speed change was seen. Change-Id: If7a5564456f0f68d6957df3bd2d1876bbb8dfd27	2015-04-03 16:07:43 -07:00
Jingning Han	30e9c091c0	Merge "Tune SSSE3 assembly implementation to improve quantization speed"	2015-04-03 11:24:28 -07:00
Jingning Han	60e01c6530	Account for eob cost in the RTC mode decision process This commit accounts for the transform block end of coefficient flag cost in the RTC mode decision process. This allows a more precise rate estimate. It also turns on the model to block sizes up to 32x32. The test sequences shows about 3% - 5% speed penalty for speed -6. The average compression performance improvement for speed -6 is 1.58% in PSNR. The compression gains for hard clips like jimredvga, mmmoving, and tacomascmv at low bit-rate range are 1.8%, 2.1%, and 3.2%, respectively. Change-Id: Ic2ae211888e25a93979eac56b274c6e5ebcc21fb	2015-04-03 10:31:51 -07:00
Yunqing Wang	12cb30d4bd	Merge "Set vbp thresholds for aq3 boosted blocks"	2015-04-02 18:22:08 -07:00
Yaowu Xu	718feb0f69	move ref_frame_cost computations into a function Change-Id: Iebf2ad2b1db7e2874788fda8d55e67f4cb1149f1	2015-04-02 18:10:55 -07:00
Marco	f85f79f630	Merge "Code cleanup: put (8x8/4x4)fill_variance into separate function."	2015-04-02 17:33:01 -07:00
Yunqing Wang	cae03a7ef5	Set vbp thresholds for aq3 boosted blocks The vbp thresholds are set seperately for boosted/non-boosted superblocks according to their segment_id. This way we don't have to force the boosted blocks to split to 32x32. Speed 6 RTC set borg test result showed some quality gains. Overall PSNR: +0.199%; Avg PSNR: +0.245%; SSIM: +0.802%. No speed change was observed. Change-Id: I37c6643a3e2da59c4b7dc10ebe05abc8abf4026a	2015-04-02 15:48:32 -07:00
Marco	77ea408983	Code cleanup: put (8x8/4x4)fill_variance into separate function. Code cleanup, no change in behavior. Change-Id: I043b889f8f0b3afb49de0da00873bc3499ebda24	2015-04-02 13:37:35 -07:00
Marco	6eb05c9ed0	Small fix to segment check in pickmode. Change-Id: Id5fd82a504def2523292466fbaad5dade9424c72	2015-04-02 09:55:13 -07:00
Jingning Han	2149f214d5	Merge "Reduce required xmm number by one in block_error_fp"	2015-04-01 15:46:22 -07:00
Jingning Han	657cabe0f7	Tune SSSE3 assembly implementation to improve quantization speed Change-Id: If0ca8b25b4800d4336e6cbc97194cd9b01c5b5a3	2015-04-01 15:28:01 -07:00
Yaowu Xu	fff4654d36	Merge "Simplify bsize calculation"	2015-04-01 15:06:55 -07:00
Jingning Han	cf4447339e	Merge "Optimize quantization simd implementation"	2015-04-01 14:55:18 -07:00
Jingning Han	a4364e5146	Merge "Simplify effective src_diff address computation"	2015-04-01 14:55:03 -07:00
Jingning Han	7acb2a8795	Merge "Refactor block_yrd function for RTC coding mode"	2015-04-01 14:54:24 -07:00
Yaowu Xu	ba91b54d7c	Simplify bsize calculation Change-Id: Ibc514684def9914c66f04cb7931f773e2b79c168	2015-04-01 12:15:06 -07:00
Jingning Han	19da916716	Simplify effective src_diff address computation Remove redundant offset calculation for effective src_diff address. Change-Id: I4aab241a36abcef7fd8adf74aed5e12b8b88e0ef	2015-04-01 12:07:47 -07:00
Jingning Han	f2cf3c06a0	Reduce required xmm number by one in block_error_fp Use 6 xmms instead of 8. Change-Id: If976ad85d09191d2fb0565399d690f2869dbbcc7	2015-04-01 12:07:35 -07:00
Jingning Han	1470529f62	Refactor block_yrd function for RTC coding mode This commit separates Hadamard transform/quantization operations from rate and distortion computation in block_yrd. This allows one to skip SATD computation when all transform blocks are quantized to zero. It also uses a new block error function that skips repeated computation of sum of squared residuals. It reduces the CPU cycles spent on block error calculation in block_yrd by 40%. Change-Id: I726acb2454b44af1c3bd95385abecac209959b10	2015-04-01 12:00:43 -07:00
Jingning Han	eed1badedd	Optimize quantization simd implementation This commit allows the quantizer to compare the AC coefficients to the quantization step size to determine if further multiplication operations are needed. It makes the quantization process 20% faster without coding statistics change. Change-Id: I735aaf6a9c0874c82175bb565b20e131464db64a	2015-04-01 11:47:09 -07:00
Yunqing Wang	a0043c6d30	Enhance the transform skipping decision-making in non-rd mode For large partition blocks(block_size > 32x32), the variance calculation is modified so that every 8x8 block's variance is stored during the calculation, which is used in the following transform skipping test. Also, the variance for every tx block is calculated. The skipping test checks all tx blocks in the partition, and sets the skip flag only if all tx blocks are skippable. If the skip flag of Y plane is 1, a quick evaluation is done on UV planes. If the current partition block is skippable in YUV planes, the mode search checks fewer inter modes and doesn't check intra modes. The rtc set borg test(at speed 6) showed that: Overall psnr: -0.527%; Avg psnr: -0.510%; ssim: -0.573%. Average single-thread speedup on rtc set was 3.5%. For 720p clips, more speedups were seen. gipsrecmotion: 13% gipsrestat: 12% vidyo: 5 - 9% dark: 15% niklas: 6% Change-Id: I8d8ebec0cb305f1de016516400bf007c3042666e	2015-04-01 09:43:40 -07:00
Yunqing Wang	fc98114761	Merge "Rename vbp thresholds"	2015-03-31 16:33:30 -07:00
Yunqing Wang	c28ff1a9de	Rename vbp thresholds Code refactoring Change-Id: I410fcce1bc6d95c62c474445f4c97ea8469f1e79	2015-03-31 15:14:44 -07:00
Jingning Han	502ac72233	Merge "Tuning SATD rate calculation for speed"	2015-03-31 14:24:26 -07:00
Jingning Han	1c39c5b96f	Merge "Use aligned copy in 8x8 Hadamard transform SSE2"	2015-03-31 12:16:47 -07:00
Jingning Han	fa4289522e	Merge "Allow block skip coding option in RTC mode"	2015-03-31 12:16:36 -07:00
Jingning Han	1638d7dc96	Merge "Fix 8x8 Hadamard SSE2 implementation"	2015-03-31 12:16:27 -07:00
Alex Converse	9670d766ab	Merge "VP9E_GET_ACTIVE_MAP API function."	2015-03-31 11:52:56 -07:00
Jingning Han	531468a07a	Tuning SATD rate calculation for speed This commit allows the encoder to check the eob per transform block to decide how to compute the SATD rate cost. If the entire block is quantized to zero, there is no need to add anything; if only the DC coefficient is non-zero, add its absolute value; otherwise, sum over the block. This reduces the CPU cycles spent on vp9_satd_sse2 to one third. Change-Id: I0d56044b793b286efc0875fafc0b8bf2d2047e32	2015-03-31 11:02:20 -07:00
hui su	d4f2f1dd5b	Merge "Move vp9_coef_con_tree to common/"	2015-03-31 10:51:10 -07:00
Jingning Han	014fa45298	Use aligned copy in 8x8 Hadamard transform SSE2 This reduces the 8x8 Hadamard transform cycles by 20%. Change-Id: If34c5e02f3afa42244c6efabe121f7cf5d2df41b	2015-03-31 10:21:52 -07:00
Jingning Han	db5ec37edc	Merge "Enable 16x16 Hadamard transform in SATD based mode decision"	2015-03-31 09:55:41 -07:00
Jingning Han	8c5670bb6f	Merge "Use SATD based mode decision for block sizes below 16x16"	2015-03-31 09:47:47 -07:00
Jingning Han	ebe1be9186	Allow block skip coding option in RTC mode When the estimated rate-distortion cost of skip coding mode is lower than that of sending quantized coefficients, allow the encoder to drop these coefficients. This improves the compression performance of speed -6 by 0.268% and makes the encoding speed slightly faster. Change-Id: Idff2d7ba59f27ead33dd5a0e9f68746ed3c2ab68	2015-03-31 09:32:53 -07:00
hui su	302e24cb3e	Move vp9_coef_con_tree to common/ This tree should be defined in common/, as it is needed for both encoder and decoder. Change-Id: I4f5cbc80025cf2ced14182c98f7c82dc7d0f87db	2015-03-31 09:20:46 -07:00
Jingning Han	9b99eb2e12	Merge "Reuse inter prediction pixel block for Hadamard transform"	2015-03-30 16:09:38 -07:00
Jingning Han	34a996ac1e	Fix 8x8 Hadamard SSE2 implementation This commit fixes the SSE2 version 8x8 Hadamard transform alignment and makes it consistent with the C version. Change-Id: I1304e5f97e0e5ef2d798fe38081609c39f5bfe74	2015-03-30 15:54:08 -07:00
Jingning Han	26d3d3af6a	Enable 16x16 Hadamard transform in SATD based mode decision This commit replaces the 16x16 2D-DCT transform with Hadamard transform for RTC coding mode. It reduces the CPU cycles cost on 16x16 transform by 5X. Overall it makes the speed -6 encoding speed 1.5% faster without compromise on compression performance. Change-Id: If6c993831dc4c678d841edc804ff395ed37f2a1b	2015-03-30 15:43:31 -07:00
Jingning Han	f0ac5aaa08	Merge "Hadamard transform based coding mode decision process"	2015-03-30 15:43:15 -07:00
Jingning Han	b4b5af6acd	Use SATD based mode decision for block sizes below 16x16 This commit makes the encoder to select between SATD/variance as metric for mode decision. It also allows to account chroma component costs for mode decision as well. The overall encoding time increase as compared to variance based mode selection is about 15% for speed -6. The compression performance is on average 2.2% better than variance based approach, with about 5% compression performance gains for hard clips (e.g., jimredvga, nikas720p, and mmmoving) at lower bit-rate range. Change-Id: I4d04a31d36f4fcb3f5f491dacd6e7fe44cb9d815	2015-03-30 15:20:07 -07:00
Jingning Han	8a927a1b7a	Reuse inter prediction pixel block for Hadamard transform It saves one unnecessary motion compensated prediction constructed by using 8-tap filter. Change-Id: I101215131e6f38621d5935885f94cc74de6a5377	2015-03-30 15:04:33 -07:00
Jingning Han	8c411f74e0	Hadamard transform based coding mode decision process This commit uses Hadamard transform based rate-distortion cost estimate for rtc coding mode decision. It improves the compression performance of speed -6 for many hard clips at lower bit-rates. For example, 5.5% for jimredvga, 6.7% for mmmoving, 6.1% for niklas720p. This will introduce extra encoding cycle costs at this point. Change-Id: Iaf70634fa2417a705ee29f2456175b981db3d375	2015-03-30 14:46:05 -07:00
Alex Converse	bf7def9a43	Merge "Simplify skip check."	2015-03-30 11:31:45 -07:00
Marco	fa20a60f0d	Speed 5: use non-rd mode for key frame coding. Metrics on RTC set go down by ~1.5% on average. Key frame encoding time goes down by factor of ~5. Change-Id: Ia83acc55848613870e5ac6efe7f3d904d877febb	2015-03-27 16:19:26 -07:00
Adrian Grange	ad18b2b641	Remove 8-bit array in HBD Creating both 8- and 16-bit arrays and then only using one of them is wasteful. Change-Id: Ic5b397c283efaff7bcfff2d2413838ba3e065561	2015-03-25 15:37:03 -07:00
Adrian Grange	65df3d138a	Replace heap with stack memory allocation Replaced the dynamic memory allocation of the second_pred buffer with an allocation on the stack. Change-Id: I2716c46b71e8587714ca5733a99eca2c68419b23	2015-03-25 15:36:43 -07:00
Adrian Grange	8d8d7bfde5	Fix use of scaling in joint motion search To enable us to the scale-invariant motion estimation code during mode selection, each of the reference buffers is scaled to match the size of the frame being encoded. This fix ensures that a unit scaling factor is used in this case rather than the one calculated assuming that the reference frame is not scaled. Change-Id: Id9a5c85dad402f3a7cc7ea9f30f204edad080ebf	2015-03-25 15:35:29 -07:00
paulwilkins	ab788c5380	Merge "Enable group adaptive max q by default."	2015-03-24 15:00:12 -07:00
Alex Converse	4dcb839607	VP9E_GET_ACTIVE_MAP API function. This is useful when aq mode 3 (cyclic refresh) reactivates segments for refresh. Change-Id: I3ad1d9410b899ede393d82bb8db14e2da4d84eca	2015-03-24 11:19:47 -07:00
Yaowu Xu	c77d4dcb35	Merge "vp9_pred_mv(): misc fixes and optimizations"	2015-03-24 10:36:51 -07:00
Alex Converse	02697e35dc	Merge "A tiny cyclic refresh / active map fix."	2015-03-24 09:43:24 -07:00
paulwilkins	8ea7bafdaa	Merge "Revised rd adjustment for variance."	2015-03-24 03:12:56 -07:00
paulwilkins	c0b71cf82f	Merge "Experimental rd bias based on source vs recon variance."	2015-03-24 03:12:41 -07:00
Alex Converse	31f1563a92	A tiny cyclic refresh / active map fix. Change-Id: I198727461455c8c198a0c892d02ed3cb1673aa50	2015-03-23 18:51:00 -07:00
hkuang	cd1d40ff5d	Merge "Safely free all the frame buffers after all the workers finish the work."	2015-03-23 16:50:15 -07:00
Alex Converse	b7605a9d70	Simplify skip check. SEG_LVL_SKIP implies skip. This is enforced by skip = write_skip(). Change-Id: I61c79581c9c53deae36685c2bcf388cb4d8827d3	2015-03-23 10:53:31 -07:00
paulwilkins	691ec45b4e	Enable group adaptive max q by default. Set the GF group adaptive max Q compile flag to 1 by default. This change has a quite big visual impact in some clips and also contributes to tighter rate control. For short test clips that have consistent content the impact is quite small on metrics but for more varied long form clips there is a drop in overal psnr but a sharp rise in average psnr caused by greater expenditure on some easier sections and tighter rate clipping in hard sections. In chunck'ed encodes some of the effect will already be present due to the independent rate control in each chunk but this change takes the control down to a smaller scale. yt hd +10.67%, - 3.77%, -1.56% yt +9.654%, - 3.6%, - 1.82% std hd +0.25%, -0.85%, -0.42% derf +0.25%, - 1.1%. - 0.87% Change-Id: Ibbc39b800d99d053939f4c6712d715124082843e	2015-03-23 15:57:09 +00:00
Yaowu Xu	9fd8abc541	vp9_pred_mv(): misc fixes and optimizations 1. skip near if it is same as nearest 2. correct rounding for converting mv to fullpel position 3. update pred_mv_sad after new mv search. Overall .1%~.25% compression gains on rtc set for speed 5, 6, 7, 8. Change-Id: Ic300ca53f7da18073771f1bb993c58cde9deee89	2015-03-20 17:17:04 -07:00
Alex Converse	6d6ef8eb3c	Don't apply active map on key frames. This allows applciations to be KF oblivious. Change-Id: Ic02712eae6ad8d6b3eaec26548299d24ca0d5cc0	2015-03-20 14:57:24 -07:00
Alex Converse	e032fc7b9e	Set loop filter level to zero on inactive segment. Change-Id: I6022a79351882a72a219aee13563bf21bcd70383	2015-03-20 14:43:06 -07:00
paulwilkins	7e234b9228	Revised rd adjustment for variance. Revised adjustment for rd based on source complexity. Two cases: 1) Bias against low variance intra predictors when the actual source variance is higher. 2) When the source variance is very low to give a slight bias against predictors that might introduce false texture or features. The impact on metrics of this change across the test sets is small and mixed. derf -0.073%, -0.049%, -0.291% std hd -0.093%, -0.1%, -0.557% yt +0.186%, +0.04%, - 0.074% ythd +0.625%, + 0.563%, +0.584% Medium to strong psycho-visual improvements in some problem clips. This feature and intra weight on GF group length now turned on by default. Change-Id: Idefc8b633a7b7bc56c42dbe19f6b2f872d73851e	2015-03-20 11:59:39 +00:00
paulwilkins	9a1ce7be7d	Experimental rd bias based on source vs recon variance. This experiment biases the rd decision based on the impact a mode decision has on the relative spatial complexity of the reconstruction vs the source. The aim is to better retain a semblance of texture even if it is slightly misaligned / wrong, rather than use a simple rd measure that tends to favor use of a flat predictor if a perfect match can't be found. This improves the appearance of texture and visual quality on specific test clips but is hidden under a flag and currently off by default pending visual quality testing on a wider Yt set. Change-Id: Idf6e754a8949bf39ed9d314c6f2daaa20c888aad	2015-03-20 11:57:36 +00:00
Adrian Grange	12d946df89	Restore first ref frame pointer to the correct value The joint_motion_search function alternates prediction between two reference frames. In order to reuse existing code, a pointer to the appropriate reference frame is written into xd->plane[0].pre[0], that the motion estimation code assumes points to the reference frame. If this first reference frame was scaled then the pointer was incorrectly being reset to point to the unscaled reference frame rather than the scaled version. Change-Id: I76f73a8d8f4f15c1f3a5e7e08a35140cdb7886ab	2015-03-19 16:17:31 -07:00
Adrian Grange	53c9ebe609	Move joint_motion_search & delete function prototype Change-Id: I7fb3a78ed0e0bc940d8b4a57c470302f8369782f	2015-03-19 14:28:52 -07:00
hkuang	b88dac8938	Safely free all the frame buffers after all the workers finish the work. Issue: 978 Change-Id: Ia7aa809095008f6819a44d7ecb0329def79b1117	2015-03-19 12:21:00 -07:00
Jingning Han	067fc49996	Merge "Speed up non-rd mode decision search"	2015-03-19 09:18:10 -07:00
Jingning Han	411bbce470	Merge "Fix an ioc warning in vp9_pick_inter_mode"	2015-03-19 09:17:25 -07:00
Marco	fc2da4c5ba	Merge "Adjustments to aq-mode=3."	2015-03-19 09:01:17 -07:00
James Zern	6f23d40582	Merge "vp9_resize_plane: quiet some static analysis warnings"	2015-03-18 19:39:48 -07:00
James Zern	c664f16182	Merge changes Ie5a24275,Ib72946a8,I532b882b * changes: vp9_fdct8x8_quant_ssse3: quiet a static analysis warning vp9_fdct8x8_quant_sse2: quiet a static analysis warning vp9_mv_pred: quiet a static analysis warning	2015-03-18 19:38:49 -07:00
Alex Converse	748843712f	Merge "Fix external resize memory issues."	2015-03-18 16:04:30 -07:00
James Zern	c4367b9b51	vp9_resize_plane: quiet some static analysis warnings document resolution assumptions with a few asserts Change-Id: Ia4ab738fd3e0a1ba0ed30a57facd2658c2c1fd60	2015-03-18 14:34:30 -07:00
James Zern	388add965f	vp9_fdct8x8_quant_ssse3: quiet a static analysis warning add an assert to validate 'in' array size Change-Id: Ie5a24275c066d9dd59714f6104510abbd4850dc5	2015-03-18 14:33:43 -07:00
James Zern	198b039e2a	vp9_fdct8x8_quant_sse2: quiet a static analysis warning add an assert to validate 'in' array size Change-Id: Ib72946a86f34e1ce8a69954e8e3e4fe1a0f18a91	2015-03-18 14:33:04 -07:00
James Zern	428369293d	vp9_mv_pred: quiet a static analysis warning add an assert to validate pred_mv array size Change-Id: I532b882b71e2baff3ac76e07ed133ec5a11bd0fc	2015-03-18 14:31:58 -07:00
Marco	71e6ed7bd1	Adjustments to aq-mode=3. Factor in segment#2 and skip blocks into the postencode estimated bits, and increase somewhat the aggressiveness of the refresh. PSNR/SSIM Metrics on RTC set go up by ~0.8/0.5%. Change-Id: I5d4e7cb00a3aefb25d18c88b6b24118b72dc5d51	2015-03-18 12:06:16 -07:00
Jingning Han	83cbe22623	Speed up non-rd mode decision search This commit makes the encoder to explicitly calculate the SAD associated with the LAST_FRAME motion vector and compare it to that of the GOLDEN_FRAME given by integral projection motion estimation. It skips the expensive sub-pixel motion search over GOLDEN_FRAME when the LAST_FRAME can provide fairly good motion compensated prediction quality. For dark720p speed -6 single thread goes from 33304 b/f, 40.070 dB, 18156 ms -> 33319 b/f, 40.061 dB, 17611 ms Change-Id: I01bc94b9b598075567a392111046b97a9bc30efe	2015-03-18 12:04:58 -07:00
Adrian Grange	83288c7af8	Order header files alphabetically Change-Id: I3e275544bff478849c1b5f3dcd5de950ee330d14	2015-03-18 11:18:08 -07:00
Jingning Han	4640a0c480	Merge "Fix the C version of column vector projection"	2015-03-17 22:53:49 -07:00
Jingning Han	c932584f0f	Fix the C version of column vector projection Make the C and SSE2 versions consistent. Change-Id: I03c405d22a36bd1a97480efb96dc5af230667424	2015-03-17 18:50:53 -07:00
Marco	e52109158a	Update to variance partition. Use force_split to constrain the partition selection. This is used because in the top-down approach to variance partition, a block size may be selected even though one of its subblocks may have high variance. In this patch the selection of the 64x64 block size will only be allowed if the variance of all the 32x32 subblocks are also below the threshold. Stil testing, but some visual improvement for areas near slow moving boundary can be seen. Metrics for RTC set increase by about ~0.5%. Change-Id: Iab3e7b19bf70f534236f7a43fd873895a2bb261d	2015-03-17 17:02:47 -07:00
Yunqing Wang	45e8e4a01f	Merge "Refactor set vbp thresholds function"	2015-03-17 16:05:53 -07:00
Yunqing Wang	c0423abf00	Refactor set vbp thresholds function Code refactoring. Change-Id: I73b6fcc0444155ee46c1efa5253c1d608c6439cb	2015-03-17 12:23:32 -07:00
Adrian Grange	ed6824e449	Remove unused ZBIN_BOOST macros Change-Id: I5169155b20ea3676a6ce58ec77d6aeba07db29d9	2015-03-17 11:53:58 -07:00
Jingning Han	ee41141466	Fix an ioc warning in vp9_pick_inter_mode Shut off all the metric checks for golden reference frame, if we decide that it is unlikely to be selected for reference. Change-Id: Ie457cc1fd43935584403b4982659aed80fb9909c	2015-03-17 10:13:44 -07:00
Yaowu Xu	de3097aa23	Merge "Remove duplicate clamping"	2015-03-16 16:56:10 -07:00
Jingning Han	adaffcc010	Merge "Remove ineffective newmv skip checking from vp9_pick_inter_mode"	2015-03-16 16:43:43 -07:00
Jingning Han	4e8daaf960	Merge "Simplify prediction filter search in rtc coding mode"	2015-03-16 16:43:26 -07:00
Jingning Han	82231beced	Merge "Refactor column integral projection computation"	2015-03-16 16:43:11 -07:00
Yaowu Xu	3119c24658	Merge "change the order of inter modes evaluated"	2015-03-16 16:14:34 -07:00
Alex Converse	6126afe62e	Fix external resize memory issues. These were uncovered by the chromoting perftest. Change-Id: Ia5a90fd1718ff757c1484decf3861295260e6722	2015-03-16 15:56:26 -07:00
Yaowu Xu	4611f24797	Remove duplicate clamping The mvs are clamped in the vp9_find_best_ref_mvs() already. Change-Id: I9bea5e35aef6007466fe7fca4bc2dc5c17e74222	2015-03-16 15:19:37 -07:00
Jingning Han	c852200f51	Remove ineffective newmv skip checking from vp9_pick_inter_mode Change-Id: I41ee684cf113a7b5edf280183e51cb08b2e93cc4	2015-03-16 15:06:27 -07:00
Jingning Han	981bb84882	Simplify prediction filter search in rtc coding mode Reduce unnecessary fetch from MB_MODE_INFO. Change-Id: Iff89b76d5e2774c00a564e902913a633fa2e1ea9	2015-03-16 14:54:00 -07:00
Yaowu Xu	f2d682fc10	change the order of inter modes evaluated Change-Id: I10c1ad23b110cf92cb026e895039c215c47abfd0	2015-03-16 12:49:30 -07:00
Jingning Han	2cfddec332	Refactor column integral projection computation Move the scaling factor outside column projection. This avoids repeated calculation of the same scaling factor. Profiling shows that the percentage of vp9_int_pro_col_sse2 of overall cycles goes from 2.29% down to 1.88%. Change-Id: I5ac4e324ab2d7f33ba2de66dd2a12e04e04dfd66	2015-03-16 12:07:15 -07:00
Jingning Han	09e0b38a86	Merge "Fix indent in choose_partitioning"	2015-03-16 11:52:12 -07:00
Jingning Han	7cf383d17f	Fix indent in choose_partitioning Change-Id: I4039f8ac75a9cfcc4d07abd0619d1379bb10fe51	2015-03-16 11:01:00 -07:00
Yaowu Xu	51d529a578	vp9_pick_inter_mode(): minor optimizations 1. remove duplicate initialization to mbmi->interp_filter. 2. move mv clamping into ref_frame loop instead of mode checking loop. 3. move the check if last frame is same as golden frame earlier to avoid initialization of Golden reference related variables. Change-Id: Idf2d05e19e94a24f69cc289687869fc71d2ff289	2015-03-16 10:08:02 -07:00
Jingning Han	1f9b2b77ad	Merge "Fix choose_partitioning threshold setup for speed -5"	2015-03-15 09:04:07 -07:00
Jingning Han	b03cf9317a	Fix 1-step refinement search table Change-Id: I32f0bcb40c6e7ba63bfae487739ededd0b6b2dde	2015-03-14 10:52:11 -07:00
Jingning Han	1f00a9b9d5	Fix choose_partitioning threshold setup for speed -5 The compression performance of speed -5 is on average 12.6% better than speed -6. At lower bit-rates, the gains are typically 20% or more. For 2-thread encoding, the speed -5 takes about 1.6x time of speed -6. Change-Id: If7a73464a24d33e8f49b9533b51ec51c8da7fc80	2015-03-13 17:01:56 -07:00
Marco	87999b1c2e	Merge "Fix crash with vp9 denoiser on."	2015-03-13 14:31:40 -07:00
Jingning Han	6cceed09cf	Merge "Use sdx4df to do 1-step refinement"	2015-03-13 12:57:49 -07:00
Marco	e38066a74d	Fix crash with vp9 denoiser on. Crash occured on very first key frame, because denoiser temporal function was beng entered. Updated denoiser unittest to set cpu_used from first frame, and verified fix fixes the crash. Change-Id: I3be1124b52846fbbe7248d2c3d6136e086c80bc1	2015-03-13 11:10:02 -07:00
Marco	deaf661f45	Merge "Lower bitrate threshold below which cyclic refresh is turned off."	2015-03-13 10:31:35 -07:00
Alex Converse	f8df916931	Merge "Reconcile active_map and cyclic refresh"	2015-03-13 10:20:15 -07:00
Jingning Han	688c99a706	Merge "Reset src buffer only once in vp9_int_pro_motion_estimation"	2015-03-13 09:56:00 -07:00
Jingning Han	1b3499ae8b	Merge "Reduce the number of full block SAD calls"	2015-03-13 09:55:52 -07:00
Jingning Han	cce7020f2c	Use sdx4df to do 1-step refinement Change-Id: Ie0c3ef3ae3aedf049b1a296de607730b79c12672	2015-03-13 09:53:15 -07:00
Marco	62a3f53997	Lower bitrate threshold below which cyclic refresh is turned off. Change-Id: Ib54ab11adf8178eec74f65388a89c8f912c7869a	2015-03-13 09:42:45 -07:00
paulwilkins	b6749aa3a7	Merge "Shorten GF/arf interval in hard scenes."	2015-03-13 08:45:52 -07:00
Jingning Han	ba29125f7b	Reset src buffer only once in vp9_int_pro_motion_estimation Change-Id: I5c96b6a25f9df60da65b7af7c92a921b611746e3	2015-03-12 18:50:53 -07:00
Yaowu Xu	1aa75c65cc	Merge "vp9_pick_inter_mode(): Use single loop to evaluate inter modes"	2015-03-12 18:43:23 -07:00
Jingning Han	427cdf0a41	Reduce the number of full block SAD calls This commit uses a 6-point 1-step refine motion search in the integral projection based full pixel motion estimation, to replace the current 9-point search. It reduces runtime cost of speed -6 on some noisy clips, e.g., dark720p single thread 33314 b/f, 40.076 dB, 18231 ms -> 33307 b/f, 40.067 dB, 17768 ms The compression performance for rtc set remains unchanged. Change-Id: I194ea5a9ce52e5a10baeee36338633adc22f764c	2015-03-12 18:30:57 -07:00
Yunqing Wang	769e6567e9	Merge "Minorly modify model_rd_for_sb_y function"	2015-03-12 17:16:48 -07:00
Jingning Han	7a9d8f1efe	Merge "Fix fdct8x8_quant ssse3 overflow issue"	2015-03-12 16:43:09 -07:00
Alex Converse	1bfacd3529	Reconcile active_map and cyclic refresh Change-Id: Id7f8654aeeb20caa402bc822521b1d72c658f4f9	2015-03-12 16:19:49 -07:00
Yaowu Xu	2b368097c8	vp9_pick_inter_mode(): Use single loop to evaluate inter modes This commit changes to use single loop to evaluate all inter modes. There is no impact on compression quality and speed, but allow future experiment with the order of modes evaluated. Change-Id: I71696ce1014cbe127e25e98710d835987f5ecc09	2015-03-12 16:14:29 -07:00
Yunqing Wang	5d677c97eb	Minorly modify model_rd_for_sb_y function Added a skip_dc check. If skip_dc = 1, we could eliminate calling of vp9_model_rd_from_var_lapndz(). This gave slight PSNR & SSIM gain(<0.1%), and no speed change. Change-Id: If5ca733366148c86b98e196a00cc890f50e9a3e5	2015-03-12 14:04:14 -07:00
Jingning Han	fcb96b3afd	Fix fdct8x8_quant ssse3 overflow issue This resolves webm issue 968. Change-Id: Ieb363129b1e135a561141c68211d413226aba754	2015-03-12 12:43:19 -07:00
Deb Mukherjee	791bf5657f	Merge "Some rate control adjustments to control overshoot"	2015-03-12 11:10:59 -07:00
Jingning Han	1ff15fbffe	Merge "Prevent integer overflow in choose_partitioning"	2015-03-12 09:24:02 -07:00
Jingning Han	90ea10ec91	Merge "Remove unnecessary speed feature checking"	2015-03-12 09:23:51 -07:00
Jingning Han	594890a534	Merge "Apply fast motion search to golden reference frame"	2015-03-12 09:23:41 -07:00
Jingning Han	8fdddd5c01	Merge "Refactor to remove GLOBAL_MOTION"	2015-03-12 09:23:31 -07:00
Marco	0adc58037a	Merge "Fix visual studio build failure."	2015-03-11 17:19:47 -07:00
Jingning Han	238b6be24b	Prevent integer overflow in choose_partitioning Re-arrange the multiplication and right shift operations to avoid integer overflow in choose_partitioning. Change-Id: Ib4005cafb410a67c1960486471d75b6ebe38c4e0	2015-03-11 16:31:42 -07:00
Marco	a291b0b4a3	Fix visual studio build failure. Change-Id: Ifeb14f945d0f0300eb7b21b38e5720ac1c11a6cf	2015-03-11 16:12:39 -07:00
Jingning Han	313c28f8b8	Remove unnecessary speed feature checking This commit removes the pred_mv_sad comparison from rtc motion search, given that a stronger comparison has been done at the mode search level to eliminate unlikely selected reference frames. Change-Id: I49b8d24b2174303066fd8eff2102c0648f2869df	2015-03-11 16:11:40 -07:00
Adrian Grange	39d20c6ac3	Merge "Clamp rate correction factor after scaling it"	2015-03-11 16:09:49 -07:00
Jingning Han	54eda13f8d	Apply fast motion search to golden reference frame This commit enables the rtc coding mode to run integral projection based motion search for golden reference frame. It improves the speed -6 compression performance by 1.1% on average, 3.46% for jimred_vga, 6.46% for tacomascmvvga, and 0.5% for vidyo clips. The speed -6 is about 6% slower. Change-Id: I0fe402ad2edf0149d0349ad304ab9b2abdf0c804	2015-03-11 16:03:49 -07:00
Jingning Han	1ca4d51b2e	Refactor to remove GLOBAL_MOTION Make the vp9_int_pro_motion_estimation() function return zero motion vector if high bit depth is turned on, instead of removing it from compiled codes. Change-Id: Ia48f010eb590b2d517d5678c394110b326a1a95e	2015-03-11 15:53:15 -07:00
Yaowu Xu	dc902fedb2	Merge "Separate rd_thresh adaption by ref_frame"	2015-03-11 10:41:20 -07:00
Adrian Grange	42a89eb8cc	Clamp rate correction factor after scaling it Added clamp on the rate correction factor after it has been scaled. Change-Id: I5d4b46a101987b43c5bcfd7e0bd1b7b4d53640a4	2015-03-11 09:08:15 -07:00
paulwilkins	b29c48b03c	Shorten GF/arf interval in hard scenes. This patch accounts in the first pass stats for blocks that while not coded as intra, are complex and have an intra error / best error ratio below a threshold. The modification shortens the GF arf interval for a particular class of content that contains a lot of blocks matching the above criteria. (In one short problem test sequence the average interval dropped from about 14-15 to 10-11) The change results in small net gains in metrics results for the Yt(~0.2%) and yt-hd (~0.5%) sets and is approximately neutral for the other test sets. The change is currently shielded by a flag and off by default pending verification that it does not cause other regressions in tests on a wider YT test set. Change-Id: I6b803daa6a4ac09a6f428fb3a18be1ecedd974b7	2015-03-11 14:15:23 +00:00
Yaowu Xu	d549aa3b17	Separate rd_thresh adaption by ref_frame Only update the rd_thresh factors for modes sharing same reference frame. This helps overall compression of 6 and 7 by .13% and .19% respectively without any noticeable speed difference. Change-Id: Idb3a3879512c5d7d0880034516079949290690c5	2015-03-10 19:06:52 -07:00
Deb Mukherjee	0308e2ee6d	Some rate control adjustments to control overshoot Some rate control adjustments to control overshoot in the constrained quality mode. Change-Id: I8907b9a883642d779009d0a138adfa6ba67e7f41	2015-03-10 17:25:10 -07:00
Marco	340260585c	Merge "Modify update golden reference update under aq-mode=3 mode."	2015-03-10 11:48:10 -07:00
Marco	fb31aa09e2	Modify update golden reference update under aq-mode=3 mode. For non-SVC 1 pass CBR: make the GF update interval a multiple of the cyclic refresh period, and use encoding stats to prevent GF update at certain times. Change-Id: I4c44cacc2f70f1d27391a47644837e1eaa065017	2015-03-10 10:54:00 -07:00
Yaowu Xu	12943e722d	Merge "Enable using Golden reference in choose_partition()"	2015-03-10 10:48:52 -07:00
paulwilkins	4b01a2d350	Merge "Allow q adjustment for VPX_CQ and VPX_CBR."	2015-03-10 10:45:02 -07:00
Adrian Grange	78df712216	Fix vp9_compute_qdelta_by_rate loop behavior The return value from vp9_compute_qdelta_by_rate, which is a delta value for the quantizer, could never be 0 if (qindex == rc->worst_quality). This occurs because target_index was setup unconditionally in the loop and yet the loop counter stopped at (rc->worst_quality - 1). Change-Id: I6b59cd9b5811ff33357e71cd7d814c5e53d291f2	2015-03-10 09:14:54 -07:00
Yaowu Xu	059a473b35	Enable using Golden reference in choose_partition() Choose_partition uses only the last frame as reference frame in making partition decision, this commit adds the check on how well Golden frame with (0,0) predicts the current block, and uses GF(0,0) as basis for partition decision if it produces better prediction. The commit improves rtc speed 6 and 7 encoding by 0.14% and 0.19% respectively. Change-Id: I156acf925bd6e0b586d48155d1940d27270a3915	2015-03-10 08:57:28 -07:00
Alex Converse	066ed601a5	Merge "Don't waste time partitioning skip superblocks."	2015-03-09 13:02:16 -07:00
Jingning Han	9708f9d66a	Merge "Skip golden ref frame check when it is same as last ref frame"	2015-03-09 12:27:19 -07:00
Jingning Han	6245a91e0b	Skip golden ref frame check when it is same as last ref frame When golden reference frame is refreshed, the next frame has both its last and golden reference frames point to the same reference frame in real-time coding mode. Experiments suggest that using two separate reference frames for frames right after golden refresh frame does not provide further compression performance advantage. This commit hence retains the current encoder implementation and shuts off the mode search over golden reference frame in this case. It makes the encoder run slightly faster at no coding performance change. Change-Id: I1561f7799253a10e675d05c63c1749fe9e85b472	2015-03-09 11:14:55 -07:00
Alex Converse	06b59299c8	Don't waste time partitioning skip superblocks. Force 64x64 partitioning when a whole superblock is SEGMENT_LVL_SKIP. This drops encode times of screens mostly at rest by 20%. Change-Id: Ieba554b0b8a0c1679aae784a8bd11f038ab942c3	2015-03-09 11:02:05 -07:00
paulwilkins	2cff9c4efe	Allow q adjustment for VPX_CQ and VPX_CBR. Adjustment previously only enabled in VBR mode. This patch allows adjustment of min and max q for CBR and adjustment of max q only for CQ mode. Change-Id: Id5e583f3d50453cd544fc57249acacd946457482	2015-03-09 17:13:55 +00:00
Yunqing Wang	969dd8f128	Merge "vp9_ethread: fix me consts initialization to support aq_mode=3 encoding"	2015-03-09 09:42:12 -07:00
Jingning Han	d2b6a4cc80	Merge "Move pred_mv assign outside integral projection motion search"	2015-03-09 09:34:26 -07:00
Yunqing Wang	c4fb2d7cc7	Merge "Modify the setting of transform skip flags in non-rd mode"	2015-03-09 08:35:57 -07:00
Yunqing Wang	6e0ec0b2d9	vp9_ethread: fix me consts initialization to support aq_mode=3 encoding While turning on "--aq_mode=3", the quantizers are updated by each thread. Fixed the me consts initialization function to make sure that the correct thread data are updated. Change-Id: Ied27bb7bae76fc3fa2cda4f8c35ac0b46271bef4	2015-03-06 16:31:46 -08:00
Yunqing Wang	268f260d64	Modify the setting of transform skip flags in non-rd mode While searching for the best mode in non-rd case, SSE of a partition block is calculated and the transform size is set. This patch rewrites the skip checking conditions based on transform size instead of partition size to be more precise. Small gains were seen in rtc set borg test (speed 6). AVG PSNR: 0.087%, overall PSNR: 0.073%, SSIM: 0.146%. No noticeable speed change. Change-Id: I5603ca5339c784dfa02263f4005988ccd8c32f6e	2015-03-06 09:22:00 -08:00
Yaowu Xu	0f37601fd7	Merge changes I1b972c94,I9c897d32 * changes: Prevent invalid memory access Use correct bsize for uv	2015-03-06 07:27:59 -08:00
Yaowu Xu	8cbeb7cf36	Prevent invalid memory access Change-Id: I1b972c945274254d896d772d859840b2f8211b4f	2015-03-05 14:57:11 -08:00
Alex Converse	feda5d244c	Merge changes I219c287b,I6adee670 * changes: Call encoder control before running ethread test. Don't copy thread data for the main thread.	2015-03-05 14:43:42 -08:00
Alex Converse	b21e361f8d	Merge "Fix misleading indentation."	2015-03-05 14:43:38 -08:00
Alex Converse	ad01d275e9	Merge "Don't inline cost_coeffs."	2015-03-05 13:54:44 -08:00
Adrian Grange	6e3be5c3b6	Merge "Fix valgrind memcpy memory overlaps warning"	2015-03-05 12:52:57 -08:00
Alex Converse	2eb113d00a	Don't inline cost_coeffs. It was tiny when it was orginally marked INLINE. Forcing this function to be inlined prevents the compiler from inlining its much smaller callers. No measurable speed impact, 28320 byte smaller libvpx.a Change-Id: I6bf4c917157d15cbadb3cd3e20a9e82d35dc7d6f	2015-03-05 12:39:02 -08:00
Alex Converse	56cc37c642	Fix misleading indentation. Change-Id: Ic82b039a3d42f9aa01b85a3a69facfaa84b43a53	2015-03-05 12:10:56 -08:00
Alex Converse	71d5a59c6d	Don't copy thread data for the main thread. Change-Id: I6adee6704cacfeae0ed0b217a91095457d1be74a	2015-03-05 12:10:56 -08:00
Jingning Han	fda0410822	Move pred_mv assign outside integral projection motion search Change-Id: I040b066fdce08e2f05115a22ea808715aa147779	2015-03-05 11:44:10 -08:00
Jingning Han	87bf5203af	Merge "Move integral projection motion search to vp9_mcomp.c"	2015-03-05 09:25:16 -08:00
Yaowu Xu	b573fef76d	Use correct bsize for uv Change-Id: I9c897d32af6c3a956bb6f424a74c12737727038a	2015-03-05 08:20:35 -08:00
Adrian Grange	4b546583c4	Merge "Small rationalization of code in vp9_first_pass"	2015-03-04 12:49:58 -08:00
Adrian Grange	a34a042615	Merge "Make encoder buffer allocation dynamic"	2015-03-04 10:54:10 -08:00
Adrian Grange	fed9e1fee9	Small rationalization of code in vp9_first_pass Change-Id: I87cc0e038171c60a957298827e312fead500f7fb	2015-03-04 10:49:03 -08:00
Jingning Han	50c06052e9	Merge "Use SAD value to set chroma cost flag"	2015-03-04 10:47:56 -08:00
Jingning Han	2deecdd5cb	Move integral projection motion search to vp9_mcomp.c Make it a general purpose fast motion estimation function, to be used in the mode search process. Change-Id: Ib354cb0e664dc61c30c0b2314297835ee75b157a	2015-03-04 10:30:15 -08:00
Jingning Han	7d8061a44a	Use SAD value to set chroma cost flag This saves an extra 64x64 variance calculation and replaces two 32x32 variance functions with sad functions. The compression performance change is unnoticeable. Change-Id: I6d33868695664ec73b56c42945162ae61c484856	2015-03-04 09:46:39 -08:00
Jingning Han	0fe8304d0b	Merge "Properly handle the boundary blocks for integral projection search"	2015-03-04 09:01:33 -08:00
Adrian Grange	3807dd82ab	Make encoder buffer allocation dynamic Frame buffers are now allocated dynamically on-demand. Entries in the reference frame map, cm->ref_frame_map, may now be set to -1 (INVALID_IDX) to indicate that there is not a valid reference buffer in that "slot". All slots in the reference frame map are now initialized to the empty state (-1) and each buffer is initialized to have a reference count of 0. Change-Id: Id1afe98de98db4ae8b2dfefed7889c3b28c68582	2015-03-04 07:58:32 -08:00
Deb Mukherjee	87d1a488ed	Merge "dc quantizer fix for 32x32 transforms"	2015-03-03 23:23:44 -08:00
Jingning Han	540318d3f8	Merge "Scale the normalization factor depending on the block size"	2015-03-03 19:04:34 -08:00
Jingning Han	e5fe165840	Properly handle the boundary blocks for integral projection search Use rectangular block size for integral projection motion estimation if the the 64x64 block has over half block outside the frame. This avoids the issue that the motion information of these blocks is dominated by the extended pixels, instead of the pixels of interest. Change-Id: I22f4d2bb7f6a20db9b3f5e2e5463a7f4b9d1b737	2015-03-03 16:15:12 -08:00
Deb Mukherjee	6910e92d04	dc quantizer fix for 32x32 transforms The rounding factor needs to be scaled down by a factor of 2. Also, the quantized and dequantized coefficients are memset to 0 when dc quantizer is used. Change-Id: Ifa68bab02addbf1b83d249c5b4cbd5cda796b1cf	2015-03-03 15:58:27 -08:00
Adrian Grange	852f62fde5	Fix valgrind memcpy memory overlaps warning Change-Id: Id0bb162b48b891c5c849f0411ef2ac0aa4bbe261	2015-03-03 15:06:34 -08:00
Jingning Han	a521008201	Scale the normalization factor depending on the block size Change-Id: I0a26994bf65ea224e496b09af2ce71e1a4210433	2015-03-03 11:29:46 -08:00
Yaowu Xu	47ac3ea0bb	Adapt color sensitiviy threshold to luma signal energy Instead using only a fixed threshold, this commit adapts the threshold for color sensitivity decision to luma signal energy: chroma channel's sse is at least 1/6 of that in luma for color sensitivity flag to be set to active. This recoups a large portion of the speed loss due to accounting for chroma component costs in RTC mode decision. Change-Id: Ie01f747f6037dba6a1d1ed3e10b71a0ef1abc42c	2015-03-03 11:15:13 -08:00
Jingning Han	1790d45252	Use variance metric for integral projection vector match This commit replaces the SAD with variance as metric for the integral projection vector match. It improves the search accuracy in the presence of slight light change. The average speed -6 compression performance for rtc set is improved by 1.7%. No speed changes are observed for the test clips. Change-Id: I71c1d27e42de2aa429fb3564e6549bba1c7d6d4d	2015-03-01 10:42:56 -08:00
Jingning Han	f4e0eb17e8	Merge "Fix source frame border extension"	2015-02-27 18:19:18 -08:00
Jingning Han	fe85fabbac	Fix source frame border extension This commit fixes an issue in source frame border extension. It causes certain frame resolution such as 640x480 to have a portion of the right/bottom extension filled by zeros, which misleads motion search and degrades transform coding performance when large block size is used. This fix improves the speed 2 compression performance of a few yt sequence, typically ranging from 1% - 2%, up to 5% at median to low bit-rate. Change-Id: Id6b09a5695d9e7651c6dfbc2c6a72288b08af7fb	2015-02-27 15:48:01 -08:00
Adrian Grange	94bba48525	Merge "Fix calc_highbd_psnr"	2015-02-27 15:42:08 -08:00
Alex Converse	2b2fc812f1	Merge "Make SVC compatible with external resize."	2015-02-27 14:37:48 -08:00
Adrian Grange	54293ee3c7	Fix calc_highbd_psnr Should use the crop dimensions of the frame rather than the extended size. Change-Id: I49ed041a46ff0753d43e074020857b7ff2f95e17	2015-02-27 14:05:02 -08:00
Marco	2b0ed0842f	Merge "Fix arithmetic overflow warnings."	2015-02-27 11:53:57 -08:00
Jingning Han	89ee460ee4	Merge "Refactor integral projection based motion estimation"	2015-02-27 09:49:30 -08:00
Marco	c3f7bb16b4	Fix arithmetic overflow warnings. Change-Id: Ib85b5bc135aa0907a76b8c74faafe577e27d014f	2015-02-26 15:27:21 -08:00
Jingning Han	73a00d3219	Refactor integral projection based motion estimation Support variable block size integral projection based motion estimation. Change-Id: Iee6d65e44df4480aa13fb7b84b9c91914b89caa1	2015-02-26 14:48:59 -08:00
Yaowu Xu	754bbcfdc8	Fix the encoder to support profile change Change-Id: Iefb928ad1174e274409facfb44f80265ff0f7683	2015-02-26 11:41:01 -08:00
Yaowu Xu	387bb8bed7	Correct parameter order in a function call Change-Id: Ibd87db1c4371edcbe193d39df2fdc07d3842c21a	2015-02-26 11:39:57 -08:00
paulwilkins	e2b4ef1313	Merge "Account for rate error in GF group Q calculation."	2015-02-26 08:20:08 -08:00
Alex Converse	6ea83fdfcb	Make SVC compatible with external resize. Fixes https://code.google.com/p/webm/issues/detail?id=943 Change-Id: I6177bf6ab6b31a22d2652732f579b8aed3f28887	2015-02-25 14:05:51 -08:00
Jingning Han	3e1d14a6ce	Merge "Motion compensated reference refinement"	2015-02-25 12:33:09 -08:00
Jingning Han	4c5a4efc38	Merge "Re-distribute hierarchical vector match pattern"	2015-02-25 10:33:25 -08:00
Jingning Han	b7050c0be3	Motion compensated reference refinement This commit applies one-step refinement search to the resulting motion vector of the integral projectiion based motion estimation, per 64x64 block. It improves the coding performance of speed -6. pedestrian 1080p 500 kbps 51735 b/f, 36.794 dB, 16044 ms -> 51382 b/f, 36.793 dB, 16282 ms cloud 1080p 500 kbps 24081 b/f, 37.988 dB, 14016 ms -> 23597 b/f, 38.076 dB, 12774 ms vidyo1 720p 1000 kbps 16552 b/f, 40.514 dB, 8279 ms -> 16553 b/f, 40.543 dB, 8510 ms The rtc set compression performance is improved by 0.5%. Change-Id: I3d09bea2caf58b2a4f3b38aa26fffafcbe9a2c17	2015-02-25 10:32:09 -08:00
Yunqing Wang	419ff1352e	Merge "Fix ssse3 quantize_fp functions while skip=1"	2015-02-25 10:10:10 -08:00
Jingning Han	0f57d0a682	Merge "Fix fwd transform sse2 build issue on older gcc version"	2015-02-25 09:32:00 -08:00
Jingning Han	e47033319d	Fix fwd transform sse2 build issue on older gcc version Change-Id: I3e0e53d129552babf29e6c5d047483733983973c	2015-02-24 23:25:21 -08:00
Jingning Han	f87e315e1e	Re-distribute hierarchical vector match pattern This commit modifies the hierarchical vector match patter. It avoids repeated SAD computation at same points. The function vp9_vector_sad_sse2 is called 12 times per 64x64 block, instead of 15 times as before. The effective coverage remains the same. Change-Id: I91ad9d27d40db8963c907d02af84e10702136994	2015-02-24 11:48:38 -08:00
Yunqing Wang	58e0159c80	Fix ssse3 quantize_fp functions while skip=1 In ssse3 functions, DEFINE_ARGS macro hard codes qcoeff and dqcoeff to r3 and r4. If skip is 1, qcoeff and dqcoeff need to be loaded from the stack, which doesn't work because of the above definitions. Currently, skip=1 case is not used in the encoder. This patch fixed the issue, so it can be turned on later. Change-Id: I998d696b1a7a85dca2b3bcee790b21c21e039147	2015-02-24 10:37:05 -08:00
paulwilkins	8d7f53f04c	Account for rate error in GF group Q calculation. When GF group adaptive maxQ is enabled this patch accounts somewhat for accumulated error in the rate control. This improves accuracy quite a bit on many clips especially when there is overshoot. Examples when the overshoot and undershoot command line parameters are set to 100: Hall @ 1200 overshoot is reduced from 67-24%. Akiyo @ 400 undershoot is reduced from 28%-15%. Setting a lower value for undershoot or overshoot still reduces the error further. Impact on metrics is mixed with some gains in average psnr but generally a little lower (e.g. 0.5%) on overall and ssim. The GF group adaptation is still off by default in this patch. Compared to with the head, enabling this mode now gives big average psnr gains on the YT sets (e.g. YT_HD >11.2%), a drop in overall PSNR (YT-HD 3.9%) and a smaller drop or neutral for SSIM. Change-Id: If4b32cd0740d3fb941317b374f9c2951954eee90	2015-02-23 10:57:27 +00:00
Marco	c9f660d895	Merge "Remove a few unneccessary multiplications in denoiser."	2015-02-20 14:42:02 -08:00
Marco	8f84fbe756	Remove a few unneccessary multiplications in denoiser. Change-Id: I3edbb7cc67203fbbf32c6fd4a08015ca9d9ed53e	2015-02-20 11:55:11 -08:00
Hangyu Kuang	8724d31d12	Move dequant table from VP9_COMMON to VP9_COMP as decoder does not need it any more. This reduces VP9_COMMON size from 25776 bytes to 17584 bytes(~31%). Change-Id: Ic5daea732ccefb6d512b048af7983f0efe08589b	2015-02-20 11:12:42 -08:00
Marco	a1b402e71c	Merge "Adjustments to cyclic refresh (aq-mode=3)."	2015-02-20 09:55:05 -08:00
Jingning Han	6728655422	Merge "Add high bit depth support to rtc sub8x8 block coding"	2015-02-20 09:35:18 -08:00
Marco	0187f4b411	Adjustments to cyclic refresh (aq-mode=3). Target higher delta-qp for big blocks with zero motion, and for segment#1: avoid 64x64 partition size and force 8x8 tx size. Metrics on RTC set mostly positive: SSIM up by ~4%, PSRN by ~1.5%. Doesn't seem to be any change in speed. Change-Id: I1f68fa3c4f62dab3b90cc58041f05ebb048ae5ac	2015-02-20 08:47:59 -08:00
Jingning Han	6f4245894a	Add high bit depth support to rtc sub8x8 block coding This commit adds proper buffer handle to support high bit depth in rtc sub8x8 block coding. Change-Id: Ibaf8a2160194121aec9ca68b8094817fed9ccaea	2015-02-20 08:36:33 -08:00
Adrian Grange	f03627347e	Merge "Fix control string in firstpass stats fprintf"	2015-02-19 16:36:43 -08:00
Yunqing Wang	5e57729601	Merge "Improve skip_txfm thresholds in the non-rd mode selection"	2015-02-19 15:31:02 -08:00
Adrian Grange	2ae314fe3a	Fix control string in firstpass stats fprintf 20 items in the control string but only 19 arguments. Change-Id: I51dab9aa1c58c653b52395005a9cb41f09feb484	2015-02-19 15:18:30 -08:00
Jingning Han	216b171d63	Merge "Integral projection based motion estimation"	2015-02-19 15:08:11 -08:00
Yunqing Wang	81fc5bf81c	Improve skip_txfm thresholds in the non-rd mode selection Modified the thresholds of deciding whether or not to skip the transforms in model_rd_for_sb_y(). Used zbin[] instead of dequant[] to be more precise. Also, modified the checking coditions. Rtc set borg test results (at speed 6) showed: average PSNR gain: 0.138%, overall PSNR gain: 0.158%, and SSIM gain: 0.177%. The data rate test was modified slightly as suggested by Marco. Change-Id: Ieaf633ab77f4838cb3c45cf69065b29d55f8ae6c	2015-02-19 14:30:46 -08:00
Jingning Han	ed2dc59c1b	Integral projection based motion estimation This commit introduces a new block match motion estimation using integral projection measurement. The 2-D block and the nearby region is projected onto the horizontal and vertical 1-D vectors, respectively. It then runs vector match, instead of block match, over the two separate 1-D vectors to locate the motion compensated reference block. This process is run per 64x64 block to align the reference before choosing partitioning in speed 6. The overall CPU cycle cost due to this additional 64x64 block match (SSE2 version) takes around 2% at low bit-rate rtc speed 6. When strong motion activities exist in the video sequence, it substantially improves the partition selection accuracy, thereby achieving better compression performance and lower CPU cycles. The experiments were tested in RTC speed -6 setting: cloud 1080p 500 kbps 17006 b/f, 37.086 dB, 5386 ms -> 16669 b/f, 37.970 dB, 5085 ms (>0.9dB gain and 6% faster) pedestrian_area 1080p 500 kbps 53537 b/f, 36.771 dB, 18706 ms -> 51897 b/f, 36.792 dB, 18585 ms (4% bit-rate savings) blue_sky 1080p 500 kbps 70214 b/f, 33.600 dB, 13979 ms -> 53885 b/f, 33.645 dB, 10878 ms (30% bit-rate savings, 25% faster) jimred 400 kbps 13380 b/f, 36.014 dB, 5723 ms -> 13377 b/f, 36.087 dB, 5831 ms (2% bit-rate savings, 2% slower) Change-Id: Iffdb6ea5b16b77016bfa3dd3904d284168ae649c	2015-02-19 13:47:19 -08:00
Jingning Han	83559e7357	Fix a check condition in nonrd_pick_partition Change-Id: Ic92fb4b16948f745c218351b24fdafecf9abce3a	2015-02-19 09:54:55 -08:00
Yaowu Xu	c5718a7aa3	Merge "Fix an encoder/decode mismatch bug"	2015-02-13 16:40:41 -08:00
Yaowu Xu	4bc7f4828f	Fix an encoder/decode mismatch bug This commit prevent the encoder to update last_frame_type when a frame is dropped in the encoder. Prior to this fix, if there is a dropped frame immediatedly after a key frame, decoder would have the value of last_frame_type as key frame, different from encoder as the dropped frame in encoder would have updated the value to an inter frame. This leads to different probability update in encoder and decoder, thereby encoder/decoder mismatch. This fixes issue #941 Change-Id: I27115224b138bec43ae3916c016574f5740822b0	2015-02-13 15:45:47 -08:00
Marco	b1940bf5fe	Replace some operations with shift in encoder_breakout. Replaced a divide by 9 with 8, so some very small difference, but otherwise no change in behavior. Change-Id: I1079ae3c41e0789ff0bc6fa9940a238b6bca0f5b	2015-02-13 10:45:19 -08:00
Jingning Han	e69c79e19a	Merge "Fix ioc issue in block_rd_txfm"	2015-02-12 15:07:41 -08:00
Jingning Han	5041aa0fbe	Fix ioc issue in block_rd_txfm Force 64-bit precision in the intermediate steps. Change-Id: I666113d9adcef8975da201d5aa1a13b783d09594	2015-02-12 12:51:39 -08:00
Marco	cc7d981de1	Merge "Add skin detection."	2015-02-12 11:12:27 -08:00
Jingning Han	f4c29ae9ea	Merge "Update partition rate cost in rtc speed 5"	2015-02-12 09:14:49 -08:00
Jingning Han	ee83243daa	Merge "Add mode cost to sub8x8 block mode decision in rtc coding"	2015-02-12 09:14:29 -08:00
Marco	56435bb7b6	Add skin detection. Simple skin detection, from vp8; works reasonable on most of the RTC clips, but could miss sometimes. Added debug flag to write out skin map over source input. Change-Id: I2caea7592f1c459047aac46627eeb24a94946464	2015-02-11 17:47:17 -08:00
Adrian Grange	053625e4cd	Add cast to convert double to int Change-Id: I7f63c2940256a5dadf9a29a853809290dd9e98ed	2015-02-11 15:59:48 -08:00
Jingning Han	e665c8f2c9	Add mode cost to sub8x8 block mode decision in rtc coding This commit allows the encoder to properly account for the mode cost in sub8x8 non-RD mode decision. Change-Id: I2951960d20e37ed08e372ee0c7044935b2b9b899	2015-02-11 14:43:02 -08:00
Jingning Han	c9725813db	Merge "Account for inter prediction filter rate cost in rtc mode selection"	2015-02-11 14:42:44 -08:00
Jingning Han	532cb435f8	Merge "Add ref frame rate cost to non-RD mode decision"	2015-02-11 14:36:48 -08:00
Jingning Han	7a4e0b2265	Update partition rate cost in rtc speed 5 The block partition rate cost should be updated when recursive partition search is needed. Change-Id: I7bc5ad1fc2cbd3577dee7f7e8da111a2742bdeb9	2015-02-11 12:48:29 -08:00
Jingning Han	41b7f76db1	Account for inter prediction filter rate cost in rtc mode selection Add the rate cost on inter prediction filter type to the overall rate-distortion cost in vp9_pick_mode_inter. Change-Id: I72c34017adf5220cadb3962694ee5404469fc673	2015-02-11 12:17:29 -08:00
Jingning Han	4ce70e8847	Add ref frame rate cost to non-RD mode decision This commit adds a heuristic rate cost of reference frame to the non-RD mode decision. It improves the compression performance of speed -6 by 0.31% and speed -5 by 0.69%. Change-Id: If7f3b45519d49b2cb640bcb7316a254efc8be446	2015-02-11 11:08:10 -08:00
Yaowu Xu	ee5d79995e	Move computation up to frame level This is to avoid redo the same calculation repeatly, and also allow easier adjustments for further experiments. This commit shall have no effect on quality/compression. Change-Id: I4460acf5c808ff5518da18d21e002c5da58af857	2015-02-10 15:41:52 -08:00
Adrian Grange	2d924161c7	Merge "Auto-adaptive encoder frame resizing logic"	2015-02-10 12:16:55 -08:00
Jingning Han	f0eea5be2a	Merge "Fix block partition size in fill_mode_info_sb"	2015-02-10 10:49:03 -08:00
Adrian Grange	23ebacdb81	Auto-adaptive encoder frame resizing logic Note: This feature is still in development. Add an option for the encoder to decide the resolution at which to encode each frame. Each KF/GF/ARF goup is tested to see if it would be better encoded at a lower resolution. At present, each KF/GF/ARF is coded first at full-size and if the coded size exceeds a threshold (twice target data rate) at the maximum active Q then the entire group is encoded at lower resolution. This feature is enabled in vpxenc by setting: --resize-allowed=1 In addition, if the vpxenc command line also specifies valid frame dimensions using: --resize-width=XXXX & --resize_height=YYYY then all frames will be encoded at this resolution. Change-Id: I13f341e0a82512f9e84e144e0f3b5aed8a65402b	2015-02-10 09:59:32 -08:00
Yunqing Wang	84b813aa42	Merge "Make encoder and decoder share common thread function"	2015-02-10 09:06:41 -08:00
Yunqing Wang	d3a37731c2	Merge "Rename loopfilter_thread files to thread_common files"	2015-02-10 09:06:23 -08:00
Jingning Han	ebb4c9e8e7	Fix block partition size in fill_mode_info_sb This commit fixes the sub block partition size used in fill_mode_info_sb. Previous implementation effectively disabled the rectangular block sizes. This commit resolved this issue. Change-Id: Ic1c383ab0a9a2e7d59e85b388093f1f1f94d1e7f	2015-02-10 08:39:32 -08:00
Yunqing Wang	07eb8c8da3	Merge "Fix high bit depth assembly function bugs"	2015-02-09 15:30:36 -08:00
Yunqing Wang	4ae092c660	Make encoder and decoder share common thread function Moved vp9_accumulate_frame_counts to vp9_thread_common.c to eliminate the duplicate code. Change-Id: I9cf506d729603c8bf1494b4c86a3b7d47af1917a	2015-02-06 11:45:51 -08:00
Jingning Han	ba933b90c6	Merge "Re-arrange inter mode search order in RTC coding flow"	2015-02-06 10:11:33 -08:00
Yunqing Wang	41063137c3	Rename loopfilter_thread files to thread_common files Renames the files to allow more common thread code to be moved to vp9/common. Change-Id: I7386e64e221086e3cdc087e79812f993c423413b	2015-02-06 10:03:31 -08:00
Yaowu Xu	8b5e665098	Merge "Replace repeated check with single variable"	2015-02-06 09:17:59 -08:00
Jingning Han	b2762a8853	Re-arrange inter mode search order in RTC coding flow This commit makes the ZEROMV mode first in the search order to ensure that the zero mv is always checked in the RTC coding mode. It improves the average speed -6 compression performance by 0.3% in both PSNR and SSIM at no visible speed change. Change-Id: I465a7e59f4e20cd84fee3f02ced6f98036945949	2015-02-06 08:52:52 -08:00
Yunqing Wang	789ae447f8	Fix high bit depth assembly function bugs The high bit depth build failed while building for 32bit target. The bugs were in vp9_highbd_subpel_variance.asm and vp9_highbd_sad4d_sse2.asm functions. This patch fixed the bugs, and made 32bit build work. Change-Id: Idc8e5e1b7965bb70d4afba140c6583c5d9666b75	2015-02-05 11:24:03 -08:00
Yaowu Xu	c905c42ad8	Remove unnecessary initialization loop_filter_level is always reset in loop_filter_frame() later in encoder. Change-Id: I608e03d905a6b23e7d5025ca747e4784c665007e	2015-02-04 13:56:16 -08:00
Yaowu Xu	581aee001e	Move tx_mode decision logic into select_tx_mode() Change-Id: I7f8f78c33eb3f33344b029a27bda320f4d68c577	2015-02-04 13:54:49 -08:00
Yaowu Xu	19451e6d67	Replace repeated check with single variable Change-Id: I2f6a669bf7c6d9796388ad3f3fa3fc942635c215	2015-02-04 12:59:14 -08:00
Yaowu Xu	a844a778c7	Merge "Adjust partitioning threshold based rtc speed"	2015-02-04 12:52:03 -08:00
Yaowu Xu	3bc0c6576f	Merge "Move calls to avoid unnecessary operations"	2015-02-04 12:51:16 -08:00
Yaowu Xu	bdfb5f986e	Adjust partitioning threshold based rtc speed On rtc set: speed 7 quality improves about 0.5% speed 8 quality improves about 1.0% Encoding time for speed 7 changes from 67804ms to 65889ms Encoding time for speed 8 changes from 58659ms to 56808ms Change-Id: Iabcfb53012fc1b9f3326cdbc167e5758b8c7ad30	2015-02-04 11:28:39 -08:00
Jingning Han	1b9082ec6b	Unify luma and chroma inter predictors in choose_partitioning Change-Id: I8bfc80f4fffb0892e93d3326394a52d1ee3c0f37	2015-02-04 10:02:57 -08:00
Jingning Han	4ccfc7d517	Save an extra call for setup_pred_plane function Reuse the yv12_mb array to fetch the buffer pointers/strides corresponding to the current reference frame. Change-Id: I5276b7494158b2cccef15213be2dc189e9036851	2015-02-04 09:47:14 -08:00
Jingning Han	0c6d3a03e1	Account for chroma component costs in RTC mode decision This commit allows the encoder to account for additional chroma plane costs in the mode decision process, if the current block potentially contains significant color change. It improves the visual quality at very low bit-rates. The compression performance of dark720p is improved by 12.39% in speed 6. For jimred at 150 kbps, the PSNR of V component (red) increased by 0.2 dB, at the expense of about 5% increase in encoding time. Note that for sequences where the chroma components are fairly consistent, the encoding time increase is negligible. On average the rtc set compression performance is improved by 1.172% in PSNR and 1.920% in SSIM. Change-Id: Ia55b24ef23a25304f7ec9958fbf07fd6e658505c	2015-02-04 09:45:14 -08:00
Johann	3a5d40608e	Merge "Remove unnecessary pointer check"	2015-02-03 17:12:56 -08:00
Yaowu Xu	02537ebbe4	Move calls to avoid unnecessary operations Change-Id: I236f7f75ab9a4511d1b52a6a67299b0e844a103e	2015-02-03 17:01:37 -08:00
Yaowu Xu	cb411108a3	Merge "adjust rtc setting and threshold"	2015-02-03 15:13:52 -08:00
Jim Bankoski	d7783cae95	Merge "make low bitrates a lot less blocky"	2015-02-03 13:25:06 -08:00
Johann	ba18609502	Remove unnecessary pointer check The original implementation had the following comment: // Ignore mv costing if mvsadcost is NULL However the current implementation does not allow for this. If x exists then nmvsadcost must not be null. This removes the only warning from -Wpointer-bool-conversion https://code.google.com/p/webm/issues/detail?id=894 Change-Id: I1a2cee340d7972d41e1bbbe1ec8dfbe917667085	2015-02-03 13:03:46 -08:00
Jingning Han	894f0fbd3b	Merge "Assign 2nd ref frame in choose_partitioning"	2015-02-03 12:25:18 -08:00
Jingning Han	ca9c352fc3	Assign 2nd ref frame in choose_partitioning Avoid the use of uninitialized second reference frame for fetching reference block. Change-Id: I9983a0daea829700b3270dc8bf2bcc6d6ea36652	2015-02-03 11:17:51 -08:00
Jim Bankoski	9f1cf2c8cf	make low bitrates a lot less blocky Remove loop filter skip at speed 7+ because of bad visual artifacts and up the postprocessing. Change-Id: Ibdd0bac71aaee232d2bb2e14462733c51517768d	2015-02-03 06:45:56 -08:00
Yaowu Xu	65a1a3e85d	adjust rtc setting and threshold 1. Adjusted the threshold for coef update computation based on counts of tx used, avoid coef update computation when count is low (<20) 2. Move sf->lpf_pick = LPF_PICK_MINIMAL_LPF to speed 8. Change-Id: I02b44309e40fcdbf135c7934ae067a3f42502d30	2015-02-02 17:43:46 -08:00
Alex Converse	a79db92c07	Merge "Allow larger encoder configurations."	2015-02-02 12:05:56 -08:00
Yaowu Xu	80e729f601	Merge "Optimize coef update"	2015-02-01 20:08:29 -08:00
hkuang	be6aeadaf4	Try again to merge branch 'frame-parallel' into master branch. In frame parallel decode, libvpx decoder decodes several frames on all cpus in parallel fashion. If not being flushed, it will only return frame when all the cpus are busy. If getting flushed, it will return all the frames in the decoder. Compare with current serial decode mode in which libvpx decoder is idle between decode calls, libvpx decoder is busy between decode calls. Current frame parallel decode will only speed up the decoding for frame parallel encoded videos. For non frame parallel encoded videos, frame parallel decode is slower than serial decode due to lack of loopfilter worker thread. There are still some known issues that need to be addressed. For example: decode frame parallel videos with segmentation enabled is not right sometimes. * frame-parallel: Add error handling for frame parallel decode and unit test for that. Fix a bug in frame parallel decode and add a unit test for that. Add two test vectors to test frame parallel decode. Add key frame seeking to webmdec and webm_video_source. Implement frame parallel decode for VP9. Increase the thread test range to cover 5, 6, 7, 8 threads. Fix a bug in adding frame parallel unit test. Add VP9 frame-parallel unit test. Manually pick "Make the api behavior conform to api spec." from master branch. Move vp9_dec_build_inter_predictors_* to decoder folder. Add segmentation map array for current and last frame segmentation. Include the right header for VP9 worker thread. Move vp9_thread.* to common. ctrl_get_reference does not need user_priv. Seperate the frame buffers from VP9 encoder/decoder structure. Revert "Revert "Revert "Revert 3 patches from Hangyu to get Chrome to build:""" Conflicts: test/codec_factory.h test/decode_test_driver.cc test/decode_test_driver.h test/invalid_file_test.cc test/test-data.sha1 test/test.mk test/test_vectors.cc vp8/vp8_dx_iface.c vp9/common/vp9_alloccommon.c vp9/common/vp9_entropymode.c vp9/common/vp9_loopfilter_thread.c vp9/common/vp9_loopfilter_thread.h vp9/common/vp9_mvref_common.c vp9/common/vp9_onyxc_int.h vp9/common/vp9_reconinter.c vp9/decoder/vp9_decodeframe.c vp9/decoder/vp9_decodeframe.h vp9/decoder/vp9_decodemv.c vp9/decoder/vp9_decoder.c vp9/decoder/vp9_decoder.h vp9/encoder/vp9_encoder.c vp9/encoder/vp9_pickmode.c vp9/encoder/vp9_rdopt.c vp9/vp9_cx_iface.c vp9/vp9_dx_iface.c This reverts commit `a18da9760a`. Change-Id: I361442ffec1586d036ea2e0ee97ce4f077585f02	2015-01-30 21:00:13 -08:00
Jingning Han	f1ab5c1021	Merge "Format fixes in vp9_rd_pick_inter_mode_sb/sub8x8"	2015-01-30 15:49:14 -08:00
Yaowu Xu	45971abd1d	Optimize coef update 1. move the check of search method of USE_TX_8X8 up one level to avoid operations of build_tree_distributions() 2. count tx used and avoid computaton for coef udpate when one size is not used at all. Change-Id: Ia3e54a2588aa531c41377a1bfaa64385d04a592c	2015-01-30 10:16:40 -08:00
Yunqing Wang	3b3e299650	Merge "Fix issues in 32bit PIC enabled build"	2015-01-29 16:41:25 -08:00
Alex Converse	797a2556eb	Allow larger encoder configurations. Allow changing colorspace in the encoder and increasing frame size. Change-Id: I8e7c3b891af29ce420a15beb4f6f9c250245b2bb	2015-01-29 15:07:40 -08:00
Paul Wilkins	68340a3470	Merge "Change to update of rate control factors."	2015-01-29 13:50:52 -08:00
Marco	a80dd52b6e	Merge "Fix to vp9 denoiser."	2015-01-29 09:10:30 -08:00
Paul Wilkins	f752da8ce2	Change to update of rate control factors. Remove damping parameter and use the damping formula introduced by Yaowu Xu in all cases. Change-Id: I18db7e0d0f262d5140102f259ab07821d374d285	2015-01-28 15:44:53 -08:00
Yaowu Xu	ff99a3c750	Simplify update_coef_probs() 1. reduce the size of temporaray arrays on stack 2. avoid build_tree_distribution for tx size that is not used at all. Change-Id: I0f8d7124e16a3789d3c15ad24cf02c1c12789e2c	2015-01-28 15:12:42 -08:00
Marco	c0923d4d3a	Fix to vp9 denoiser. Prevent from using wrong mv for denoiser motion compensation. Change-Id: Ifa0f9daabdbdab0900d3c17304059fe0d15de914	2015-01-28 12:07:27 -08:00
Frank Galligan	d1e6b8231a	Merge "Add vp9_sad32x32x4d_neon Neon intrinsic function."	2015-01-28 10:35:50 -08:00
Frank Galligan	eb12d880ab	Merge "Add vp9_sad16x16x4d_neon Neon intrinsic function."	2015-01-27 23:01:44 -08:00
Frank Galligan	80a3a07929	Merge "Add vp9_sad64x64x4d_neon Neon intrinsic function."	2015-01-27 23:01:15 -08:00
Yunqing Wang	10d5e09c87	Fix issues in 32bit PIC enabled build This patch was to fix issue 924: https://code.google.com/p/webm/issues/detail?id=924 The SECTION_RODATA macro was modified to support macho32 format. The sub-pixel functions were modified to pass in 2 more parameters to handle the global offsets for PIC build. Change-Id: I3bfcd336bcae945edf300bca4ab40376a2628cd4	2015-01-27 22:20:21 -08:00
Yaowu Xu	fe2439703d	Merge "move clear_system_state() call before using double"	2015-01-27 12:42:13 -08:00
Frank Galligan	e3167f7fbf	Add vp9_sad32x32x4d_neon Neon intrinsic function. On Nexus 7 speed -6 saw ~18% increase in perf. Tested on Nexus 7, built with ndk r10d, gcc 4.9. BUG=https://code.google.com/p/webm/issues/detail?id=908 Change-Id: I70ccdea0326750552ed946fb004507d6efe02d5c	2015-01-27 08:54:00 -08:00
Frank Galligan	9f574d0316	Add vp9_sad16x16x4d_neon Neon intrinsic function. On Nexus 7 speed -6 saw ~15% increase in perf. Tested on Nexus 7, built with ndk r10d, gcc 4.9. BUG=https://code.google.com/p/webm/issues/detail?id=908 Change-Id: I4b2006b644c488f42bf06d8a22ef0e6120a96bf9	2015-01-27 08:42:17 -08:00
Frank Galligan	54fa956715	Add vp9_sad64x64x4d_neon Neon intrinsic function. On Nexus 7 speed -6 saw ~30% increase in perf. Tested on Nexus 7, built with ndk r10d, gcc 4.9. BUG=https://code.google.com/p/webm/issues/detail?id=908 Change-Id: Id12af7d1883243c23e6692e898aea82299633d58	2015-01-27 08:33:40 -08:00
Marco	1c4a84c6e9	Merge "aq-mode=3: Update to allow for refresh on modes other than zero-mv."	2015-01-26 19:47:13 -08:00

... 5 6 7 8 9 ...

5518 Commits