generic-library/vpx

Author	SHA1	Message	Date
Johann	dee70d355f	Merge "Move variance functions to vpx_dsp"	2015-05-26 23:02:11 +00:00
Johann	c3bdffb0a5	Move variance functions to vpx_dsp subpel functions will be moved in another patch. Change-Id: Idb2e049bad0b9b32ac42cc7731cd6903de2826ce	2015-05-26 12:01:52 -07:00
Jingning Han	96dba4902c	Fix integral projection motion search for frame resize This commit fixes the integral projection motion search crash when frame resize is used. It fixes issue 994. Change-Id: Ieeb52619121d7444f7d6b3d0cf09415f990d1506	2015-05-22 15:40:45 -07:00
James Zern	700b7fd0a9	vp9/encoder: make some functions static silences missing prototype warnings Change-Id: I3338fcaa67b5dcdf6bf237e8b374db3befd18753	2015-05-15 10:43:47 -07:00
Johann	1d7ccd5325	Relocate memory operations for common code With the sad functions, and hopefully the variance functions soon, moving to the vpx_dsp location, place the defines used in the reference C code in a common location. Change-Id: I4c8ce7778eb38a0a3ee674d2f1c488eda01cfeca	2015-05-13 11:41:15 -07:00
James Zern	fd3658b0e4	replace DECLARE_ALIGNED_ARRAY w/DECLARE_ALIGNED this macro was used inconsistently and only differs in behavior from DECLARE_ALIGNED when an alignment attribute is unavailable. this macro is used with calls to assembly, while generic c-code doesn't rely on it, so in a c-only build without an alignment attribute the code will function as expected. Change-Id: Ie9d06d4028c0de17c63b3a27e6c1b0491cc4ea79	2015-05-07 11:55:08 -07:00
paulwilkins	aecb1770d5	Merge "Image size restriction to rd auto partition search."	2015-05-07 14:12:14 +00:00
paulwilkins	af76953448	Merge "Remove CONSTRAIN_NEIGHBORING_MIN_MAX."	2015-05-05 09:32:11 +00:00
paulwilkins	4a7dcf8eb2	Image size restriction to rd auto partition search. Impose a limit on the rd auto partition search based on the image format. Smaller formats require that the search includes includes a smaller minimum block size. This change is intended to mitigate the visual impact of ringing in some problem clips, for smaller image formats. Change-Id: Ie039e5f599ee079bbef5d272f3e40e2e27d8f97b	2015-05-01 16:16:02 +01:00
paulwilkins	287b0c6da9	Remove CONSTRAIN_NEIGHBORING_MIN_MAX. Remove one of the auto partition size cases. This case can behaves badly in some types of animated content and was only used for the rd encode path. A subsequent patch will add additional checks to help further improve visual quality. Change-Id: I0ebd8da3d45ab8501afa45d7959ced8c2d60ee4e	2015-05-01 15:15:16 +01:00
Yunqing Wang	a257e469e1	Adjust the vbp early termination threshold slightly Calculated cpi->vbp_threshold_sad from this frame's dequant value. The encoding quality and speed didn't change much. Borg test result: PSNR: -0.002%, SSIM: -0.003%. Change-Id: I97c9826986f39582f29910d637d08a69c90afdee	2015-04-30 08:51:02 -07:00
James Zern	f58011ada5	vpx_mem: remove vpx_memset vestigial. replace instances with memset() which they already were being defined to. Change-Id: Ie030cfaaa3e890dd92cf1a995fcb1927ba175201	2015-04-28 20:00:59 -07:00
James Zern	f274c2199b	vpx_mem: remove vpx_memcpy vestigial. replace instances with memcpy() which they already were being defined to. Change-Id: Icfd1b0bc5d95b70efab91b9ae777ace1e81d2d7c	2015-04-28 19:59:41 -07:00
Yaowu Xu	b3e411e481	Add validation of UV partition size For color sampling format other than 420, valid partion size in Y may not work for UV plane. This commit adds validation of UV partition size before select the partition choice. This fixes a crash for real time encoding of 422 input. Change-Id: I1fe3282accfd58625e8b5e6a4c8d2c84199751b6	2015-04-24 12:34:18 -07:00
Scott LaVarnway	8b17f7f4eb	Revert "Remove mi_grid_* structures." (see I3a05cf1610679fed26e0b2eadd315a9ae91afdd6) For the test clip used, the decoder performance improved by ~2%. This is also an intermediate step towards adding back the mode_info streams. Change-Id: Idddc4a3f46e4180fbebddc156c4bbf177d5c2e0d	2015-04-21 11:16:45 -07:00
Marco Paniconi	f76ccce5bc	Revert "Revert "Force_split on 16x16 blocks in variance partition."" This reverts commit `004b9d83e3` Change-Id: I2f2d0bdb9368c2c07f1d29a69cd461267a3a8743	2015-04-16 17:52:13 -07:00
Yunqing Wang	63c5bf2b9c	Fix Tsan errors This patch fixed 2 reported Tsan errors while running VP9 real-time encoder. Change-Id: Ib0278fe802852862c3ce87c4a500e544d7089f67	2015-04-15 12:33:39 -07:00
Yunqing Wang	004b9d83e3	Revert "Force_split on 16x16 blocks in variance partition." This reverts commit `eb8c667570`. The patch caused mismatch while using multi-threads. Change-Id: Icd646340af25b5d91e32f03ed3ea212e00e3e0be	2015-04-14 15:19:31 -07:00
Marco	eb8c667570	Force_split on 16x16 blocks in variance partition. Force split on 16x16 block (to 8x8) based on the minmax over the 8x8 sub-blocks. Also increase variance threshold for 32x32, and add exit condiiton in choose_partition (with very safe threshold) based on sad used to select reference frame. Some visual improvement near moving boundaries. Average gain in psnr/ssim: ~0.6%, some clips go up ~1 or 2%. Encoding time increase (due to more 8x8 blocks) from ~1-4%, depending on clip. Change-Id: I4759bb181251ac41517cd45e326ce2997dadb577	2015-04-13 12:05:07 -07:00
Jingning Han	2404332c1b	Merge "Remove get_nonrd_var_based_fixed_partition function"	2015-04-09 14:45:19 -07:00
Jingning Han	208aa6158b	Remove get_nonrd_var_based_fixed_partition function This function has been replaced by other approaches and is not in use now. Change-Id: I387f45b5607d202539e482468ccc70e6c0f9341f	2015-04-09 09:49:55 -07:00
Yunqing Wang	12cb30d4bd	Merge "Set vbp thresholds for aq3 boosted blocks"	2015-04-02 18:22:08 -07:00
Yunqing Wang	cae03a7ef5	Set vbp thresholds for aq3 boosted blocks The vbp thresholds are set seperately for boosted/non-boosted superblocks according to their segment_id. This way we don't have to force the boosted blocks to split to 32x32. Speed 6 RTC set borg test result showed some quality gains. Overall PSNR: +0.199%; Avg PSNR: +0.245%; SSIM: +0.802%. No speed change was observed. Change-Id: I37c6643a3e2da59c4b7dc10ebe05abc8abf4026a	2015-04-02 15:48:32 -07:00
Marco	77ea408983	Code cleanup: put (8x8/4x4)fill_variance into separate function. Code cleanup, no change in behavior. Change-Id: I043b889f8f0b3afb49de0da00873bc3499ebda24	2015-04-02 13:37:35 -07:00
Yaowu Xu	ba91b54d7c	Simplify bsize calculation Change-Id: Ibc514684def9914c66f04cb7931f773e2b79c168	2015-04-01 12:15:06 -07:00
Yunqing Wang	fc98114761	Merge "Rename vbp thresholds"	2015-03-31 16:33:30 -07:00
Yunqing Wang	c28ff1a9de	Rename vbp thresholds Code refactoring Change-Id: I410fcce1bc6d95c62c474445f4c97ea8469f1e79	2015-03-31 15:14:44 -07:00
Alex Converse	02697e35dc	Merge "A tiny cyclic refresh / active map fix."	2015-03-24 09:43:24 -07:00
paulwilkins	c0b71cf82f	Merge "Experimental rd bias based on source vs recon variance."	2015-03-24 03:12:41 -07:00
Alex Converse	31f1563a92	A tiny cyclic refresh / active map fix. Change-Id: I198727461455c8c198a0c892d02ed3cb1673aa50	2015-03-23 18:51:00 -07:00
paulwilkins	9a1ce7be7d	Experimental rd bias based on source vs recon variance. This experiment biases the rd decision based on the impact a mode decision has on the relative spatial complexity of the reconstruction vs the source. The aim is to better retain a semblance of texture even if it is slightly misaligned / wrong, rather than use a simple rd measure that tends to favor use of a flat predictor if a perfect match can't be found. This improves the appearance of texture and visual quality on specific test clips but is hidden under a flag and currently off by default pending visual quality testing on a wider Yt set. Change-Id: Idf6e754a8949bf39ed9d314c6f2daaa20c888aad	2015-03-20 11:57:36 +00:00
Marco	71e6ed7bd1	Adjustments to aq-mode=3. Factor in segment#2 and skip blocks into the postencode estimated bits, and increase somewhat the aggressiveness of the refresh. PSNR/SSIM Metrics on RTC set go up by ~0.8/0.5%. Change-Id: I5d4e7cb00a3aefb25d18c88b6b24118b72dc5d51	2015-03-18 12:06:16 -07:00
Marco	e52109158a	Update to variance partition. Use force_split to constrain the partition selection. This is used because in the top-down approach to variance partition, a block size may be selected even though one of its subblocks may have high variance. In this patch the selection of the 64x64 block size will only be allowed if the variance of all the 32x32 subblocks are also below the threshold. Stil testing, but some visual improvement for areas near slow moving boundary can be seen. Metrics for RTC set increase by about ~0.5%. Change-Id: Iab3e7b19bf70f534236f7a43fd873895a2bb261d	2015-03-17 17:02:47 -07:00
Yunqing Wang	45e8e4a01f	Merge "Refactor set vbp thresholds function"	2015-03-17 16:05:53 -07:00
Yunqing Wang	c0423abf00	Refactor set vbp thresholds function Code refactoring. Change-Id: I73b6fcc0444155ee46c1efa5253c1d608c6439cb	2015-03-17 12:23:32 -07:00
Adrian Grange	ed6824e449	Remove unused ZBIN_BOOST macros Change-Id: I5169155b20ea3676a6ce58ec77d6aeba07db29d9	2015-03-17 11:53:58 -07:00
Jingning Han	7cf383d17f	Fix indent in choose_partitioning Change-Id: I4039f8ac75a9cfcc4d07abd0619d1379bb10fe51	2015-03-16 11:01:00 -07:00
Jingning Han	1f00a9b9d5	Fix choose_partitioning threshold setup for speed -5 The compression performance of speed -5 is on average 12.6% better than speed -6. At lower bit-rates, the gains are typically 20% or more. For 2-thread encoding, the speed -5 takes about 1.6x time of speed -6. Change-Id: If7a73464a24d33e8f49b9533b51ec51c8da7fc80	2015-03-13 17:01:56 -07:00
Marco	e38066a74d	Fix crash with vp9 denoiser on. Crash occured on very first key frame, because denoiser temporal function was beng entered. Updated denoiser unittest to set cpu_used from first frame, and verified fix fixes the crash. Change-Id: I3be1124b52846fbbe7248d2c3d6136e086c80bc1	2015-03-13 11:10:02 -07:00
Alex Converse	1bfacd3529	Reconcile active_map and cyclic refresh Change-Id: Id7f8654aeeb20caa402bc822521b1d72c658f4f9	2015-03-12 16:19:49 -07:00
Jingning Han	238b6be24b	Prevent integer overflow in choose_partitioning Re-arrange the multiplication and right shift operations to avoid integer overflow in choose_partitioning. Change-Id: Ib4005cafb410a67c1960486471d75b6ebe38c4e0	2015-03-11 16:31:42 -07:00
Jingning Han	1ca4d51b2e	Refactor to remove GLOBAL_MOTION Make the vp9_int_pro_motion_estimation() function return zero motion vector if high bit depth is turned on, instead of removing it from compiled codes. Change-Id: Ia48f010eb590b2d517d5678c394110b326a1a95e	2015-03-11 15:53:15 -07:00
Yaowu Xu	059a473b35	Enable using Golden reference in choose_partition() Choose_partition uses only the last frame as reference frame in making partition decision, this commit adds the check on how well Golden frame with (0,0) predicts the current block, and uses GF(0,0) as basis for partition decision if it produces better prediction. The commit improves rtc speed 6 and 7 encoding by 0.14% and 0.19% respectively. Change-Id: I156acf925bd6e0b586d48155d1940d27270a3915	2015-03-10 08:57:28 -07:00
Alex Converse	06b59299c8	Don't waste time partitioning skip superblocks. Force 64x64 partitioning when a whole superblock is SEGMENT_LVL_SKIP. This drops encode times of screens mostly at rest by 20%. Change-Id: Ieba554b0b8a0c1679aae784a8bd11f038ab942c3	2015-03-09 11:02:05 -07:00
Yunqing Wang	969dd8f128	Merge "vp9_ethread: fix me consts initialization to support aq_mode=3 encoding"	2015-03-09 09:42:12 -07:00
Jingning Han	d2b6a4cc80	Merge "Move pred_mv assign outside integral projection motion search"	2015-03-09 09:34:26 -07:00
Yunqing Wang	6e0ec0b2d9	vp9_ethread: fix me consts initialization to support aq_mode=3 encoding While turning on "--aq_mode=3", the quantizers are updated by each thread. Fixed the me consts initialization function to make sure that the correct thread data are updated. Change-Id: Ied27bb7bae76fc3fa2cda4f8c35ac0b46271bef4	2015-03-06 16:31:46 -08:00
Yaowu Xu	0f37601fd7	Merge changes I1b972c94,I9c897d32 * changes: Prevent invalid memory access Use correct bsize for uv	2015-03-06 07:27:59 -08:00
Yaowu Xu	8cbeb7cf36	Prevent invalid memory access Change-Id: I1b972c945274254d896d772d859840b2f8211b4f	2015-03-05 14:57:11 -08:00
Jingning Han	fda0410822	Move pred_mv assign outside integral projection motion search Change-Id: I040b066fdce08e2f05115a22ea808715aa147779	2015-03-05 11:44:10 -08:00
Jingning Han	87bf5203af	Merge "Move integral projection motion search to vp9_mcomp.c"	2015-03-05 09:25:16 -08:00
Yaowu Xu	b573fef76d	Use correct bsize for uv Change-Id: I9c897d32af6c3a956bb6f424a74c12737727038a	2015-03-05 08:20:35 -08:00
Adrian Grange	a34a042615	Merge "Make encoder buffer allocation dynamic"	2015-03-04 10:54:10 -08:00
Jingning Han	50c06052e9	Merge "Use SAD value to set chroma cost flag"	2015-03-04 10:47:56 -08:00
Jingning Han	2deecdd5cb	Move integral projection motion search to vp9_mcomp.c Make it a general purpose fast motion estimation function, to be used in the mode search process. Change-Id: Ib354cb0e664dc61c30c0b2314297835ee75b157a	2015-03-04 10:30:15 -08:00
Jingning Han	7d8061a44a	Use SAD value to set chroma cost flag This saves an extra 64x64 variance calculation and replaces two 32x32 variance functions with sad functions. The compression performance change is unnoticeable. Change-Id: I6d33868695664ec73b56c42945162ae61c484856	2015-03-04 09:46:39 -08:00
Jingning Han	0fe8304d0b	Merge "Properly handle the boundary blocks for integral projection search"	2015-03-04 09:01:33 -08:00
Adrian Grange	3807dd82ab	Make encoder buffer allocation dynamic Frame buffers are now allocated dynamically on-demand. Entries in the reference frame map, cm->ref_frame_map, may now be set to -1 (INVALID_IDX) to indicate that there is not a valid reference buffer in that "slot". All slots in the reference frame map are now initialized to the empty state (-1) and each buffer is initialized to have a reference count of 0. Change-Id: Id1afe98de98db4ae8b2dfefed7889c3b28c68582	2015-03-04 07:58:32 -08:00
Deb Mukherjee	87d1a488ed	Merge "dc quantizer fix for 32x32 transforms"	2015-03-03 23:23:44 -08:00
Jingning Han	e5fe165840	Properly handle the boundary blocks for integral projection search Use rectangular block size for integral projection motion estimation if the the 64x64 block has over half block outside the frame. This avoids the issue that the motion information of these blocks is dominated by the extended pixels, instead of the pixels of interest. Change-Id: I22f4d2bb7f6a20db9b3f5e2e5463a7f4b9d1b737	2015-03-03 16:15:12 -08:00
Deb Mukherjee	6910e92d04	dc quantizer fix for 32x32 transforms The rounding factor needs to be scaled down by a factor of 2. Also, the quantized and dequantized coefficients are memset to 0 when dc quantizer is used. Change-Id: Ifa68bab02addbf1b83d249c5b4cbd5cda796b1cf	2015-03-03 15:58:27 -08:00
Yaowu Xu	47ac3ea0bb	Adapt color sensitiviy threshold to luma signal energy Instead using only a fixed threshold, this commit adapts the threshold for color sensitivity decision to luma signal energy: chroma channel's sse is at least 1/6 of that in luma for color sensitivity flag to be set to active. This recoups a large portion of the speed loss due to accounting for chroma component costs in RTC mode decision. Change-Id: Ie01f747f6037dba6a1d1ed3e10b71a0ef1abc42c	2015-03-03 11:15:13 -08:00
Jingning Han	1790d45252	Use variance metric for integral projection vector match This commit replaces the SAD with variance as metric for the integral projection vector match. It improves the search accuracy in the presence of slight light change. The average speed -6 compression performance for rtc set is improved by 1.7%. No speed changes are observed for the test clips. Change-Id: I71c1d27e42de2aa429fb3564e6549bba1c7d6d4d	2015-03-01 10:42:56 -08:00
Jingning Han	73a00d3219	Refactor integral projection based motion estimation Support variable block size integral projection based motion estimation. Change-Id: Iee6d65e44df4480aa13fb7b84b9c91914b89caa1	2015-02-26 14:48:59 -08:00
Jingning Han	3e1d14a6ce	Merge "Motion compensated reference refinement"	2015-02-25 12:33:09 -08:00
Jingning Han	4c5a4efc38	Merge "Re-distribute hierarchical vector match pattern"	2015-02-25 10:33:25 -08:00
Jingning Han	b7050c0be3	Motion compensated reference refinement This commit applies one-step refinement search to the resulting motion vector of the integral projectiion based motion estimation, per 64x64 block. It improves the coding performance of speed -6. pedestrian 1080p 500 kbps 51735 b/f, 36.794 dB, 16044 ms -> 51382 b/f, 36.793 dB, 16282 ms cloud 1080p 500 kbps 24081 b/f, 37.988 dB, 14016 ms -> 23597 b/f, 38.076 dB, 12774 ms vidyo1 720p 1000 kbps 16552 b/f, 40.514 dB, 8279 ms -> 16553 b/f, 40.543 dB, 8510 ms The rtc set compression performance is improved by 0.5%. Change-Id: I3d09bea2caf58b2a4f3b38aa26fffafcbe9a2c17	2015-02-25 10:32:09 -08:00
Jingning Han	f87e315e1e	Re-distribute hierarchical vector match pattern This commit modifies the hierarchical vector match patter. It avoids repeated SAD computation at same points. The function vp9_vector_sad_sse2 is called 12 times per 64x64 block, instead of 15 times as before. The effective coverage remains the same. Change-Id: I91ad9d27d40db8963c907d02af84e10702136994	2015-02-24 11:48:38 -08:00
Marco	0187f4b411	Adjustments to cyclic refresh (aq-mode=3). Target higher delta-qp for big blocks with zero motion, and for segment#1: avoid 64x64 partition size and force 8x8 tx size. Metrics on RTC set mostly positive: SSIM up by ~4%, PSRN by ~1.5%. Doesn't seem to be any change in speed. Change-Id: I1f68fa3c4f62dab3b90cc58041f05ebb048ae5ac	2015-02-20 08:47:59 -08:00
Jingning Han	216b171d63	Merge "Integral projection based motion estimation"	2015-02-19 15:08:11 -08:00
Jingning Han	ed2dc59c1b	Integral projection based motion estimation This commit introduces a new block match motion estimation using integral projection measurement. The 2-D block and the nearby region is projected onto the horizontal and vertical 1-D vectors, respectively. It then runs vector match, instead of block match, over the two separate 1-D vectors to locate the motion compensated reference block. This process is run per 64x64 block to align the reference before choosing partitioning in speed 6. The overall CPU cycle cost due to this additional 64x64 block match (SSE2 version) takes around 2% at low bit-rate rtc speed 6. When strong motion activities exist in the video sequence, it substantially improves the partition selection accuracy, thereby achieving better compression performance and lower CPU cycles. The experiments were tested in RTC speed -6 setting: cloud 1080p 500 kbps 17006 b/f, 37.086 dB, 5386 ms -> 16669 b/f, 37.970 dB, 5085 ms (>0.9dB gain and 6% faster) pedestrian_area 1080p 500 kbps 53537 b/f, 36.771 dB, 18706 ms -> 51897 b/f, 36.792 dB, 18585 ms (4% bit-rate savings) blue_sky 1080p 500 kbps 70214 b/f, 33.600 dB, 13979 ms -> 53885 b/f, 33.645 dB, 10878 ms (30% bit-rate savings, 25% faster) jimred 400 kbps 13380 b/f, 36.014 dB, 5723 ms -> 13377 b/f, 36.087 dB, 5831 ms (2% bit-rate savings, 2% slower) Change-Id: Iffdb6ea5b16b77016bfa3dd3904d284168ae649c	2015-02-19 13:47:19 -08:00
Jingning Han	83559e7357	Fix a check condition in nonrd_pick_partition Change-Id: Ic92fb4b16948f745c218351b24fdafecf9abce3a	2015-02-19 09:54:55 -08:00
Yaowu Xu	ee5d79995e	Move computation up to frame level This is to avoid redo the same calculation repeatly, and also allow easier adjustments for further experiments. This commit shall have no effect on quality/compression. Change-Id: I4460acf5c808ff5518da18d21e002c5da58af857	2015-02-10 15:41:52 -08:00
Jingning Han	ebb4c9e8e7	Fix block partition size in fill_mode_info_sb This commit fixes the sub block partition size used in fill_mode_info_sb. Previous implementation effectively disabled the rectangular block sizes. This commit resolved this issue. Change-Id: Ic1c383ab0a9a2e7d59e85b388093f1f1f94d1e7f	2015-02-10 08:39:32 -08:00
Yaowu Xu	8b5e665098	Merge "Replace repeated check with single variable"	2015-02-06 09:17:59 -08:00
Yaowu Xu	c905c42ad8	Remove unnecessary initialization loop_filter_level is always reset in loop_filter_frame() later in encoder. Change-Id: I608e03d905a6b23e7d5025ca747e4784c665007e	2015-02-04 13:56:16 -08:00
Yaowu Xu	581aee001e	Move tx_mode decision logic into select_tx_mode() Change-Id: I7f8f78c33eb3f33344b029a27bda320f4d68c577	2015-02-04 13:54:49 -08:00
Yaowu Xu	19451e6d67	Replace repeated check with single variable Change-Id: I2f6a669bf7c6d9796388ad3f3fa3fc942635c215	2015-02-04 12:59:14 -08:00
Yaowu Xu	a844a778c7	Merge "Adjust partitioning threshold based rtc speed"	2015-02-04 12:52:03 -08:00
Yaowu Xu	3bc0c6576f	Merge "Move calls to avoid unnecessary operations"	2015-02-04 12:51:16 -08:00
Yaowu Xu	bdfb5f986e	Adjust partitioning threshold based rtc speed On rtc set: speed 7 quality improves about 0.5% speed 8 quality improves about 1.0% Encoding time for speed 7 changes from 67804ms to 65889ms Encoding time for speed 8 changes from 58659ms to 56808ms Change-Id: Iabcfb53012fc1b9f3326cdbc167e5758b8c7ad30	2015-02-04 11:28:39 -08:00
Jingning Han	1b9082ec6b	Unify luma and chroma inter predictors in choose_partitioning Change-Id: I8bfc80f4fffb0892e93d3326394a52d1ee3c0f37	2015-02-04 10:02:57 -08:00
Jingning Han	0c6d3a03e1	Account for chroma component costs in RTC mode decision This commit allows the encoder to account for additional chroma plane costs in the mode decision process, if the current block potentially contains significant color change. It improves the visual quality at very low bit-rates. The compression performance of dark720p is improved by 12.39% in speed 6. For jimred at 150 kbps, the PSNR of V component (red) increased by 0.2 dB, at the expense of about 5% increase in encoding time. Note that for sequences where the chroma components are fairly consistent, the encoding time increase is negligible. On average the rtc set compression performance is improved by 1.172% in PSNR and 1.920% in SSIM. Change-Id: Ia55b24ef23a25304f7ec9958fbf07fd6e658505c	2015-02-04 09:45:14 -08:00
Yaowu Xu	02537ebbe4	Move calls to avoid unnecessary operations Change-Id: I236f7f75ab9a4511d1b52a6a67299b0e844a103e	2015-02-03 17:01:37 -08:00
Jingning Han	ca9c352fc3	Assign 2nd ref frame in choose_partitioning Avoid the use of uninitialized second reference frame for fetching reference block. Change-Id: I9983a0daea829700b3270dc8bf2bcc6d6ea36652	2015-02-03 11:17:51 -08:00
Yaowu Xu	45971abd1d	Optimize coef update 1. move the check of search method of USE_TX_8X8 up one level to avoid operations of build_tree_distributions() 2. count tx used and avoid computaton for coef udpate when one size is not used at all. Change-Id: Ia3e54a2588aa531c41377a1bfaa64385d04a592c	2015-01-30 10:16:40 -08:00
Yaowu Xu	fe2439703d	Merge "move clear_system_state() call before using double"	2015-01-27 12:42:13 -08:00
Marco	1c4a84c6e9	Merge "aq-mode=3: Update to allow for refresh on modes other than zero-mv."	2015-01-26 19:47:13 -08:00
Yaowu Xu	645b7cdf03	move clear_system_state() call before using double Floating point is used in vp9_convert_qindex_to_q(), so sometime unit test ActiveMapTest would cause run time error without properly call to clear_system_state to reset register status. Change-Id: I181e9395148c44a6ca8b97d6e109bd4a152143c6	2015-01-26 18:41:50 -08:00
Marco	3f1af6e85e	aq-mode=3: Update to allow for refresh on modes other than zero-mv. Add distortion threshold condition to refresh state of a coding block, and allow for qp adjustment also for some intra modes and non-zero motion modes. Also some code cleanup (remove unused variables/code). Change-Id: I735fa2b28bc64f60e0323976b82510577b074203	2015-01-26 16:44:25 -08:00
Yaowu Xu	6d16f6c14c	Fix MSVC warnings on conversion from int64 to int Change-Id: I7e96509ffa36899fcd2935749927a1e8aac8d025	2015-01-26 10:54:06 -08:00
Marco	0dccb6277c	Modify variance partition selection for low resolutions. For low spatial resolutions: bias partittion selection to smaller block sizes, and base the variance computation on 4x4 down-sampling. Also move the threshold computations into the choose_partitioning, so they are computed once for each sb block. On low-res clips (RTC_derf) PSNR/SSIMetrics increase by about 4-5%. No change for resolutions above CIF. Change-Id: I93f8ff742c8044786977bb6e31dcf8efda6dd1b0	2015-01-22 15:16:55 -08:00
Deb Mukherjee	e7570493b8	Moves inter mode count updates to update_stats This makes the inter_mode counts update consistent with other symbols. Also, forward updates should work corerctly now. Change-Id: Id98be26fd08875162e644bb8f1de6f0918f85396	2015-01-06 16:40:45 -08:00
Paul Wilkins	a88e4e64b1	Merge "Deleted unused #define"	2015-01-06 04:18:20 -08:00
Jingning Han	5486db185c	Add bsize check condition in nonrd_use_partition Check if block size is below 8x8 for rectangular block coding. It is added to support 4x8 and 8x4 block coding for RTC mode. Change-Id: I760b328f45b98ae48adc45ed5a39fb643cd8aebd	2015-01-02 10:12:37 -08:00
Jingning Han	dad89d5ca1	Enable sub8x8 inter block search for RTC coding mode This commit enables sub8x8 inter block coding for RTC mode. The use of sub8x8 blocks can be turned on by allowing choose_partitioning function to select 4x4/4x8/8x4 block sizes. Change-Id: Ifbf1fb3888fe4c094fc85158ac3aa89867d8494a	2014-12-24 17:40:31 -08:00
Jingning Han	d0f2377027	Revert "Revert "Removal of legacy zbin_extra / zbin_oq_value."" This reverts commit `9946ee23e0`. Fix the ssse3 asm function. Change-Id: I07f77a63aa98087626e45c4e87aa5dcafc0b0b07	2014-12-22 10:09:25 -08:00
Paul Wilkins	9946ee23e0	Revert "Removal of legacy zbin_extra / zbin_oq_value." This reverts commit `e9b586e21b`. Change-Id: I5b36e6727da6c05278d97e2c37b80c109f79bed4	2014-12-19 15:02:58 +00:00
Paul Wilkins	e9b586e21b	Removal of legacy zbin_extra / zbin_oq_value. zbin extra / zbin_oq_value was widely passed around, hence removal touches a lot of code. Change-Id: Idc94359735b60c38a160e4385ae09d5ca8b6b8e5	2014-12-18 16:49:11 +00:00
Paul Wilkins	60e9b731cf	Remove mode dependent zbin boost. Initial patch to remove get_zbin_mode_boost() and cpi->zbin_mode_boost. For now sets a dummy value of 0 for zbin extra pending a further clean up patch. Change-Id: I64a1e1eca2d39baa8ffb0871b515a0be05c9a6af	2014-12-18 16:45:52 +00:00
Paul Wilkins	b76312124d	Deleted unused #define FAST_MOTION_MV_THRESH no longer referenced. Change-Id: Idee6ee5a59ba330904c42b20c9ec35b6fc16f7a2	2014-12-17 14:59:22 +00:00
Peter de Rivaz	e3d19bfc63	Fix for crash in highbitdepth rt mode Change 72141 introduced a new use of vp9_avg_4x4. This call needs to switch to using vp9_highbd_avg_4x4 when performing high bitdepth encodes. Change-Id: I6a8ba4b62f8a75d0a917b365a55245e2f0438ea1	2014-12-16 10:55:49 +00:00
James Zern	4d40a046da	Merge "vp9: move encoder-only member from common"	2014-12-12 14:28:55 -08:00
Marco	7f59cff53d	Merge "Allow for 4x4 prediction blocks for key frame, speed 6."	2014-12-12 14:27:31 -08:00
James Zern	72ece1308b	vp9: move encoder-only member from common allow_comp_inter_inter VP9_COMMON -> VP9_COMP Change-Id: I6d9dc25d1cdd7e2ab62f5be69cd9fa883d21dbb6	2014-12-12 11:17:44 -08:00
Jingning Han	3e0793b80b	Merge "Fix PICK_MODE_CONTEXT index in non-RD coding mode"	2014-12-12 09:16:01 -08:00
Jingning Han	e2c2a65695	Fix PICK_MODE_CONTEXT index in non-RD coding mode This commit fixes a bug in the PICK_MODE_CONTEXT index for horizontal partition case. The compression performance change is less than 0.01% level, since most blocks are selected to use square block size in RTC coding mode. Change-Id: I67effc18ae8795fccdd82a55f4efc609fa5cb3e1	2014-12-11 17:21:24 -08:00
Marco	7e99cd2a9b	Allow for 4x4 prediction blocks for key frame, speed 6. For key frame under variance source partition: 4x4 prediction blocks may be selected when variance of 8x8 block is very high (threshold is set fairly high for now). Testing on some RTC clips shows this helps to reduce some ringing artifacts on key frame. Encoded key frame size increases about ~10%. Key frame PSNR increases about ~0.1-0.2dB. Change-Id: I56e203fac32ea6ef69897fb3ea269c59cb50d174	2014-12-11 15:36:16 -08:00
Jingning Han	811c74cdfa	Merge "Replace division with bit shift in choose_partitioning"	2014-12-11 13:30:03 -08:00
Jingning Han	d9892e846f	Merge "Refactor choose_partitioning computing scheme"	2014-12-11 11:14:07 -08:00
Jingning Han	d5c396a902	Replace division with bit shift in choose_partitioning This commit explicitly uses the bit shift operation instead of division for computing block variance. Change-Id: Id19c0ff27dd1d1ae4aceee6657e1aad0d406bd74	2014-12-11 11:06:57 -08:00
Jingning Han	377d2f027a	Refactor choose_partitioning computing scheme This commit refactors the choose_partitioning function. It removes redundant memset calls and makes the encoder to calculate variance value per block only when it is needed. It reduces the average runtime cost of choose_partitioning by 60%. Overall it reduces speed -6 runtime by 2-5%. Change-Id: I951922c50d901d0fff77a3bafc45992179bacef9	2014-12-11 09:33:40 -08:00
Paul Wilkins	65cfb808d0	Merge "Substantial restructuring of AQ mode 2."	2014-12-10 10:44:27 -08:00
Jingning Han	e728678c50	Refactor update_state_rt Update the frame motion vector only if previous frame motion vector is needed for next frame reference motion vector. Change-Id: Ica50f9d7b46ad4f815bba0d9e30f5546df29546f	2014-12-09 15:35:49 -08:00
Jingning Han	225cdef665	Make RTC coding flow support sub8x8 in key frame coding This commit enables the use of sub8x8 blocks in RTC key frame encoding. It requires the block size to be preset and will decide the coding mode and encode the bit-stream. Change-Id: I35aaf8ee2d4d6085432410c7963f339f85a2c19b	2014-12-09 11:34:58 -08:00
Jingning Han	4bacaab46d	Cosmetic naming change Rename set_modeinfo_offsets as set_mode_info_offsets, to be more consistent with naming convention. Change-Id: I68ca1f36c4a78127d9439a50c1506a2afd07927d	2014-12-09 10:32:04 -08:00
Jingning Han	f051a7beab	Take out redundant setting of mode_info from set_block_size The later encoding process will take the top-left block's mode_info for pre-determined block size. Change-Id: I76a90f9ce7f3b2dbc2975b52442114e461c465b5	2014-12-09 10:27:18 -08:00
Paul Wilkins	e68c8dcfd2	Substantial restructuring of AQ mode 2. The restructure moves the decision into the rd pick modes loop and makes a decision based at the 16x16 block level instead of only the 64x64 level. This gives finer granularity and better visual results on the clips I have tested. Metrics results are worse than the old AQ2 especially for PSNR and this mode now falls between AQ0 and AQ1 in terms of visual impact and metrics results. Further tuning of this to follow. It should be noted that if there are multiple iterations of the recode loop the segment for a MB could change in each loop if the previous loop causes a change in the complexity / variance bin of the block. Also where a block gets a delta Q this will alter the rd multiplier for this block in subsequent recode iterations and frames where the segmentation is applied. Change-Id: I20256c125daa14734c16f7cc9aefab656ab808f7	2014-12-09 15:10:52 +00:00
Jingning Han	1395ded2a7	Remove unused rd cost calculation from nonrd_use_partition The per block rd cost calculation is not needed when partition size is preset. Change-Id: Ie5575248bbffb584e908aa13097f697ace6ec747	2014-12-08 18:45:19 -08:00
James Zern	616b3a810f	vp9 asserts: fix compile warning string literal to int within an assert Change-Id: I76a173f96b9add5bf27c3f5ad5d72c6f30e51629	2014-12-05 16:20:42 -08:00
hkuang	eaa6deee5b	Merge "Merge set_prev_mi function into encoder function."	2014-12-05 15:12:50 -08:00
Jingning Han	6ae829088f	Merge "Remove redundant vp9_zero in choose_partitioning"	2014-12-05 11:47:58 -08:00
Jingning Han	62c7356098	Merge "Use hybrid RD and non-RD coding flow for key frame coding"	2014-12-05 11:25:19 -08:00
Jingning Han	9d88b30854	Remove redundant vp9_zero in choose_partitioning It makes the overall speed -6 about 2% faster with no compression performance change. Change-Id: I680a967b421caa2c5a5cdb821311c4726a2df45a	2014-12-05 10:39:39 -08:00
Jingning Han	07711e9b27	Use hybrid RD and non-RD coding flow for key frame coding When block size is below 16x16, the encoder swap from non-RD to RD mode for key frame coding. This largely brough back the key frame compression performance. For vidyo1 at 1000 kbps, the key frame coding statistics are changed 9978F, 34.183 dB, 36807 us -> 9838F, 35.020 dB, 61677 us As compared to the full RD case 7187F, 34.930 dB, 214470 us The overall rtc set coding performance (single key frame setting) is improved by 1.5%. Change-Id: I78a4ecf025d7b24ec911e85be94e01da05e77878	2014-12-05 09:35:27 -08:00
Yunqing Wang	a3a4a34c60	Merge "vp9_ethread: the tile-based multi-threaded encoder"	2014-12-05 08:23:49 -08:00
hkuang	62de07c8c6	Merge set_prev_mi function into encoder function. Change-Id: Ifcf2efbb232ea4cabcdebbe77e0820d121e4a6da	2014-12-04 14:44:23 -08:00
Yunqing Wang	eba9c762a1	vp9_ethread: the tile-based multi-threaded encoder Currently, VP9 supports column-tile encoding, which allows a frame to be encoded in multiple column tiles independently. The number of column tiles are set by encoder option "--tile-columns". This provides a way to encode a frame in parallel. Based on previous set of patches, this patch implemented the tile- based multi-threaded encoder. Each thread processes one or more tiles. Usage: For HD clips: --tile-columns=2 --threads=1/2/3/4 While using 4 threads, tests showed that the encoder achieved 2.3X - 2.5X speedup at good-quality speed 3, and 2X speedup at realtime speed 5. Change-Id: Ied987f8f2618b1283a8643ad255e88341733c9d4	2014-12-04 11:21:34 -08:00
Jingning Han	17176cd452	Fix indent in source_var_based_partition_search_method Change-Id: I6e5e0571d6967b9b992966336715e35bb97f187e	2014-12-03 12:37:36 -08:00
Marco	8fd3f9a2fb	Enable non-rd mode coding on key frame, for speed 6. For key frame at speed 6: enable the non-rd mode selection in speed setting and use the (non-rd) variance_based partition. Adjust some logic/thresholds in variance partition selection for key frame only (no change to delta frames), mainly to bias to selecting smaller prediction blocks, and also set max tx size of 16x16. Loss in key frame quality (~0.6-0.7dB) compared to rd coding, but speeds up key frame encoding by at least 6x. Average PNSR/SSIM metrics over RTC clips go down by ~1-2% for speed 6. Change-Id: Ie4845e0127e876337b9c105aa37e93b286193405	2014-12-03 09:18:08 -08:00
Peter de Rivaz	7e40a55ef9	Added high bitdepth sse2 transform functions Also removes some spurious changes in common/vp9_blockd.h which was introduced by a rebase issue between nextgen and master branches. Change-Id: If359f0e9a71bca9c2ba685a87a355873536bb282 (cherry picked from commit `005d80cd05`) (cherry picked from commit `08d2f54800`) (cherry picked from commit `4230c2306c`)	2014-12-02 11:16:24 -08:00
Yunqing Wang	0993bef7e9	vp9_ethread: calculate and save the tok starting address for tiles Each tile's tok starting address is calculated before the encoding process. These addresses are stored so that the same calculation won't be done again in packing bit stream. Change-Id: I0a3be0301f002260c19a850303f2f73ebc47aa50	2014-11-25 17:19:35 -08:00
Yunqing Wang	edbd61e136	vp9_ethread: modify VP9_COMP structure This patch modified struct VP9_COMP. Created a struct ThreadData to include data that need to be copied for each thread. In multiple thread case, one thread processes one tile. all threads share one copy of VP9_COMP, (refer to VP9_COMP cpi in the code) but each thread has its own copy of ThreadData, (refer to ThreadData td in the code). Therefore, within the scope of encode_tiles(), both cpi and td need to be passed as function parameters. In single thread case, the FRAME_COUNTS pointer in ThreadData points to "counts" in VP9_COMMON. Change-Id: Ib37908b2d8e2c0f4f9c18f38017df5ce60e8b13e	2014-11-24 17:57:38 -08:00
Jingning Han	2fbdfd2c66	Key frame non-RD mode decision process This commit makes a non-RD coding mode decision process for key frame coding. It can be optionally turned on in speed -6 and above. Change-Id: I0847258b392877a0210b4768bef88ebc9ad009b5	2014-11-24 09:04:28 -08:00
Paul Wilkins	f5209d7e01	Remove rate component adjustment for AQ1 In AQ1 a rate adjustment was applied for blocks coded with a deltaq. This tends to skew the partition selection and cause rate overshoot. For example, consider a 64x64 super block where some but not all sub blocks are in a low q segment and some are in a high q segment. The choice of Q when considering large partition and transform sizes is defined by the lowest sub block segment id (currently this implies the lowest Q). If some parts of the larger partition are very hard this will cause a high rate component. The correct behavior here is for the rd code to discard the large partition choice and break down to sub blocks where some have low and some have high Q. However the rate correction factor above mask the high cost of coding at a larger partition size. Change-Id: Ie077edd0b1b43c094898f481df772ea280b35960	2014-11-21 08:51:58 -08:00
Paul Wilkins	d031237999	Add variance restriction to AQ2. Add an additional restriction to bit/complexity based segmentation based on spatial variance. Only lower Q when both the number of bits spent in the initial encoding pass and the spatial complexity are below a threshold. This will prevent the low Q segments being used just because there is a surfeit of bits. Small metrics gains especially opsnr. derf ~0.2% std-hd ~0.3% Change-Id: I6a8496d466d673f9b0e2b2ca6304ea7b6d8e1cce	2014-11-20 16:23:35 -08:00
Paul Wilkins	6a760d483d	Initial AQ1 restructuring. This is the first of a series of patches to restructure and improve AQ mode 1 (variance based AQ). Change-Id: Idcf693131a3ea2459dcfd957a54a65b971fa4a2a	2014-11-20 15:50:15 -08:00
Yunqing Wang	54ba65a63e	Merge "vp9_ethread: move max/min partition size to mb struct"	2014-11-20 14:00:37 -08:00
Yunqing Wang	ad7586a9e1	vp9_ethread: move max/min partition size to mb struct The max_partition_size and max_partition_size are set at the beginning while setting speed features, and then adjusted at SB level. Moving them to mb struct ensures there is a local copy for each thread. Change-Id: I7dd08dc918d9f772fcd718bbd6533e0787720ad4	2014-11-20 09:24:50 -08:00
Yunqing Wang	70c9d2983b	Revert "vp9_ethread: include a pointer to mb in VP9_COMP" This reverts commit `6906d218dd`. Another way will be used to handle mb struct. Change-Id: Ic1111a46b2b1ee00f8f9e3fcd4cf3eb6030b2dc4	2014-11-20 08:31:12 -08:00
Yunqing Wang	d0b547c676	vp9_ethread: combine encoder counts in separate struct Several frame counters in encoder are updated at SB level. Combine those counters and put them in a separate struct, which allows us to allocate one copy for each thread. Change-Id: I00366296a13c0ada4d8fa12f5e07728388b6cab7	2014-11-14 16:09:22 -08:00
Yunqing Wang	6906d218dd	vp9_ethread: include a pointer to mb in VP9_COMP Modified VP9_COMP struct to include MACROBLOCK *mb. This change makes it feasible in multi-thread case to allocate a mb for each thread. Change-Id: I624d6d1aa9c132362200753e5d90b581b1738d6e	2014-11-14 12:31:06 -08:00
Yunqing Wang	807885b5e0	Merge "vp9_ethread: modify the cyclic refresh struct"	2014-11-13 18:35:01 -08:00
Yunqing Wang	8ee605f188	vp9_ethread: modify the cyclic refresh struct Two members in struct CYCLIC_REFRESH int64_t projected_rate_sb; int64_t projected_dist_sb; are updated at the superblock level, which makes them shared data in the multi-thread situation, and requires extra work to handle them. However, those values are updated and used immediately, and therefore can be removed. This patch cleaned up the code and removed the two members. Change-Id: I2c6ee4552bf49fb63ce590cdb47f9723974fffb1	2014-11-13 15:05:46 -08:00
Jingning Han	6efafda738	Merge "Refactor nonrd_use_partition coding process"	2014-11-13 13:58:21 -08:00
Jingning Han	754b05a4de	Refactor nonrd_use_partition coding process This commit integrates the non-RD mode decision process and the encoding process into a single recursion scheme. Change-Id: I6a7e72a0b84d567554801ebbe01ec75d54c1f77d	2014-11-06 17:00:48 -08:00
Jingning Han	10da059b52	Remove unused is_background function Change-Id: Ia540eac5f066ae95280c2f898370eddf0110c279	2014-11-05 21:19:23 -08:00
Jingning Han	caaf63b2c4	Rework cut-off decisions in cyclic refresh aq mode This commit removes the cyclic aq mode dependency on in_static_area and reworks the corresponding cut-off thresholds. It improves the compression performance of speed -5 by 1.47% in PSNR and 2.07% in SSIM, and the compression performance of speed -6 by 3.10% in PSNR and 5.25% in SSIM. Speed wise, about 1% faster in both settings at high bit-rates. Change-Id: I1ffc775afdc047964448d9dff5751491ba4ff4a9	2014-11-05 21:17:09 -08:00
hkuang	55577431ae	Bind motion vectors with frame buffer structure. This will save a lot of memory for decoder due to removing of prev_mi, but prev_mi is still needed in encoder. So this will increase a little bit memory for encoder. Change-Id: I24b2f1a423ebffa55a9bd2fcee1077dac995b2ed	2014-10-31 17:01:08 -07:00
Jingning Han	1cffea9fb7	Merge "Rework pred pixel buffer system in non-RD coding mode"	2014-10-31 08:55:24 -07:00
Jingning Han	7bea8c59f9	Rework pred pixel buffer system in non-RD coding mode This commit makes the inter prediction buffer system to support hybrid partition search. It reduces the runtime of speed -5 by about 3%. No compression performance change. vidyo1 720p 1000 kbps 11831 ms -> 11497 ms nik 720p 1000 kbps 10919 ms -> 10645 ms Change-Id: I5b2da747c6395c253cd074d3907f5402e1840c36	2014-10-30 11:08:35 -07:00
Jingning Han	07436abb86	Use zero motion vector in choose_partitioning The zero motion vector was effectively used in the subsampled pixel based variance calculation. This commit makes it directly use zero mv to generate prediction. Change-Id: Ica83dc843e9f8da2f89c3ef451e50f16214c0def	2014-10-27 19:38:43 -07:00
Jingning Han	d56b3eb0cf	Refactor encoder tile data structure Make the common tile info as one element in the encoder tile data struct. Change-Id: I8c474b4ba67ee3e2c86ab164f353ff71ea9992be	2014-10-27 19:37:13 -07:00
Jingning Han	192010d218	Refactor rtc coding mode to support tile encoding Use per tile threshold in the prediction mode search process. Change-Id: I6c74ee5a3b069bb4281002dfe51310911a0756c0	2014-10-27 09:53:46 -07:00
Jingning Han	eee201c221	Tile based adaptive mode search in RD loop Make the spatially adaptive mode search in rate-distortion optimization loop inter tile independent. Experiments suggest that this does not significantly change the coding staticstics. Single tile, speed 3: pedestrian_area 1080p 1500 kbps 59192 b/f, 40.611 dB, 101689 ms blue_sky 1080p 1500 kbps 58505 b/f, 36.347 dB, 62458 ms mobile_cal 720p 1000 kbps 13335 b/f, 35.646 dB, 45655 ms as compared to 4 column tiles, speed 3: pedestrian_area 1080p 1500 kbps 59329 b/f, 40.597 dB, 101917 ms blue_sky 1080p 1500 kbps 58712 b/f, 36.320 dB, 62693 ms mobile_cal 720p 1000 kbps 13191 b/f, 35.485 dB, 45319 ms Change-Id: I35c6e1e0a859fece8f4145dec28623cbc6a12325	2014-10-24 10:00:27 -07:00
Jingning Han	be212d4db3	Refactor rate distortion cost structure in non-RD coding mode This commit refactors the rate distortion structure used in the non-RD coding mode and saves a few RDCOST calculations. Change-Id: I62c3416c300d2c5372f21b96d93a6b633a34ab3a	2014-10-21 17:17:11 -07:00
Jingning Han	1ed1dde06d	Remove unused copy_partitioning Change-Id: I75a2a3772ed17e73180eb4f263cc838cae4927b0	2014-10-21 09:47:58 -07:00
Jingning Han	f2c21cfa1e	Merge "Remove deprecated constrain_copy_partitioning function"	2014-10-21 09:44:11 -07:00
Jingning Han	55ec7ebca7	Merge "Remove unused sb_has_motion function in vp9_encodeframe.c"	2014-10-21 09:43:55 -07:00
Jingning Han	072d844aff	Merge "Remove deprecated use_lastframe_partitioning feature"	2014-10-21 09:43:45 -07:00
Jingning Han	61ff08ef61	Merge "Hybrid partition search for rtc coding mode"	2014-10-21 09:43:35 -07:00
Paul Wilkins	889c53a507	Merge "Resolve compiler warning."	2014-10-21 07:02:13 -07:00
Jingning Han	abb2fbb10e	Remove deprecated constrain_copy_partitioning function Its functionality has been replaced with choose_partitioning and threshold based control on split mode check. Change-Id: Ic9bb321df06b524f5c38ea5874dc6f6a8f93c5e3	2014-10-20 17:08:21 -07:00
Jingning Han	ef53898c48	Remove unused sb_has_motion function in vp9_encodeframe.c Change-Id: I035fb6aa5c10741b065e27befb097d8087e3c62f	2014-10-20 17:08:11 -07:00
Jingning Han	e62ce79e1a	Remove deprecated use_lastframe_partitioning feature This speed feature has been deprecated in both yt and rtc coding modes. This commit removes the related operations. Change-Id: I079c79c6adafe45581af2ebf8b98faebcface1ce	2014-10-20 17:03:38 -07:00
Jingning Han	9f128b3ed9	Hybrid partition search for rtc coding mode This commit re-designs the recursive partition search scheme in rtc speed -5. It first checks if the current block is under cyclic refresh mode. If so, apply recursive partition search. Otherwise, perform sub-sampled pixel based partition selection. When the pre-selection finds the partition size should be 32x32 or above, use the partition size directly. Otherwise, apply partition search at nearby levels around the preset partition size. It is enabled in speed -5. The compression performance of rtc speed -5 is improved by 9.4%. Speed wise, the run-time goes slower from 1% to 10%. nik_720p, 1000 kbps 33220 b/f, 38.977 dB, 10109 ms -> 33200 b/f, 39.119 dB, 10210 ms vidyo1_720p, 1000 kbps 16536 b/f, 40.495 dB, 10119 ms -> 16536 b/f, 40.827 dB, 11287 ms Change-Id: I65adba352e3adc03bae50854ddaea1b421653c6c	2014-10-20 13:02:12 -07:00
Yunqing Wang	67c866750c	Merge "Remove the dependency in token storing locations"	2014-10-20 08:26:46 -07:00
Paul Wilkins	9626a0cb62	Resolve compiler warning. conversion from 'const int64_t' to 'int', possible loss of data. Change-Id: I471a73bba5d448d9be0ef9cbf1590fa73aa74be1	2014-10-20 12:08:33 +01:00
Debargha Mukherjee	6202c75f84	Merge "Add highbitdepth function for vp9_avg_8x8"	2014-10-18 14:37:10 -07:00
Yaowu Xu	06e65269c7	Merge "Remove unused VAR_BASED_FIXED_PARTITION flag"	2014-10-18 13:31:47 -07:00
Yaowu Xu	7bf475926b	Merge "Use rate/distortion thresholds to control non-RD partition search"	2014-10-18 13:31:41 -07:00
Peter de Rivaz	73ae6e495c	Add highbitdepth function for vp9_avg_8x8 Cherry-picked from https://gerrit.chromium.org/gerrit/#/c/71914/ (`a92f987a6b`) on highbitdepth branch. Change-Id: I6903e4e4cb57d90590725c8a1c64c23da7ae65e8	2014-10-17 17:04:37 -07:00
Yunqing Wang	7c4992c466	Remove the dependency in token storing locations Currently, the tokens for a tile are stored immediately after its preceding tile, which causes a dependency. This is unnecessary since we always allocate enough memory for tokens. Removing the dependency allows token writing done in parallel. This patch doesn't change encoding result. Change-Id: I7365a6e5e2c2833eb14377c37e1503c9d0f26543	2014-10-17 14:25:33 -07:00
Jingning Han	3bc94cd2eb	Merge "Add init and reset functions for RD_COST struct"	2014-10-17 11:15:19 -07:00
Jingning Han	e1111fba7e	Remove unused VAR_BASED_FIXED_PARTITION flag Change-Id: I4ce19b7cb1c45fed86e81ee785e787630020fb4f	2014-10-17 09:02:25 -07:00
Marco	48ea5b7190	Merge "Some updates for Speed 6/VAR_BASED_PARTITION."	2014-10-15 15:57:21 -07:00
Jingning Han	e2612fbd70	Add init and reset functions for RD_COST struct Change-Id: I2902de7051a883fd22e27a655209233733969cfd	2014-10-15 15:02:06 -07:00
Jingning Han	5e766ccee0	Use rate/distortion thresholds to control non-RD partition search Compare the estimated rate and distortion to the thresholds scaled according to the operating block size and determine if further split partition search will be run. The compression performance of speed -5 is changed by -0.074%. The encoding speed is 10% - 15% faster. vidyo1 720p 16545 b/f, 40.492 dB, 11475 ms -> 16535 b/f, 40.486 dB, 10100 ms nik720p 16624 b/f, 36.310 dB, 10071 ms -> 16617 b/f, 36.313 dB, 8346 ms Change-Id: Ic9197ab5761279ae55d2fb7813b2af0e0db497b8	2014-10-15 13:40:33 -07:00
Marco	09ea74f194	Some updates for Speed 6/VAR_BASED_PARTITION. Reduce the intra_cost_penalty for non-rd mode, and some updates to VAR_BASED_PARTITION. Visual tests show some improvement at Speed 6, for RTC clips. Change-Id: If9090daf7aed14906a32d931a538ab544bbca606	2014-10-15 12:06:48 -07:00
Jingning Han	89b8c7a513	Replace copy_partitioning use case with choose_partitioning This commit replaces the use of copy_partitioning with choose_partitioning based on the sse of subsamped pixels, which provides significantly better coding performance and runs at similar speed, as compared to copy_partitioning. It improves rtc speed 5 coding performance by 3%. Change-Id: I52d3682a12dce0147f5e52383a594fc242ca3228	2014-10-15 11:37:20 -07:00
Jingning Han	811cef97c9	Refactor rate distortion cost structure This commit makes a struct that contains rate value, distortion value, and the rate-distortion cost. The goal is to provide a better interface for rate-distortion related operation. It is first used in rd_pick_partition and saves a few RDCOST calculations. Change-Id: I1a6ab7b35282d3c80195af59b6810e577544691f	2014-10-13 14:27:16 -07:00
Yaowu Xu	bdea0055b2	Merge "vp9/choose_partitioning: add missing clear_system_state"	2014-10-10 09:16:19 -07:00
James Zern	a3e1a9291a	vp9/choose_partitioning: add missing clear_system_state set_vt_partitioning does double math Change-Id: I8e9d73d5c89b937a5326abf04164d24d9d88c5ef	2014-10-10 08:14:46 -07:00
Deb Mukherjee	9a29fdbae7	Merge "Rename highbitdepth functions to use highbd prefix"	2014-10-09 15:39:56 -07:00
Deb Mukherjee	1929c9b391	Rename highbitdepth functions to use highbd prefix Uses highbd_ prefix convention consistently. Change-Id: I58f7f799a7ff8e32701bcd71c955bcf1cdd4581e	2014-10-09 14:40:40 -07:00
Jingning Han	112789d4f2	Merge "Remove sub8x8 block index from rd_pick_partition argument"	2014-10-09 11:16:11 -07:00
Jingning Han	6a0d291fce	Remove sub8x8 block index from rd_pick_partition argument This parameter is deprecated. Its function is replaced with other explicit condition check. Change-Id: I61337e350ba8ca9eb50382db8b4d4acbf45cb7eb	2014-10-09 09:20:16 -07:00
James Zern	cec763bd97	set_vt_partitioning: fix type conversion warning double -> int64 + make threshold_multiplier an int Change-Id: I6d3607fdf13d670f57c9d9b04a80acb2be1346a0	2014-10-09 11:41:36 +02:00
Jim Bankoski	20254d1daa	Merge "experimental : partition using 1/8 x 1/8 image"	2014-10-08 09:04:26 -07:00
Jim Bankoski	0ce51d823f	experimental : partition using 1/8 x 1/8 image The concept: There's too much noise in source pixels for variance and at low bitrate the reconstructed looks nothing like the source so we have problems getting good partitionings with either. This skirts the issue by using a box blur scaled down version for variance calculations. To compare against source_var_ moved keyframe to be rd based like source_var. Change-Id: Ie3babdbfadae324b7b5a76bea192893af27f0624	2014-10-07 16:36:14 -07:00
Jingning Han	7ee58985bd	Replace mi_width_log2() with mi_width_log2_lookup table Change-Id: If0ea98aa139d14d40cd924114e18396aff36b5a5	2014-10-07 12:45:25 -07:00
Jingning Han	b66f7016c1	Take out repeated block width/height lookup functions The functions b_width_log2 and b_height_log2 only do direct table fetch. This commit unifies such use cases by using the table directly and removes these functions. Change-Id: I3103fc6ba959c1182886a2799d21b8b77c8a7b6b	2014-10-07 12:33:07 -07:00
Yaowu Xu	5966acc1be	Merge "Properly initialize segmentID in nonrd coding path"	2014-10-06 07:57:36 -07:00
Yaowu Xu	0065b73481	Properly initialize segmentID in nonrd coding path This commit adds proper initialization of segment id for variance AQ mode in non-rd coding path. It fixes the enc/dec mismatch issue of rt=7 with --aq-mode=1, as reported in issue #816 Change-Id: I02fa41b96345bf2e66077d5ea553f85ba800f7bb	2014-10-03 15:01:53 -07:00
Jingning Han	ef62233396	Fix indent in encode_rd_sb_row Change-Id: Icbcfe7b56d88474f4398b4c5b52f6719d551ab4a	2014-10-03 11:57:36 -07:00
Jingning Han	bb260d9076	Rework partition search skip scheme This commit enables the encoder to skip split partition search if the bigger block size has all non-zero quantized coefficients in low frequency area and the total rate cost is below a certain threshold. It logarithmatically scales the rate threshold according to the current block size. For speed 3, the compression performance loss: derf -0.093% stdhd -0.066% Local experiments show 4% - 20% encoding speed-up for speed 3. blue_sky_1080p, 1500 kbps 51051 b/f, 35.891 dB, 67236 ms -> 50554 b/f, 35.857 dB, 59270 ms (12% speed-up) old_town_cross_720p, 1500 kbps 14431 b/f, 36.249 dB, 57687 ms -> 14108 b/f, 36.172 dB, 46586 ms (19% speed-up) pedestrian_area_1080p, 1500 kbps 50812 b/f, 40.124 dB, 100439 ms -> 50755 b/f, 40.118 dB, 96549 ms (4% speed-up) mobile_calendar_720p, 1000 kbps 10352 b/f, 35.055 dB, 51837 ms -> 10172 b/f, 35.003 dB, 44076 ms (15% speed-up) Change-Id: I412e34db49060775b3b89ba1738522317c3239c8	2014-10-03 11:54:30 -07:00
Yunqing Wang	b1b6fd85db	Merge "Skip the partition search for still frames"	2014-09-30 11:59:05 -07:00
Yunqing Wang	c8d01b1eaf	Merge "Refactor encode_rd_sb_row function"	2014-09-30 11:58:39 -07:00
Yunqing Wang	1fcbf6ed56	Skip the partition search for still frames This patch re-enabled the feature in Pengchong's patch (commit `1286126073`). Originally, it was turned on while use_lastframe_partitioning > 0(not used anymore). Now it was added as a feature, and turned on while speed >= 2. As described in the original patch, this feature helps speed up the slideshows in YouTube. Change-Id: I1b0f18d65da1ee1c8d1e117dabba910c5207c471	2014-09-26 09:03:52 -07:00
Deb Mukherjee	993d10a217	Adds various high bit-depth encode functions Change-Id: I6f67b171022bbc8199c6d674190b57f6bab1b62f	2014-09-25 01:50:36 -07:00

... 2 3 4 5 6 ...

1227 Commits