generic-library/vpx

Author	SHA1	Message	Date
Alex Converse	90c9ede8e6	Limit cyclic refresh revisitng blocks at the same quantizer. For screen content don't refresh a block at a quantizer higher than it was last coded at. PReviosuly at realtime speeds the encoder had a tendency to recode a block from GOLDEN with a higher Q than it was last coded at. Change-Id: Iacd561806c769dcce1a81b9827ffc70090f5ba18	2015-06-19 15:23:02 -07:00
Jingning Han	8e8bc5f28b	Add dynamic range comment to vp9_int_pro_col Change-Id: If14d9f874bd0bf2c5a455982088fd70591f5ea5a	2015-06-19 09:43:57 -07:00
Yaowu Xu	d8428ae35d	Fix a msvc compiler warning Change-Id: Ida8a04370895ed14bd118324ec2577da926e4648	2015-06-19 09:04:29 -07:00
James Zern	c5d779d266	Merge changes I2552d810,I51952c0a,Ib82e4247,I9c8d16cb * changes: vp9_mcomp: make search_step_table static vp9_encodeframe: delete auto_partition_range() vp9_mcomp: don't mark setup_center_error() inline vp9_encoder: hide adjust_image_stat()	2015-06-19 03:31:38 +00:00
Marco	d77f51ba9e	Add dynamic resize logic for 1 pass CBR. Decision to scale down/up is based on buffer state and average QP over previous time window. Limit the total amount of down-scaling to be at most one scale down for now. Reset certain quantities after resize (buffer level, cyclic refresh, rate correction factor). Feature is enable via the setting rc_resize_allowed = 1. Change-Id: I9b1a53024e1e1e953fb8a1e1f75d21d160280dc7	2015-06-18 17:13:37 -07:00
Jingning Han	d1398e9f13	Merge "Add dynamic range comment to vp9_satd"	2015-06-18 19:36:53 +00:00
Jingning Han	4f1f510f16	Add dynamic range comment to vp9_satd Change-Id: I75873846e6fdafbe7597a1bd0192115d2d1e9987	2015-06-18 09:18:22 -07:00
Parag Salasakar	b6ea0c4c57	Merge "mips msa vp9 fdct 32x32 optimization"	2015-06-18 04:30:53 +00:00
Jingning Han	7f6cddb58f	Take out assertion for block_yrd in rtc coding flow The internal behavior of block_yrd differs in high bit depth settings from 8-bit one. This causes the assertion condition not true for high bit depth. Change-Id: I15dc02e7162d27cabe78c451941d769d488b1174	2015-06-17 08:51:16 -07:00
James Zern	0d51a97ae9	Merge "Fix integer overflow issue in rtc coding flow intra mode search"	2015-06-17 05:29:32 +00:00
Jingning Han	bc7074508a	Fix integer overflow issue in rtc coding flow intra mode search The overflow issue affects a variable that is only used in inter mode. This commit fixes the ioc warning triggered in the intra mode. It does not affect the compression performance. Change-Id: I593d1b5650599de07f3e68176dd1442c6cb7bdbc	2015-06-16 19:31:24 -07:00
Parag Salasakar	d9fedf7832	mips msa vp9 fdct 32x32 optimization average improvement ~4x-6x Change-Id: Ibcac3ef8ed5e207cf8c121e696570e6b63d3c0f4	2015-06-17 07:58:34 +05:30
Parag Salasakar	fa53008fb7	Merge "mips msa vp9 fdct 16x16 optimization"	2015-06-17 01:21:59 +00:00
Marco	8914ab696d	Remove duplicate calls for set_frame_size in 1 pass mode. set_frame_size() is being called twice, once before entering encode_encode_frame_to_data_rate(), and once again in that function. No need to call it twice for one-pass mode. Change-Id: I5fabaf0a90482d4f42cd89ef7ae1402c31aec600	2015-06-16 09:57:13 -07:00
Scott LaVarnway	5fe0e55ca4	Merge "Eliminated frame_type check in get_partition_probs()"	2015-06-16 13:40:23 +00:00
Scott LaVarnway	b2658ec321	Eliminated frame_type check in get_partition_probs() Moved the frame_type check to the tile level and stored the prob ptr in MACROBLOCKD. Change-Id: I10b5a4abd58213dc7610e3ade1a1583c01526842	2015-06-16 05:37:54 -07:00
Parag Salasakar	89b4b315aa	mips msa vp9 fdct 16x16 optimization average improvement ~4x-6x Change-Id: Id3b2243e5b3c7844c90c4231a5e75fa69911362c	2015-06-16 12:49:34 +05:30
James Zern	3edd293dae	vp9_pred_common: inline vp9_get_tx_size_context + drop 'vp9_' prefix Change-Id: If3f3ec32d03026af78b8fcd82749e587a3f43059	2015-06-15 18:41:22 -07:00
James Zern	e6add6499f	vp9_pred_common: inline vp9_get_segment_id + drop 'vp9_' prefix Change-Id: Id5a3c8d416dbdf93d9f4f1bde662f7b2c2290168	2015-06-15 18:41:14 -07:00
Yunqing Wang	e820ca6973	Merge "vp9_ethread: create enough threads while using SVC"	2015-06-15 23:03:32 +00:00
James Zern	a6d126709a	Merge changes I19588f9e,I6dc338a6 * changes: vp9_encodeframe: make coord_lookup[] static vp9_resize: make vp9_filteredinterp_*[] static	2015-06-15 23:03:28 +00:00
Yunqing Wang	c98273c9e7	vp9_ethread: create enough threads while using SVC This patch modified the thread creating code. When use_svc is true, the number of threads created is decided by the highest resolution. This resolved WebM issue 1018. Change-Id: I367227b14d1f8b08bbdad3635b232a3a37bbba26	2015-06-15 14:30:54 -07:00
Marco	24b3ede251	Remove redundant second declaration in svc_layercontext.c Change-Id: Ia3b1c1db54204fd92a56b7f698a9f26d27ee572a	2015-06-15 14:06:43 -07:00
James Zern	5214bd52c8	vp9_encodeframe: make coord_lookup[] static Change-Id: I19588f9e674c8635b6e58e4633120be736d256a6	2015-06-12 19:47:46 -07:00
James Zern	5168baea10	vp9_resize: make vp9_filteredinterp_*[] static + drop the vp9_ prefix Change-Id: I6dc338a69265dcaa8c6fe071e5757312bf92efca	2015-06-12 19:47:45 -07:00
James Zern	aaa49f0485	vp9_mcomp: make search_step_table static Change-Id: I2552d8101cf49ed951782ab69adce407579700fc	2015-06-12 18:11:54 -07:00
James Zern	31509af247	vp9_encodeframe: delete auto_partition_range() unused since: `1f00a9b` Fix choose_partitioning threshold setup for speed -5 Change-Id: I51952c0a1be3e6e0aa36ff2ffcfbbea60a505960	2015-06-12 17:57:37 -07:00
James Zern	7ea431df98	vp9_mcomp: don't mark setup_center_error() inline this function is a bit too involved for the hint; avoids a -Winline warning Change-Id: Ib82e424764aa78b37ddb94116e2b009a6de31d35	2015-06-12 17:56:33 -07:00
James Zern	471302a07b	vp9_encoder: hide adjust_image_stat() this function is only needed with CONFIG_INTERNAL_STATS Change-Id: I9c8d16cb9069dd8370f8b30329933c0d97f6d0aa	2015-06-12 17:55:08 -07:00
Jingning Han	176c291d9c	Fix potential overflow issue in hadamard_16x16() This commit fixes a potential integer overflow issue in function hadamard_16x16. It adds corresponding dynamic range comment. Change-Id: Iec22f3be345fb920ec79178e016378e2f65b20be	2015-06-12 10:56:18 -07:00
Jingning Han	4f52d49f1e	Add dynamic range comment to hadamard_8x8() Add comment to assist SIMD optimization. Change-Id: I300d5a848e6e9947e451de2a871a88940703fc9f	2015-06-12 10:39:49 -07:00
Yunqing Wang	254a4c033c	Merge "Allocate tile data adaptively to accommodate the frame size increase"	2015-06-12 15:49:40 +00:00
Scott LaVarnway	0fbc277746	Merge "inline vp9_get_segdata()"	2015-06-11 19:48:19 +00:00
Yunqing Wang	2c838ede68	Allocate tile data adaptively to accommodate the frame size increase If the frame size increases, the tile data buffer needs to be re-allocated according to the number of tiles existing in current frame. This patch makes the multi-tile encoding work in spatial SVC usage case, and partially solved WebM issue 1018. Change-Id: I1ad6f33058cf5ce6f60ed5024455a709ca80c5ad	2015-06-11 11:30:18 -07:00
Scott LaVarnway	cca866f578	inline vp9_get_segdata() and change name. Change-Id: I706645cf9d9dc04f1b3b6ac80df80edb7f101854	2015-06-11 09:52:00 -07:00
Marco	2aa67ce20f	Move adjustment of some CR parameters to existing function. Refactor/no change in behavior. Change-Id: Idb3c55b1304feaf689b90403f79bc96dba26f060	2015-06-11 08:31:03 -07:00
Scott LaVarnway	a49c701529	Merge "inline vp9_segfeature_active()"	2015-06-11 12:29:45 +00:00
Scott LaVarnway	42c0b1b1f1	inline vp9_segfeature_active() and changed name. Change-Id: Ie023ca66cc2c823032f58d4faeb53fd1863c94f3	2015-06-11 04:20:55 -07:00
Paul Wilkins	59114915bc	Merge "Changes to active maxq calculation in two pass."	2015-06-10 13:33:53 +00:00
Scott LaVarnway	97880c3324	Merge "Reducing size of MODE_INFO struct"	2015-06-10 13:15:19 +00:00
Marco	997ac14c6a	Adjust some parameters for cyclic refresh for low bitrates. Reduce motion threshold and boost factor for second segment, for low bitrates, at low resolutions for now. This is to reduce the rate fluctuation/frame dropping that occurs at these low bitrates. Change-Id: Ia66c3be41831882fca8c1e4fe104f5ea8fbe7142	2015-06-09 15:10:03 -07:00
Paul Wilkins	faf8c63b0f	Changes to active maxq calculation in two pass. Some initial experiments into discounting dead zone formating bars and intra skip blocks (common in some types of animation and graphics) in the calculation of the active max Q for each ARF/GF group. TODO: check for vertical formating bars and validate the horizontal bar at the bottom edge of the image. As expected, this change as it stands, does not make much difference for the natural videos in the std-hd and derf sets. However, for the yt and yt hd set there is a significant rise in the average PSNR with overall PSNR and SSIM remaining neutral. The mean rise for the YT-HD test set was > 6%. This is mainly because the change allows Q to drop further on titles and other graphics sections where spending a small number of extra bits gives a sharp rise in PSNR. Change-Id: I3f878ae91fc1854312d7ecf9fa792c17bc1aa6b7	2015-06-09 15:31:24 +01:00
Paul Wilkins	4a28da5843	Enable more split modes for animated content. For content that is identified as likely to contain some animation or graphics content, increase the availability of split modes for good quality speeds 1-3. On a problem test animation clip this improves metrics results by about 0.25 db and makes a noticeable difference visually. It also causes a small drop in file size (~0.5%) but a rise in encode time of about 5-6% at speed 2. For more normal content it should have no effect. Change-Id: Ic4cd9a8de065af9f9402f4477a17442aebf0e439	2015-06-09 14:50:44 +01:00
Paul Wilkins	b19b16cfa1	Merge "Animation and dead zone detection."	2015-06-08 14:26:07 +00:00
Johann	a4dad3e961	Merge "Duplicate reference variance code"	2015-06-05 16:54:33 +00:00
Marco	8710cceb45	Fix to spatial svc: set reference_frame masking. For real-time mode: keep reference_frame masking off for spatial svc. Change-Id: I15e123c06f67ea040172b8d4042a672f3525b9d8	2015-06-05 08:25:33 -07:00
Marco	8f7e7663ad	Bugfx in setting layer framerate. Index for ts_rate_decimator should be temporal layer (tl) index. Change-Id: I0320b7f7ae987ef64fdfe7c45099e7978a8fef17	2015-06-04 13:12:09 -07:00
Scott LaVarnway	baaaa57533	Reducing size of MODE_INFO struct Reduced size from 124 bytes to 104 bytes. For decode only builds, it is reduced to 68 bytes. Change-Id: If9e6b92285459425fa086ab5a743d0a598a69de3	2015-06-04 07:32:16 -07:00
Johann Koenig	c005792951	Merge "Make vp9 subpixel match vp8"	2015-06-04 06:16:13 +00:00
Johann	eb88b172fe	Make vp9 subpixel match vp8 The only difference between the two was that the vp9 function allowed for every step in the bilinear filter (16 steps) while vp8 only allowed for half of those. Since all the call sites in vp9 (<< 1) the input, it only ever used the same steps as vp8. This will allow moving the subpel variance to vpx_dsp with the rest of the variance functions. Change-Id: I6fa2509350a2dc610c46b3e15bde98a15a084b75	2015-06-03 22:10:51 -07:00
Marco	a8c5ab2ca6	Remove ABI check for 1 pass CBR SVC. Remove the ABI check for the controls needed for SVC 1 pass CBR mode. Bump up the ABI version. Change-Id: I35b79ee010e14af83c6d1e801d574deaaa2fc7eb	2015-06-03 17:43:22 -07:00
Paul Wilkins	668e804504	Animation and dead zone detection. Adds code to detect dead zone bars at the top and bottom of reformatted letterbox video (note that the code only looks at the top of the image and assumes any dead zone is symmetrical). Use of this to adapt rate control etc. will follow in a subsequent patch. Also counts other blocks (excluding the dead zone) that have no intra signal. The presence of a significant number of such blocks can be used as a identify that the frame may be artificial (e.g. animation, screen capture, graphics). This patch contains plumbing only and does not use the signal. Change-Id: I59bc93529cd4065416cef773e405fda3ae006a20	2015-06-04 01:01:20 +01:00
Johann	ce2ca9f777	Duplicate reference variance code Some places are using the unoptimized variance function. This was never intended and does not fit into the optimization framework. Change-Id: Id96238407aad03b0ffd4a46cd183555a026daedc	2015-06-03 13:28:59 -07:00
Marco	c139b81a13	Vidyo patch: Rate control for SVC, 1 pass CBR mode. -Make Rate control work for SVC 1 pass CBR mode. -Added temporal layering mode. -Fixed bug in non-rd variance partition. -Modified/updated the sample encoders (vp9_spatial_svc_encoder, vpx_temporal_svc_encoder). -Added datarate unittest(s) for 1 pass CBR SVC. Change-Id: Ie94b1b68a56ea1267b5087c625e5df04def2ee48	2015-06-02 07:54:13 -07:00
paulwilkins	dbd3760712	Merge "Fast feedback of bits on undershoot."	2015-06-01 18:15:10 +00:00
Marco	26ab314176	For non-rd pickmode: remove VAR_PARTITION condition. Keep the logic, transform size based on cyclic refresh and bsize, (that was conditioned on VAR_PARTITION conditions) the same for all speeds in non-rd mode (speeds >= 5). No change to speeds >=6. Small improvement for speed 5, ~0.5/1.5% gain for avg psnr/ssim. Change-Id: If9c5657f3d30efd3c7f147166bba7cb69ea55114	2015-05-28 17:29:47 -07:00
Minghai Shang	45db29784d	Merge "[svc] Disable tiles for spatial svc case"	2015-05-28 22:13:54 +00:00
Scott LaVarnway	bbea7c95d8	Merge "Re-worked header files"	2015-05-28 19:56:39 +00:00
hkuang	5317185eb0	Merge "Add error handling when running out of free frame buffers."	2015-05-28 17:41:01 +00:00
hkuang	131cab7c27	Add error handling when running out of free frame buffers. Change-Id: If28b59b9521204a6e3aecedcf75932d76a752567	2015-05-27 14:20:58 -07:00
Marco	a49fff632c	Non-rd variance partition: Adjust thresholds for 1080p. Increase the 32x32 split threshold, to allow for more 32x32 at expense of 16x16. Visually looks somewhat better. Change-Id: Ia1439c3a0dc2d7933468b88bd59266fcd9f03505	2015-05-27 12:30:35 -07:00
Marco	109a2edf90	Merge "Refactor set_vbp_thresholds."	2015-05-27 19:10:28 +00:00
Minghai Shang	30181c46d8	Merge "[svc] Make size of empty frame to be 16x16 all the time"	2015-05-27 17:49:00 +00:00
Marco	f76d42a98a	Refactor set_vbp_thresholds. Break out the setting of the block variance split thresholds, since they are locally modified, e.g., based on local/segment qp. No change in performance. Change-Id: I0a3238e6dab05140657539fc4bd27ac5ff7a554e	2015-05-27 09:25:18 -07:00
Minghai Shang	15353216c5	[svc] Make size of empty frame to be 16x16 all the time Change-Id: Ibab09aa0e8c69cf5efea2f0ec035e5da9cc894b0	2015-05-26 16:04:36 -07:00
Johann	dee70d355f	Merge "Move variance functions to vpx_dsp"	2015-05-26 23:02:11 +00:00
Johann	c3bdffb0a5	Move variance functions to vpx_dsp subpel functions will be moved in another patch. Change-Id: Idb2e049bad0b9b32ac42cc7731cd6903de2826ce	2015-05-26 12:01:52 -07:00
Minghai Shang	9ae5fb706e	Merge "[svc] Turn on frame_parallel_decoding_mode"	2015-05-26 17:50:45 +00:00
Jingning Han	96dba4902c	Fix integral projection motion search for frame resize This commit fixes the integral projection motion search crash when frame resize is used. It fixes issue 994. Change-Id: Ieeb52619121d7444f7d6b3d0cf09415f990d1506	2015-05-22 15:40:45 -07:00
Scott LaVarnway	b962646fc5	Re-worked header files Various header/test files had to be re-worked in order to build "Remove cm parameter from vp9_decode_block_tokens()". This patch reverts the "Remove cm" part and only contains the re-worked header files. Change-Id: I520958a88d1991fee988a3c784d0eac40e117a32	2015-05-22 11:19:51 -07:00
Minghai Shang	9843e7c635	[svc] Disable tiles for spatial svc case Change-Id: I8655a6760ab61947c09f337ddd9f4c1baf803a56	2015-05-20 14:31:49 -07:00
Minghai Shang	e2c6a633fb	[svc] Turn on frame_parallel_decoding_mode Change-Id: I33b0384ee87f83950e03be6c999bc5f193055fd3	2015-05-20 10:56:48 -07:00
paulwilkins	883fdd45cf	Fast feedback of bits on undershoot. This patch provides a partial rapid feedback of bits resulting from extreme undershoot. Some improvement on some problem animated material but in its current form only a small impact on the metrics results of our standard test sets. Change-Id: Ie03036ea8123bc2553437cb8c8c9e7a9fc5dac5d	2015-05-20 16:47:34 +01:00
paulwilkins	ade9693a30	Fix issues with mixed ARF and GF groups. This patch addresses two issues that can occur when the encoder chooses to use a mixture of ARF and GF groups. The first issue relates to a failure to reset the "ARF active" flag correctly when transitioning from coding ARF groups to coding GF groups. This caused some golden frames to be encoded with an incorrect bit rate target as if they were ARF overlay frames. The second issue relates to the encoding of a single short GF group just before a key frame. Where the last group before a key frame is an ARF group we expect the final frame before the key frame to be an low data rate overlay frame. However, when the last group is a GF group, the final frame before the key frame should be a normal frame with a normal bit allocation. This issue had the potential to cause a single poorly coded frame just before a key frame. If that key frame were a forced key frame rather than a real scene cut, this might cause pulsing. Change-Id: Idf1eb5eaf63a231495a74de7899236e1ead9fb00	2015-05-20 16:46:44 +01:00
James Zern	a989c66b84	rename vp9_dct_impl_sse2.c to vp9_dct_sse2_impl.h this file shouldn't be built directly, it is included in vp9_dct_sse2.c to create a non-high-bitdepth and a high-bitdepth version silences missing prototype warnings for the unused FDCT* functions Change-Id: Ide6ff8c24ab31bdb0f833260505ae33660a1ad5b	2015-05-15 17:01:19 -07:00
James Zern	587a71f1d6	rename vp9_dct32x32_sse2.c to vp9_dct32x32_sse2_impl.h this file shouldn't be built directly, it is included in vp9_dct_sse2.c to create a non-high-bitdepth and a high-bitdepth version silences missing prototype warnings for the unused FDCT32x32* functions Change-Id: I0e38f16dae5ea1728de184ee2c89287d48675c51	2015-05-15 16:59:52 -07:00
James Zern	4ec47249bc	rename vp9_dct32x32_avx2.c to vp9_dct32x32_avx2_impl.h this file shouldn't be built directly, it is included in vp9_dct_avx2.c to create a non-high-bitdepth and a high-bitdepth version silences missing prototype warnings for the unused FDCT32x32* functions Change-Id: I4c19935c0e035b393be513bde735e9a78064a494	2015-05-15 16:47:51 -07:00
James Zern	985f19bc6b	Merge changes from topic 'missing-proto' * changes: vp9_subexp.h: add a missing prototype vp9: add some missing includes vp9 intrinsics: add vp9_rtcd include vp9: correct some function signatures vp9_variance_sse2: sync function signatures vp9/encoder: make some functions static vp9_dct_sse2: make some functions static vp9_decodeframe.c: make a function static	2015-05-15 23:08:15 +00:00
Marco	e88de49faa	Change tx_size_search_method setting for non-rd speed 5. Use the same settting as in speed >=6. This will use same logic for tx_size selecton as in speed >=6, which limits the transform size and reduces ringing artifact. Also metrics go up on average with this change: ~2% for PSNR, ~10% for SSIM. Change-Id: Ia2d50db236ae1cc72f742bfa6c9ec5ea50ff0e0a	2015-05-15 11:12:47 -07:00
James Zern	ca5a54113f	vp9_subexp.h: add a missing prototype + include the .h in the .c silences missing prototype warnings Change-Id: Ia87366dccb4bf4e9f2ffa5d3ab51ac6ca5488c91	2015-05-15 10:43:48 -07:00
James Zern	97db651ce0	vp9: add some missing includes mostly: <file>.c should include <file>.h silences missing prototype warnings Change-Id: Ic05ec32c6f7b2224b78825904d96d73aacad6000	2015-05-15 10:43:47 -07:00
James Zern	330fba41e2	vp9 intrinsics: add vp9_rtcd include silences a missing declaration warning Change-Id: I59a34e1a1377cf3529b678d7ec0122bd43ab1bf1	2015-05-15 10:43:47 -07:00
James Zern	18b60af27c	vp9: correct some function signatures silences missing prototype warnings Change-Id: Idaf68d83d2cb03847f3ee002c4d00c2ac79da604	2015-05-15 10:43:47 -07:00
James Zern	43d5cc7fe1	vp9_variance_sse2: sync function signatures + include vp9_rtcd.h silences missing prototype warnings Change-Id: I77902f07a454029baad4fe5fe6fc37c65644e6f7	2015-05-15 10:43:47 -07:00
James Zern	700b7fd0a9	vp9/encoder: make some functions static silences missing prototype warnings Change-Id: I3338fcaa67b5dcdf6bf237e8b374db3befd18753	2015-05-15 10:43:47 -07:00
James Zern	8515e62e6b	vp9_dct_sse2: make some functions static silences missing prototype warnings Change-Id: I773b6a6b5bd7c57db18c3b17c519534f80e131de	2015-05-15 10:43:47 -07:00
paulwilkins	4f569e8485	Merge "Revert "Skip the last frame update for some frame repeats.""	2015-05-15 09:17:19 +00:00
paulwilkins	eb8faf1c89	Revert "Skip the last frame update for some frame repeats." Testing on another rate control patch reveals that in some situations, where the encoder is flipping in and out of arf mode, we get an encoder decoder mismatch. Whilst it is still not clear why, skipping the last buffer update seems to trigger the problem. Until I can establish why, or if there is another underlying cause, I am reverting this change. This reverts commit `e5112b3ae3`. Change-Id: I315c5200414de89458015823344b7367e9dd75ba	2015-05-14 17:21:44 +01:00
Johann	cafae5b544	Merge "Relocate memory operations for common code"	2015-05-13 19:47:24 +00:00
Johann	1d7ccd5325	Relocate memory operations for common code With the sad functions, and hopefully the variance functions soon, moving to the vpx_dsp location, place the defines used in the reference C code in a common location. Change-Id: I4c8ce7778eb38a0a3ee674d2f1c488eda01cfeca	2015-05-13 11:41:15 -07:00
Yunqing Wang	f72af26305	Merge "Remove unneeded variable declaration"	2015-05-12 23:33:31 +00:00
Yaowu Xu	a8015e217e	Merge "Protect new metric computation with use_highbitdepth flag"	2015-05-12 23:20:35 +00:00
Yaowu Xu	3f42d10805	Protect new metric computation with use_highbitdepth flag The computation of new metrics is not supported yet in highbitdepth mode. This commit adds protection to make sure the computation is done only when highbitdepth is not on. This protection shall be revised when support of highbitdpeth computation is added. This resolves the encoder crash when configured with both --enable-internal-stats --enable-vp9-highbitdepth Change-Id: Id9f4bcc4fa26d9ca0e9eabade83f3f88a5b212e6	2015-05-12 15:12:05 -07:00
Yunqing Wang	8ba2d2d5a0	Remove unneeded variable declaration This patch fixed the following warning: src\third_party\libvpx\source\libvpx\vp9\encoder\vp9_pickmode.c(1607) : warning C6246: Local declaration of 'this_mode' hides declaration of the same name in outer scope. Change-Id: I1d93c4a47a13cb13089fec5bd61e8b58e6cd8d58	2015-05-12 15:01:40 -07:00
Adrian Grange	65b768fdf9	Recompute tile params on frame resize When the frame size changes we must recompute details of the tile dimensions. Change-Id: Ie519bd6da47b5cd43933c0bcfc0f2429bcb01986	2015-05-11 15:45:26 -07:00
Marco	913862be8c	Fix rate control issue with layers and aq-mode=3. When aq-mode=3 is enabled, only for base layer frames should the qp of the frame incorporate the segment delta-qp. This was causing more rate mismatch for the enhancement layer frames when running temporal layers with aq-mode=3 on. Change-Id: I1c5e69d1ef8a51188af8696753c17fd8f67699b3	2015-05-11 10:04:18 -07:00
paulwilkins	e5112b3ae3	Skip the last frame update for some frame repeats. Where a frame appears to be a repeat of an earlier frame or frame buffer, but the first pass code does not anticipate this (usually because it is matching the GF or ARF buffer not the last frame buffer), do not update the last frame buffer. This helps ensure that the content of the last frame buffer is kept "different" where possible, and not updated to match the GF or ARF. This is particularly helpful in some animated sequences where there are groups of repeating frames. Here it has quite a big impact. However, in most of our standard test clips it has little or no impact. Change-Id: I77332ee1a69f9ffc0c6080bfeb811c43fd8828e6	2015-05-08 17:51:26 +01:00
James Zern	fd3658b0e4	replace DECLARE_ALIGNED_ARRAY w/DECLARE_ALIGNED this macro was used inconsistently and only differs in behavior from DECLARE_ALIGNED when an alignment attribute is unavailable. this macro is used with calls to assembly, while generic c-code doesn't rely on it, so in a c-only build without an alignment attribute the code will function as expected. Change-Id: Ie9d06d4028c0de17c63b3a27e6c1b0491cc4ea79	2015-05-07 11:55:08 -07:00
Johann	76a08210b6	Merge "Move shared SAD code to vpx_dsp"	2015-05-07 18:33:06 +00:00
Marco	97307af21a	Merge "Remvoe EIGHTTAP_SHARP filter check for non-rd mode."	2015-05-07 15:40:11 +00:00
paulwilkins	aecb1770d5	Merge "Image size restriction to rd auto partition search."	2015-05-07 14:12:14 +00:00
Marco	76fe5dfc67	Remvoe EIGHTTAP_SHARP filter check for non-rd mode. Using EIGHTTAP and EIGHTTAP_SMOOTH seem sufficient. Hard to see any visual gain from allowing EIGHTTAP_SHARP, and it is rarely selected. PSNR/SSIM metrics go up by ~0.18/0.14%. Change-Id: I96fa0d98f9321b913e3ebcd464d4ff3c63018791	2015-05-06 17:08:34 -07:00
Johann	d5d9289800	Move shared SAD code to vpx_dsp Create a new component, vpx_dsp, for code that can be shared between codecs. Move the SAD code into the component. This reduces the size of vpxenc/dec by 36k on x86_64 builds. Change-Id: I73f837ddaecac6b350bf757af0cfe19c4ab9327a	2015-05-06 16:58:20 -07:00
Yunqing Wang	36eabb1c3c	Add intra mode early termination in non-rd mode Added the intra mode early termination in order to speed up the mode search in non-rd case since we started to include more intra modes in the search list. Borg tests(rtc set) showed a 0.048% PSNR gain and 0.061 SSIM gain. No speed change. Change-Id: I6f255fe534dc50b736e6a66a726ad458eb9b4443	2015-05-05 16:31:36 -07:00
paulwilkins	af76953448	Merge "Remove CONSTRAIN_NEIGHBORING_MIN_MAX."	2015-05-05 09:32:11 +00:00
paulwilkins	4cd65e4f19	Merge "Adjust ARF min and max interval."	2015-05-05 09:31:38 +00:00
Marco	b9a72d3c4d	Allow for H and V intra modes for non-rd mode. For non-rd mode (speed >=5): use mask based on prediction block size, and (for non-screen content mode) allow for checking horiz and vert intra modes for blocks sizes < 16x16. Avg psnr/ssim metrics go up by about ~0.2%. Only allowing H/V intra on block sizes below 16x16 for now, to keep encoding time increase very small, and also when allowing H/V on 16x16 blocks, metrics went down on a few clips which need to be further examined. Change-Id: I8ae0bc8cb2a964f9709612c76c5661acaab1381e	2015-05-04 09:48:41 -07:00
Yunqing Wang	d31256cd38	Merge "Reduce intra_cost_penalty for BLOCK_8X8"	2015-05-01 18:29:38 +00:00
Yunqing Wang	57fefd5f9a	Merge "Adjust the vbp early termination threshold slightly"	2015-05-01 18:29:25 +00:00
paulwilkins	4a7dcf8eb2	Image size restriction to rd auto partition search. Impose a limit on the rd auto partition search based on the image format. Smaller formats require that the search includes includes a smaller minimum block size. This change is intended to mitigate the visual impact of ringing in some problem clips, for smaller image formats. Change-Id: Ie039e5f599ee079bbef5d272f3e40e2e27d8f97b	2015-05-01 16:16:02 +01:00
paulwilkins	287b0c6da9	Remove CONSTRAIN_NEIGHBORING_MIN_MAX. Remove one of the auto partition size cases. This case can behaves badly in some types of animated content and was only used for the rd encode path. A subsequent patch will add additional checks to help further improve visual quality. Change-Id: I0ebd8da3d45ab8501afa45d7959ced8c2d60ee4e	2015-05-01 15:15:16 +01:00
paulwilkins	e0786c280e	Adjust ARF min and max interval. Previously limit on max interval set to 0.5 seconds. Though this helped some low frame rate material it appears to be a bit too aggressive for some 24 and 25 fps content. This patch relaxes the limit to 0.75 seconds. The patch also adds a new minimum interval variable to replace the current hard wired value. This allows us to impose a limit on the maximum number of primary arfs per second for high frame rate (e.g. 50 & 60fps) content. This is to address concerns regarding playback performance on some platforms if there is a high base frame rate and very frequent arfs. Change-Id: I373e8b6b2a8ef522eced6c6d2cceb234ff763fcf	2015-05-01 15:11:49 +01:00
Yunqing Wang	4907c29904	Reduce intra_cost_penalty for BLOCK_8X8 This patch reduced the BLOCK_8X8's intra_cost_penalty, which allows 8x8 blocks to conduct intra mode search. Borg test result(rtc set): 0.077% PSNR gain, 0.228% SSIM gain. No speed changes. Change-Id: Icfe90c4f6969de24bda8ecacbd3da50330bf22b2	2015-04-30 11:03:06 -07:00
Yunqing Wang	fd90ce2711	Merge "Improve golden frame refreshing in non-rd mode"	2015-04-30 15:57:55 +00:00
Yunqing Wang	a257e469e1	Adjust the vbp early termination threshold slightly Calculated cpi->vbp_threshold_sad from this frame's dequant value. The encoding quality and speed didn't change much. Borg test result: PSNR: -0.002%, SSIM: -0.003%. Change-Id: I97c9826986f39582f29910d637d08a69c90afdee	2015-04-30 08:51:02 -07:00
Yunqing Wang	d31698b0e0	Improve golden frame refreshing in non-rd mode The default golden frame interval was doubled. After encoding a frame, the background motion was measured. If the motion was high, the current frame was set as the golden frame. Currently, the changes were applied only while aq-mode 3 was on. Borg tests(rtc set) showed a 0.226% PSNR gain and 0.312% SSIM gain. No speed changes. Change-Id: Id1e2793cc5be37e8a9bacec1380af6f36182f9b1	2015-04-29 16:43:43 -07:00
James Zern	f58011ada5	vpx_mem: remove vpx_memset vestigial. replace instances with memset() which they already were being defined to. Change-Id: Ie030cfaaa3e890dd92cf1a995fcb1927ba175201	2015-04-28 20:00:59 -07:00
James Zern	f274c2199b	vpx_mem: remove vpx_memcpy vestigial. replace instances with memcpy() which they already were being defined to. Change-Id: Icfd1b0bc5d95b70efab91b9ae777ace1e81d2d7c	2015-04-28 19:59:41 -07:00
James Zern	fbd3b89488	vpx_mem: remove vpx_memmove vestigial. replace instances with memmove() which they already were being defined to. Change-Id: If396d3f9e3cf79c0ee5d7429615ef3d6b2a34afa	2015-04-28 19:59:40 -07:00
Yaowu Xu	b3e411e481	Add validation of UV partition size For color sampling format other than 420, valid partion size in Y may not work for UV plane. This commit adds validation of UV partition size before select the partition choice. This fixes a crash for real time encoding of 422 input. Change-Id: I1fe3282accfd58625e8b5e6a4c8d2c84199751b6	2015-04-24 12:34:18 -07:00
Jim Bankoski	a6e9ae9066	Adds worst frame metrics for a bunch of metrics. Change-Id: Ieaccc36ed1bee024bb644a9cfaafdaaa65d31772	2015-04-22 06:45:56 -07:00
paulwilkins	e07b141da0	Merge "Modified test for auto key frame detection."	2015-04-22 02:29:17 -07:00
paulwilkins	5d8877a944	Merge "Limit arf interval for low fpf clips."	2015-04-22 02:25:38 -07:00
Jim Bankoski	3b35e962e2	Merge "Adds a new temporal consistency metric to libvpx."	2015-04-21 16:11:11 -07:00
Scott LaVarnway	8b17f7f4eb	Revert "Remove mi_grid_* structures." (see I3a05cf1610679fed26e0b2eadd315a9ae91afdd6) For the test clip used, the decoder performance improved by ~2%. This is also an intermediate step towards adding back the mode_info streams. Change-Id: Idddc4a3f46e4180fbebddc156c4bbf177d5c2e0d	2015-04-21 11:16:45 -07:00
Jim Bankoski	ee87e20d53	Adds a new temporal consistency metric to libvpx. Change-Id: Id61699ebf57ae4f8af96a468740c852b2f45f8e1	2015-04-21 10:05:37 -07:00
paulwilkins	3606b78108	Modified test for auto key frame detection. The existing test was triggering a lot of false positives on some types of animated material with very plain backgrounds. These were triggering code designed to catch key frames in letter box format clips. This patch tightens up the criteria and imposes a minimum requirement on the % blocks coded intra in the first pass and the ratio between the % coded intra and the modified inter % after discounting neutral (flat) blocks that are coded equally well either way. On a particular problem animation clip this change eliminated a large number of false positives including some cases where the old code selected kf several times in a row. Marginal false negatives are less damaging typically to compression and in the problem clip there are now a couple of cases where "visual" scene cuts are ignored because of well correlated content across the scene cut. Replaced some magic numbers related to this with #defines and added explanatory comments. Change-Id: Ia3d304ac60eb7e4323e3817eaf83b4752cd63ecf	2015-04-21 12:50:11 +01:00
Yaowu Xu	b423a6b212	Resolve configuration conflict Between --enable-internal-stats and --enable-vp9-highbitdepth Change-Id: I36b741554e835033e69883270b6b0e5374a1aafa	2015-04-20 16:44:12 -07:00
Yaowu Xu	305492c375	Move declaration before statement Change-Id: Ib64786fcc0d6dc11c4e66f5b7f3e93b2a4fcb664	2015-04-20 09:50:59 -07:00
Jim Bankoski	03829f2fea	Merge "Adds a blockiness metric to internal stats."	2015-04-17 16:06:26 -07:00
Jim Bankoski	3d2f037a44	Merge "adds psnrhvs to internal stats."	2015-04-17 16:06:10 -07:00
Jim Bankoski	f2cbee9a04	Merge "Adds a fastssim metric to VPX internal stats."	2015-04-17 16:05:53 -07:00
Jim Bankoski	1777413a2a	Adds a blockiness metric to internal stats. Change-Id: Iedceeb020492050063acf3fd2326f96c29db9ae5	2015-04-17 11:13:18 -07:00
Jim Bankoski	9757c1aded	adds psnrhvs to internal stats. PSNR HVS is a human visual system weighted version of SNR that's gained some popularity from academia and apparently better matches MOS testing. This code is borrowed from the Daala Project but uses our FDCT code. Change-Id: Idd10fbc93129f7f4734946f6009f87d0f44cd2d7	2015-04-17 10:29:27 -07:00
Jim Bankoski	3f7f194304	Adds a fastssim metric to VPX internal stats. This code appeared in the Daala project first and was originally committed by Nathan Egge. Change-Id: Iadce416a091929c51b46637ebdec984cddcaf18c	2015-04-17 10:23:24 -07:00
Jingning Han	73bce9ec7e	Merge "Remove unnecessary backup token stream pointer"	2015-04-17 09:13:53 -07:00
Marco Paniconi	f76ccce5bc	Revert "Revert "Force_split on 16x16 blocks in variance partition."" This reverts commit `004b9d83e3` Change-Id: I2f2d0bdb9368c2c07f1d29a69cd461267a3a8743	2015-04-16 17:52:13 -07:00
Jingning Han	645c70f852	Remove unnecessary backup token stream pointer When the tokenization is not taking effect, the tokenization pointer remains unchanged. No need to re-assign the backup pointer value. Change-Id: I58fe1f6285aa3b4a88ceb864c11d5de8ac6235dd	2015-04-16 16:44:44 -07:00
Minghai Shang	29b5cf6a9d	Merge "[svc] Fix syntax error when encoding multiple tiles."	2015-04-16 13:43:44 -07:00
Minghai Shang	4aa9255efa	[svc] Fix syntax error when encoding multiple tiles. Change-Id: Ia77b551415f3b3386e22a6c805f244f2d13fe3e3	2015-04-16 12:56:30 -07:00
paulwilkins	effd974b16	Limit arf interval for low fpf clips. This patch limits the maximum arf interval length to approximately half a second. In some low fps animations in particular the existing code was selecting an overly long interval which was hurting visual quality. For a sample problem test clip (360P animation , 15fps, ~200Kbit/s) this change also improved metrics by >0.5 db. There may be some clips where this hurts metrics a little, but the worst case impact visually is likely to be less than having an interval that is much too long. On more normal material at 24 fps or higher, the impact is likely to be nil/minimal. Change-Id: Id8b57413931a670c861213ea91d7cc596375a297	2015-04-16 11:50:37 +01:00
Yunqing Wang	14e7203e7b	Merge "Fix Tsan errors"	2015-04-15 15:34:03 -07:00
Yunqing Wang	63c5bf2b9c	Fix Tsan errors This patch fixed 2 reported Tsan errors while running VP9 real-time encoder. Change-Id: Ib0278fe802852862c3ce87c4a500e544d7089f67	2015-04-15 12:33:39 -07:00
Johann	14ef4aeafb	Reorganize *_rtcd() calling conventions Change-Id: Ib1e17d8aae9b713b87f560ab5e49952ee2bfdcc2	2015-04-15 11:12:05 -04:00
Yunqing Wang	004b9d83e3	Revert "Force_split on 16x16 blocks in variance partition." This reverts commit `eb8c667570`. The patch caused mismatch while using multi-threads. Change-Id: Icd646340af25b5d91e32f03ed3ea212e00e3e0be	2015-04-14 15:19:31 -07:00
Marco	eb8c667570	Force_split on 16x16 blocks in variance partition. Force split on 16x16 block (to 8x8) based on the minmax over the 8x8 sub-blocks. Also increase variance threshold for 32x32, and add exit condiiton in choose_partition (with very safe threshold) based on sad used to select reference frame. Some visual improvement near moving boundaries. Average gain in psnr/ssim: ~0.6%, some clips go up ~1 or 2%. Encoding time increase (due to more 8x8 blocks) from ~1-4%, depending on clip. Change-Id: I4759bb181251ac41517cd45e326ce2997dadb577	2015-04-13 12:05:07 -07:00
Jingning Han	2404332c1b	Merge "Remove get_nonrd_var_based_fixed_partition function"	2015-04-09 14:45:19 -07:00
Jingning Han	4565812032	Merge "Compute prediction filter type cost only when needed"	2015-04-09 14:45:11 -07:00
Jingning Han	93d9c50419	Merge "SSSE3 assembly implementation of 8x8 Hadamard transform"	2015-04-09 11:16:11 -07:00
Jingning Han	208aa6158b	Remove get_nonrd_var_based_fixed_partition function This function has been replaced by other approaches and is not in use now. Change-Id: I387f45b5607d202539e482468ccc70e6c0f9341f	2015-04-09 09:49:55 -07:00
Debargha Mukherjee	59681be0a0	Merge "Improve accuracy of rate control in CQ mode"	2015-04-08 10:48:17 -07:00
James Zern	2ed0cf06f9	Merge "vp9_full_search_sadx[38]: align sad arrays"	2015-04-07 20:57:21 -07:00
Yaowu Xu	c88ce84bb5	Merge "Optimize the checking for transform skipping"	2015-04-07 16:29:51 -07:00
Yaowu Xu	90517b5e85	Merge "move ref_frame_cost computations into a function"	2015-04-07 16:29:45 -07:00
Debargha Mukherjee	60bd744c88	Improve accuracy of rate control in CQ mode Modifies a special handling that improves rate control accuracy in the constrained quality mode, when the undershoot and overshoot limits are set tighter. Change-Id: If62103f0ef3ed1cac92807400678c93da50cf046	2015-04-07 16:29:21 -07:00
James Zern	e1ff83f4b0	vp9_full_search_sadx[38]: align sad arrays the sse4 code expects 16-byte aligned arrays; vp8 already had a similar change applied: `b2aa401` Align SAD output array to be 16-byte aligned Change-Id: I5e902035e5a87e23309e151113f3c0d4a8372226	2015-04-07 14:34:06 -07:00
Jingning Han	927693a991	Merge "Enable Hadamard transform based cost estimate for all block sizes"	2015-04-07 12:51:27 -07:00
Jingning Han	6de407b638	Merge "Account for eob cost in the RTC mode decision process"	2015-04-07 12:50:30 -07:00
Jingning Han	25206e7b7f	Compute prediction filter type cost only when needed Skip redundant prediction filter type cost in filter search loop, if the rate value will be reset in Hadamard transform based rate distortion estimate. Change-Id: Ie5221f4bc8da9461c449df367251aeeac52c6e5d	2015-04-07 12:41:46 -07:00
Yaowu Xu	0bb897211d	Optimize the checking for transform skipping If U is not skippable, then do not perform the check on V. Change-Id: Iba5e8362bd42390197f373c44388a426a4404549	2015-04-06 17:54:05 -07:00
Jingning Han	7f629dfca4	SSSE3 assembly implementation of 8x8 Hadamard transform It uses about 10% less CPU cycles than the SSE2 intrinsic implementation. Change-Id: I91017c0c068679a214b98cdd4cff3a6facfb7499	2015-04-04 09:59:37 -07:00
Jingning Han	9922e4344a	Enable Hadamard transform based cost estimate for all block sizes This commit turns on the Hadamard transform based rate distortion estimate for all block sizes in RTC coding mode. It conditionally skips the rate distortion estimation if all zero block flag is set on. No significant encoding speed change is observed. The compression performance of speed -6 is improved by 1.7% over using it only for block sizes of 32x32 and below. Change-Id: I768145e6f05c737b05b5b5f1ee674e929532cafb	2015-04-04 09:58:45 -07:00
Yunqing Wang	b2baaa215b	Merge "Fix the scaling factor in UV skipping test"	2015-04-03 17:09:59 -07:00
Yunqing Wang	1a1114d21c	Fix the scaling factor in UV skipping test The threshold scaling factor was calculated wrong using partition size "bsize". Thank Yaowu for pointing it out. It was fixed and no speed change was seen. Change-Id: If7a5564456f0f68d6957df3bd2d1876bbb8dfd27	2015-04-03 16:07:43 -07:00
Jingning Han	30e9c091c0	Merge "Tune SSSE3 assembly implementation to improve quantization speed"	2015-04-03 11:24:28 -07:00
Jingning Han	60e01c6530	Account for eob cost in the RTC mode decision process This commit accounts for the transform block end of coefficient flag cost in the RTC mode decision process. This allows a more precise rate estimate. It also turns on the model to block sizes up to 32x32. The test sequences shows about 3% - 5% speed penalty for speed -6. The average compression performance improvement for speed -6 is 1.58% in PSNR. The compression gains for hard clips like jimredvga, mmmoving, and tacomascmv at low bit-rate range are 1.8%, 2.1%, and 3.2%, respectively. Change-Id: Ic2ae211888e25a93979eac56b274c6e5ebcc21fb	2015-04-03 10:31:51 -07:00
Yunqing Wang	12cb30d4bd	Merge "Set vbp thresholds for aq3 boosted blocks"	2015-04-02 18:22:08 -07:00
Yaowu Xu	718feb0f69	move ref_frame_cost computations into a function Change-Id: Iebf2ad2b1db7e2874788fda8d55e67f4cb1149f1	2015-04-02 18:10:55 -07:00
Marco	f85f79f630	Merge "Code cleanup: put (8x8/4x4)fill_variance into separate function."	2015-04-02 17:33:01 -07:00
Yunqing Wang	cae03a7ef5	Set vbp thresholds for aq3 boosted blocks The vbp thresholds are set seperately for boosted/non-boosted superblocks according to their segment_id. This way we don't have to force the boosted blocks to split to 32x32. Speed 6 RTC set borg test result showed some quality gains. Overall PSNR: +0.199%; Avg PSNR: +0.245%; SSIM: +0.802%. No speed change was observed. Change-Id: I37c6643a3e2da59c4b7dc10ebe05abc8abf4026a	2015-04-02 15:48:32 -07:00
Marco	77ea408983	Code cleanup: put (8x8/4x4)fill_variance into separate function. Code cleanup, no change in behavior. Change-Id: I043b889f8f0b3afb49de0da00873bc3499ebda24	2015-04-02 13:37:35 -07:00
Marco	6eb05c9ed0	Small fix to segment check in pickmode. Change-Id: Id5fd82a504def2523292466fbaad5dade9424c72	2015-04-02 09:55:13 -07:00
Jingning Han	2149f214d5	Merge "Reduce required xmm number by one in block_error_fp"	2015-04-01 15:46:22 -07:00
Jingning Han	657cabe0f7	Tune SSSE3 assembly implementation to improve quantization speed Change-Id: If0ca8b25b4800d4336e6cbc97194cd9b01c5b5a3	2015-04-01 15:28:01 -07:00
Yaowu Xu	fff4654d36	Merge "Simplify bsize calculation"	2015-04-01 15:06:55 -07:00
Jingning Han	cf4447339e	Merge "Optimize quantization simd implementation"	2015-04-01 14:55:18 -07:00
Jingning Han	a4364e5146	Merge "Simplify effective src_diff address computation"	2015-04-01 14:55:03 -07:00
Jingning Han	7acb2a8795	Merge "Refactor block_yrd function for RTC coding mode"	2015-04-01 14:54:24 -07:00
Yaowu Xu	ba91b54d7c	Simplify bsize calculation Change-Id: Ibc514684def9914c66f04cb7931f773e2b79c168	2015-04-01 12:15:06 -07:00
Jingning Han	19da916716	Simplify effective src_diff address computation Remove redundant offset calculation for effective src_diff address. Change-Id: I4aab241a36abcef7fd8adf74aed5e12b8b88e0ef	2015-04-01 12:07:47 -07:00
Jingning Han	f2cf3c06a0	Reduce required xmm number by one in block_error_fp Use 6 xmms instead of 8. Change-Id: If976ad85d09191d2fb0565399d690f2869dbbcc7	2015-04-01 12:07:35 -07:00
Jingning Han	1470529f62	Refactor block_yrd function for RTC coding mode This commit separates Hadamard transform/quantization operations from rate and distortion computation in block_yrd. This allows one to skip SATD computation when all transform blocks are quantized to zero. It also uses a new block error function that skips repeated computation of sum of squared residuals. It reduces the CPU cycles spent on block error calculation in block_yrd by 40%. Change-Id: I726acb2454b44af1c3bd95385abecac209959b10	2015-04-01 12:00:43 -07:00
Jingning Han	eed1badedd	Optimize quantization simd implementation This commit allows the quantizer to compare the AC coefficients to the quantization step size to determine if further multiplication operations are needed. It makes the quantization process 20% faster without coding statistics change. Change-Id: I735aaf6a9c0874c82175bb565b20e131464db64a	2015-04-01 11:47:09 -07:00
Yunqing Wang	a0043c6d30	Enhance the transform skipping decision-making in non-rd mode For large partition blocks(block_size > 32x32), the variance calculation is modified so that every 8x8 block's variance is stored during the calculation, which is used in the following transform skipping test. Also, the variance for every tx block is calculated. The skipping test checks all tx blocks in the partition, and sets the skip flag only if all tx blocks are skippable. If the skip flag of Y plane is 1, a quick evaluation is done on UV planes. If the current partition block is skippable in YUV planes, the mode search checks fewer inter modes and doesn't check intra modes. The rtc set borg test(at speed 6) showed that: Overall psnr: -0.527%; Avg psnr: -0.510%; ssim: -0.573%. Average single-thread speedup on rtc set was 3.5%. For 720p clips, more speedups were seen. gipsrecmotion: 13% gipsrestat: 12% vidyo: 5 - 9% dark: 15% niklas: 6% Change-Id: I8d8ebec0cb305f1de016516400bf007c3042666e	2015-04-01 09:43:40 -07:00
Yunqing Wang	fc98114761	Merge "Rename vbp thresholds"	2015-03-31 16:33:30 -07:00
Yunqing Wang	c28ff1a9de	Rename vbp thresholds Code refactoring Change-Id: I410fcce1bc6d95c62c474445f4c97ea8469f1e79	2015-03-31 15:14:44 -07:00
Jingning Han	502ac72233	Merge "Tuning SATD rate calculation for speed"	2015-03-31 14:24:26 -07:00
Jingning Han	1c39c5b96f	Merge "Use aligned copy in 8x8 Hadamard transform SSE2"	2015-03-31 12:16:47 -07:00
Jingning Han	fa4289522e	Merge "Allow block skip coding option in RTC mode"	2015-03-31 12:16:36 -07:00
Jingning Han	1638d7dc96	Merge "Fix 8x8 Hadamard SSE2 implementation"	2015-03-31 12:16:27 -07:00
Alex Converse	9670d766ab	Merge "VP9E_GET_ACTIVE_MAP API function."	2015-03-31 11:52:56 -07:00
Jingning Han	531468a07a	Tuning SATD rate calculation for speed This commit allows the encoder to check the eob per transform block to decide how to compute the SATD rate cost. If the entire block is quantized to zero, there is no need to add anything; if only the DC coefficient is non-zero, add its absolute value; otherwise, sum over the block. This reduces the CPU cycles spent on vp9_satd_sse2 to one third. Change-Id: I0d56044b793b286efc0875fafc0b8bf2d2047e32	2015-03-31 11:02:20 -07:00
hui su	d4f2f1dd5b	Merge "Move vp9_coef_con_tree to common/"	2015-03-31 10:51:10 -07:00
Jingning Han	014fa45298	Use aligned copy in 8x8 Hadamard transform SSE2 This reduces the 8x8 Hadamard transform cycles by 20%. Change-Id: If34c5e02f3afa42244c6efabe121f7cf5d2df41b	2015-03-31 10:21:52 -07:00
Jingning Han	db5ec37edc	Merge "Enable 16x16 Hadamard transform in SATD based mode decision"	2015-03-31 09:55:41 -07:00
Jingning Han	8c5670bb6f	Merge "Use SATD based mode decision for block sizes below 16x16"	2015-03-31 09:47:47 -07:00
Jingning Han	ebe1be9186	Allow block skip coding option in RTC mode When the estimated rate-distortion cost of skip coding mode is lower than that of sending quantized coefficients, allow the encoder to drop these coefficients. This improves the compression performance of speed -6 by 0.268% and makes the encoding speed slightly faster. Change-Id: Idff2d7ba59f27ead33dd5a0e9f68746ed3c2ab68	2015-03-31 09:32:53 -07:00
hui su	302e24cb3e	Move vp9_coef_con_tree to common/ This tree should be defined in common/, as it is needed for both encoder and decoder. Change-Id: I4f5cbc80025cf2ced14182c98f7c82dc7d0f87db	2015-03-31 09:20:46 -07:00
Jingning Han	9b99eb2e12	Merge "Reuse inter prediction pixel block for Hadamard transform"	2015-03-30 16:09:38 -07:00
Jingning Han	34a996ac1e	Fix 8x8 Hadamard SSE2 implementation This commit fixes the SSE2 version 8x8 Hadamard transform alignment and makes it consistent with the C version. Change-Id: I1304e5f97e0e5ef2d798fe38081609c39f5bfe74	2015-03-30 15:54:08 -07:00
Jingning Han	26d3d3af6a	Enable 16x16 Hadamard transform in SATD based mode decision This commit replaces the 16x16 2D-DCT transform with Hadamard transform for RTC coding mode. It reduces the CPU cycles cost on 16x16 transform by 5X. Overall it makes the speed -6 encoding speed 1.5% faster without compromise on compression performance. Change-Id: If6c993831dc4c678d841edc804ff395ed37f2a1b	2015-03-30 15:43:31 -07:00
Jingning Han	f0ac5aaa08	Merge "Hadamard transform based coding mode decision process"	2015-03-30 15:43:15 -07:00
Jingning Han	b4b5af6acd	Use SATD based mode decision for block sizes below 16x16 This commit makes the encoder to select between SATD/variance as metric for mode decision. It also allows to account chroma component costs for mode decision as well. The overall encoding time increase as compared to variance based mode selection is about 15% for speed -6. The compression performance is on average 2.2% better than variance based approach, with about 5% compression performance gains for hard clips (e.g., jimredvga, nikas720p, and mmmoving) at lower bit-rate range. Change-Id: I4d04a31d36f4fcb3f5f491dacd6e7fe44cb9d815	2015-03-30 15:20:07 -07:00
Jingning Han	8a927a1b7a	Reuse inter prediction pixel block for Hadamard transform It saves one unnecessary motion compensated prediction constructed by using 8-tap filter. Change-Id: I101215131e6f38621d5935885f94cc74de6a5377	2015-03-30 15:04:33 -07:00
Jingning Han	8c411f74e0	Hadamard transform based coding mode decision process This commit uses Hadamard transform based rate-distortion cost estimate for rtc coding mode decision. It improves the compression performance of speed -6 for many hard clips at lower bit-rates. For example, 5.5% for jimredvga, 6.7% for mmmoving, 6.1% for niklas720p. This will introduce extra encoding cycle costs at this point. Change-Id: Iaf70634fa2417a705ee29f2456175b981db3d375	2015-03-30 14:46:05 -07:00
Alex Converse	bf7def9a43	Merge "Simplify skip check."	2015-03-30 11:31:45 -07:00
Marco	fa20a60f0d	Speed 5: use non-rd mode for key frame coding. Metrics on RTC set go down by ~1.5% on average. Key frame encoding time goes down by factor of ~5. Change-Id: Ia83acc55848613870e5ac6efe7f3d904d877febb	2015-03-27 16:19:26 -07:00
Adrian Grange	ad18b2b641	Remove 8-bit array in HBD Creating both 8- and 16-bit arrays and then only using one of them is wasteful. Change-Id: Ic5b397c283efaff7bcfff2d2413838ba3e065561	2015-03-25 15:37:03 -07:00
Adrian Grange	65df3d138a	Replace heap with stack memory allocation Replaced the dynamic memory allocation of the second_pred buffer with an allocation on the stack. Change-Id: I2716c46b71e8587714ca5733a99eca2c68419b23	2015-03-25 15:36:43 -07:00
Adrian Grange	8d8d7bfde5	Fix use of scaling in joint motion search To enable us to the scale-invariant motion estimation code during mode selection, each of the reference buffers is scaled to match the size of the frame being encoded. This fix ensures that a unit scaling factor is used in this case rather than the one calculated assuming that the reference frame is not scaled. Change-Id: Id9a5c85dad402f3a7cc7ea9f30f204edad080ebf	2015-03-25 15:35:29 -07:00
paulwilkins	ab788c5380	Merge "Enable group adaptive max q by default."	2015-03-24 15:00:12 -07:00
Alex Converse	4dcb839607	VP9E_GET_ACTIVE_MAP API function. This is useful when aq mode 3 (cyclic refresh) reactivates segments for refresh. Change-Id: I3ad1d9410b899ede393d82bb8db14e2da4d84eca	2015-03-24 11:19:47 -07:00
Yaowu Xu	c77d4dcb35	Merge "vp9_pred_mv(): misc fixes and optimizations"	2015-03-24 10:36:51 -07:00
Alex Converse	02697e35dc	Merge "A tiny cyclic refresh / active map fix."	2015-03-24 09:43:24 -07:00
paulwilkins	8ea7bafdaa	Merge "Revised rd adjustment for variance."	2015-03-24 03:12:56 -07:00
paulwilkins	c0b71cf82f	Merge "Experimental rd bias based on source vs recon variance."	2015-03-24 03:12:41 -07:00
Alex Converse	31f1563a92	A tiny cyclic refresh / active map fix. Change-Id: I198727461455c8c198a0c892d02ed3cb1673aa50	2015-03-23 18:51:00 -07:00
hkuang	cd1d40ff5d	Merge "Safely free all the frame buffers after all the workers finish the work."	2015-03-23 16:50:15 -07:00
Alex Converse	b7605a9d70	Simplify skip check. SEG_LVL_SKIP implies skip. This is enforced by skip = write_skip(). Change-Id: I61c79581c9c53deae36685c2bcf388cb4d8827d3	2015-03-23 10:53:31 -07:00
paulwilkins	691ec45b4e	Enable group adaptive max q by default. Set the GF group adaptive max Q compile flag to 1 by default. This change has a quite big visual impact in some clips and also contributes to tighter rate control. For short test clips that have consistent content the impact is quite small on metrics but for more varied long form clips there is a drop in overal psnr but a sharp rise in average psnr caused by greater expenditure on some easier sections and tighter rate clipping in hard sections. In chunck'ed encodes some of the effect will already be present due to the independent rate control in each chunk but this change takes the control down to a smaller scale. yt hd +10.67%, - 3.77%, -1.56% yt +9.654%, - 3.6%, - 1.82% std hd +0.25%, -0.85%, -0.42% derf +0.25%, - 1.1%. - 0.87% Change-Id: Ibbc39b800d99d053939f4c6712d715124082843e	2015-03-23 15:57:09 +00:00
Yaowu Xu	9fd8abc541	vp9_pred_mv(): misc fixes and optimizations 1. skip near if it is same as nearest 2. correct rounding for converting mv to fullpel position 3. update pred_mv_sad after new mv search. Overall .1%~.25% compression gains on rtc set for speed 5, 6, 7, 8. Change-Id: Ic300ca53f7da18073771f1bb993c58cde9deee89	2015-03-20 17:17:04 -07:00
Alex Converse	6d6ef8eb3c	Don't apply active map on key frames. This allows applciations to be KF oblivious. Change-Id: Ic02712eae6ad8d6b3eaec26548299d24ca0d5cc0	2015-03-20 14:57:24 -07:00
Alex Converse	e032fc7b9e	Set loop filter level to zero on inactive segment. Change-Id: I6022a79351882a72a219aee13563bf21bcd70383	2015-03-20 14:43:06 -07:00
paulwilkins	7e234b9228	Revised rd adjustment for variance. Revised adjustment for rd based on source complexity. Two cases: 1) Bias against low variance intra predictors when the actual source variance is higher. 2) When the source variance is very low to give a slight bias against predictors that might introduce false texture or features. The impact on metrics of this change across the test sets is small and mixed. derf -0.073%, -0.049%, -0.291% std hd -0.093%, -0.1%, -0.557% yt +0.186%, +0.04%, - 0.074% ythd +0.625%, + 0.563%, +0.584% Medium to strong psycho-visual improvements in some problem clips. This feature and intra weight on GF group length now turned on by default. Change-Id: Idefc8b633a7b7bc56c42dbe19f6b2f872d73851e	2015-03-20 11:59:39 +00:00
paulwilkins	9a1ce7be7d	Experimental rd bias based on source vs recon variance. This experiment biases the rd decision based on the impact a mode decision has on the relative spatial complexity of the reconstruction vs the source. The aim is to better retain a semblance of texture even if it is slightly misaligned / wrong, rather than use a simple rd measure that tends to favor use of a flat predictor if a perfect match can't be found. This improves the appearance of texture and visual quality on specific test clips but is hidden under a flag and currently off by default pending visual quality testing on a wider Yt set. Change-Id: Idf6e754a8949bf39ed9d314c6f2daaa20c888aad	2015-03-20 11:57:36 +00:00
Adrian Grange	12d946df89	Restore first ref frame pointer to the correct value The joint_motion_search function alternates prediction between two reference frames. In order to reuse existing code, a pointer to the appropriate reference frame is written into xd->plane[0].pre[0], that the motion estimation code assumes points to the reference frame. If this first reference frame was scaled then the pointer was incorrectly being reset to point to the unscaled reference frame rather than the scaled version. Change-Id: I76f73a8d8f4f15c1f3a5e7e08a35140cdb7886ab	2015-03-19 16:17:31 -07:00
Adrian Grange	53c9ebe609	Move joint_motion_search & delete function prototype Change-Id: I7fb3a78ed0e0bc940d8b4a57c470302f8369782f	2015-03-19 14:28:52 -07:00
hkuang	b88dac8938	Safely free all the frame buffers after all the workers finish the work. Issue: 978 Change-Id: Ia7aa809095008f6819a44d7ecb0329def79b1117	2015-03-19 12:21:00 -07:00
Jingning Han	067fc49996	Merge "Speed up non-rd mode decision search"	2015-03-19 09:18:10 -07:00
Jingning Han	411bbce470	Merge "Fix an ioc warning in vp9_pick_inter_mode"	2015-03-19 09:17:25 -07:00
Marco	fc2da4c5ba	Merge "Adjustments to aq-mode=3."	2015-03-19 09:01:17 -07:00
James Zern	6f23d40582	Merge "vp9_resize_plane: quiet some static analysis warnings"	2015-03-18 19:39:48 -07:00
James Zern	c664f16182	Merge changes Ie5a24275,Ib72946a8,I532b882b * changes: vp9_fdct8x8_quant_ssse3: quiet a static analysis warning vp9_fdct8x8_quant_sse2: quiet a static analysis warning vp9_mv_pred: quiet a static analysis warning	2015-03-18 19:38:49 -07:00
Alex Converse	748843712f	Merge "Fix external resize memory issues."	2015-03-18 16:04:30 -07:00
James Zern	c4367b9b51	vp9_resize_plane: quiet some static analysis warnings document resolution assumptions with a few asserts Change-Id: Ia4ab738fd3e0a1ba0ed30a57facd2658c2c1fd60	2015-03-18 14:34:30 -07:00
James Zern	388add965f	vp9_fdct8x8_quant_ssse3: quiet a static analysis warning add an assert to validate 'in' array size Change-Id: Ie5a24275c066d9dd59714f6104510abbd4850dc5	2015-03-18 14:33:43 -07:00
James Zern	198b039e2a	vp9_fdct8x8_quant_sse2: quiet a static analysis warning add an assert to validate 'in' array size Change-Id: Ib72946a86f34e1ce8a69954e8e3e4fe1a0f18a91	2015-03-18 14:33:04 -07:00
James Zern	428369293d	vp9_mv_pred: quiet a static analysis warning add an assert to validate pred_mv array size Change-Id: I532b882b71e2baff3ac76e07ed133ec5a11bd0fc	2015-03-18 14:31:58 -07:00
Marco	71e6ed7bd1	Adjustments to aq-mode=3. Factor in segment#2 and skip blocks into the postencode estimated bits, and increase somewhat the aggressiveness of the refresh. PSNR/SSIM Metrics on RTC set go up by ~0.8/0.5%. Change-Id: I5d4e7cb00a3aefb25d18c88b6b24118b72dc5d51	2015-03-18 12:06:16 -07:00
Jingning Han	83cbe22623	Speed up non-rd mode decision search This commit makes the encoder to explicitly calculate the SAD associated with the LAST_FRAME motion vector and compare it to that of the GOLDEN_FRAME given by integral projection motion estimation. It skips the expensive sub-pixel motion search over GOLDEN_FRAME when the LAST_FRAME can provide fairly good motion compensated prediction quality. For dark720p speed -6 single thread goes from 33304 b/f, 40.070 dB, 18156 ms -> 33319 b/f, 40.061 dB, 17611 ms Change-Id: I01bc94b9b598075567a392111046b97a9bc30efe	2015-03-18 12:04:58 -07:00
Adrian Grange	83288c7af8	Order header files alphabetically Change-Id: I3e275544bff478849c1b5f3dcd5de950ee330d14	2015-03-18 11:18:08 -07:00
Jingning Han	4640a0c480	Merge "Fix the C version of column vector projection"	2015-03-17 22:53:49 -07:00
Jingning Han	c932584f0f	Fix the C version of column vector projection Make the C and SSE2 versions consistent. Change-Id: I03c405d22a36bd1a97480efb96dc5af230667424	2015-03-17 18:50:53 -07:00
Marco	e52109158a	Update to variance partition. Use force_split to constrain the partition selection. This is used because in the top-down approach to variance partition, a block size may be selected even though one of its subblocks may have high variance. In this patch the selection of the 64x64 block size will only be allowed if the variance of all the 32x32 subblocks are also below the threshold. Stil testing, but some visual improvement for areas near slow moving boundary can be seen. Metrics for RTC set increase by about ~0.5%. Change-Id: Iab3e7b19bf70f534236f7a43fd873895a2bb261d	2015-03-17 17:02:47 -07:00
Yunqing Wang	45e8e4a01f	Merge "Refactor set vbp thresholds function"	2015-03-17 16:05:53 -07:00
Yunqing Wang	c0423abf00	Refactor set vbp thresholds function Code refactoring. Change-Id: I73b6fcc0444155ee46c1efa5253c1d608c6439cb	2015-03-17 12:23:32 -07:00
Adrian Grange	ed6824e449	Remove unused ZBIN_BOOST macros Change-Id: I5169155b20ea3676a6ce58ec77d6aeba07db29d9	2015-03-17 11:53:58 -07:00
Jingning Han	ee41141466	Fix an ioc warning in vp9_pick_inter_mode Shut off all the metric checks for golden reference frame, if we decide that it is unlikely to be selected for reference. Change-Id: Ie457cc1fd43935584403b4982659aed80fb9909c	2015-03-17 10:13:44 -07:00
Yaowu Xu	de3097aa23	Merge "Remove duplicate clamping"	2015-03-16 16:56:10 -07:00
Jingning Han	adaffcc010	Merge "Remove ineffective newmv skip checking from vp9_pick_inter_mode"	2015-03-16 16:43:43 -07:00

... 3 4 5 6 7 ...

5518 Commits