generic-library/vpx

Author	SHA1	Message	Date
Jingning Han	26d3d3af6a	Enable 16x16 Hadamard transform in SATD based mode decision This commit replaces the 16x16 2D-DCT transform with Hadamard transform for RTC coding mode. It reduces the CPU cycles cost on 16x16 transform by 5X. Overall it makes the speed -6 encoding speed 1.5% faster without compromise on compression performance. Change-Id: If6c993831dc4c678d841edc804ff395ed37f2a1b	2015-03-30 15:43:31 -07:00
Jingning Han	f0ac5aaa08	Merge "Hadamard transform based coding mode decision process"	2015-03-30 15:43:15 -07:00
Jingning Han	b4b5af6acd	Use SATD based mode decision for block sizes below 16x16 This commit makes the encoder to select between SATD/variance as metric for mode decision. It also allows to account chroma component costs for mode decision as well. The overall encoding time increase as compared to variance based mode selection is about 15% for speed -6. The compression performance is on average 2.2% better than variance based approach, with about 5% compression performance gains for hard clips (e.g., jimredvga, nikas720p, and mmmoving) at lower bit-rate range. Change-Id: I4d04a31d36f4fcb3f5f491dacd6e7fe44cb9d815	2015-03-30 15:20:07 -07:00
Jingning Han	8a927a1b7a	Reuse inter prediction pixel block for Hadamard transform It saves one unnecessary motion compensated prediction constructed by using 8-tap filter. Change-Id: I101215131e6f38621d5935885f94cc74de6a5377	2015-03-30 15:04:33 -07:00
Jingning Han	8c411f74e0	Hadamard transform based coding mode decision process This commit uses Hadamard transform based rate-distortion cost estimate for rtc coding mode decision. It improves the compression performance of speed -6 for many hard clips at lower bit-rates. For example, 5.5% for jimredvga, 6.7% for mmmoving, 6.1% for niklas720p. This will introduce extra encoding cycle costs at this point. Change-Id: Iaf70634fa2417a705ee29f2456175b981db3d375	2015-03-30 14:46:05 -07:00
Alex Converse	bf7def9a43	Merge "Simplify skip check."	2015-03-30 11:31:45 -07:00
Marco	fa20a60f0d	Speed 5: use non-rd mode for key frame coding. Metrics on RTC set go down by ~1.5% on average. Key frame encoding time goes down by factor of ~5. Change-Id: Ia83acc55848613870e5ac6efe7f3d904d877febb	2015-03-27 16:19:26 -07:00
Adrian Grange	ad18b2b641	Remove 8-bit array in HBD Creating both 8- and 16-bit arrays and then only using one of them is wasteful. Change-Id: Ic5b397c283efaff7bcfff2d2413838ba3e065561	2015-03-25 15:37:03 -07:00
Adrian Grange	65df3d138a	Replace heap with stack memory allocation Replaced the dynamic memory allocation of the second_pred buffer with an allocation on the stack. Change-Id: I2716c46b71e8587714ca5733a99eca2c68419b23	2015-03-25 15:36:43 -07:00
Adrian Grange	8d8d7bfde5	Fix use of scaling in joint motion search To enable us to the scale-invariant motion estimation code during mode selection, each of the reference buffers is scaled to match the size of the frame being encoded. This fix ensures that a unit scaling factor is used in this case rather than the one calculated assuming that the reference frame is not scaled. Change-Id: Id9a5c85dad402f3a7cc7ea9f30f204edad080ebf	2015-03-25 15:35:29 -07:00
paulwilkins	ab788c5380	Merge "Enable group adaptive max q by default."	2015-03-24 15:00:12 -07:00
Alex Converse	4dcb839607	VP9E_GET_ACTIVE_MAP API function. This is useful when aq mode 3 (cyclic refresh) reactivates segments for refresh. Change-Id: I3ad1d9410b899ede393d82bb8db14e2da4d84eca	2015-03-24 11:19:47 -07:00
Yaowu Xu	c77d4dcb35	Merge "vp9_pred_mv(): misc fixes and optimizations"	2015-03-24 10:36:51 -07:00
Alex Converse	02697e35dc	Merge "A tiny cyclic refresh / active map fix."	2015-03-24 09:43:24 -07:00
paulwilkins	8ea7bafdaa	Merge "Revised rd adjustment for variance."	2015-03-24 03:12:56 -07:00
paulwilkins	c0b71cf82f	Merge "Experimental rd bias based on source vs recon variance."	2015-03-24 03:12:41 -07:00
Alex Converse	31f1563a92	A tiny cyclic refresh / active map fix. Change-Id: I198727461455c8c198a0c892d02ed3cb1673aa50	2015-03-23 18:51:00 -07:00
hkuang	cd1d40ff5d	Merge "Safely free all the frame buffers after all the workers finish the work."	2015-03-23 16:50:15 -07:00
Alex Converse	b7605a9d70	Simplify skip check. SEG_LVL_SKIP implies skip. This is enforced by skip = write_skip(). Change-Id: I61c79581c9c53deae36685c2bcf388cb4d8827d3	2015-03-23 10:53:31 -07:00
paulwilkins	691ec45b4e	Enable group adaptive max q by default. Set the GF group adaptive max Q compile flag to 1 by default. This change has a quite big visual impact in some clips and also contributes to tighter rate control. For short test clips that have consistent content the impact is quite small on metrics but for more varied long form clips there is a drop in overal psnr but a sharp rise in average psnr caused by greater expenditure on some easier sections and tighter rate clipping in hard sections. In chunck'ed encodes some of the effect will already be present due to the independent rate control in each chunk but this change takes the control down to a smaller scale. yt hd +10.67%, - 3.77%, -1.56% yt +9.654%, - 3.6%, - 1.82% std hd +0.25%, -0.85%, -0.42% derf +0.25%, - 1.1%. - 0.87% Change-Id: Ibbc39b800d99d053939f4c6712d715124082843e	2015-03-23 15:57:09 +00:00
Yaowu Xu	9fd8abc541	vp9_pred_mv(): misc fixes and optimizations 1. skip near if it is same as nearest 2. correct rounding for converting mv to fullpel position 3. update pred_mv_sad after new mv search. Overall .1%~.25% compression gains on rtc set for speed 5, 6, 7, 8. Change-Id: Ic300ca53f7da18073771f1bb993c58cde9deee89	2015-03-20 17:17:04 -07:00
Alex Converse	6d6ef8eb3c	Don't apply active map on key frames. This allows applciations to be KF oblivious. Change-Id: Ic02712eae6ad8d6b3eaec26548299d24ca0d5cc0	2015-03-20 14:57:24 -07:00
Alex Converse	e032fc7b9e	Set loop filter level to zero on inactive segment. Change-Id: I6022a79351882a72a219aee13563bf21bcd70383	2015-03-20 14:43:06 -07:00
paulwilkins	7e234b9228	Revised rd adjustment for variance. Revised adjustment for rd based on source complexity. Two cases: 1) Bias against low variance intra predictors when the actual source variance is higher. 2) When the source variance is very low to give a slight bias against predictors that might introduce false texture or features. The impact on metrics of this change across the test sets is small and mixed. derf -0.073%, -0.049%, -0.291% std hd -0.093%, -0.1%, -0.557% yt +0.186%, +0.04%, - 0.074% ythd +0.625%, + 0.563%, +0.584% Medium to strong psycho-visual improvements in some problem clips. This feature and intra weight on GF group length now turned on by default. Change-Id: Idefc8b633a7b7bc56c42dbe19f6b2f872d73851e	2015-03-20 11:59:39 +00:00
paulwilkins	9a1ce7be7d	Experimental rd bias based on source vs recon variance. This experiment biases the rd decision based on the impact a mode decision has on the relative spatial complexity of the reconstruction vs the source. The aim is to better retain a semblance of texture even if it is slightly misaligned / wrong, rather than use a simple rd measure that tends to favor use of a flat predictor if a perfect match can't be found. This improves the appearance of texture and visual quality on specific test clips but is hidden under a flag and currently off by default pending visual quality testing on a wider Yt set. Change-Id: Idf6e754a8949bf39ed9d314c6f2daaa20c888aad	2015-03-20 11:57:36 +00:00
Adrian Grange	12d946df89	Restore first ref frame pointer to the correct value The joint_motion_search function alternates prediction between two reference frames. In order to reuse existing code, a pointer to the appropriate reference frame is written into xd->plane[0].pre[0], that the motion estimation code assumes points to the reference frame. If this first reference frame was scaled then the pointer was incorrectly being reset to point to the unscaled reference frame rather than the scaled version. Change-Id: I76f73a8d8f4f15c1f3a5e7e08a35140cdb7886ab	2015-03-19 16:17:31 -07:00
Adrian Grange	53c9ebe609	Move joint_motion_search & delete function prototype Change-Id: I7fb3a78ed0e0bc940d8b4a57c470302f8369782f	2015-03-19 14:28:52 -07:00
hkuang	b88dac8938	Safely free all the frame buffers after all the workers finish the work. Issue: 978 Change-Id: Ia7aa809095008f6819a44d7ecb0329def79b1117	2015-03-19 12:21:00 -07:00
Jingning Han	067fc49996	Merge "Speed up non-rd mode decision search"	2015-03-19 09:18:10 -07:00
Jingning Han	411bbce470	Merge "Fix an ioc warning in vp9_pick_inter_mode"	2015-03-19 09:17:25 -07:00
Marco	fc2da4c5ba	Merge "Adjustments to aq-mode=3."	2015-03-19 09:01:17 -07:00
James Zern	6f23d40582	Merge "vp9_resize_plane: quiet some static analysis warnings"	2015-03-18 19:39:48 -07:00
James Zern	c664f16182	Merge changes Ie5a24275,Ib72946a8,I532b882b * changes: vp9_fdct8x8_quant_ssse3: quiet a static analysis warning vp9_fdct8x8_quant_sse2: quiet a static analysis warning vp9_mv_pred: quiet a static analysis warning	2015-03-18 19:38:49 -07:00
Alex Converse	748843712f	Merge "Fix external resize memory issues."	2015-03-18 16:04:30 -07:00
James Zern	c4367b9b51	vp9_resize_plane: quiet some static analysis warnings document resolution assumptions with a few asserts Change-Id: Ia4ab738fd3e0a1ba0ed30a57facd2658c2c1fd60	2015-03-18 14:34:30 -07:00
James Zern	388add965f	vp9_fdct8x8_quant_ssse3: quiet a static analysis warning add an assert to validate 'in' array size Change-Id: Ie5a24275c066d9dd59714f6104510abbd4850dc5	2015-03-18 14:33:43 -07:00
James Zern	198b039e2a	vp9_fdct8x8_quant_sse2: quiet a static analysis warning add an assert to validate 'in' array size Change-Id: Ib72946a86f34e1ce8a69954e8e3e4fe1a0f18a91	2015-03-18 14:33:04 -07:00
James Zern	428369293d	vp9_mv_pred: quiet a static analysis warning add an assert to validate pred_mv array size Change-Id: I532b882b71e2baff3ac76e07ed133ec5a11bd0fc	2015-03-18 14:31:58 -07:00
Marco	71e6ed7bd1	Adjustments to aq-mode=3. Factor in segment#2 and skip blocks into the postencode estimated bits, and increase somewhat the aggressiveness of the refresh. PSNR/SSIM Metrics on RTC set go up by ~0.8/0.5%. Change-Id: I5d4e7cb00a3aefb25d18c88b6b24118b72dc5d51	2015-03-18 12:06:16 -07:00
Jingning Han	83cbe22623	Speed up non-rd mode decision search This commit makes the encoder to explicitly calculate the SAD associated with the LAST_FRAME motion vector and compare it to that of the GOLDEN_FRAME given by integral projection motion estimation. It skips the expensive sub-pixel motion search over GOLDEN_FRAME when the LAST_FRAME can provide fairly good motion compensated prediction quality. For dark720p speed -6 single thread goes from 33304 b/f, 40.070 dB, 18156 ms -> 33319 b/f, 40.061 dB, 17611 ms Change-Id: I01bc94b9b598075567a392111046b97a9bc30efe	2015-03-18 12:04:58 -07:00
Adrian Grange	83288c7af8	Order header files alphabetically Change-Id: I3e275544bff478849c1b5f3dcd5de950ee330d14	2015-03-18 11:18:08 -07:00
Jingning Han	4640a0c480	Merge "Fix the C version of column vector projection"	2015-03-17 22:53:49 -07:00
Jingning Han	c932584f0f	Fix the C version of column vector projection Make the C and SSE2 versions consistent. Change-Id: I03c405d22a36bd1a97480efb96dc5af230667424	2015-03-17 18:50:53 -07:00
Marco	e52109158a	Update to variance partition. Use force_split to constrain the partition selection. This is used because in the top-down approach to variance partition, a block size may be selected even though one of its subblocks may have high variance. In this patch the selection of the 64x64 block size will only be allowed if the variance of all the 32x32 subblocks are also below the threshold. Stil testing, but some visual improvement for areas near slow moving boundary can be seen. Metrics for RTC set increase by about ~0.5%. Change-Id: Iab3e7b19bf70f534236f7a43fd873895a2bb261d	2015-03-17 17:02:47 -07:00
Yunqing Wang	45e8e4a01f	Merge "Refactor set vbp thresholds function"	2015-03-17 16:05:53 -07:00
Yunqing Wang	c0423abf00	Refactor set vbp thresholds function Code refactoring. Change-Id: I73b6fcc0444155ee46c1efa5253c1d608c6439cb	2015-03-17 12:23:32 -07:00
Adrian Grange	ed6824e449	Remove unused ZBIN_BOOST macros Change-Id: I5169155b20ea3676a6ce58ec77d6aeba07db29d9	2015-03-17 11:53:58 -07:00
Jingning Han	ee41141466	Fix an ioc warning in vp9_pick_inter_mode Shut off all the metric checks for golden reference frame, if we decide that it is unlikely to be selected for reference. Change-Id: Ie457cc1fd43935584403b4982659aed80fb9909c	2015-03-17 10:13:44 -07:00
Yaowu Xu	de3097aa23	Merge "Remove duplicate clamping"	2015-03-16 16:56:10 -07:00
Jingning Han	adaffcc010	Merge "Remove ineffective newmv skip checking from vp9_pick_inter_mode"	2015-03-16 16:43:43 -07:00
Jingning Han	4e8daaf960	Merge "Simplify prediction filter search in rtc coding mode"	2015-03-16 16:43:26 -07:00
Jingning Han	82231beced	Merge "Refactor column integral projection computation"	2015-03-16 16:43:11 -07:00
Yaowu Xu	3119c24658	Merge "change the order of inter modes evaluated"	2015-03-16 16:14:34 -07:00
Alex Converse	6126afe62e	Fix external resize memory issues. These were uncovered by the chromoting perftest. Change-Id: Ia5a90fd1718ff757c1484decf3861295260e6722	2015-03-16 15:56:26 -07:00
Yaowu Xu	4611f24797	Remove duplicate clamping The mvs are clamped in the vp9_find_best_ref_mvs() already. Change-Id: I9bea5e35aef6007466fe7fca4bc2dc5c17e74222	2015-03-16 15:19:37 -07:00
Jingning Han	c852200f51	Remove ineffective newmv skip checking from vp9_pick_inter_mode Change-Id: I41ee684cf113a7b5edf280183e51cb08b2e93cc4	2015-03-16 15:06:27 -07:00
Jingning Han	981bb84882	Simplify prediction filter search in rtc coding mode Reduce unnecessary fetch from MB_MODE_INFO. Change-Id: Iff89b76d5e2774c00a564e902913a633fa2e1ea9	2015-03-16 14:54:00 -07:00
Yaowu Xu	f2d682fc10	change the order of inter modes evaluated Change-Id: I10c1ad23b110cf92cb026e895039c215c47abfd0	2015-03-16 12:49:30 -07:00
Jingning Han	2cfddec332	Refactor column integral projection computation Move the scaling factor outside column projection. This avoids repeated calculation of the same scaling factor. Profiling shows that the percentage of vp9_int_pro_col_sse2 of overall cycles goes from 2.29% down to 1.88%. Change-Id: I5ac4e324ab2d7f33ba2de66dd2a12e04e04dfd66	2015-03-16 12:07:15 -07:00
Jingning Han	09e0b38a86	Merge "Fix indent in choose_partitioning"	2015-03-16 11:52:12 -07:00
Jingning Han	7cf383d17f	Fix indent in choose_partitioning Change-Id: I4039f8ac75a9cfcc4d07abd0619d1379bb10fe51	2015-03-16 11:01:00 -07:00
Yaowu Xu	51d529a578	vp9_pick_inter_mode(): minor optimizations 1. remove duplicate initialization to mbmi->interp_filter. 2. move mv clamping into ref_frame loop instead of mode checking loop. 3. move the check if last frame is same as golden frame earlier to avoid initialization of Golden reference related variables. Change-Id: Idf2d05e19e94a24f69cc289687869fc71d2ff289	2015-03-16 10:08:02 -07:00
Jingning Han	1f9b2b77ad	Merge "Fix choose_partitioning threshold setup for speed -5"	2015-03-15 09:04:07 -07:00
Jingning Han	b03cf9317a	Fix 1-step refinement search table Change-Id: I32f0bcb40c6e7ba63bfae487739ededd0b6b2dde	2015-03-14 10:52:11 -07:00
Jingning Han	1f00a9b9d5	Fix choose_partitioning threshold setup for speed -5 The compression performance of speed -5 is on average 12.6% better than speed -6. At lower bit-rates, the gains are typically 20% or more. For 2-thread encoding, the speed -5 takes about 1.6x time of speed -6. Change-Id: If7a73464a24d33e8f49b9533b51ec51c8da7fc80	2015-03-13 17:01:56 -07:00
Marco	87999b1c2e	Merge "Fix crash with vp9 denoiser on."	2015-03-13 14:31:40 -07:00
Jingning Han	6cceed09cf	Merge "Use sdx4df to do 1-step refinement"	2015-03-13 12:57:49 -07:00
Marco	e38066a74d	Fix crash with vp9 denoiser on. Crash occured on very first key frame, because denoiser temporal function was beng entered. Updated denoiser unittest to set cpu_used from first frame, and verified fix fixes the crash. Change-Id: I3be1124b52846fbbe7248d2c3d6136e086c80bc1	2015-03-13 11:10:02 -07:00
Marco	deaf661f45	Merge "Lower bitrate threshold below which cyclic refresh is turned off."	2015-03-13 10:31:35 -07:00
Alex Converse	f8df916931	Merge "Reconcile active_map and cyclic refresh"	2015-03-13 10:20:15 -07:00
Jingning Han	688c99a706	Merge "Reset src buffer only once in vp9_int_pro_motion_estimation"	2015-03-13 09:56:00 -07:00
Jingning Han	1b3499ae8b	Merge "Reduce the number of full block SAD calls"	2015-03-13 09:55:52 -07:00
Jingning Han	cce7020f2c	Use sdx4df to do 1-step refinement Change-Id: Ie0c3ef3ae3aedf049b1a296de607730b79c12672	2015-03-13 09:53:15 -07:00
Marco	62a3f53997	Lower bitrate threshold below which cyclic refresh is turned off. Change-Id: Ib54ab11adf8178eec74f65388a89c8f912c7869a	2015-03-13 09:42:45 -07:00
paulwilkins	b6749aa3a7	Merge "Shorten GF/arf interval in hard scenes."	2015-03-13 08:45:52 -07:00
Jingning Han	ba29125f7b	Reset src buffer only once in vp9_int_pro_motion_estimation Change-Id: I5c96b6a25f9df60da65b7af7c92a921b611746e3	2015-03-12 18:50:53 -07:00
Yaowu Xu	1aa75c65cc	Merge "vp9_pick_inter_mode(): Use single loop to evaluate inter modes"	2015-03-12 18:43:23 -07:00
Jingning Han	427cdf0a41	Reduce the number of full block SAD calls This commit uses a 6-point 1-step refine motion search in the integral projection based full pixel motion estimation, to replace the current 9-point search. It reduces runtime cost of speed -6 on some noisy clips, e.g., dark720p single thread 33314 b/f, 40.076 dB, 18231 ms -> 33307 b/f, 40.067 dB, 17768 ms The compression performance for rtc set remains unchanged. Change-Id: I194ea5a9ce52e5a10baeee36338633adc22f764c	2015-03-12 18:30:57 -07:00
Yunqing Wang	769e6567e9	Merge "Minorly modify model_rd_for_sb_y function"	2015-03-12 17:16:48 -07:00
Jingning Han	7a9d8f1efe	Merge "Fix fdct8x8_quant ssse3 overflow issue"	2015-03-12 16:43:09 -07:00
Alex Converse	1bfacd3529	Reconcile active_map and cyclic refresh Change-Id: Id7f8654aeeb20caa402bc822521b1d72c658f4f9	2015-03-12 16:19:49 -07:00
Yaowu Xu	2b368097c8	vp9_pick_inter_mode(): Use single loop to evaluate inter modes This commit changes to use single loop to evaluate all inter modes. There is no impact on compression quality and speed, but allow future experiment with the order of modes evaluated. Change-Id: I71696ce1014cbe127e25e98710d835987f5ecc09	2015-03-12 16:14:29 -07:00
Yunqing Wang	5d677c97eb	Minorly modify model_rd_for_sb_y function Added a skip_dc check. If skip_dc = 1, we could eliminate calling of vp9_model_rd_from_var_lapndz(). This gave slight PSNR & SSIM gain(<0.1%), and no speed change. Change-Id: If5ca733366148c86b98e196a00cc890f50e9a3e5	2015-03-12 14:04:14 -07:00
Jingning Han	fcb96b3afd	Fix fdct8x8_quant ssse3 overflow issue This resolves webm issue 968. Change-Id: Ieb363129b1e135a561141c68211d413226aba754	2015-03-12 12:43:19 -07:00
Deb Mukherjee	791bf5657f	Merge "Some rate control adjustments to control overshoot"	2015-03-12 11:10:59 -07:00
Jingning Han	1ff15fbffe	Merge "Prevent integer overflow in choose_partitioning"	2015-03-12 09:24:02 -07:00
Jingning Han	90ea10ec91	Merge "Remove unnecessary speed feature checking"	2015-03-12 09:23:51 -07:00
Jingning Han	594890a534	Merge "Apply fast motion search to golden reference frame"	2015-03-12 09:23:41 -07:00
Jingning Han	8fdddd5c01	Merge "Refactor to remove GLOBAL_MOTION"	2015-03-12 09:23:31 -07:00
Marco	0adc58037a	Merge "Fix visual studio build failure."	2015-03-11 17:19:47 -07:00
Jingning Han	238b6be24b	Prevent integer overflow in choose_partitioning Re-arrange the multiplication and right shift operations to avoid integer overflow in choose_partitioning. Change-Id: Ib4005cafb410a67c1960486471d75b6ebe38c4e0	2015-03-11 16:31:42 -07:00
Marco	a291b0b4a3	Fix visual studio build failure. Change-Id: Ifeb14f945d0f0300eb7b21b38e5720ac1c11a6cf	2015-03-11 16:12:39 -07:00
Jingning Han	313c28f8b8	Remove unnecessary speed feature checking This commit removes the pred_mv_sad comparison from rtc motion search, given that a stronger comparison has been done at the mode search level to eliminate unlikely selected reference frames. Change-Id: I49b8d24b2174303066fd8eff2102c0648f2869df	2015-03-11 16:11:40 -07:00
Adrian Grange	39d20c6ac3	Merge "Clamp rate correction factor after scaling it"	2015-03-11 16:09:49 -07:00
Jingning Han	54eda13f8d	Apply fast motion search to golden reference frame This commit enables the rtc coding mode to run integral projection based motion search for golden reference frame. It improves the speed -6 compression performance by 1.1% on average, 3.46% for jimred_vga, 6.46% for tacomascmvvga, and 0.5% for vidyo clips. The speed -6 is about 6% slower. Change-Id: I0fe402ad2edf0149d0349ad304ab9b2abdf0c804	2015-03-11 16:03:49 -07:00
Jingning Han	1ca4d51b2e	Refactor to remove GLOBAL_MOTION Make the vp9_int_pro_motion_estimation() function return zero motion vector if high bit depth is turned on, instead of removing it from compiled codes. Change-Id: Ia48f010eb590b2d517d5678c394110b326a1a95e	2015-03-11 15:53:15 -07:00
Yaowu Xu	dc902fedb2	Merge "Separate rd_thresh adaption by ref_frame"	2015-03-11 10:41:20 -07:00
Adrian Grange	42a89eb8cc	Clamp rate correction factor after scaling it Added clamp on the rate correction factor after it has been scaled. Change-Id: I5d4b46a101987b43c5bcfd7e0bd1b7b4d53640a4	2015-03-11 09:08:15 -07:00
paulwilkins	b29c48b03c	Shorten GF/arf interval in hard scenes. This patch accounts in the first pass stats for blocks that while not coded as intra, are complex and have an intra error / best error ratio below a threshold. The modification shortens the GF arf interval for a particular class of content that contains a lot of blocks matching the above criteria. (In one short problem test sequence the average interval dropped from about 14-15 to 10-11) The change results in small net gains in metrics results for the Yt(~0.2%) and yt-hd (~0.5%) sets and is approximately neutral for the other test sets. The change is currently shielded by a flag and off by default pending verification that it does not cause other regressions in tests on a wider YT test set. Change-Id: I6b803daa6a4ac09a6f428fb3a18be1ecedd974b7	2015-03-11 14:15:23 +00:00
Yaowu Xu	d549aa3b17	Separate rd_thresh adaption by ref_frame Only update the rd_thresh factors for modes sharing same reference frame. This helps overall compression of 6 and 7 by .13% and .19% respectively without any noticeable speed difference. Change-Id: Idb3a3879512c5d7d0880034516079949290690c5	2015-03-10 19:06:52 -07:00
Deb Mukherjee	0308e2ee6d	Some rate control adjustments to control overshoot Some rate control adjustments to control overshoot in the constrained quality mode. Change-Id: I8907b9a883642d779009d0a138adfa6ba67e7f41	2015-03-10 17:25:10 -07:00
Marco	340260585c	Merge "Modify update golden reference update under aq-mode=3 mode."	2015-03-10 11:48:10 -07:00
Marco	fb31aa09e2	Modify update golden reference update under aq-mode=3 mode. For non-SVC 1 pass CBR: make the GF update interval a multiple of the cyclic refresh period, and use encoding stats to prevent GF update at certain times. Change-Id: I4c44cacc2f70f1d27391a47644837e1eaa065017	2015-03-10 10:54:00 -07:00
Yaowu Xu	12943e722d	Merge "Enable using Golden reference in choose_partition()"	2015-03-10 10:48:52 -07:00
paulwilkins	4b01a2d350	Merge "Allow q adjustment for VPX_CQ and VPX_CBR."	2015-03-10 10:45:02 -07:00
Adrian Grange	78df712216	Fix vp9_compute_qdelta_by_rate loop behavior The return value from vp9_compute_qdelta_by_rate, which is a delta value for the quantizer, could never be 0 if (qindex == rc->worst_quality). This occurs because target_index was setup unconditionally in the loop and yet the loop counter stopped at (rc->worst_quality - 1). Change-Id: I6b59cd9b5811ff33357e71cd7d814c5e53d291f2	2015-03-10 09:14:54 -07:00
Yaowu Xu	059a473b35	Enable using Golden reference in choose_partition() Choose_partition uses only the last frame as reference frame in making partition decision, this commit adds the check on how well Golden frame with (0,0) predicts the current block, and uses GF(0,0) as basis for partition decision if it produces better prediction. The commit improves rtc speed 6 and 7 encoding by 0.14% and 0.19% respectively. Change-Id: I156acf925bd6e0b586d48155d1940d27270a3915	2015-03-10 08:57:28 -07:00
Alex Converse	066ed601a5	Merge "Don't waste time partitioning skip superblocks."	2015-03-09 13:02:16 -07:00
Jingning Han	9708f9d66a	Merge "Skip golden ref frame check when it is same as last ref frame"	2015-03-09 12:27:19 -07:00
Jingning Han	6245a91e0b	Skip golden ref frame check when it is same as last ref frame When golden reference frame is refreshed, the next frame has both its last and golden reference frames point to the same reference frame in real-time coding mode. Experiments suggest that using two separate reference frames for frames right after golden refresh frame does not provide further compression performance advantage. This commit hence retains the current encoder implementation and shuts off the mode search over golden reference frame in this case. It makes the encoder run slightly faster at no coding performance change. Change-Id: I1561f7799253a10e675d05c63c1749fe9e85b472	2015-03-09 11:14:55 -07:00
Alex Converse	06b59299c8	Don't waste time partitioning skip superblocks. Force 64x64 partitioning when a whole superblock is SEGMENT_LVL_SKIP. This drops encode times of screens mostly at rest by 20%. Change-Id: Ieba554b0b8a0c1679aae784a8bd11f038ab942c3	2015-03-09 11:02:05 -07:00
paulwilkins	2cff9c4efe	Allow q adjustment for VPX_CQ and VPX_CBR. Adjustment previously only enabled in VBR mode. This patch allows adjustment of min and max q for CBR and adjustment of max q only for CQ mode. Change-Id: Id5e583f3d50453cd544fc57249acacd946457482	2015-03-09 17:13:55 +00:00
Yunqing Wang	969dd8f128	Merge "vp9_ethread: fix me consts initialization to support aq_mode=3 encoding"	2015-03-09 09:42:12 -07:00
Jingning Han	d2b6a4cc80	Merge "Move pred_mv assign outside integral projection motion search"	2015-03-09 09:34:26 -07:00
Yunqing Wang	c4fb2d7cc7	Merge "Modify the setting of transform skip flags in non-rd mode"	2015-03-09 08:35:57 -07:00
Yunqing Wang	6e0ec0b2d9	vp9_ethread: fix me consts initialization to support aq_mode=3 encoding While turning on "--aq_mode=3", the quantizers are updated by each thread. Fixed the me consts initialization function to make sure that the correct thread data are updated. Change-Id: Ied27bb7bae76fc3fa2cda4f8c35ac0b46271bef4	2015-03-06 16:31:46 -08:00
Yunqing Wang	268f260d64	Modify the setting of transform skip flags in non-rd mode While searching for the best mode in non-rd case, SSE of a partition block is calculated and the transform size is set. This patch rewrites the skip checking conditions based on transform size instead of partition size to be more precise. Small gains were seen in rtc set borg test (speed 6). AVG PSNR: 0.087%, overall PSNR: 0.073%, SSIM: 0.146%. No noticeable speed change. Change-Id: I5603ca5339c784dfa02263f4005988ccd8c32f6e	2015-03-06 09:22:00 -08:00
Yaowu Xu	0f37601fd7	Merge changes I1b972c94,I9c897d32 * changes: Prevent invalid memory access Use correct bsize for uv	2015-03-06 07:27:59 -08:00
Yaowu Xu	8cbeb7cf36	Prevent invalid memory access Change-Id: I1b972c945274254d896d772d859840b2f8211b4f	2015-03-05 14:57:11 -08:00
Alex Converse	feda5d244c	Merge changes I219c287b,I6adee670 * changes: Call encoder control before running ethread test. Don't copy thread data for the main thread.	2015-03-05 14:43:42 -08:00
Alex Converse	b21e361f8d	Merge "Fix misleading indentation."	2015-03-05 14:43:38 -08:00
Alex Converse	ad01d275e9	Merge "Don't inline cost_coeffs."	2015-03-05 13:54:44 -08:00
Adrian Grange	6e3be5c3b6	Merge "Fix valgrind memcpy memory overlaps warning"	2015-03-05 12:52:57 -08:00
Alex Converse	2eb113d00a	Don't inline cost_coeffs. It was tiny when it was orginally marked INLINE. Forcing this function to be inlined prevents the compiler from inlining its much smaller callers. No measurable speed impact, 28320 byte smaller libvpx.a Change-Id: I6bf4c917157d15cbadb3cd3e20a9e82d35dc7d6f	2015-03-05 12:39:02 -08:00
Alex Converse	56cc37c642	Fix misleading indentation. Change-Id: Ic82b039a3d42f9aa01b85a3a69facfaa84b43a53	2015-03-05 12:10:56 -08:00
Alex Converse	71d5a59c6d	Don't copy thread data for the main thread. Change-Id: I6adee6704cacfeae0ed0b217a91095457d1be74a	2015-03-05 12:10:56 -08:00
Jingning Han	fda0410822	Move pred_mv assign outside integral projection motion search Change-Id: I040b066fdce08e2f05115a22ea808715aa147779	2015-03-05 11:44:10 -08:00
Jingning Han	87bf5203af	Merge "Move integral projection motion search to vp9_mcomp.c"	2015-03-05 09:25:16 -08:00
Yaowu Xu	b573fef76d	Use correct bsize for uv Change-Id: I9c897d32af6c3a956bb6f424a74c12737727038a	2015-03-05 08:20:35 -08:00
Adrian Grange	4b546583c4	Merge "Small rationalization of code in vp9_first_pass"	2015-03-04 12:49:58 -08:00
Adrian Grange	a34a042615	Merge "Make encoder buffer allocation dynamic"	2015-03-04 10:54:10 -08:00
Adrian Grange	fed9e1fee9	Small rationalization of code in vp9_first_pass Change-Id: I87cc0e038171c60a957298827e312fead500f7fb	2015-03-04 10:49:03 -08:00
Jingning Han	50c06052e9	Merge "Use SAD value to set chroma cost flag"	2015-03-04 10:47:56 -08:00
Jingning Han	2deecdd5cb	Move integral projection motion search to vp9_mcomp.c Make it a general purpose fast motion estimation function, to be used in the mode search process. Change-Id: Ib354cb0e664dc61c30c0b2314297835ee75b157a	2015-03-04 10:30:15 -08:00
Jingning Han	7d8061a44a	Use SAD value to set chroma cost flag This saves an extra 64x64 variance calculation and replaces two 32x32 variance functions with sad functions. The compression performance change is unnoticeable. Change-Id: I6d33868695664ec73b56c42945162ae61c484856	2015-03-04 09:46:39 -08:00
Jingning Han	0fe8304d0b	Merge "Properly handle the boundary blocks for integral projection search"	2015-03-04 09:01:33 -08:00
Adrian Grange	3807dd82ab	Make encoder buffer allocation dynamic Frame buffers are now allocated dynamically on-demand. Entries in the reference frame map, cm->ref_frame_map, may now be set to -1 (INVALID_IDX) to indicate that there is not a valid reference buffer in that "slot". All slots in the reference frame map are now initialized to the empty state (-1) and each buffer is initialized to have a reference count of 0. Change-Id: Id1afe98de98db4ae8b2dfefed7889c3b28c68582	2015-03-04 07:58:32 -08:00
Deb Mukherjee	87d1a488ed	Merge "dc quantizer fix for 32x32 transforms"	2015-03-03 23:23:44 -08:00
Jingning Han	540318d3f8	Merge "Scale the normalization factor depending on the block size"	2015-03-03 19:04:34 -08:00
Jingning Han	e5fe165840	Properly handle the boundary blocks for integral projection search Use rectangular block size for integral projection motion estimation if the the 64x64 block has over half block outside the frame. This avoids the issue that the motion information of these blocks is dominated by the extended pixels, instead of the pixels of interest. Change-Id: I22f4d2bb7f6a20db9b3f5e2e5463a7f4b9d1b737	2015-03-03 16:15:12 -08:00
Deb Mukherjee	6910e92d04	dc quantizer fix for 32x32 transforms The rounding factor needs to be scaled down by a factor of 2. Also, the quantized and dequantized coefficients are memset to 0 when dc quantizer is used. Change-Id: Ifa68bab02addbf1b83d249c5b4cbd5cda796b1cf	2015-03-03 15:58:27 -08:00
Adrian Grange	852f62fde5	Fix valgrind memcpy memory overlaps warning Change-Id: Id0bb162b48b891c5c849f0411ef2ac0aa4bbe261	2015-03-03 15:06:34 -08:00
Jingning Han	a521008201	Scale the normalization factor depending on the block size Change-Id: I0a26994bf65ea224e496b09af2ce71e1a4210433	2015-03-03 11:29:46 -08:00
Yaowu Xu	47ac3ea0bb	Adapt color sensitiviy threshold to luma signal energy Instead using only a fixed threshold, this commit adapts the threshold for color sensitivity decision to luma signal energy: chroma channel's sse is at least 1/6 of that in luma for color sensitivity flag to be set to active. This recoups a large portion of the speed loss due to accounting for chroma component costs in RTC mode decision. Change-Id: Ie01f747f6037dba6a1d1ed3e10b71a0ef1abc42c	2015-03-03 11:15:13 -08:00
Jingning Han	1790d45252	Use variance metric for integral projection vector match This commit replaces the SAD with variance as metric for the integral projection vector match. It improves the search accuracy in the presence of slight light change. The average speed -6 compression performance for rtc set is improved by 1.7%. No speed changes are observed for the test clips. Change-Id: I71c1d27e42de2aa429fb3564e6549bba1c7d6d4d	2015-03-01 10:42:56 -08:00
Jingning Han	f4e0eb17e8	Merge "Fix source frame border extension"	2015-02-27 18:19:18 -08:00
Jingning Han	fe85fabbac	Fix source frame border extension This commit fixes an issue in source frame border extension. It causes certain frame resolution such as 640x480 to have a portion of the right/bottom extension filled by zeros, which misleads motion search and degrades transform coding performance when large block size is used. This fix improves the speed 2 compression performance of a few yt sequence, typically ranging from 1% - 2%, up to 5% at median to low bit-rate. Change-Id: Id6b09a5695d9e7651c6dfbc2c6a72288b08af7fb	2015-02-27 15:48:01 -08:00
Adrian Grange	94bba48525	Merge "Fix calc_highbd_psnr"	2015-02-27 15:42:08 -08:00
Alex Converse	2b2fc812f1	Merge "Make SVC compatible with external resize."	2015-02-27 14:37:48 -08:00
Adrian Grange	54293ee3c7	Fix calc_highbd_psnr Should use the crop dimensions of the frame rather than the extended size. Change-Id: I49ed041a46ff0753d43e074020857b7ff2f95e17	2015-02-27 14:05:02 -08:00
Marco	2b0ed0842f	Merge "Fix arithmetic overflow warnings."	2015-02-27 11:53:57 -08:00
Jingning Han	89ee460ee4	Merge "Refactor integral projection based motion estimation"	2015-02-27 09:49:30 -08:00
Marco	c3f7bb16b4	Fix arithmetic overflow warnings. Change-Id: Ib85b5bc135aa0907a76b8c74faafe577e27d014f	2015-02-26 15:27:21 -08:00
Jingning Han	73a00d3219	Refactor integral projection based motion estimation Support variable block size integral projection based motion estimation. Change-Id: Iee6d65e44df4480aa13fb7b84b9c91914b89caa1	2015-02-26 14:48:59 -08:00
Yaowu Xu	754bbcfdc8	Fix the encoder to support profile change Change-Id: Iefb928ad1174e274409facfb44f80265ff0f7683	2015-02-26 11:41:01 -08:00
Yaowu Xu	387bb8bed7	Correct parameter order in a function call Change-Id: Ibd87db1c4371edcbe193d39df2fdc07d3842c21a	2015-02-26 11:39:57 -08:00
paulwilkins	e2b4ef1313	Merge "Account for rate error in GF group Q calculation."	2015-02-26 08:20:08 -08:00
Alex Converse	6ea83fdfcb	Make SVC compatible with external resize. Fixes https://code.google.com/p/webm/issues/detail?id=943 Change-Id: I6177bf6ab6b31a22d2652732f579b8aed3f28887	2015-02-25 14:05:51 -08:00
Jingning Han	3e1d14a6ce	Merge "Motion compensated reference refinement"	2015-02-25 12:33:09 -08:00
Jingning Han	4c5a4efc38	Merge "Re-distribute hierarchical vector match pattern"	2015-02-25 10:33:25 -08:00
Jingning Han	b7050c0be3	Motion compensated reference refinement This commit applies one-step refinement search to the resulting motion vector of the integral projectiion based motion estimation, per 64x64 block. It improves the coding performance of speed -6. pedestrian 1080p 500 kbps 51735 b/f, 36.794 dB, 16044 ms -> 51382 b/f, 36.793 dB, 16282 ms cloud 1080p 500 kbps 24081 b/f, 37.988 dB, 14016 ms -> 23597 b/f, 38.076 dB, 12774 ms vidyo1 720p 1000 kbps 16552 b/f, 40.514 dB, 8279 ms -> 16553 b/f, 40.543 dB, 8510 ms The rtc set compression performance is improved by 0.5%. Change-Id: I3d09bea2caf58b2a4f3b38aa26fffafcbe9a2c17	2015-02-25 10:32:09 -08:00
Yunqing Wang	419ff1352e	Merge "Fix ssse3 quantize_fp functions while skip=1"	2015-02-25 10:10:10 -08:00
Jingning Han	0f57d0a682	Merge "Fix fwd transform sse2 build issue on older gcc version"	2015-02-25 09:32:00 -08:00
Jingning Han	e47033319d	Fix fwd transform sse2 build issue on older gcc version Change-Id: I3e0e53d129552babf29e6c5d047483733983973c	2015-02-24 23:25:21 -08:00
Jingning Han	f87e315e1e	Re-distribute hierarchical vector match pattern This commit modifies the hierarchical vector match patter. It avoids repeated SAD computation at same points. The function vp9_vector_sad_sse2 is called 12 times per 64x64 block, instead of 15 times as before. The effective coverage remains the same. Change-Id: I91ad9d27d40db8963c907d02af84e10702136994	2015-02-24 11:48:38 -08:00
Yunqing Wang	58e0159c80	Fix ssse3 quantize_fp functions while skip=1 In ssse3 functions, DEFINE_ARGS macro hard codes qcoeff and dqcoeff to r3 and r4. If skip is 1, qcoeff and dqcoeff need to be loaded from the stack, which doesn't work because of the above definitions. Currently, skip=1 case is not used in the encoder. This patch fixed the issue, so it can be turned on later. Change-Id: I998d696b1a7a85dca2b3bcee790b21c21e039147	2015-02-24 10:37:05 -08:00
paulwilkins	8d7f53f04c	Account for rate error in GF group Q calculation. When GF group adaptive maxQ is enabled this patch accounts somewhat for accumulated error in the rate control. This improves accuracy quite a bit on many clips especially when there is overshoot. Examples when the overshoot and undershoot command line parameters are set to 100: Hall @ 1200 overshoot is reduced from 67-24%. Akiyo @ 400 undershoot is reduced from 28%-15%. Setting a lower value for undershoot or overshoot still reduces the error further. Impact on metrics is mixed with some gains in average psnr but generally a little lower (e.g. 0.5%) on overall and ssim. The GF group adaptation is still off by default in this patch. Compared to with the head, enabling this mode now gives big average psnr gains on the YT sets (e.g. YT_HD >11.2%), a drop in overall PSNR (YT-HD 3.9%) and a smaller drop or neutral for SSIM. Change-Id: If4b32cd0740d3fb941317b374f9c2951954eee90	2015-02-23 10:57:27 +00:00
Marco	c9f660d895	Merge "Remove a few unneccessary multiplications in denoiser."	2015-02-20 14:42:02 -08:00
Marco	8f84fbe756	Remove a few unneccessary multiplications in denoiser. Change-Id: I3edbb7cc67203fbbf32c6fd4a08015ca9d9ed53e	2015-02-20 11:55:11 -08:00
Hangyu Kuang	8724d31d12	Move dequant table from VP9_COMMON to VP9_COMP as decoder does not need it any more. This reduces VP9_COMMON size from 25776 bytes to 17584 bytes(~31%). Change-Id: Ic5daea732ccefb6d512b048af7983f0efe08589b	2015-02-20 11:12:42 -08:00
Marco	a1b402e71c	Merge "Adjustments to cyclic refresh (aq-mode=3)."	2015-02-20 09:55:05 -08:00
Jingning Han	6728655422	Merge "Add high bit depth support to rtc sub8x8 block coding"	2015-02-20 09:35:18 -08:00
Marco	0187f4b411	Adjustments to cyclic refresh (aq-mode=3). Target higher delta-qp for big blocks with zero motion, and for segment#1: avoid 64x64 partition size and force 8x8 tx size. Metrics on RTC set mostly positive: SSIM up by ~4%, PSRN by ~1.5%. Doesn't seem to be any change in speed. Change-Id: I1f68fa3c4f62dab3b90cc58041f05ebb048ae5ac	2015-02-20 08:47:59 -08:00
Jingning Han	6f4245894a	Add high bit depth support to rtc sub8x8 block coding This commit adds proper buffer handle to support high bit depth in rtc sub8x8 block coding. Change-Id: Ibaf8a2160194121aec9ca68b8094817fed9ccaea	2015-02-20 08:36:33 -08:00
Adrian Grange	f03627347e	Merge "Fix control string in firstpass stats fprintf"	2015-02-19 16:36:43 -08:00
Yunqing Wang	5e57729601	Merge "Improve skip_txfm thresholds in the non-rd mode selection"	2015-02-19 15:31:02 -08:00
Adrian Grange	2ae314fe3a	Fix control string in firstpass stats fprintf 20 items in the control string but only 19 arguments. Change-Id: I51dab9aa1c58c653b52395005a9cb41f09feb484	2015-02-19 15:18:30 -08:00
Jingning Han	216b171d63	Merge "Integral projection based motion estimation"	2015-02-19 15:08:11 -08:00
Yunqing Wang	81fc5bf81c	Improve skip_txfm thresholds in the non-rd mode selection Modified the thresholds of deciding whether or not to skip the transforms in model_rd_for_sb_y(). Used zbin[] instead of dequant[] to be more precise. Also, modified the checking coditions. Rtc set borg test results (at speed 6) showed: average PSNR gain: 0.138%, overall PSNR gain: 0.158%, and SSIM gain: 0.177%. The data rate test was modified slightly as suggested by Marco. Change-Id: Ieaf633ab77f4838cb3c45cf69065b29d55f8ae6c	2015-02-19 14:30:46 -08:00
Jingning Han	ed2dc59c1b	Integral projection based motion estimation This commit introduces a new block match motion estimation using integral projection measurement. The 2-D block and the nearby region is projected onto the horizontal and vertical 1-D vectors, respectively. It then runs vector match, instead of block match, over the two separate 1-D vectors to locate the motion compensated reference block. This process is run per 64x64 block to align the reference before choosing partitioning in speed 6. The overall CPU cycle cost due to this additional 64x64 block match (SSE2 version) takes around 2% at low bit-rate rtc speed 6. When strong motion activities exist in the video sequence, it substantially improves the partition selection accuracy, thereby achieving better compression performance and lower CPU cycles. The experiments were tested in RTC speed -6 setting: cloud 1080p 500 kbps 17006 b/f, 37.086 dB, 5386 ms -> 16669 b/f, 37.970 dB, 5085 ms (>0.9dB gain and 6% faster) pedestrian_area 1080p 500 kbps 53537 b/f, 36.771 dB, 18706 ms -> 51897 b/f, 36.792 dB, 18585 ms (4% bit-rate savings) blue_sky 1080p 500 kbps 70214 b/f, 33.600 dB, 13979 ms -> 53885 b/f, 33.645 dB, 10878 ms (30% bit-rate savings, 25% faster) jimred 400 kbps 13380 b/f, 36.014 dB, 5723 ms -> 13377 b/f, 36.087 dB, 5831 ms (2% bit-rate savings, 2% slower) Change-Id: Iffdb6ea5b16b77016bfa3dd3904d284168ae649c	2015-02-19 13:47:19 -08:00
Jingning Han	83559e7357	Fix a check condition in nonrd_pick_partition Change-Id: Ic92fb4b16948f745c218351b24fdafecf9abce3a	2015-02-19 09:54:55 -08:00
Yaowu Xu	c5718a7aa3	Merge "Fix an encoder/decode mismatch bug"	2015-02-13 16:40:41 -08:00
Yaowu Xu	4bc7f4828f	Fix an encoder/decode mismatch bug This commit prevent the encoder to update last_frame_type when a frame is dropped in the encoder. Prior to this fix, if there is a dropped frame immediatedly after a key frame, decoder would have the value of last_frame_type as key frame, different from encoder as the dropped frame in encoder would have updated the value to an inter frame. This leads to different probability update in encoder and decoder, thereby encoder/decoder mismatch. This fixes issue #941 Change-Id: I27115224b138bec43ae3916c016574f5740822b0	2015-02-13 15:45:47 -08:00
Marco	b1940bf5fe	Replace some operations with shift in encoder_breakout. Replaced a divide by 9 with 8, so some very small difference, but otherwise no change in behavior. Change-Id: I1079ae3c41e0789ff0bc6fa9940a238b6bca0f5b	2015-02-13 10:45:19 -08:00
Jingning Han	e69c79e19a	Merge "Fix ioc issue in block_rd_txfm"	2015-02-12 15:07:41 -08:00
Jingning Han	5041aa0fbe	Fix ioc issue in block_rd_txfm Force 64-bit precision in the intermediate steps. Change-Id: I666113d9adcef8975da201d5aa1a13b783d09594	2015-02-12 12:51:39 -08:00
Marco	cc7d981de1	Merge "Add skin detection."	2015-02-12 11:12:27 -08:00
Jingning Han	f4c29ae9ea	Merge "Update partition rate cost in rtc speed 5"	2015-02-12 09:14:49 -08:00
Jingning Han	ee83243daa	Merge "Add mode cost to sub8x8 block mode decision in rtc coding"	2015-02-12 09:14:29 -08:00
Marco	56435bb7b6	Add skin detection. Simple skin detection, from vp8; works reasonable on most of the RTC clips, but could miss sometimes. Added debug flag to write out skin map over source input. Change-Id: I2caea7592f1c459047aac46627eeb24a94946464	2015-02-11 17:47:17 -08:00
Adrian Grange	053625e4cd	Add cast to convert double to int Change-Id: I7f63c2940256a5dadf9a29a853809290dd9e98ed	2015-02-11 15:59:48 -08:00
Jingning Han	e665c8f2c9	Add mode cost to sub8x8 block mode decision in rtc coding This commit allows the encoder to properly account for the mode cost in sub8x8 non-RD mode decision. Change-Id: I2951960d20e37ed08e372ee0c7044935b2b9b899	2015-02-11 14:43:02 -08:00
Jingning Han	c9725813db	Merge "Account for inter prediction filter rate cost in rtc mode selection"	2015-02-11 14:42:44 -08:00
Jingning Han	532cb435f8	Merge "Add ref frame rate cost to non-RD mode decision"	2015-02-11 14:36:48 -08:00
Jingning Han	7a4e0b2265	Update partition rate cost in rtc speed 5 The block partition rate cost should be updated when recursive partition search is needed. Change-Id: I7bc5ad1fc2cbd3577dee7f7e8da111a2742bdeb9	2015-02-11 12:48:29 -08:00
Jingning Han	41b7f76db1	Account for inter prediction filter rate cost in rtc mode selection Add the rate cost on inter prediction filter type to the overall rate-distortion cost in vp9_pick_mode_inter. Change-Id: I72c34017adf5220cadb3962694ee5404469fc673	2015-02-11 12:17:29 -08:00
Jingning Han	4ce70e8847	Add ref frame rate cost to non-RD mode decision This commit adds a heuristic rate cost of reference frame to the non-RD mode decision. It improves the compression performance of speed -6 by 0.31% and speed -5 by 0.69%. Change-Id: If7f3b45519d49b2cb640bcb7316a254efc8be446	2015-02-11 11:08:10 -08:00
Yaowu Xu	ee5d79995e	Move computation up to frame level This is to avoid redo the same calculation repeatly, and also allow easier adjustments for further experiments. This commit shall have no effect on quality/compression. Change-Id: I4460acf5c808ff5518da18d21e002c5da58af857	2015-02-10 15:41:52 -08:00
Adrian Grange	2d924161c7	Merge "Auto-adaptive encoder frame resizing logic"	2015-02-10 12:16:55 -08:00
Jingning Han	f0eea5be2a	Merge "Fix block partition size in fill_mode_info_sb"	2015-02-10 10:49:03 -08:00
Adrian Grange	23ebacdb81	Auto-adaptive encoder frame resizing logic Note: This feature is still in development. Add an option for the encoder to decide the resolution at which to encode each frame. Each KF/GF/ARF goup is tested to see if it would be better encoded at a lower resolution. At present, each KF/GF/ARF is coded first at full-size and if the coded size exceeds a threshold (twice target data rate) at the maximum active Q then the entire group is encoded at lower resolution. This feature is enabled in vpxenc by setting: --resize-allowed=1 In addition, if the vpxenc command line also specifies valid frame dimensions using: --resize-width=XXXX & --resize_height=YYYY then all frames will be encoded at this resolution. Change-Id: I13f341e0a82512f9e84e144e0f3b5aed8a65402b	2015-02-10 09:59:32 -08:00
Yunqing Wang	84b813aa42	Merge "Make encoder and decoder share common thread function"	2015-02-10 09:06:41 -08:00
Yunqing Wang	d3a37731c2	Merge "Rename loopfilter_thread files to thread_common files"	2015-02-10 09:06:23 -08:00
Jingning Han	ebb4c9e8e7	Fix block partition size in fill_mode_info_sb This commit fixes the sub block partition size used in fill_mode_info_sb. Previous implementation effectively disabled the rectangular block sizes. This commit resolved this issue. Change-Id: Ic1c383ab0a9a2e7d59e85b388093f1f1f94d1e7f	2015-02-10 08:39:32 -08:00
Yunqing Wang	07eb8c8da3	Merge "Fix high bit depth assembly function bugs"	2015-02-09 15:30:36 -08:00
Yunqing Wang	4ae092c660	Make encoder and decoder share common thread function Moved vp9_accumulate_frame_counts to vp9_thread_common.c to eliminate the duplicate code. Change-Id: I9cf506d729603c8bf1494b4c86a3b7d47af1917a	2015-02-06 11:45:51 -08:00
Jingning Han	ba933b90c6	Merge "Re-arrange inter mode search order in RTC coding flow"	2015-02-06 10:11:33 -08:00
Yunqing Wang	41063137c3	Rename loopfilter_thread files to thread_common files Renames the files to allow more common thread code to be moved to vp9/common. Change-Id: I7386e64e221086e3cdc087e79812f993c423413b	2015-02-06 10:03:31 -08:00
Yaowu Xu	8b5e665098	Merge "Replace repeated check with single variable"	2015-02-06 09:17:59 -08:00
Jingning Han	b2762a8853	Re-arrange inter mode search order in RTC coding flow This commit makes the ZEROMV mode first in the search order to ensure that the zero mv is always checked in the RTC coding mode. It improves the average speed -6 compression performance by 0.3% in both PSNR and SSIM at no visible speed change. Change-Id: I465a7e59f4e20cd84fee3f02ced6f98036945949	2015-02-06 08:52:52 -08:00
Yunqing Wang	789ae447f8	Fix high bit depth assembly function bugs The high bit depth build failed while building for 32bit target. The bugs were in vp9_highbd_subpel_variance.asm and vp9_highbd_sad4d_sse2.asm functions. This patch fixed the bugs, and made 32bit build work. Change-Id: Idc8e5e1b7965bb70d4afba140c6583c5d9666b75	2015-02-05 11:24:03 -08:00
Yaowu Xu	c905c42ad8	Remove unnecessary initialization loop_filter_level is always reset in loop_filter_frame() later in encoder. Change-Id: I608e03d905a6b23e7d5025ca747e4784c665007e	2015-02-04 13:56:16 -08:00
Yaowu Xu	581aee001e	Move tx_mode decision logic into select_tx_mode() Change-Id: I7f8f78c33eb3f33344b029a27bda320f4d68c577	2015-02-04 13:54:49 -08:00
Yaowu Xu	19451e6d67	Replace repeated check with single variable Change-Id: I2f6a669bf7c6d9796388ad3f3fa3fc942635c215	2015-02-04 12:59:14 -08:00
Yaowu Xu	a844a778c7	Merge "Adjust partitioning threshold based rtc speed"	2015-02-04 12:52:03 -08:00
Yaowu Xu	3bc0c6576f	Merge "Move calls to avoid unnecessary operations"	2015-02-04 12:51:16 -08:00
Yaowu Xu	bdfb5f986e	Adjust partitioning threshold based rtc speed On rtc set: speed 7 quality improves about 0.5% speed 8 quality improves about 1.0% Encoding time for speed 7 changes from 67804ms to 65889ms Encoding time for speed 8 changes from 58659ms to 56808ms Change-Id: Iabcfb53012fc1b9f3326cdbc167e5758b8c7ad30	2015-02-04 11:28:39 -08:00
Jingning Han	1b9082ec6b	Unify luma and chroma inter predictors in choose_partitioning Change-Id: I8bfc80f4fffb0892e93d3326394a52d1ee3c0f37	2015-02-04 10:02:57 -08:00
Jingning Han	4ccfc7d517	Save an extra call for setup_pred_plane function Reuse the yv12_mb array to fetch the buffer pointers/strides corresponding to the current reference frame. Change-Id: I5276b7494158b2cccef15213be2dc189e9036851	2015-02-04 09:47:14 -08:00
Jingning Han	0c6d3a03e1	Account for chroma component costs in RTC mode decision This commit allows the encoder to account for additional chroma plane costs in the mode decision process, if the current block potentially contains significant color change. It improves the visual quality at very low bit-rates. The compression performance of dark720p is improved by 12.39% in speed 6. For jimred at 150 kbps, the PSNR of V component (red) increased by 0.2 dB, at the expense of about 5% increase in encoding time. Note that for sequences where the chroma components are fairly consistent, the encoding time increase is negligible. On average the rtc set compression performance is improved by 1.172% in PSNR and 1.920% in SSIM. Change-Id: Ia55b24ef23a25304f7ec9958fbf07fd6e658505c	2015-02-04 09:45:14 -08:00
Johann	3a5d40608e	Merge "Remove unnecessary pointer check"	2015-02-03 17:12:56 -08:00
Yaowu Xu	02537ebbe4	Move calls to avoid unnecessary operations Change-Id: I236f7f75ab9a4511d1b52a6a67299b0e844a103e	2015-02-03 17:01:37 -08:00
Yaowu Xu	cb411108a3	Merge "adjust rtc setting and threshold"	2015-02-03 15:13:52 -08:00
Jim Bankoski	d7783cae95	Merge "make low bitrates a lot less blocky"	2015-02-03 13:25:06 -08:00
Johann	ba18609502	Remove unnecessary pointer check The original implementation had the following comment: // Ignore mv costing if mvsadcost is NULL However the current implementation does not allow for this. If x exists then nmvsadcost must not be null. This removes the only warning from -Wpointer-bool-conversion https://code.google.com/p/webm/issues/detail?id=894 Change-Id: I1a2cee340d7972d41e1bbbe1ec8dfbe917667085	2015-02-03 13:03:46 -08:00
Jingning Han	894f0fbd3b	Merge "Assign 2nd ref frame in choose_partitioning"	2015-02-03 12:25:18 -08:00
Jingning Han	ca9c352fc3	Assign 2nd ref frame in choose_partitioning Avoid the use of uninitialized second reference frame for fetching reference block. Change-Id: I9983a0daea829700b3270dc8bf2bcc6d6ea36652	2015-02-03 11:17:51 -08:00
Jim Bankoski	9f1cf2c8cf	make low bitrates a lot less blocky Remove loop filter skip at speed 7+ because of bad visual artifacts and up the postprocessing. Change-Id: Ibdd0bac71aaee232d2bb2e14462733c51517768d	2015-02-03 06:45:56 -08:00
Yaowu Xu	65a1a3e85d	adjust rtc setting and threshold 1. Adjusted the threshold for coef update computation based on counts of tx used, avoid coef update computation when count is low (<20) 2. Move sf->lpf_pick = LPF_PICK_MINIMAL_LPF to speed 8. Change-Id: I02b44309e40fcdbf135c7934ae067a3f42502d30	2015-02-02 17:43:46 -08:00
Alex Converse	a79db92c07	Merge "Allow larger encoder configurations."	2015-02-02 12:05:56 -08:00
Yaowu Xu	80e729f601	Merge "Optimize coef update"	2015-02-01 20:08:29 -08:00
hkuang	be6aeadaf4	Try again to merge branch 'frame-parallel' into master branch. In frame parallel decode, libvpx decoder decodes several frames on all cpus in parallel fashion. If not being flushed, it will only return frame when all the cpus are busy. If getting flushed, it will return all the frames in the decoder. Compare with current serial decode mode in which libvpx decoder is idle between decode calls, libvpx decoder is busy between decode calls. Current frame parallel decode will only speed up the decoding for frame parallel encoded videos. For non frame parallel encoded videos, frame parallel decode is slower than serial decode due to lack of loopfilter worker thread. There are still some known issues that need to be addressed. For example: decode frame parallel videos with segmentation enabled is not right sometimes. * frame-parallel: Add error handling for frame parallel decode and unit test for that. Fix a bug in frame parallel decode and add a unit test for that. Add two test vectors to test frame parallel decode. Add key frame seeking to webmdec and webm_video_source. Implement frame parallel decode for VP9. Increase the thread test range to cover 5, 6, 7, 8 threads. Fix a bug in adding frame parallel unit test. Add VP9 frame-parallel unit test. Manually pick "Make the api behavior conform to api spec." from master branch. Move vp9_dec_build_inter_predictors_* to decoder folder. Add segmentation map array for current and last frame segmentation. Include the right header for VP9 worker thread. Move vp9_thread.* to common. ctrl_get_reference does not need user_priv. Seperate the frame buffers from VP9 encoder/decoder structure. Revert "Revert "Revert "Revert 3 patches from Hangyu to get Chrome to build:""" Conflicts: test/codec_factory.h test/decode_test_driver.cc test/decode_test_driver.h test/invalid_file_test.cc test/test-data.sha1 test/test.mk test/test_vectors.cc vp8/vp8_dx_iface.c vp9/common/vp9_alloccommon.c vp9/common/vp9_entropymode.c vp9/common/vp9_loopfilter_thread.c vp9/common/vp9_loopfilter_thread.h vp9/common/vp9_mvref_common.c vp9/common/vp9_onyxc_int.h vp9/common/vp9_reconinter.c vp9/decoder/vp9_decodeframe.c vp9/decoder/vp9_decodeframe.h vp9/decoder/vp9_decodemv.c vp9/decoder/vp9_decoder.c vp9/decoder/vp9_decoder.h vp9/encoder/vp9_encoder.c vp9/encoder/vp9_pickmode.c vp9/encoder/vp9_rdopt.c vp9/vp9_cx_iface.c vp9/vp9_dx_iface.c This reverts commit `a18da9760a`. Change-Id: I361442ffec1586d036ea2e0ee97ce4f077585f02	2015-01-30 21:00:13 -08:00
Jingning Han	f1ab5c1021	Merge "Format fixes in vp9_rd_pick_inter_mode_sb/sub8x8"	2015-01-30 15:49:14 -08:00
Yaowu Xu	45971abd1d	Optimize coef update 1. move the check of search method of USE_TX_8X8 up one level to avoid operations of build_tree_distributions() 2. count tx used and avoid computaton for coef udpate when one size is not used at all. Change-Id: Ia3e54a2588aa531c41377a1bfaa64385d04a592c	2015-01-30 10:16:40 -08:00
Yunqing Wang	3b3e299650	Merge "Fix issues in 32bit PIC enabled build"	2015-01-29 16:41:25 -08:00
Alex Converse	797a2556eb	Allow larger encoder configurations. Allow changing colorspace in the encoder and increasing frame size. Change-Id: I8e7c3b891af29ce420a15beb4f6f9c250245b2bb	2015-01-29 15:07:40 -08:00
Paul Wilkins	68340a3470	Merge "Change to update of rate control factors."	2015-01-29 13:50:52 -08:00
Marco	a80dd52b6e	Merge "Fix to vp9 denoiser."	2015-01-29 09:10:30 -08:00
Paul Wilkins	f752da8ce2	Change to update of rate control factors. Remove damping parameter and use the damping formula introduced by Yaowu Xu in all cases. Change-Id: I18db7e0d0f262d5140102f259ab07821d374d285	2015-01-28 15:44:53 -08:00
Yaowu Xu	ff99a3c750	Simplify update_coef_probs() 1. reduce the size of temporaray arrays on stack 2. avoid build_tree_distribution for tx size that is not used at all. Change-Id: I0f8d7124e16a3789d3c15ad24cf02c1c12789e2c	2015-01-28 15:12:42 -08:00
Marco	c0923d4d3a	Fix to vp9 denoiser. Prevent from using wrong mv for denoiser motion compensation. Change-Id: Ifa0f9daabdbdab0900d3c17304059fe0d15de914	2015-01-28 12:07:27 -08:00
Frank Galligan	d1e6b8231a	Merge "Add vp9_sad32x32x4d_neon Neon intrinsic function."	2015-01-28 10:35:50 -08:00
Frank Galligan	eb12d880ab	Merge "Add vp9_sad16x16x4d_neon Neon intrinsic function."	2015-01-27 23:01:44 -08:00
Frank Galligan	80a3a07929	Merge "Add vp9_sad64x64x4d_neon Neon intrinsic function."	2015-01-27 23:01:15 -08:00
Yunqing Wang	10d5e09c87	Fix issues in 32bit PIC enabled build This patch was to fix issue 924: https://code.google.com/p/webm/issues/detail?id=924 The SECTION_RODATA macro was modified to support macho32 format. The sub-pixel functions were modified to pass in 2 more parameters to handle the global offsets for PIC build. Change-Id: I3bfcd336bcae945edf300bca4ab40376a2628cd4	2015-01-27 22:20:21 -08:00
Yaowu Xu	fe2439703d	Merge "move clear_system_state() call before using double"	2015-01-27 12:42:13 -08:00
Frank Galligan	e3167f7fbf	Add vp9_sad32x32x4d_neon Neon intrinsic function. On Nexus 7 speed -6 saw ~18% increase in perf. Tested on Nexus 7, built with ndk r10d, gcc 4.9. BUG=https://code.google.com/p/webm/issues/detail?id=908 Change-Id: I70ccdea0326750552ed946fb004507d6efe02d5c	2015-01-27 08:54:00 -08:00
Frank Galligan	9f574d0316	Add vp9_sad16x16x4d_neon Neon intrinsic function. On Nexus 7 speed -6 saw ~15% increase in perf. Tested on Nexus 7, built with ndk r10d, gcc 4.9. BUG=https://code.google.com/p/webm/issues/detail?id=908 Change-Id: I4b2006b644c488f42bf06d8a22ef0e6120a96bf9	2015-01-27 08:42:17 -08:00
Frank Galligan	54fa956715	Add vp9_sad64x64x4d_neon Neon intrinsic function. On Nexus 7 speed -6 saw ~30% increase in perf. Tested on Nexus 7, built with ndk r10d, gcc 4.9. BUG=https://code.google.com/p/webm/issues/detail?id=908 Change-Id: Id12af7d1883243c23e6692e898aea82299633d58	2015-01-27 08:33:40 -08:00
Marco	1c4a84c6e9	Merge "aq-mode=3: Update to allow for refresh on modes other than zero-mv."	2015-01-26 19:47:13 -08:00
Yaowu Xu	645b7cdf03	move clear_system_state() call before using double Floating point is used in vp9_convert_qindex_to_q(), so sometime unit test ActiveMapTest would cause run time error without properly call to clear_system_state to reset register status. Change-Id: I181e9395148c44a6ca8b97d6e109bd4a152143c6	2015-01-26 18:41:50 -08:00
Paul Wilkins	d231ce4fde	Merge "Adjust active maxq for GF groups."	2015-01-26 18:19:09 -08:00
Yaowu Xu	d987dc4fdb	Merge "Fix MSVC warnings on conversion from int64 to int"	2015-01-26 16:52:30 -08:00
Marco	3f1af6e85e	aq-mode=3: Update to allow for refresh on modes other than zero-mv. Add distortion threshold condition to refresh state of a coding block, and allow for qp adjustment also for some intra modes and non-zero motion modes. Also some code cleanup (remove unused variables/code). Change-Id: I735fa2b28bc64f60e0323976b82510577b074203	2015-01-26 16:44:25 -08:00
Paul Wilkins	fd070220ff	Adjust active maxq for GF groups. Currently disabled by default: enabled using #define GROUP_ADAPTIVE_MAXQ In this patch the active max Q is adjusted for each GF group based on the vbr bit allocation and raw first pass group error. This will tend to give a lower q for easy sections and a higher value for very hard sections. As such it is expected to improve quality in some of the easier sections where quality issues have been reported. This change tends to hurt overall psnr but help average psnr. SSIM also shows a small gain. Average results for derf, yt, std-hd and yt-hd test sets were as follows (%change for average psnr, overal psnr and ssim):- derf +0.291, - 0.252, -0.021 yt +6.466, -1.436, +0.552 std-hd +0.490, +0.014, +0.380 yt-hd +5.565, - 1.573, +0.099 Change-Id: Icc015499cebbf2a45054a05e8e31f3dfb43f944a	2015-01-26 14:55:36 -08:00
Yaowu Xu	6d16f6c14c	Fix MSVC warnings on conversion from int64 to int Change-Id: I7e96509ffa36899fcd2935749927a1e8aac8d025	2015-01-26 10:54:06 -08:00
Frank Galligan	9f6eba419a	Add Neon intrinsic vp9_fdct8x8_quant_neon On Nexus 7 speed -5 got ~2%, -6 got ~15%, -7 and -8 got ~30% increase in perf. Tested on Nexus 7, built with ndk r10d, gcc 4.9. Change-Id: I83246d63b96674d170098a572fa4fe28a05aaf51	2015-01-24 22:49:50 -08:00
Jingning Han	9bdc0ae2b2	Format fixes in vp9_rd_pick_inter_mode_sb/sub8x8 Add parentheses to bit operations. Change-Id: I095d601f0631d055adc4b3a8fde70c9cbae9e749	2015-01-23 11:48:58 -08:00
Adrian Grange	0e2e2c2652	Merge "Remove elevate_newmv_thresh from SPEED_FEATURES (unused)"	2015-01-23 09:57:03 -08:00
Johann	a18da9760a	Revert "Merge branch 'frame-parallel' to enable frame parallel decode in master branch." This reverts commit `bde04ce503` Change-Id: I053dae04c761b04a36dc239558503905a14d2470	2015-01-23 08:42:02 -08:00
hkuang	bde04ce503	Merge branch 'frame-parallel' to enable frame parallel decode in master branch. In frame parallel decode, libvpx decoder decodes several frames on all cpus in parallel fashion. If not being flushed, it will only return frame when all the cpus are busy. If getting flushed, it will return all the frames in the decoder. Compare with current serial decode mode in which libvpx decoder is idle between decode calls, libvpx decoder is busy between decode calls. VP9 frame parallel decode is >30% faster than serial decode with tile parallel threading which will makes devices play 1080P VP9 videos more easily. * frame-parallel: Add error handling for frame parallel decode and unit test for that. Fix a bug in frame parallel decode and add a unit test for that. Add two test vectors to test frame parallel decode. Add key frame seeking to webmdec and webm_video_source. Implement frame parallel decode for VP9. Increase the thread test range to cover 5, 6, 7, 8 threads. Fix a bug in adding frame parallel unit test. Add VP9 frame-parallel unit test. Manually pick "Make the api behavior conform to api spec." from master branch. Move vp9_dec_build_inter_predictors_* to decoder folder. Add segmentation map array for current and last frame segmentation. Include the right header for VP9 worker thread. Move vp9_thread.* to common. ctrl_get_reference does not need user_priv. Seperate the frame buffers from VP9 encoder/decoder structure. Revert "Revert "Revert "Revert 3 patches from Hangyu to get Chrome to build:""" Conflicts: test/codec_factory.h test/decode_test_driver.cc test/decode_test_driver.h test/invalid_file_test.cc test/test-data.sha1 test/test.mk test/test_vectors.cc vp8/vp8_dx_iface.c vp9/common/vp9_alloccommon.c vp9/common/vp9_entropymode.c vp9/common/vp9_loopfilter_thread.c vp9/common/vp9_loopfilter_thread.h vp9/common/vp9_mvref_common.c vp9/common/vp9_onyxc_int.h vp9/common/vp9_reconinter.c vp9/decoder/vp9_decodeframe.c vp9/decoder/vp9_decodeframe.h vp9/decoder/vp9_decodemv.c vp9/decoder/vp9_decoder.c vp9/decoder/vp9_decoder.h vp9/encoder/vp9_encoder.c vp9/encoder/vp9_pickmode.c vp9/encoder/vp9_rdopt.c vp9/vp9_cx_iface.c vp9/vp9_dx_iface.c Change-Id: Ib92eb35851c172d0624970e312ed515054e5ca64	2015-01-22 18:18:53 -08:00
Adrian Grange	527e073163	Remove elevate_newmv_thresh from SPEED_FEATURES (unused) Change-Id: I78ef7f89586a329787f6bc4c58ec83af210989a3	2015-01-22 16:12:50 -08:00
Marco	0dccb6277c	Modify variance partition selection for low resolutions. For low spatial resolutions: bias partittion selection to smaller block sizes, and base the variance computation on 4x4 down-sampling. Also move the threshold computations into the choose_partitioning, so they are computed once for each sb block. On low-res clips (RTC_derf) PSNR/SSIMetrics increase by about 4-5%. No change for resolutions above CIF. Change-Id: I93f8ff742c8044786977bb6e31dcf8efda6dd1b0	2015-01-22 15:16:55 -08:00
Paul Wilkins	cf3202132f	Merge "Bug when last group before forced key frame is short."	2015-01-22 08:28:19 -08:00
Paul Wilkins	0bff1efc2b	Bug when last group before forced key frame is short. Just before a forced key frame we often get a foreshortened arf/gf group. In such a case, we do not want to update rc->last_boosted_qindex, which is used to define the Q range for the forced key frame itself. This gives a small average metrics gain for the YT and YT-HD sets (eg. YT SSIM +0.141%). Change-Id: Ie06698bc4f249e87183b8f8fb27ff8f3fde216d9	2015-01-21 15:25:57 -08:00
JackyChen	cd0830f452	Merge "Fix compile error in Chromium building."	2015-01-21 14:52:32 -08:00
JackyChen	25a19b48ff	Fix compile error in Chromium building. The comparison of address in the condition is not necessary, since they will constantly be non-null. Change-Id: Id0b0075283f5af65215d5761a8160a4cb2a15c9b	2015-01-21 12:59:25 -08:00
Alex Converse	910ca857df	Allow external resize via vpx_codec_enc_config_set Change-Id: I3d324e2baa4de2d266c5f7ca7b635b62372e90a7	2015-01-21 11:33:06 -08:00
Frank Galligan	469ff48d7b	Merge "Add Neon intrinsics for vp9_avg_8x8_neon"	2015-01-20 14:38:39 -08:00
Yunqing Wang	7b232717af	Merge "vp9_ethread: add parallel loopfilter"	2015-01-20 09:27:08 -08:00
Frank Galligan	cc2da09d42	Fix variance Neon intrinsics > 32x32 The 16 bit sum vector was overflowing. Change-Id: I0fdf38e832ee99457ec8680a92691a6175ff8c3f	2015-01-17 10:31:48 -08:00
Yunqing Wang	e76eaf05b1	vp9_ethread: add parallel loopfilter 1. Added row-based loopfilter in encoder; 2. Moved common multi-threaded loopfilter functions from decoder to common; 3. Merged multi-threaded loopfilter code, and made encoder/ decoder call same function to reduce code duplication. Encoder tests showed that 1% - 2% speedup was seen for good-quality 2-pass mode(at speed 3); 1% - 3% speedup using 2 threads and 4% - 6% speedup using 4 threads were seen for real-time mode(at speed 7). Change-Id: I8a4ac51c2ad9bab9fa7b864e90743931c53ec1c4	2015-01-16 17:19:27 -08:00
Jingning Han	0220255fa0	Merge "Fix frame buffer swap in denoiser"	2015-01-16 16:58:37 -08:00
Jingning Han	dfda5cebc7	Fix frame buffer swap in denoiser This commit fixes a bug in denoiser reference frame buffer swap, which disables frame buffer update. Change-Id: I39a9427180fd18f9692602064ad821f7af4714c0	2015-01-16 12:29:58 -08:00
Minghai Shang	220bc3a013	[two pass temporal svc]Fix crash issue in transcoder app caused by last fix. Change-Id: I78ecc8ec3fa3ba5f69bb23813e68a5255d0534e1	2015-01-15 16:59:54 -08:00
Frank Galligan	6e7e1cf32f	Add Neon intrinsics for vp9_avg_8x8_neon On Nexus 7 speed -5, -6, -7, and -8 saw about a 1% increase in perf for 480p. Speeds -5, -6, -7, and -8 saw about a 1.5% increase in perf for 720p. Tested on Nexus 7, built with ndk r10d, gcc 4.9. Change-Id: Ibf17ebfd952a6aec941719bd8306df8ec4574bee	2015-01-15 15:32:40 -08:00
Yunqing Wang	99b99831e4	Align thread data in vp9_ethread On some platforms, such as 32bit Windows and 32bit Mac, the allocated memory isn't aligned automatically. The thread data is aligned to ensure the correct access in SIMD code. Change-Id: I1108c145fe982ddbd3d9324952758297120e4806	2015-01-14 15:51:56 -08:00
Yaowu Xu	829a01dbb7	Merge "Add encoder control for setting color space"	2015-01-14 14:14:34 -08:00
Frank Galligan	c7d6c0c5a8	Merge "Switch remaining Neon variance functions to shifts"	2015-01-14 12:17:42 -08:00
Yaowu Xu	e94b415c34	Add encoder control for setting color space This commit adds encoder side control for vp9 to set color space info in the output compressed bitstream. It also amends the "vp9_encoder_params_get_to_decoder" test to verify the correct color space information is passed from the encoder end to decoder end. Change-Id: Ibf5fba2edcb2a8dc37557f6fae5c7816efa52650	2015-01-14 10:17:14 -08:00
Frank Galligan	ec1d8387e1	Add 64x64 sub_pel_variance Neon function On Nexus 7 speed -5, -6, -7, and -8 saw about a 15% increase in perf for 480p. Speeds -5, -6, -7, and -8 saw about a 10% increase in perf for 720p. Tested on Nexus 7, built with ndk r10d, gcc 4.9. Change-Id: I2fa5315845e3021c9a6e2ea47e52e68b398d8334	2015-01-14 08:36:24 -08:00
Frank Galligan	588f74f8a6	Switch remaining Neon variance functions to shifts Saves 5 instructions on 8x8 and 16x16 and 8 instructions on 32x32, when compiled with 4.9. Change-Id: Id3da613a36a9d27d8c5169c59ba45d247c920c6c	2015-01-14 07:22:49 -08:00
Frank Galligan	bd3dbc588c	Merge "Add 64x variance Neon functions"	2015-01-13 22:38:58 -08:00
Minghai Shang	a14415d171	[twopass temporal svc] Fix decoding error on seek. Don't put small empty frame in front of a key frame. We will put key frame flag in webm container if there's a visible key frame. But there will be decoding error when we seek to here if we put the small empty frame, which will be inter frame, in front of it. Change-Id: Id50c2c1fd31da0405ff6faa7375cc2f49c55402d	2015-01-13 15:44:22 -08:00
Frank Galligan	74d40cd507	Add 64x variance Neon functions Add optimized Neon functions of: vp9_variance32x64 vp9_variance64x32 vp9_variance64x64 On Nexus 7 speed -5 and -6 saw about a 4% increase in perf. Speeds -7 and -8 saw about a 6% increase in perf. Tested on Nexus 7, built with ndk r10d, gcc 4.9. Change-Id: I5a81f13c9897eb927fa39662530f5524a0f768fa	2015-01-13 15:08:13 -08:00
Yaowu Xu	6f6fbf9175	Merge "Added plumbing for setting color space"	2015-01-13 09:20:13 -08:00
Yaowu Xu	fe3f21099f	Merge "Fix comments and color format"	2015-01-11 14:01:36 -08:00
Yaowu Xu	ce52b0f8d3	Added plumbing for setting color space Change-Id: If64052cc6e404abc8a64a889f42930d14fad21d3	2015-01-09 10:54:25 -08:00
Yaowu Xu	ecbca31a1d	Fix comments and color format Replaced "color space" with "color format" in comments where color sampling format is concerned, so to differentiate from the concept defined in COLOR_SPACE. Change-Id: I8c935034c166b24307a99352dab1686531276bb8	2015-01-09 10:36:43 -08:00
Paul Wilkins	ccffe318ff	Merge "Use 64 bit to accumulate frame sse."	2015-01-09 06:05:11 -08:00
Jingning Han	ae537c151b	Merge "Refactor mc reference block fetch in denoiser"	2015-01-08 17:56:53 -08:00
James Zern	44b55dada8	Merge "vp9: fix -Wclobbered (longjmp + local variables)"	2015-01-08 15:53:02 -08:00
Jingning Han	a0be730eae	Refactor mc reference block fetch in denoiser This commit refactors the motion compensated reference block fetch process in denoiser. It skips the stage that generates motion compensated reference block if denoiser decides to use copy block mode. For high motion clips, this could speed up the denoising process by about 10%. Change-Id: I8ef4fa5fe766a8c4529119b9ec01faefb3d4ef53	2015-01-08 12:43:08 -08:00
Jingning Han	e3f0b19f3f	Use lookup table to find pixel numbers in block This could save one multiplication in each threshold funtion called by the denoiser per block. Change-Id: I35f437e09999f0a087180878ef7805f0d86e5819	2015-01-08 12:32:28 -08:00
Jingning Han	e535ad5067	Merge "Refactor denoiser frame buffer update"	2015-01-08 11:16:14 -08:00
Jingning Han	97dc782635	Merge "Initalize zeromv_sse and newmv_sse in vp9_pick_inter_mode"	2015-01-08 10:55:03 -08:00
Jingning Han	f1866a5792	Merge "Use vp9_convolve_copy in denoiser output"	2015-01-08 09:59:10 -08:00
Jingning Han	ea061a885d	Refactor denoiser frame buffer update Use frame buffer pointer swap instead of memcpy when possible. These two CLs make the denoiser when running on vidyo1 720p at speed -6 over 10% faster. Change-Id: I64fe8a2422cafca6787a50c7f4dfb961191c0a9d	2015-01-07 18:33:13 -08:00
Jingning Han	29a5deb40c	Use vp9_convolve_copy in denoiser output Replace copy_block with vp9_convolve_copy for speed performance improvement. Change-Id: I3a08c4d01dff2253b6ee573efd02f65ccdc1b5a5	2015-01-07 18:23:17 -08:00
Zoe Liu	4cf636a60e	Removed redundant local variables in the forward hybrid transforms. Change-Id: I60f7ccbbc8dc624134e325bdce6042bc183075b6	2015-01-07 16:38:29 -08:00
Jingning Han	08055b639a	Merge "Always check and free denoiser buffer memory space"	2015-01-07 15:54:06 -08:00
Jingning Han	e42b3ee765	Initalize zeromv_sse and newmv_sse in vp9_pick_inter_mode These two parameters are used to control the denoiser cut-off thresholds. They should be properly initialized when starting mode search of a given block. Change-Id: Iba8a25487026a0dbe0d350c347d7e4e4e237b637	2015-01-07 15:32:41 -08:00
Jingning Han	b208439b5a	Merge "Fix best ref frame rd cost update in sub8x8 non-RD mode search"	2015-01-07 14:06:55 -08:00
Jingning Han	3e41563f33	Merge "Format fix in vp9_pick_inter_mode_sub8x8"	2015-01-07 14:06:06 -08:00
Jingning Han	802b798f67	Fix best ref frame rd cost update in sub8x8 non-RD mode search This fixes the issue that sub8x8 inter blocks always end up with GOLDEN_FRAME. Change-Id: Id0c25cbb9c2003f43b4dff8fb1572512c246e077	2015-01-07 12:00:02 -08:00
Jingning Han	c3fd9bbdaf	Format fix in vp9_pick_inter_mode_sub8x8 Replace ref_frame++ with ++ref_frame. Change-Id: Ic39793081156c314bf1b85d5ab76def97f3bff52	2015-01-07 11:50:36 -08:00
Jingning Han	59f29f5e3f	Merge "Fix denoiser chroma component initialization"	2015-01-07 11:30:15 -08:00
Jingning Han	9a0e694182	Merge "Skip duplicate denoiser frame buffer allocation"	2015-01-07 11:30:07 -08:00
Jingning Han	ce08006951	Always check and free denoiser buffer memory space The vp9_denoiser_free() function will internally check if the buffer pointers are NULL. This commit makes the encoder always call vp9_denoiser_free() after finishing encoding. It protects the case where noise_sensitivity_level is changed during encoding process and happen to be turned off towards the end of sequence, which could result memory space allocated to denoiser not being released. Change-Id: Ie20dc2f2e6e5fb6333fbab3356bc153978a6a0f8	2015-01-07 08:50:13 -08:00
Jingning Han	2fb9b635bb	Fix denoiser chroma component initialization Use the correct frame size and stride value for chroma components when setting the initial values. These control parameters are assigned when the denoiser buffer was allocated and initialized. Change-Id: Ia6318194c7738aff540bcbd34f77f0dac46221a1	2015-01-07 08:49:59 -08:00
Jingning Han	27582e573b	Skip duplicate denoiser frame buffer allocation Allocate the frame buffer allocation for denoiser once during the encoder initialization. This avoids allocating frame buffer multiple times and overwriting the buffer pointer without proper releasing. Change-Id: I9b3baa6283449d86fd164534d344c036bb035700	2015-01-07 08:49:04 -08:00
Paul Wilkins	a3c1a9b419	Use 64 bit to accumulate frame sse. When testing frame sse to choose a loop filter value and when checking ambient error in kf Q selection, use 64 bit values for accumulating the sse, to avoid risk of overflow for large image formats. Change-Id: I03765d16c843d0ade61a45b0cd46312472697e57	2015-01-07 14:13:16 +00:00
Deb Mukherjee	e7570493b8	Moves inter mode count updates to update_stats This makes the inter_mode counts update consistent with other symbols. Also, forward updates should work corerctly now. Change-Id: Id98be26fd08875162e644bb8f1de6f0918f85396	2015-01-06 16:40:45 -08:00
Yaowu Xu	0979dbb37b	Merge "Fix compiler warnigns for msvc2013"	2015-01-06 08:01:47 -08:00
Paul Wilkins	a88e4e64b1	Merge "Deleted unused #define"	2015-01-06 04:18:20 -08:00
Yaowu Xu	364b92dc88	Fix compiler warnigns for msvc2013 Change-Id: I1e32bf8f6872a6fb7e9cabe86483e94805e2f790	2015-01-05 17:31:19 -08:00
Jingning Han	21c0306187	Fix denoised video output function This commit fixes the buffer alignment control in denoised video output function. The encoder is now able to properly store the denoised input video into provided file when enabled. Change-Id: I258e272c8d4a9b52592e16d6d09976c6f5c21728	2015-01-03 21:39:32 -08:00
Jingning Han	2fe1bfa5ad	Merge "Remove redundant local variable for segment_id"	2015-01-02 14:48:27 -08:00
Jingning Han	5516fdd8d0	Remove redundant local variable for segment_id Use mbmi->segment_id directly in vp9_pick_inter_mode. The value is set outside this function, hence no need to assign it again. Change-Id: I3d63cdd2e4fadf62ccdefada638b00d979eb3741	2015-01-02 12:25:14 -08:00
Jingning Han	0d2d3321af	Merge "Add bsize check condition in nonrd_use_partition"	2015-01-02 11:50:57 -08:00
Jingning Han	5486db185c	Add bsize check condition in nonrd_use_partition Check if block size is below 8x8 for rectangular block coding. It is added to support 4x8 and 8x4 block coding for RTC mode. Change-Id: I760b328f45b98ae48adc45ed5a39fb643cd8aebd	2015-01-02 10:12:37 -08:00
Jingning Han	59cfaa538e	Merge "Use less tmp motion vectors in vp9_pick_inter_mode_sub8x8"	2015-01-02 10:00:45 -08:00
Jingning Han	5c31fd5c6d	Merge "Enable sub8x8 inter block search for RTC coding mode"	2015-01-02 10:00:35 -08:00
Jingning Han	2baccb18a0	Use less tmp motion vectors in vp9_pick_inter_mode_sub8x8 This commit simplifies the reference motion vector part for sub8x8 block coding in RTC mode and reduces the required local variables. Change-Id: I470d1482092563b68af22404dc1f497e7457b0a8	2014-12-30 13:16:12 -08:00
Jingning Han	f5d574c566	Merge "Set ref frame scaling factor in RTC inter mode decision"	2014-12-29 14:20:22 -08:00
Jingning Han	dad89d5ca1	Enable sub8x8 inter block search for RTC coding mode This commit enables sub8x8 inter block coding for RTC mode. The use of sub8x8 blocks can be turned on by allowing choose_partitioning function to select 4x4/4x8/8x4 block sizes. Change-Id: Ifbf1fb3888fe4c094fc85158ac3aa89867d8494a	2014-12-24 17:40:31 -08:00
Jim Bankoski	b3c66f8a2f	WIP: Remove giant value cost table Change-Id: Iabe8a8868a747626c24bb13f1796f4c7827af367	2014-12-23 15:06:17 -08:00
Jingning Han	eb1795f643	Set ref frame scaling factor in RTC inter mode decision Properly set the corresponding scaling factor of the reference frame in the non-RD mode decision process. This allows the mode search process to account for the scaled reference frame when selecting coding mode. Change-Id: I9d41bff6931c98e5a82b413e37ac5e6e14b93b23	2014-12-23 09:33:58 -08:00
James Zern	59d63e610a	vp9: fix -Wclobbered (longjmp + local variables) Local variables used at the setjmp() site need to be marked volatile. Relevant excerpt from the 'man longjmp': =============== The values of automatic variables are unspecified after a call to longjmp() if they meet all the following criteria: · they are local to the function that made the corresponding setjmp(3) call; · their values are changed between the calls to setjmp(3) and longjmp(); and · they are not declared as volatile. =============== Change-Id: I093e6eeeedbf5f781d202248ca701ba2c29d3064	2014-12-23 11:44:11 -05:00
Jim Bankoski	4e04fa6dea	Merge "make vp9_coef_encodings const"	2014-12-22 15:05:25 -08:00
Jim Bankoski	fc954c7c03	Merge "remove static initializers for partition tree"	2014-12-22 13:49:57 -08:00
Jim Bankoski	d6d431c476	Merge "Revert "Revert "Removal of legacy zbin_extra / zbin_oq_value."""	2014-12-22 13:43:56 -08:00
Jim Bankoski	fba0ead543	Merge "Tokenization without huge tables."	2014-12-22 13:36:38 -08:00
Jim Bankoski	a5f7d78a06	make vp9_coef_encodings const Change-Id: I28a3d342a4a4b23e02a0f47bb8037c4403f71d61	2014-12-22 13:35:56 -08:00
Jingning Han	d0f2377027	Revert "Revert "Removal of legacy zbin_extra / zbin_oq_value."" This reverts commit `9946ee23e0`. Fix the ssse3 asm function. Change-Id: I07f77a63aa98087626e45c4e87aa5dcafc0b0b07	2014-12-22 10:09:25 -08:00
Jim Bankoski	4b8c6d96ec	Tokenization without huge tables. Change-Id: Iff528c4b7528cc70320343b3a7ce07a92b024dfd	2014-12-22 08:42:52 -08:00
Jim Bankoski	17ee87b46c	convert extra bit cat structure to const statics Change-Id: Idb257e78dab2339ab1f41c3c82e537bc23e90b65	2014-12-22 06:57:50 -08:00
Jim Bankoski	3d94b9bf24	Merge "resolve visual studio warnings around initializers"	2014-12-19 15:18:04 -08:00
Jim Bankoski	dd4275e498	resolve visual studio warnings around initializers Change-Id: Id2ad4fb24242f7ca8fa7a152f0889fded4113613	2014-12-19 12:38:25 -08:00
Jingning Han	1b5d612b5d	Merge "Add a guard on intra mode skip control for RTC mode"	2014-12-19 11:03:00 -08:00
Jingning Han	9c93307c10	Merge "Remove ARF mode entries from THR_MODES array in non-RD mode"	2014-12-19 11:02:51 -08:00
Jingning Han	cb01baa0fa	Merge "Rework mode search threshold update for RTC coding mode"	2014-12-19 11:02:40 -08:00
Jingning Han	a8e6d4d041	Merge "Properly store the tx_size of selected intra mode"	2014-12-19 11:02:37 -08:00
Paul Wilkins	9946ee23e0	Revert "Removal of legacy zbin_extra / zbin_oq_value." This reverts commit `e9b586e21b`. Change-Id: I5b36e6727da6c05278d97e2c37b80c109f79bed4	2014-12-19 15:02:58 +00:00
Paul Wilkins	8ac3f9adaa	Merge "Removal of legacy zbin_extra / zbin_oq_value."	2014-12-19 03:37:02 -08:00
James Zern	b32ba09d35	Merge "make vp9 encoder static initializers thread safe"	2014-12-18 18:48:30 -08:00
Jim Bankoski	cd60930814	make vp9 encoder static initializers thread safe Change-Id: If2d0888d13ebe52bc7c3b16f16319408a86ab6de	2014-12-18 15:50:46 -08:00
Jingning Han	6ec0ef6691	Add a guard on intra mode skip control for RTC mode This commit adds a guard condition to the intra mode test skip control in RTC coding mode. If all inter modes are skipped, force the encoder to check intra mode. It avoids situations where the encoder processes without properly assigning required mode information. Change-Id: Ibb349fee997d6584ce901d08b06e8df3ca9c01b1	2014-12-18 12:00:27 -08:00
Paul Wilkins	e9b586e21b	Removal of legacy zbin_extra / zbin_oq_value. zbin extra / zbin_oq_value was widely passed around, hence removal touches a lot of code. Change-Id: Idc94359735b60c38a160e4385ae09d5ca8b6b8e5	2014-12-18 16:49:11 +00:00
Paul Wilkins	60e9b731cf	Remove mode dependent zbin boost. Initial patch to remove get_zbin_mode_boost() and cpi->zbin_mode_boost. For now sets a dummy value of 0 for zbin extra pending a further clean up patch. Change-Id: I64a1e1eca2d39baa8ffb0871b515a0be05c9a6af	2014-12-18 16:45:52 +00:00
Paul Wilkins	2e39817f5e	Merge "Improve motion detection for low complexity regions."	2014-12-18 08:38:21 -08:00
Jingning Han	dd0602e01c	Remove ARF mode entries from THR_MODES array in non-RD mode The alternate reference frame is disabled in non-RD mode. No need to keep the related entries in the THR_MODES array. Change-Id: I53386f4bb1c6284f582801f27246c5edf55bc24b	2014-12-17 17:13:15 -08:00
Jingning Han	455514a683	Rework mode search threshold update for RTC coding mode In RTC coding mode, the alternate reference frame modes and compound inter prediction modes are disabled. This commit reworks the related mode search threshold update process to skip interacting with these coding modes. It provides about 1.5% speed-up for speed -6 on average. vidyo1 16551 b/f, 40.451 dB, 6261 ms -> 16550 b/f, 40.459 dB, 6190 ms nik720p 33316 b/f, 38.795 dB, 6335 ms -> 33310 b/f, 38.798 dB, 6237 ms mmmoving 33265 b/f, 41.055 dB, 7176 ms -> 33267 b/f, 41.064 dB, 7084 ms dark720 33329 b/f, 39.729 dB, 11235 ms -> 33331 b/f, 39.733 dB, 10731 ms Change-Id: If2a4090a371cd28f579be219c013b972d7d9b97f	2014-12-17 15:56:01 -08:00
Yaowu Xu	a16f075375	Corrected value range of --cpu-used for vp9 This commit removes undefined value options of cpu-used for VP9 and changed vpxenc prompt to reflect the usable range of [-8,8] Change-Id: Ib80fef3dbb6ec9aabac45ed13e8ab6fbaf94f55e	2014-12-17 15:18:01 -08:00
Jim Bankoski	fd96deb06c	remove static initializers for partition tree Could have problem with 2 encoders. Change-Id: I92d326933c00fee688f77b54acf467ca5a8516bc see: https://code.google.com/p/webm/issues/detail?id=900&thanks=900&ts=1418843841	2014-12-17 11:41:06 -08:00
Jingning Han	56a8bc54a6	Properly store the tx_size of selected intra mode Use a temporary variable to store the transform size associated with the best intra mode and restore the mode_info if the overall best mode is intra mode. Change-Id: I2606e0061ad32f91b095462902b1eb734b128eea	2014-12-17 09:25:14 -08:00
Jingning Han	cc8a11d8a1	Merge "Set second ref frame to be NONE in key frame coding"	2014-12-17 09:24:39 -08:00
Paul Wilkins	b76312124d	Deleted unused #define FAST_MOTION_MV_THRESH no longer referenced. Change-Id: Idee6ee5a59ba330904c42b20c9ec35b6fc16f7a2	2014-12-17 14:59:22 +00:00
Jingning Han	200d93545e	Merge "Fix intra mode update process in vp9_pick_inter_mode"	2014-12-16 17:04:04 -08:00
Jingning Han	01613aa753	Set second ref frame to be NONE in key frame coding This commit explicitly set the second reference frame type to be NONE in key frame coding mode. This fixes a subtle dependency of reference motion vector used by next inter frame on mode_info reset before key frame coding. Change-Id: I5ff0359753fdc9992b0bfe889490f7a32d7d5f6a	2014-12-16 15:49:58 -08:00
Frank Galligan	5fdd0f1fe0	Merge "Revert "Revert "Add support for setting byte alignment."""	2014-12-16 15:14:17 -08:00
Jingning Han	581c8dbd33	Merge "Initialize best_tx_size with invalid value"	2014-12-16 10:01:03 -08:00
Jingning Han	b47f9c5802	Merge "Use right shift to replace division in vp9_pick_inter_mode"	2014-12-16 09:26:51 -08:00
Paul Wilkins	b6c75c5a8d	Improve motion detection for low complexity regions. Where there is very subtle motion, especially when combined with low spatial complexity, the codec sometimes fails to quickly pick up the ambient motion field. Once it has been established though the field propagates well using Nearest and Near MV. This patch looks specifically at the case where the Nearest and Near have not been established as non zero vectors and in this case discounts the cost of searching for a new vector in the rd code. This will almost certainly have some implications in terms of encode speed but it should be possible to mitigate the impact in a subsequent using first pass stats and the local spatial complexity. Average results for test sets approximately neutral. Change-Id: I44a29e20f11f7ab10f8c93ffbdc50183d9801524	2014-12-16 17:22:54 +00:00
Peter de Rivaz	e3d19bfc63	Fix for crash in highbitdepth rt mode Change 72141 introduced a new use of vp9_avg_4x4. This call needs to switch to using vp9_highbd_avg_4x4 when performing high bitdepth encodes. Change-Id: I6a8ba4b62f8a75d0a917b365a55245e2f0438ea1	2014-12-16 10:55:49 +00:00
Jingning Han	df3e3ab6ff	Fix intra mode update process in vp9_pick_inter_mode When multiple intra modes are tested, the previous mode info update process may overwrite the selected best intra mode and make the final selection use an inter mode. This commit fixes this issue by moving the mode_info reset outside the intra mode search loop. Change-Id: I15ed4288a6b3cb0832104a5e6d5d9a25cd1a5b2b	2014-12-15 17:52:09 -08:00
Jingning Han	c2c7596fc7	Initialize best_tx_size with invalid value If vp9_pick_inter_mode works properly, it should at least check one coding mode and hence get best_tx_size assigned a valid value. There is no need to initialize best_tx_size with a legitimate value before starting the mode search. Change-Id: Ic0496cd89672ea9c2c512a9bd1da952190af9cba	2014-12-15 12:58:34 -08:00
Jingning Han	83e2c62aba	Use right shift to replace division in vp9_pick_inter_mode Make the variable reduction_fac log2 based and explicitly use right shift when computing intra_cost_penalty. Change-Id: I208f1fb879a02debb3b3fc64f9fd06260dcf1c86	2014-12-15 12:48:07 -08:00
Frank Galligan	c4f7079ad4	Revert "Revert "Add support for setting byte alignment."" This reverts commit `91471d6aad`. Fixes the compile issues if post_proc is enabled. Change-Id: Ib40a15ce2c194f9b5adfa65a17ab01ddf60f5a59	2014-12-15 12:20:37 -08:00
Jingning Han	eefe869291	Simplify rate-distortion modeling function Use left shift to replace one multiplication. The computation outcome remains identical. Change-Id: I1e1737af0a245de0d2a2bde10f0c171477199fc1	2014-12-15 11:51:16 -08:00
Paul Wilkins	91471d6aad	Revert "Add support for setting byte alignment." Fails to compile. Bad calls to vp9_alloc_frame_buffer and vp9_realloc_frame_buffer in postproc.c This reverts commit `399823b6f5`. Change-Id: I29f0e173f8e185d3a303cfdb17813e1eccb51e3a	2014-12-15 11:54:13 +00:00
Frank Galligan	9c2601eb68	Merge "Add support for setting byte alignment."	2014-12-12 15:47:11 -08:00
James Zern	4d40a046da	Merge "vp9: move encoder-only member from common"	2014-12-12 14:28:55 -08:00
Marco	7f59cff53d	Merge "Allow for 4x4 prediction blocks for key frame, speed 6."	2014-12-12 14:27:31 -08:00
Frank Galligan	399823b6f5	Add support for setting byte alignment. Add support for setting byte alignment on the Y, U, and V plane of the reference buffers. The byte alignment must be a power of 2, from 32 to 1024. A value of 0 sets legacy alignment. Change-Id: I7c1399622f7aa68e123646369216b32047dda73d	2014-12-12 13:34:36 -08:00
James Zern	72ece1308b	vp9: move encoder-only member from common allow_comp_inter_inter VP9_COMMON -> VP9_COMP Change-Id: I6d9dc25d1cdd7e2ab62f5be69cd9fa883d21dbb6	2014-12-12 11:17:44 -08:00
Jingning Han	3e0793b80b	Merge "Fix PICK_MODE_CONTEXT index in non-RD coding mode"	2014-12-12 09:16:01 -08:00
Jingning Han	e2c2a65695	Fix PICK_MODE_CONTEXT index in non-RD coding mode This commit fixes a bug in the PICK_MODE_CONTEXT index for horizontal partition case. The compression performance change is less than 0.01% level, since most blocks are selected to use square block size in RTC coding mode. Change-Id: I67effc18ae8795fccdd82a55f4efc609fa5cb3e1	2014-12-11 17:21:24 -08:00
Marco	7e99cd2a9b	Allow for 4x4 prediction blocks for key frame, speed 6. For key frame under variance source partition: 4x4 prediction blocks may be selected when variance of 8x8 block is very high (threshold is set fairly high for now). Testing on some RTC clips shows this helps to reduce some ringing artifacts on key frame. Encoded key frame size increases about ~10%. Key frame PSNR increases about ~0.1-0.2dB. Change-Id: I56e203fac32ea6ef69897fb3ea269c59cb50d174	2014-12-11 15:36:16 -08:00
Jingning Han	811c74cdfa	Merge "Replace division with bit shift in choose_partitioning"	2014-12-11 13:30:03 -08:00
Jingning Han	d9892e846f	Merge "Refactor choose_partitioning computing scheme"	2014-12-11 11:14:07 -08:00
Jingning Han	d5c396a902	Replace division with bit shift in choose_partitioning This commit explicitly uses the bit shift operation instead of division for computing block variance. Change-Id: Id19c0ff27dd1d1ae4aceee6657e1aad0d406bd74	2014-12-11 11:06:57 -08:00
Jingning Han	377d2f027a	Refactor choose_partitioning computing scheme This commit refactors the choose_partitioning function. It removes redundant memset calls and makes the encoder to calculate variance value per block only when it is needed. It reduces the average runtime cost of choose_partitioning by 60%. Overall it reduces speed -6 runtime by 2-5%. Change-Id: I951922c50d901d0fff77a3bafc45992179bacef9	2014-12-11 09:33:40 -08:00
Paul Wilkins	65cfb808d0	Merge "Substantial restructuring of AQ mode 2."	2014-12-10 10:44:27 -08:00
Jingning Han	ad19724f1a	Merge "Use use_prev_frame_mvs flag for ref mv search branch"	2014-12-10 09:25:12 -08:00
Jingning Han	6fc289b9c0	Merge "Refactor update_state_rt"	2014-12-10 09:25:05 -08:00
Jingning Han	8bd88a3c83	Merge "Make RTC coding flow support sub8x8 in key frame coding"	2014-12-10 09:24:56 -08:00
Jingning Han	4cda7a1a9a	Merge "Cosmetic naming change"	2014-12-10 09:05:34 -08:00
Jingning Han	fb3cc0ed57	Merge "Take out redundant setting of mode_info from set_block_size"	2014-12-10 09:05:26 -08:00
Jingning Han	161f636809	Merge "Remove unused rd cost calculation from nonrd_use_partition"	2014-12-10 09:05:18 -08:00
Jingning Han	0cac834b5a	Use use_prev_frame_mvs flag for ref mv search branch Replace error_resilient flag with use_prev_frame_mvs in vp9_pick_inter_mode reference motion vector search selection. This effectively turns off the simplified ref mv search in the settings of frame resizing, even if error-resilient mode is off. Change-Id: I7fed814ee7bc0cb419a03b846e0fc2de46ba7686	2014-12-09 18:18:40 -08:00
Jingning Han	e728678c50	Refactor update_state_rt Update the frame motion vector only if previous frame motion vector is needed for next frame reference motion vector. Change-Id: Ica50f9d7b46ad4f815bba0d9e30f5546df29546f	2014-12-09 15:35:49 -08:00
Jingning Han	225cdef665	Make RTC coding flow support sub8x8 in key frame coding This commit enables the use of sub8x8 blocks in RTC key frame encoding. It requires the block size to be preset and will decide the coding mode and encode the bit-stream. Change-Id: I35aaf8ee2d4d6085432410c7963f339f85a2c19b	2014-12-09 11:34:58 -08:00
Jingning Han	4bacaab46d	Cosmetic naming change Rename set_modeinfo_offsets as set_mode_info_offsets, to be more consistent with naming convention. Change-Id: I68ca1f36c4a78127d9439a50c1506a2afd07927d	2014-12-09 10:32:04 -08:00
Jingning Han	f051a7beab	Take out redundant setting of mode_info from set_block_size The later encoding process will take the top-left block's mode_info for pre-determined block size. Change-Id: I76a90f9ce7f3b2dbc2975b52442114e461c465b5	2014-12-09 10:27:18 -08:00
Paul Wilkins	e68c8dcfd2	Substantial restructuring of AQ mode 2. The restructure moves the decision into the rd pick modes loop and makes a decision based at the 16x16 block level instead of only the 64x64 level. This gives finer granularity and better visual results on the clips I have tested. Metrics results are worse than the old AQ2 especially for PSNR and this mode now falls between AQ0 and AQ1 in terms of visual impact and metrics results. Further tuning of this to follow. It should be noted that if there are multiple iterations of the recode loop the segment for a MB could change in each loop if the previous loop causes a change in the complexity / variance bin of the block. Also where a block gets a delta Q this will alter the rd multiplier for this block in subsequent recode iterations and frames where the segmentation is applied. Change-Id: I20256c125daa14734c16f7cc9aefab656ab808f7	2014-12-09 15:10:52 +00:00
Jingning Han	1395ded2a7	Remove unused rd cost calculation from nonrd_use_partition The per block rd cost calculation is not needed when partition size is preset. Change-Id: Ie5575248bbffb584e908aa13097f697ace6ec747	2014-12-08 18:45:19 -08:00
James Zern	c38d0490b3	Merge "Changes to assembler for NASM on mac."	2014-12-08 12:55:06 -08:00
hkuang	f925e5ce0f	Merge "Improve the performance by caching the left_mi and right_mi in macroblockd."	2014-12-08 10:24:17 -08:00
Paul Wilkins	127f65531b	Merge "Use average mb energy from first pass in AQ2 test."	2014-12-08 09:01:39 -08:00
Frank Galligan	0f8e8330eb	Merge "Fix potential integer overflow."	2014-12-07 21:37:39 -08:00
James Zern	da464c483f	Merge "vp9 asserts: fix compile warning"	2014-12-05 21:09:42 -08:00
James Zern	3db785facc	Merge "vp9: fix frame-parallel encoding"	2014-12-05 19:00:48 -08:00
Deb Mukherjee	0d367474d0	Merge "Some internal-stats, vp9-highbitdepth bug fixes"	2014-12-05 17:49:52 -08:00
James Zern	6db81fd629	vp9: fix frame-parallel encoding the flag in the header wasn't being set based on the encoder configuration in non-intra only mode broken since: `fbc2fbf` Adding oxcf temp variable. Change-Id: Ib4cff9901889824bc4e68d7f0f6deb1e41df2f53	2014-12-05 17:44:46 -08:00
Jingning Han	bd6bfb93b0	Merge "Remove redundant rdcost reset"	2014-12-05 17:35:07 -08:00
Jingning Han	296afb9440	Merge "Fix a motion search skip condition in vp9_pick_inter_mode"	2014-12-05 17:35:04 -08:00
Jingning Han	3d8d1e374e	Merge "Remove redundant MB_MODE_INFO reset from vp9_pick_mode_inter"	2014-12-05 16:59:50 -08:00
hkuang	382f86f945	Improve the performance by caching the left_mi and right_mi in macroblockd. This improve the deocde performance by ~2% on Nexus 7 2013. Change-Id: Ie9c4ba0371a149eb7fddc687a6a291c17298d6c3	2014-12-05 16:25:42 -08:00
James Zern	616b3a810f	vp9 asserts: fix compile warning string literal to int within an assert Change-Id: I76a173f96b9add5bf27c3f5ad5d72c6f30e51629	2014-12-05 16:20:42 -08:00
Jingning Han	17bedc54f5	Remove redundant rdcost reset The initial reset of this_rdc in vp9_pick_inter_mode is not needed, since it will be re-assign when used. Change-Id: Ic0e12d741cbab292fc214c1eabb48b129af7839b	2014-12-05 16:06:17 -08:00
Jingning Han	eadffb2d6e	Fix a motion search skip condition in vp9_pick_inter_mode Compare the current best mode rate-distortion cost with the skip threshold to decide if performing motion search. Change-Id: Ia071824f8dd3b7db485f424692a485a2da6a1a9f	2014-12-05 15:58:36 -08:00
Jingning Han	732d57c2b5	Remove redundant MB_MODE_INFO reset from vp9_pick_mode_inter Change-Id: I0222f7abc61202f4a83b117bbfb042ada6304562	2014-12-05 15:51:11 -08:00
hkuang	eaa6deee5b	Merge "Merge set_prev_mi function into encoder function."	2014-12-05 15:12:50 -08:00
Deb Mukherjee	37448d3e1f	Some internal-stats, vp9-highbitdepth bug fixes Change-Id: I0363d98f6f6558a43276aec48f27dca37c93f5ad	2014-12-05 13:40:50 -08:00
Jingning Han	6ae829088f	Merge "Remove redundant vp9_zero in choose_partitioning"	2014-12-05 11:47:58 -08:00
Jingning Han	69a9dc5cd3	Merge "Enable conditional skip path in rd_pick_intra_sby_mode"	2014-12-05 11:25:30 -08:00
Jingning Han	62c7356098	Merge "Use hybrid RD and non-RD coding flow for key frame coding"	2014-12-05 11:25:19 -08:00
Jingning Han	9d88b30854	Remove redundant vp9_zero in choose_partitioning It makes the overall speed -6 about 2% faster with no compression performance change. Change-Id: I680a967b421caa2c5a5cdb821311c4726a2df45a	2014-12-05 10:39:39 -08:00
Jingning Han	74ded4863e	Enable conditional skip path in rd_pick_intra_sby_mode These speed-up features for key frame coding are only turned on in the settings of hybrid non-RD and RD mode decision. It provides about 20% speed-up to the hybrid key frame coding at the expense of certain compression performance loss. For vidyo1, the key frame coding statistics are changed 9838F, 35.020 dB, 61677 us -> 9920F, 34.834 dB, 47556 us Overall rtc set compression performance is down by -0.257%. Change-Id: I0025447fda26bb7855e982955642b5f55d71b51f	2014-12-05 09:36:09 -08:00
Jingning Han	07711e9b27	Use hybrid RD and non-RD coding flow for key frame coding When block size is below 16x16, the encoder swap from non-RD to RD mode for key frame coding. This largely brough back the key frame compression performance. For vidyo1 at 1000 kbps, the key frame coding statistics are changed 9978F, 34.183 dB, 36807 us -> 9838F, 35.020 dB, 61677 us As compared to the full RD case 7187F, 34.930 dB, 214470 us The overall rtc set coding performance (single key frame setting) is improved by 1.5%. Change-Id: I78a4ecf025d7b24ec911e85be94e01da05e77878	2014-12-05 09:35:27 -08:00
Yunqing Wang	a3a4a34c60	Merge "vp9_ethread: the tile-based multi-threaded encoder"	2014-12-05 08:23:49 -08:00
Frank Galligan	4c4d7261e4	Fix potential integer overflow. ioc found a potential integer overflow in the rate control. This is related to https://code.google.com/p/webm/issues/detail?id=821 Change-Id: Ib6c4acd6e964972f932fce7490592eb134f2b7ea	2014-12-05 08:02:12 -08:00
Paul Wilkins	bb6e47c1c9	Merge "Increase strength of AQ1."	2014-12-05 04:11:43 -08:00
Debargha Mukherjee	15cf55b3ca	Merge "Use the RTC optimizations when in high bitdepth mode."	2014-12-04 19:22:27 -08:00
Debargha Mukherjee	4bfde1071e	Merge "Corrected the renaming of CONFIG_VP9_HIGH ro CONFIG_VP9_HIGHBITDEPTH."	2014-12-04 15:52:35 -08:00
Peter de Rivaz	a306bd8274	Use the RTC optimizations when in high bitdepth mode. Change 72193 made the encoder behave differently when configured with and without high bitdepth. This change means the same algorithm is used for both. Change-Id: I707a44a94afca773a9e0c2f7ebeeea83030257c5	2014-12-04 15:48:42 -08:00
hkuang	62de07c8c6	Merge set_prev_mi function into encoder function. Change-Id: Ifcf2efbb232ea4cabcdebbe77e0820d121e4a6da	2014-12-04 14:44:23 -08:00
Yunqing Wang	eba9c762a1	vp9_ethread: the tile-based multi-threaded encoder Currently, VP9 supports column-tile encoding, which allows a frame to be encoded in multiple column tiles independently. The number of column tiles are set by encoder option "--tile-columns". This provides a way to encode a frame in parallel. Based on previous set of patches, this patch implemented the tile- based multi-threaded encoder. Each thread processes one or more tiles. Usage: For HD clips: --tile-columns=2 --threads=1/2/3/4 While using 4 threads, tests showed that the encoder achieved 2.3X - 2.5X speedup at good-quality speed 3, and 2X speedup at realtime speed 5. Change-Id: Ied987f8f2618b1283a8643ad255e88341733c9d4	2014-12-04 11:21:34 -08:00
Deb Mukherjee	4f860dba78	Merge "Fixes a missing highbitdepth convolve call bug"	2014-12-04 11:19:59 -08:00
Adrian Grange	9065da983f	Merge "Free motion vector array before re-allocating"	2014-12-04 07:08:37 -08:00
Peter de Rivaz	f610f88be4	Corrected the renaming of CONFIG_VP9_HIGH ro CONFIG_VP9_HIGHBITDEPTH. Change 71789 renamed CONFIG_VP9_HIGH to CONFIG_VP9_HIGHBITDEPTH. However, one use of CONFIG_VP9_HIGH was missed. Change-Id: I0ebb9c71380c6d810a25708d15471abf9533e695	2014-12-04 11:01:46 +00:00
Tom Finegan	7339681ee9	Merge "sse2 visual studio build fix"	2014-12-03 18:05:03 -08:00
Deb Mukherjee	70d9dbd818	Fixes a missing highbitdepth convolve call bug Bug was introduced in https://gerrit.chromium.org/gerrit/#/c/72122/ Change-Id: Idb500ea619a30e7bc50e22fb8ee03be5282f41db	2014-12-03 17:48:50 -08:00
Adrian Grange	b56451f488	Merge "Use memset for initialization to 0"	2014-12-03 16:50:39 -08:00
Deb Mukherjee	6615706af2	sse2 visual studio build fix Change-Id: Id8c8c3be882bcd92afea3ccec6ebdf3f208d28ef	2014-12-03 16:35:26 -08:00
Adrian Grange	979ee6e4c9	Free motion vector array before re-allocating Change-Id: I0c39136d67e1e83020d61f86b062a04182ec9b00	2014-12-03 16:07:32 -08:00
Marco	fb20a07c36	Merge "Increase delta-qp for aq=3 mode, after key frame."	2014-12-03 16:03:06 -08:00
Adrian Grange	73caef0500	Use memset for initialization to 0 Change-Id: I714ca22b5d51016bf8b035cf457616c707257641	2014-12-03 15:22:02 -08:00
Marco	a047e7cdf8	Increase delta-qp for aq=3 mode, after key frame. For a few refresh periods after key frame, use large qp-delta to increase quality ramp-up. Change-Id: Ib5a150fb2dfa6bafd0d4e6b5d28dfd0724b61319	2014-12-03 13:04:45 -08:00
Jingning Han	17176cd452	Fix indent in source_var_based_partition_search_method Change-Id: I6e5e0571d6967b9b992966336715e35bb97f187e	2014-12-03 12:37:36 -08:00
Jingning Han	8f3db5f22e	Merge "Remove unused ONE_LOOP entry from speed feature"	2014-12-03 11:34:42 -08:00
Jingning Han	228ec17ff2	Merge "Rework coeff probability model update for rtc coding"	2014-12-03 11:34:35 -08:00
Marco	8fd3f9a2fb	Enable non-rd mode coding on key frame, for speed 6. For key frame at speed 6: enable the non-rd mode selection in speed setting and use the (non-rd) variance_based partition. Adjust some logic/thresholds in variance partition selection for key frame only (no change to delta frames), mainly to bias to selecting smaller prediction blocks, and also set max tx size of 16x16. Loss in key frame quality (~0.6-0.7dB) compared to rd coding, but speeds up key frame encoding by at least 6x. Average PNSR/SSIM metrics over RTC clips go down by ~1-2% for speed 6. Change-Id: Ie4845e0127e876337b9c105aa37e93b286193405	2014-12-03 09:18:08 -08:00
Jingning Han	a8d8c0f633	Remove unused ONE_LOOP entry from speed feature Change-Id: I56ead0ebc2491144c4e79e5859b05e126176702c	2014-12-03 09:17:08 -08:00
Jingning Han	8fe50191c6	Rework coeff probability model update for rtc coding This commit reworks the ONE_LOOP_REDUCED coefficient probability model update process. It allows model update for every coefficient across the spectrum at a coarser resolution, instead of performing precise update only for certain subset of probability models. The overall runtime remains nearly same (<1% change) for speed -6. The compression performance is improved by 7.5% in PSNR for speed -5 and 4.57% for speed -6, respectively. Change-Id: Ifb17136382ee7e39a9f34ff4a4f09a753125c8d1	2014-12-03 09:15:25 -08:00
Debargha Mukherjee	99874f55fb	Merge "Reinsert macro to fix issue 884."	2014-12-02 15:32:24 -08:00
Deb Mukherjee	1fbe0c7615	Merge "Fix a warning related to VPX_EFLAG_FORCE_KF check"	2014-12-02 14:03:55 -08:00
Peter de Rivaz	2c886953d1	Reinsert macro to fix issue 884. Change 72056 unfolded some macro definitions, but lost some alternative behaviour required for high bitdepth encodes. This causes the encoder to crash, see issue 884. Change-Id: I8ce4d73c9fe0a3c10ccb86fba210fabc8b2f0ccc	2014-12-02 13:45:26 -08:00
Deb Mukherjee	02941b0df2	Fix a warning related to VPX_EFLAG_FORCE_KF check Fixes a warning in chrome build. Change-Id: I8fa0fd3e7ba1aecf89e5f79ce94cd64ed6a9567c	2014-12-02 11:35:52 -08:00

... 7 8 9 10 11 ...

5518 Commits