generic-library/vpx

Author	SHA1	Message	Date
Yaowu Xu	56d0b36092	Merge "Fixed a bug where no valid partition is allowed"	2014-02-12 10:13:49 -08:00
Yaowu Xu	69a6871904	Fixed a bug where no valid partition is allowed Change-Id: I4d2729dc5c46db2847700256941a66b0957c105d	2014-02-12 09:00:34 -08:00
Dmitry Kovalev	79dd1f8441	Removing vp9_foreach_transformed_block_uv() function. Change-Id: I35ec77b71e6fd686865cead9281e4dd9e9bc9e86	2014-02-11 18:06:00 -08:00
Yunqing Wang	507fd5220b	Enable encode_breakout in real time encoding In real time encoding, we enable encode_breakout to make encoding fast. A speed feature "use_encode_breakout" is defined to set encode_breakout thresholds for different speeds. However, currently, static_thresh is an encoder option. The encode_ breakout can be turned off if user sets static_thresh=0 specifically. The rtc set borg test result: (need to set --static_thresh=1) speed -5, psnr loss -3.543%; speed -4, psnr loss -2.358%; speed -3, psnr loss -0.771%. Encoding speed test: speed -5, 11% - 60% speedup; speed -4, 5.5% - 28% speedup; speed -3, 0.8% - 7% speedup. Change-Id: Icde592ffbe77eac7446f872a2e9eb2051733677b	2014-02-11 15:30:54 -08:00
Dmitry Kovalev	4fff8566f8	Merge "Cleaning up compute_fast_motion_search_level()."	2014-02-11 11:12:29 -08:00
Jingning Han	220e9a932e	Merge "Use more meaningful names for speed features"	2014-02-11 08:49:35 -08:00
Paul Wilkins	f30b323180	Modified Aq1 and Aq2 Aq 1 only updates segment map on kf and arf and only uses 3 segments. With these settings AQ1 is + for most clips in SSIM but negative in psnr. However, the penalty in PSNR is much less than previously. Old version aq1 average results for std hd -20.899% psnr, -5.809% SSIM New version aq1 for std hd -3.57% psnr, +1.23% SSIM Aq2 Now uses only 2 segments and rd. This mode is still slightly negative for most clips on psnr and SSIM but seems to have a much bigger visual impact on several problem clips than aq mode 1. Old results for std hd: -2.578% psnr, -1.151% SSIM New results for std hd: -1.561% psnr, -0.85% SSIM Change-Id: I94f57f8a73121629ce598fb921aad761c1450e1c	2014-02-11 16:27:05 +00:00
Dmitry Kovalev	4a13d53523	Merge "Cleaning up update_stats() function."	2014-02-10 17:30:29 -08:00
Dmitry Kovalev	7e7ae66f74	Merge "Making vp9_activity_masking() static."	2014-02-10 17:29:40 -08:00
Jingning Han	734938dc6b	Use more meaningful names for speed features Use frame_parameter_update to precisely describe the functionality. Change-Id: Ia9a55ba8efef7b987e30d949dd00ac716189bdb9	2014-02-10 15:20:11 -08:00
Yaowu Xu	855070e254	Merged two similar functions to reduce duplication Function encode_rtc_frame_internal() and encode_frame_internal() only differed by a couple of speed features, this commit relocation those difference into the setup of speed features and merged two functions into one to remove duplication. It also fixed a subtle bug super_fast_rtc was used before it was initialized. Change-Id: I234a5a1d11a4450930e5b4943dbab434208d5030	2014-02-10 11:36:42 -08:00
Dmitry Kovalev	1a30a8743b	Making vp9_activity_masking() static. Change-Id: Ic6a733f1fe92458da89c8459c5686ba1e08b92bf	2014-02-08 19:41:37 -08:00
Dmitry Kovalev	e1fdcbcb82	Cleaning up compute_fast_motion_search_level(). Change-Id: I151bd3de689bceb72969120095257c37656db92f	2014-02-07 16:39:40 -08:00
Dmitry Kovalev	6c17ab6384	Cleaning up update_stats() function. Change-Id: I9139210fc6b9878de7844d74dd97784a6d289230	2014-02-07 15:21:31 -08:00
Dmitry Kovalev	9f528c5dbd	Removing redundant is_inter_mode() call. Block type was already detected by is_inter_block() call. Change-Id: I7923ce11b6a0071ce9df8c744a78c816651a15dc	2014-02-05 11:04:53 -08:00
Dmitry Kovalev	b9fea167f9	Removing DBG_PRNT_SEGMAP. Change-Id: I71d85e3455545960938e525ae8aa0a667e1db94c	2014-02-04 16:33:03 -08:00
Dmitry Kovalev	3ffb204360	Merge "Removing ENC_DEBUG."	2014-02-03 17:11:52 -08:00
Yongzhe Wang	513faceaed	Build fix with config internal stats Fixes a build issue when internal stats is enabled Change-Id: I822cc60274e34b5f29ccbaa1f986fb9da6a8de4b	2014-02-03 14:35:48 -08:00
Jim Bankoski	9dec7712ab	static function convert to inline or global vp9_blockd.h Change-Id: Ifdd951f24932839f06d1c700371662511dde6ebe	2014-01-31 19:50:40 -08:00
Jim Bankoski	5ccd193219	Merge "bsize problem 360p"	2014-01-31 16:21:13 -08:00
Dmitry Kovalev	a8a2f22958	Merge "Renaming "mbskip" to "skip"."	2014-01-31 15:52:35 -08:00
Jim Bankoski	1833028681	bsize problem 360p Fixes an assert that crashed for 360p.. Change-Id: I2faf15c93cbdb0e62a27a3b663f0d09ba62774a8	2014-01-31 15:14:02 -08:00
Yaowu Xu	538b1c6d52	Only allow interp_filter change in SWITCHABLE mode This commit added a logic to prevent the inter_filter type from being changed if the default interp_filter mode is not switchable. Also, it sets the default interp_filter to BILINEAR at very and super fast rtc encoding modes Change-Id: Ic41e6d31de29795a4ce536ec79afb01cab6daad3	2014-01-31 15:10:08 -08:00
Yaowu Xu	6a4e2ddabc	Properly merge two different real time modes --rt --cpu-used=-5 uses the progressive rtc mode --rt --cpu-used=-6 uses the new super fast rtc mode Change-Id: Id6469ca996100cdf794a0e42d76430161f22f976	2014-01-31 15:07:51 -08:00
Jim Bankoski	da6b18622f	remove confusing compressor_speed use mode instead Change-Id: I419d7a2dc4b0714ca6ff723c5e824521c150c460	2014-01-31 07:55:19 -08:00
Yaowu Xu	5ebed3e861	Replace inline with INLINE So x86_64-win64-vs11 can build successfully. Change-Id: If354c2ea3921fac8c9b413ed39223e70bc20c535	2014-01-30 11:48:16 -08:00
Yaowu Xu	6f81942f0e	Fix a build issue for --enable-intern-stats Change-Id: Iea7c9fa0726dbf9792eea79e6a05eb8a3c718d45	2014-01-30 08:20:08 -08:00
Yaowu Xu	96dc80da61	Merge "create super fast rtc mode"	2014-01-29 16:36:20 -08:00
Yaowu Xu	08b912b4d1	Merge "Add a strict mode for auto_min_max_partition_size feature"	2014-01-29 16:36:06 -08:00
Yaowu Xu	1ca1186529	Add a strict mode for auto_min_max_partition_size feature In this new mode, the size range is strictly determined by the min and max partition size in neighborhood blocks. Niklas720 encoding time at cpu-used -5 goes from 56250ms to 50676ms, a 10% reduction. Change-Id: I316b0e2ac967ff3fad57b28d69c0ec80b7d8b34e	2014-01-29 14:51:51 -08:00
Dmitry Kovalev	b107f2c470	Renaming "mbskip" to "skip". Change-Id: I27a30b43eae026a77f92958e2238d02d9cdf7832	2014-01-29 14:48:42 -08:00
Dmitry Kovalev	70cde0af3d	Removing ENC_DEBUG. Change-Id: I101017621003314f000a454725ea13fc9db43177	2014-01-29 12:58:57 -08:00
Dmitry Kovalev	b00eb5c464	Finally removing vp9_setup_interp_filters() function. Change-Id: If446225afbb49f6033c2a4516a37c377de6f70f7	2014-01-29 11:29:34 -08:00
Jim Bankoski	ea8aaf15b5	create super fast rtc mode This patch only works if the video is a width and height that are both a multiple of 32.. It sets every partition to 16x16, and does INTRADC only on the first frame and ZEROMV on every other frame. It always does does the largest possible transform, and loop filter level is set to 4. Was ~20% faster than speed -5 of vp8 Now 20% slower but adds motion search ( every block ), nearest, near and zeromv The SVC test was changed because - while this realtime mode produces bad quality albeit quickly, it isn't obeying all the rules it should about which frames are available. Change-Id: I235c0b22573957986d41497dfb84568ec1dec8c7	2014-01-29 08:39:39 -08:00
Paul Wilkins	c382136122	Trap divide by 0. Trap divide by 0 that could occur with a 0 rate target in aq mode COMPLEXITY_AQ. Change-Id: I034514f512b2a0db470ae8d37ea395278bf473cf	2014-01-29 14:59:04 +00:00
Dmitry Kovalev	e5b31a1d8c	Decoupling set_ref_ptrs() and vp9_setup_interp_filters(). Change-Id: I8d17867a4772554cbba2bd113cc5b4c99d50146d	2014-01-27 16:00:20 -08:00
Dmitry Kovalev	4264c93844	Renaming INTERPOLATION_TYPE to INTERP_FILTER. Corresponding renames: subpel_kernel => interp_kernel vp9_get_filter_kernel() => vp9_get_interp_kernel() pred_filter_type => pred_interp_filter adaptive_pred_filter_type => adaptive_pred_interp_filter mcomp_filter_type => interp_filter read_interp_filter_type() => read_interp_filter() write_interp_filter_type() => write_interp_filter() fix_mcomp_filter_type() => fix_interp_filter() Change-Id: I1fa61fa1dc81ebbf043457c3ee2d8d4515bee6d3	2014-01-24 15:57:28 -08:00
Yaowu Xu	ebe160568b	Prevent invaid memory access Reading second motion vector only when it has a second ref_frame Change-Id: Ica72c1cd955832e15ceccda5e5a17b0bfcd83044	2014-01-22 09:10:44 -08:00
Jingning Han	b461c0884e	Deprecate best_mv from encoder This commit deprecates the use of best_mv from encoding and bit-stream writing stages. It hence removes the definition from MACROBLOCKD. Change-Id: I8e5302775a2aa4a18900726df407bff881f2dfb1	2014-01-17 17:15:34 -08:00
Jingning Han	98b01c038f	Rename pick_sb_modes to rd_pick_sb_modes Keep naming consistency for RD and non-RD mode decision functions, respectively. Change-Id: I904282b675fc511a46c13cb1f8287aa5d1c8ac94	2014-01-16 18:05:42 -08:00
Jingning Han	2f52decd22	Inter-frame non-RD mode decision This commit setups a test framework for real-time coding. It enables a light motion search for non-RD mode decision purpose. Change-Id: I8bec656331539e963c2b685a70e43e0ae32a6e9d	2014-01-16 12:35:04 -08:00
Jim Bankoski	73cd22f8d4	As you go mbmi->skip_coeff Calculate the skip_coeff as part of the encode process, rather than checking the eobs after the fact with another pass. Change-Id: Ib41b139e96a97dee30e4b993b4cc53d86337128d	2014-01-14 17:58:25 -08:00
Dmitry Kovalev	a8bb1ffd89	Merge "Reusing get_frame_new_buffer() function."	2014-01-14 14:40:48 -08:00
Dmitry Kovalev	f3728f20ea	Merge "Cleaning up vp9_encodeframe.c."	2014-01-14 14:14:49 -08:00
Dmitry Kovalev	9855b6e510	Reusing get_frame_new_buffer() function. Change-Id: Iac5c5aeaef62a4095a60d91285d2c7ad717db0fb	2014-01-13 14:04:56 -08:00
Jingning Han	4f969ccc1b	Merge "Enable skipping reference frame check in rd loop"	2014-01-10 16:00:56 -08:00
Dmitry Kovalev	3df5c54ad7	Cleaning up vp9_encodeframe.c. Change-Id: I6d9f595249dc71752abe16c042d3b07aa2e4248d	2014-01-10 13:48:44 -08:00
Jingning Han	d66c748635	Enable skipping reference frame check in rd loop This commit allows encoder to compare the SAD cost associated with the best motion vector predictor, per frame. If one reference frame has this cost more than 4 times of the best SAD cost given by other reference frames, skip NEARESTMV, NEARMV, ZEROMV mode check of this reference frame. This setting is turned on in speed 2 and above. Compression quality change in speed 2: derf -0.014% yt -0.097% hd -0.023% stdhd 0.046% It reduces the speed 2 runtime of test sequences: pedestrian_area_1080p 4000 kbps 310763 ms -> 303595 ms bluesky_1080p 6000 kbps 259852 ms -> 251920 ms Change-Id: I7f59cf79503d51836d61d56d50dc5bdf0e502e22	2014-01-09 18:25:53 -08:00
Dmitry Kovalev	0ecd583d8d	Cleanups around cpi->common. Change-Id: I0c42a729038d0f4cb7bc07f587d066fcb1dfe9d9	2014-01-08 14:51:00 -08:00
Dmitry Kovalev	7b496783c2	Merge "Adding get_ref_frame_buffer() function."	2014-01-07 09:56:06 -08:00
Dmitry Kovalev	ff655420b5	Reusing ROUND_POWER_OF_TWO macro. Change-Id: I064ba32d5358bfbf080a4300fc1793b345080006	2014-01-06 17:38:57 -08:00
Dmitry Kovalev	7919bf6afd	Adding get_ref_frame_buffer() function. Encapsulating direct references to lst_fb_idx, gld_fb_idx, alt_fb_idx. Change-Id: I7e65ba3f131286e433e6651970c5647311fa4687	2014-01-06 14:50:54 -08:00
Dmitry Kovalev	ba41e9d459	Adding RefBuffer struct. Adding RefBuffer to simplify reference buffer management. The struct has a pointer to image data and scale factors relative to the current frame. Change-Id: If38eb1491ff687cc11428aee339f3e052e2c5d9e	2014-01-03 15:21:55 -08:00
Dmitry Kovalev	2336853be1	Merge "Pre planes configuration cleanup."	2014-01-03 15:04:53 -08:00
Dmitry Kovalev	a8ba34d299	Pre planes configuration cleanup. Change-Id: I1d50f8701d9c9dedb84387a773a3e9b4daaad720	2014-01-03 12:50:57 -08:00
Dmitry Kovalev	5b04962cf4	Merging best_ref_mv and second_best_ref_mv into best_ref_mv[2]. Change-Id: If04b57828847cee09a79c94e1098d1aa4990ea0d	2014-01-03 11:31:00 -08:00
Dmitry Kovalev	f16b186b8e	Reusing vp9_get_skip_context() function in encoder. Change-Id: Ic0345622115941f49b6a568c7b8154ba892cbf0d	2014-01-02 18:29:56 -08:00
Dmitry Kovalev	1e8b5bf4ac	Merge "Removing vp9_findnearmv.{h, c} files."	2013-12-26 13:38:38 -08:00
Dmitry Kovalev	b3b9f4a4d0	Merge "Using single struct to represent scale factors."	2013-12-20 11:22:02 -08:00
Dmitry Kovalev	47d482cb0a	Merge "Reusing FRAME_COUNTS in the encoder."	2013-12-20 10:56:31 -08:00
Dmitry Kovalev	987810ad95	Removing vp9_findnearmv.{h, c} files. Moving all code from that files to vp9_mvref_common.{h, c}. Change-Id: Ibc4afcb8cea6847166ff411130e93611ebe63b20	2013-12-19 17:39:57 -08:00
Dmitry Kovalev	a3fbcc88bb	Using single struct to represent scale factors. Moving back to scale_factors struct. We don't need anymore x_offset_q4 and y_offset_q4 because both values are calculated locally inside vp9_scale_mv function. Change-Id: I78a2122ba253c428a14558bda0e78ece738d2b5b	2013-12-19 16:06:33 -08:00
Dmitry Kovalev	40e173ac42	Merge "vp9_encode_frame() cleanup."	2013-12-19 15:37:13 -08:00
Dmitry Kovalev	f06187f125	vp9_encode_frame() cleanup. Change-Id: I82ecbe7fe0baa890ce251043f3c7159188c00665	2013-12-19 14:28:42 -08:00
Dmitry Kovalev	431aaefbec	Replacing 1 << mi_{width, height}_log2() with lookup tables. Change-Id: Iba91ff1e797a83517e2cd7c3ab86cba39f39415b	2013-12-19 13:43:45 -08:00
Dmitry Kovalev	e4b85c9ed8	Merge "Adding get_zbin_mode_boost() function."	2013-12-19 11:03:23 -08:00
Dmitry Kovalev	4e84ad1fc6	Reusing FRAME_COUNTS in the encoder. Replacing: intra_inter_count, y_mode_count, y_uv_mode_count. Change-Id: I5d70f73288af6effe6176e26400138067a2ae2a3	2013-12-18 18:52:58 -08:00
Dmitry Kovalev	829ec56b47	Merge "Reusing FRAME_COUNTS in the encoder."	2013-12-18 18:27:08 -08:00
Dmitry Kovalev	de49895804	Adding get_zbin_mode_boost() function. Change-Id: Ia356178d6a3c40b512d3123390781ef94dec72d6	2013-12-18 10:39:08 -08:00
Dmitry Kovalev	1d23a6594b	Reusing FRAME_COUNTS in the encoder. Change-Id: I6ab9fe2326ebbadf0dd10cca9f66cf8277e3f43b Replacing: comp_inter_count, single_ref_count, comp_ref_count.	2013-12-16 20:12:47 -08:00
Deb Mukherjee	1e59cbf23b	Rate control changes on active_worst_quality Various cleanups and refactoring. Removes feedback of active worst qaulity and uses last_q instead to make the interface cleaner. Active worst quality is now decided only once for a frame being coded in the beginning based on last_q and other stats. Also, adds other cleaups on last_q to store also the last_q for altref frames, and reduces the altref interval a little. The output does change a little. derfraw300: +0.224% (global psnr) stdhdraw250: +0.442% (global psnr) Change-Id: Ie634cdc032697044c472dd0fe79c109b3e7f9767	2013-12-16 17:08:16 -08:00
hkuang	fb53409d2a	Merge "Remove border extension in intra frame prediction."	2013-12-16 14:48:54 -08:00
Dmitry Kovalev	4f0a381b49	Merge "Reusing nmv_frame_counts from FRAME_COUNTS in encoder."	2013-12-16 14:10:13 -08:00
hkuang	25e5552630	Remove border extension in intra frame prediction. Change-Id: Id677df4d3dbbed6fdf7319ca6464f19cf32c8176	2013-12-16 14:05:58 -08:00
Dmitry Kovalev	1a23a34419	Merge "Cleaning up encode_sb() and encode_b() functions."	2013-12-16 12:21:38 -08:00
Jingning Han	3b5a90bd86	Enable adaptive pred filter type for sub8x8 This commit enables an adaptive prediction filter type selection for sub8x8 block sizes. In speed 1, it re-uses the filter type of collocated 8x8 block if it is tested in the rate-distortion optimization loop, for the sub8x8 blocks. Otherwise, it runs the normal test over all the three filter types. In speed 2, it re-uses the 8x8 block's prediction filter type, if available. Otherwise, force it to be EIGHTTAP. Compression and speed performance wise: speed 1 derf -0.266% yt -0.138% bus at 2000 kbps: 33766ms -> 30451ms (10% speed-up) football at 600 kbps: 48173ms -> 43786ms (9% speed-up) speed 2 derf -0.026% yt +0.134% bus at 2000 kbps: 18973ms -> 17698ms (6% speed-up) football at 600 kbps: 26748ms -> 25096ms (6% speed-up) Change-Id: I77e097533b969fd3472147225fa79fc98095d342	2013-12-12 17:54:34 -08:00
Dmitry Kovalev	efe5b28c09	Reusing nmv_frame_counts from FRAME_COUNTS in encoder. Change-Id: Iadf2fcc9a5bfa5d02fc166f31963be1cc814831c	2013-12-11 15:16:10 -08:00
Dmitry Kovalev	b8dc52f4a3	Cleaning up encode_sb() and encode_b() functions. Trying to make encode_sb() more similar to write_modes_sb() and decode_mode_sb() because essentially all branching logic should be the same. Change-Id: Ib7dec7b48fce29418142abad4d1dcfdb1c770735	2013-12-11 14:38:22 -08:00
Yaowu Xu	014b9c70f7	Merge "Fix a bug"	2013-12-10 16:06:42 -08:00
Yaowu Xu	e0f82c6ed6	Fix a bug In evaluating partition split case, Wrong partition size is used in calling partition_plane_context(). This commit change to use the correct sub partition size. The incorrect partition size used were causing an ASAN error in unit test. Change-Id: Iab695b764bc51cc61580075f2ae4001421132362	2013-12-10 14:34:32 -08:00
Dmitry Kovalev	e18eb7721e	Merge "Renaming comp_pred_mode to reference_mode."	2013-12-10 10:52:34 -08:00
Dmitry Kovalev	08c48ddc01	Renaming comp_pred_mode to reference_mode. Change-Id: I83ffed2b1878a35ac35f07f9ee74309adc9c7b11	2013-12-09 15:13:34 -08:00
Dmitry Kovalev	cb92f4f042	Renaming vp9_get_pred_context_tx_size() function. Change-Id: Ia6d6f4dfb1fd1ec0f8ba53796b59a802e9d7881d	2013-12-06 15:31:06 -08:00
Jim Bankoski	dcb17eaefc	Merge "Disable early exit based on distortion in lossless"	2013-12-06 14:46:28 -08:00
Yaowu Xu	f8c06fb2ac	Disable early exit based on distortion in lossless In lossless coding, distortion is always 0. Early exit based on this metric was incorrect. This CL also changed to use best_rd instead of distortion as the metric for easly exit as requested by Jim. Change-Id: I8ef3e407ac03b4abc3283b273f936a68fad5c2ab	2013-12-06 13:37:55 -08:00
Dmitry Kovalev	63963f51ef	Renaming reference mode context calculation function. Renames: vp9_get_pred_context_comp_inter_inter => vp9_get_reference_mode_context vp9_get_pred_prob_comp_inter_inter => vp9_get_reference_mode_prob Change-Id: I3bbb69481e6b0c848028667c9269f567f293d3bd	2013-12-06 11:23:01 -08:00
Dmitry Kovalev	6fd71e1b09	vp9_get_pred_context_intra_inter() clean up. Renaming: vp9_get_pred_context_intra_inter => vp9_get_intra_inter_context vp9_get_pred_prob_intra_inter => vp9_get_intra_inter_prob Change-Id: I2c1affea2e84f4e616137c6df82adb11c7845781	2013-12-05 17:01:03 -08:00
Dmitry Kovalev	3eb0170ea6	Using lookup to determine tx_size in encode_superblock(). Change-Id: I68d6217db6f67da15380cd59ec5eda0c44da7d34	2013-12-05 12:25:03 -08:00
Dmitry Kovalev	f00d157c12	Moving eob array to the encoder. In the decoder we don't need to save eobs, we can pass eob as an argument. That's why removing eob arrays from VP9Decompressor and TileWorkerData, and moving eob pointer from macroblockd_plane to macroblock_plane. Change-Id: I8eb919acc837acfb3abdd8319af63d1bbca8217a	2013-12-03 17:59:32 -08:00
Alex Converse	962fc2e1e7	Disable partitioning in the dominant subsampling direction. E.g. disable vertical partioning for 4:2:2. Until we come up with something better to do with the chroma block size, this prevents an assert error. Change-Id: I9394fb3f14ec1343abc3ad4769de208e6278f285	2013-12-02 13:38:11 -08:00
Dmitry Kovalev	d3a2e55af4	Removing qcoeff buffers from the decoder. We only need qcoeff buffers in the encoder. Reducing TileWorkerData struct and VP9Decompressor struct sizes by 24K. Change-Id: Id148868461f7ffa3d3dd634b371503ae9c57e207	2013-11-26 18:52:10 -08:00
Dmitry Kovalev	7ba7a5f817	Merge "Removing redundant call of vp9_init_mbmode_probs()."	2013-11-25 16:08:42 -08:00
Dmitry Kovalev	e8af3db88a	Merge "Renaming COMPPREDMODE_TYPE enum and its members."	2013-11-25 10:59:08 -08:00
Paul Wilkins	644bd87e8e	In frame Q adjustment experiment. The idea here is to allow "in frame" adjustment of the final Q value used to encode each SB64, using segmentation. There is also adjustment of the rd mult in regions of overspend. Activated using aq_mode=2 Change-Id: I2f140cd898c9f877c32cd6d2e667f5e11ada4b1c	2013-11-25 10:22:55 -08:00
Dmitry Kovalev	fb9c19c62d	Renaming COMPPREDMODE_TYPE enum and its members. List of renames: COMPPREDMODE_TYPE => REFERENCE_MODE SINGLE_PREDICTION_ONLY => SINGLE_REFERENCE COMP_PREDICTION_ONLY => COMPOUND_REFERENCE HYBRID_PREDICTION => REFERENCE_MODE_SELECT (like TX_MODE_SELECT) NB_PREDICTION_TYPES => REFERENCE_MODES Change-Id: If723dabe9435325d0165dcd028142a2c78b417b4	2013-11-22 16:35:37 -08:00
Dmitry Kovalev	75e4377d81	Using partition counts from FRAME_COUNTS struct in the encoder. Change-Id: I6c3d47b00acabe7ffba22ffc73741173aa9a0bff	2013-11-22 14:26:39 -08:00
Dmitry Kovalev	c90b6bb101	Removing redundant call of vp9_init_mbmode_probs(). This function is called from vp9_setup_past_independence() which is called before the modified piece of code. Moving reset of inter_mode_probs into vp9_init_mbmode_probs() for consistency. Change-Id: Ib188e8798e1fbe15407fd501406761b746fdda95	2013-11-20 21:56:38 -08:00
Guillaume Martres	b00057c88a	Merge "vpxenc: add --aq-mode flag to control adaptive quantization"	2013-11-20 08:13:28 -08:00
Yaowu Xu	1c61e1960d	Move vp9_extend.{h,c} from common to encoder Since they used in encoder only. This commit also re-order includes for the files that include vp9_extend.h Change-Id: I929fc113f2135d3198cd1fc6a17434e5a2f8a459	2013-11-18 12:43:36 -08:00
Dmitry Kovalev	5380739a87	Removing vp9_encodeintra.{h, c} files. There was only one function in *.c file, so moving it to vp9_encodemb.c. Change-Id: I728859d08b3d6c05c33c1c5b21f0ea1d0e0f83af	2013-11-15 12:17:16 -08:00
Guillaume Martres	17084657e6	vpxenc: add --aq-mode flag to control adaptive quantization Change-Id: I57e1ad4bed3487df12893ced77c49093f8755706	2013-11-15 19:42:20 +01:00
Jingning Han	b6b9143218	Dual buffer encoding for intra modes Overall change (using dual buffer scheme for superblocks of both inter and intra modes) reduces speed 2 runtime: bluesky_1080p at 6000kbps: 263553ms -> 257441ms riverbed_1080p at 8000kbps: 233230ms -> 225308ms. Change-Id: Idf8d70f768a4b0d97b2a8506372c57b7b4022119	2013-11-13 12:57:03 -08:00
Dmitry Kovalev	3f3d14e1d3	Moving q_index from MACROBLOCKD to MACROBLOCK. Moving because q_index is used only by encoder. Change-Id: I0b96175614ed4fd3d76ee56a0ba36258e1e896f6	2013-11-12 18:13:19 -08:00
Jingning Han	e69461593d	Merge "Enable dual buffer rd search and encoding scheme"	2013-11-12 18:11:41 -08:00
Dmitry Kovalev	73a5cbeba4	Merge "Using max_tx_size instead of bsize when possible."	2013-11-12 16:54:30 -08:00
Dmitry Kovalev	3a2ea76469	Merge "Moving {sb, mb, b, ab}_index from MACROBLOCKD to MACROBLOCK."	2013-11-12 15:59:28 -08:00
Jingning Han	34b6abefa2	Enable dual buffer rd search and encoding scheme This commit enables the dual buffer rate-distortion optimization and encoding scheme. It stacks the original transform coefficients, quantized levels, and reconstructed coefficients, in the rate- distortion optimization search process, hence eliminates the need to re-run residual generation, forward transform, and quantization in the encoding stage. Change-Id: I011bfad3a59a380a869ee552e91dae0394ec492e	2013-11-11 18:32:55 -08:00
Jingning Han	3b3aea6834	Allocate dual buffer sets for encoding Allocate memory space of dual buffer sets that store the coeff, qcoeff, dqcoeff, and eobs. Connect the pointers of macroblock_plane and macroblockd_plane to the actual buffer in use accordingly. Change-Id: I2f0b5f482ca879fae39095013eaf8901db20a5a4	2013-11-11 16:24:39 -08:00
Dmitry Kovalev	3551e25099	Moving {sb, mb, b, ab}_index from MACROBLOCKD to MACROBLOCK. We use {sb, mb, b, ab}_index only inside encoder, so moving them into appropriate data structure. Change-Id: Ib5c1036716354d9d321e11a60c1634c1cb8f9716	2013-11-11 15:58:57 -08:00
Paul Wilkins	84b3b03705	Removed unused rate parameter. Change-Id: I6e4a266fdbad1d222eb45d45b67bbb82d091821a	2013-11-07 09:59:45 +00:00
Dmitry Kovalev	7b011c5467	Replacing mi_{width,height}_log2 with num_8x8_blocks_{wide,high}_lookup. Change-Id: I04c55daef89bca2b85cb7db0850f9b052abc5a7c	2013-11-06 13:34:23 -08:00
Dmitry Kovalev	4a96e64dc2	Using max_tx_size instead of bsize when possible. Change-Id: I246364bc4270ca13aefb4bc3445bcf102b3170dc	2013-11-05 17:36:43 -08:00
Adrian Grange	a0a6590e0f	Remove unused member variables from VP9_COMP Removed three members from the VP9_COMP data structure: inter_zz_count, gf_bad_count, gf_update_recommended. These were part of the VP8 real-time mode implementation that was removed from the initial VP9 codecbase. Change-Id: I866b083b88ef02c74837277d50ce532ca88492f3	2013-11-04 11:01:43 -08:00
Dmitry Kovalev	6761872e49	Replacing (SWITCHABLE_FILTERS + 1) with SWITCHABLE_FILTER_CONTEXTS. Change-Id: I9781a62bc1a4cd9176554d1271d87dbcafda9cb0	2013-10-30 14:40:34 -07:00
Dmitry Kovalev	fa1ac00aee	Making get_tx_counts() similar to get_tx_probs(). Change-Id: I5b17f40e515c4bcf9ebef5380270a214af4e0115	2013-10-28 19:52:38 -07:00
James Zern	58a0f6dbdd	vp9: add TileInfo replaces use of cur_tile_mi_(row\|col)_(start\|end) by VP9_COMMON, making it less stateful and more reusable for parallel tile decoding Change-Id: I1df09382b4567a0e5f4434825d47c79afe2399be	2013-10-28 20:54:43 +01:00
James Zern	3ffa41aae3	Merge changes If9b16f7d,I75aab21c,I9cbb768c,If5cea3d3,I96940657,I025595d8,Ie0bc3935,I3ebb172d * changes: vp9: remove partition+entropy contexts from common vp9: add above/left_context to MACROBLOCKD vp9: add above/left_seg_context to MACROBLOCKD vp9: add above/left_context to encoder vp9: add above/left_seg_context to encoder vp9: pass entropy context directly to set_skip_context vp9: pass context directly to partition functions vp9/decode: add alloc_tile_storage()	2013-10-28 12:45:11 -07:00
James Zern	ce2c337261	vp9: add above/left_context to encoder Change-Id: If5cea3d389bb1135ee490d273e57cc2c43325d01	2013-10-25 22:01:14 +02:00
James Zern	d72dfab296	vp9: add above/left_seg_context to encoder Change-Id: I969406574c6658936e9f6db5752f1b295025aab5	2013-10-25 22:01:14 +02:00
James Zern	d2bf696ee0	vp9: pass entropy context directly to set_skip_context this will allow for separate storage to be used in tile decoding Change-Id: I025595d83118bdc82a545dae69bc6602e8d2a6e3	2013-10-25 22:01:13 +02:00
James Zern	88d79eabdc	vp9: pass context directly to partition functions update_partition_context / partition_plane_context: this will allow for separate storage to be used in tile decoding Change-Id: Ie0bc393531ab7e9d2ce35c95111849b294aad4ed	2013-10-25 22:01:13 +02:00
Dmitry Kovalev	237ce8724a	Adding get_frame_new_buffer() function to replace duplicated code. Change-Id: I6e0e19231a48364c1de7dfab730b121ab227f111	2013-10-24 12:20:35 -07:00
Dmitry Kovalev	8001ed71ed	Merge "Renaming vp9_short_fdct4x4 and vp9_short_walsh4x4."	2013-10-24 10:08:42 -07:00
Dmitry Kovalev	ad867fe237	Renaming INTERPOLATIONFILTERTYPE to INTERPOLATION_TYPE. Change-Id: I1868fb75ed88bfa65c1c2ca24677d65f2894d713	2013-10-23 17:45:52 -07:00
Dmitry Kovalev	fd724f13b0	Renaming vp9_short_fdct4x4 and vp9_short_walsh4x4. For consistency with idct function names. Renames: vp9_short_fdct4x4 -> vp9_fdct4x4 vp9_short_walsh4x4 -> vp9_fwht4x4 Change-Id: Id15497cc1270acca626447d846f0ce9199770f58	2013-10-23 14:28:39 -07:00
Dmitry Kovalev	f6d870f7ae	Merge "Inlining set_partition_seg_context function."	2013-10-21 14:43:37 -07:00
Dmitry Kovalev	a0be71c703	Inlining set_partition_seg_context function. We used set_partition_seg_context() only before calls to: 1. update_partition_context() 2. partition_plane_context() Moving these functions from vp9_blockd.h to vp9_onyxc_int.h and inlining set_partition_seg_context into them. After that it is not necessary to have {above, left}_seg_context fields in MACROBLOCKD struture, so removing them also. Change-Id: I4723f59e1c8f3788432b7f51185d8d747b3a97f9	2013-10-21 12:02:19 -07:00
Jingning Han	deb10ac6f9	Merge "Make memory alloc in pick_mode_context bsize aware"	2013-10-21 11:45:59 -07:00
Yaowu Xu	db1045f2c0	Merge "Use lookup table to simplify logic"	2013-10-18 12:55:24 -07:00
Jingning Han	72033fcff8	Make memory alloc in pick_mode_context bsize aware This commit makes the buffer allocation of zcoeff_blk array in pick_mode_context block size aware. It calculates the number of 4x4 blocks in the partition and assigns the memory space accordingly. This process (and the uninitialization) is done once for each encoding pass. It allows memory copy of smaller buffer when possible. For football at 600kbps, the runtimes improve by about 1%: speed 1, 45961ms -> 45472ms speed 2, 23863ms -> 23598ms Change-Id: Id2ca24906fa89f46fa5fe742ec4b8efc2a61f877	2013-10-18 12:42:44 -07:00
Dmitry Kovalev	a8ffa96e9b	Passing block index explicitly instead of using get_sb_index(). That makes decoder and encoder (only bitstream writing part) a little bit simpler and faster. Moving get_sb_index() function to the encoder. Change-Id: Ie91aaeefd69c84b085948267b33556a7666c6278	2013-10-18 11:02:32 -07:00
Yaowu Xu	30d1ec38a7	Use lookup table to simplify logic In deciding the transform size for a given block in a given TX_MODE. Change-Id: I1467da09853e69cd320695a24c04e19a2f3d04fb	2013-10-17 14:54:16 -07:00
Guillaume Martres	ff3aada6cb	Add missing calls to emms in the adaptive quantization code Also avoid using floating-point operations when adaptive quantization is disabled. Change-Id: I54936d7afb661df049cdb3ecd246d04ac2a9d8d3	2013-10-17 14:04:41 -07:00
Guillaume Martres	5b984b36ca	Use a separate MODE_INFO stream for each tile column This should make parallel tiles decoding easier to implement. Change-Id: I6226456dd11f275fa991e4a7a930549da6675915	2013-10-16 16:24:48 -07:00
Guillaume Martres	acf0d56f0b	Get rid of "this_mi", use "mi_8x8[0]" everywhere instead The only case where they were intentionally pointing to different structures was in mbgraph, and this didn't have the expected behavior because both of these pointers are used interchangeably through the code Change-Id: I979251782f90885fe962305bcc845bc05907f80c	2013-10-16 16:24:03 -07:00
Guillaume Martres	e55f60240a	Implement variance-based adaptive quantization This should be similar to what x264 does with --aq-mode 1. It works well with clips like parkjoy and touhou (http://x264.nl/developers/Dark_Shikari/LosslessTouhou.mkv). At low bitrates, the segmentation signaling overhead may negate the benefits of this feature. (PGW) Default changed to feature OFF to allow provisional merge. Change-Id: I938abf9bb487e1d4ad3b0264ea03d9826275c70b	2013-10-16 11:55:13 +01:00
Adrian Grange	12b2c712ca	Merge "Updated encoder to handle intra-only frames"	2013-10-15 17:19:28 -07:00
Jingning Han	9b05f23e05	Merge "Make vp9_zero use cases of consistent format"	2013-10-15 16:49:05 -07:00
Alexander Voronov	d6a59fb12c	Updated encoder to handle intra-only frames Updated the encoder to handle frames that are coded intra-only. Intra-only frames must be non-showable, that is, the "show frame" flag must be set to 0 in the frame header. Tested by forcing the ARF frames to be coded intra- only. Note: The rate control code will need to be modified to account for intra-only frames better than they are currently handled. Change-Id: I6a9dd5337deddcecc599d3a44a7431909ed21079	2013-10-15 16:44:02 -07:00
Jingning Han	355db16734	Merge "Remove unused variable vp9_64x64_zeros"	2013-10-15 16:24:34 -07:00
Jingning Han	fd1cd89da6	Merge "Remove unused comment"	2013-10-15 16:23:44 -07:00
Jingning Han	c8e48f4b02	Make vp9_zero use cases of consistent format Remove the semicolon in the definition of vp9_zero macro. Make all the use cases of vp9_zero of consistent format. Change-Id: Ibaf9751e8595872b12766381a93d185a4d90df8f	2013-10-15 16:12:21 -07:00
Jingning Han	9115d84509	Remove unused variable vp9_64x64_zeros Remove the unused variable vp9_64x64_zeros from vp9_encodeframe_. Change-Id: I34bfdcab9a9105440ad05154c1e0516e70258785	2013-10-15 11:53:46 -07:00
Jingning Han	9622271033	Remove unused comment Change-Id: I2d96940fae4c7a16661a43c2bf6907d8b1c1a127	2013-10-15 11:45:38 -07:00
Dmitry Kovalev	a4585285ed	Removing unused 8x4 transform from the encoder. Change-Id: Icbcf68b5b685a56f255ebc3859c9692accdadf9e	2013-10-15 11:27:28 -07:00
Dmitry Kovalev	1e8fc24af8	Merge "Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers."	2013-10-10 10:49:27 -07:00
Jingning Han	03fe08ca30	Deprecate the use of PARTITION_INFO from encoder Use b_mode_info to store the inter prediction mode of sub8x8 block, in replacement of the use of partition_info. Remove redundant buffer update for partition_info. For bus_cif at 2000 kbps, this seem to make speed 0 about 1% faster. Change-Id: Id1b3be45e75a24fb4b42335ac480c23e440978f6	2013-10-09 09:23:52 -07:00
Dmitry Kovalev	c983c966cb	Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers. We already have itxm_add member in MACROBLOCKD structure. Both inv_txm4x4_1_add and inv_txm4x4_add are just its special cases for different eob values. But eob logic is already implemented in vp9_iwht4x4_add and vp9_idct4x4_add (that's why also removing inverse_transform_b_4x4_add). Change-Id: I80bec9b6f7d40c5e5033c613faca5c819c3e6326	2013-10-08 11:27:56 -07:00
Dmitry Kovalev	9dba044be2	Merge "Giving consistent names to IDCT/IWHT functions."	2013-10-05 23:44:05 -07:00
Jim Bankoski	de5cb8b140	vp9_encodeframe.c cpplint issues resolved Change-Id: Id9d837e062d9c4a94def4b4ed1f49a67c75d3618	2013-10-04 14:37:31 -07:00
Dmitry Kovalev	3a0602578e	Giving consistent names to IDCT/IWHT functions. The idea is to have the following names for each transform size: vp9_idct4x4_add vp9_idct4x4_1_add vp9_idct4x4_10_add vp9_idct4x4_16_add vp9_idct8x8_add vp9_idct8x8_1_add vp9_idct8x8_10_add vp9_idct8x8_64_add etc for 16x16, 32x32 The actual list of renames in this patch: vp9_idct_add_lossless -> vp9_iwht4x4_add vp9_short_iwalsh4x4_add -> vp9_iwht4x4_16_add vp9_short_iwalsh4x4_1_add -> vp9_iwht4x4_1_add vp9_idct_add -> vp9_idct4x4_add vp9_short_idct4x4_add -> vp9_idct4x4_16_add vp9_short_idct4x4_1_add -> vp9_idct4x4_1_add Change-Id: I6f43f7437c68dd30cdd05d72e213765578ed30b1	2013-10-04 14:17:06 -07:00
Jingning Han	a55625873f	Merge "Refactor inter mode rate-distortion search"	2013-10-03 12:19:53 -07:00
Jingning Han	11abab356e	Refactor inter mode rate-distortion search This commit separates the rate-distortion optimization loop of superblocks from that of sub8x8 blocks. This allows better design rate-distortion optimization search loop for each setting. It also removes the use of SPLITMV and I4X4_PRED therein. No performance change in speed 0 settings. For bus@CIF at 2000kbps, the speed 1 runtime goes from 48009ms to 43894ms (about 10% faster). The overall compression performance on derf changed by -0.021%. Speed 2 runtime goes from 27114ms to 28700ms (6% slower), while the overall coding efficiency goes up by 1.629% for derf, 1.236% for yt. Change-Id: Ie6bdfa0a370148dd60bd800961077f7e97e67dd4	2013-10-03 11:36:49 -07:00
Dmitry Kovalev	9250d1529c	Using vp9_zero instead of vpx_memset. Change-Id: I9a0d0e9c3459954aa7b9c68f92cc5d56385ebd18	2013-10-03 10:59:36 -07:00
Paul Wilkins	6253cc9279	Speed setting review. Substantial reworking of the speed vs quality trade offs for speed 1 and 2. In this patch I am attempting to freeze the "quality" meaning of speeds 1 and 2 relative to speed 0 so that in future we can better evaluate progress. I am targeting : Speed 1 quality ~-5% vs speed 0. Speed 2 quality ~-10% vs speed 0 It is inevitable that quality will still fluctuate a little as we adjust settings and add new features, but we will attempt to keep as close as possible to these values. Above speed 2 things will remain a bit more fluid for now. In this patch speed 1 is approximately 4-5x as fast as speed 0. This is similar to before but the quality hit is a lot less. Likewise speed 2 is approximately 2x as fast as speed 1 but is similar in quality to the previous speed 1 configuration. Also slight change to behavior of FLAG_EARLY_TERMINATE to insure all reference frames get at least one rd test. Important for very low variance regions. WIP :- Added a new speed level with old speed 4 becoming speed 5. Speed 3 and 4 tradeoffs still WIP Change-Id: Ic7a38dd7b5b63ab1501f9352411972f480ac6264	2013-10-03 10:23:28 +01:00
Jim Bankoski	f1d3e5e4d6	make use last partition consider motion This commit causes use last partition to consider whether a 64x64 has motion that might make a new partitioning worth while. Change-Id: I3a57bedef4f3cd961fadbfa96651c206fa36da4a	2013-10-03 10:22:39 +01:00
Paul Wilkins	ece99b3da0	Merge "Improved auto_partition_range."	2013-10-03 02:06:13 -07:00
Dmitry Kovalev	0a5e9ee054	Moving get_token_alloc function from common to the encoder. Also renaming mb_row -> mi_row, mb_col -> mi_col arguments and calculate mb_rows/mb_cols values from mi_rows/mi_cols. Change-Id: I6919a279f560648e23bc9a12f507d17c21ffd5d7	2013-10-01 11:54:10 -07:00
Jingning Han	195061feda	Fix rectangular partition check in speed 1 Make encoder skip rectangular partition check in speed 1 and above, when early termination was triggered in partition split. Thanks Guillaume (gmartres@) for catching this issue. This change makes bus_cif at 2000kbps speed 1 runtime goes down from 25612ms to 23438ms (about 9% speed-up), at the expense of -0.235% performance down. Change-Id: I98613fad081a261d30d5fa206f934ca70601c180	2013-09-30 12:14:36 -07:00
Paul Wilkins	65b93c7e52	Improved auto_partition_range. The code now takes into account temporal and spatial information to determine the partition size range, but the frequency counts have been removed. The net effect is similar in quality but about 10% faster. Change-Id: I39a513fb79cec9177b73b2a7218f0da70963ae95	2013-09-30 11:32:57 +01:00
Paul Wilkins	a76caa7ff4	Alter Speed 3. This patch deletes the variance based speed three partitioning. Speed 3 now uses the same partitioning method as speed 2 but with some stricter conditions. The speed and quality are now somewhere between speeds 2 and 4 whereas before it was worse in both than speed 4. Change-Id: Ia142e7007299d79db3ceee6ca8670540db6f7a41	2013-09-30 11:26:46 +01:00
Dmitry Kovalev	eda4e24c0d	Using is_inter_block and has_second_ref functions. Change-Id: I60dee58a4fd24d3c4f3c101a49d30e217309f43a	2013-09-25 19:03:04 -07:00
Dmitry Kovalev	d0365c4a2c	Replacing txfm with tx. Renaming txfm_stepdown_count to tx_stepdown_count and max_txfm_size to max_tx_size. Change-Id: Ifc173e22c78240e561a57c4c741b64b1b8fc6fef	2013-09-24 17:24:35 -07:00
Dmitry Kovalev	450cbfe53a	Cleaning up vp9_update_nmv_count function. Using best_mv[2] array instead of two separate variables. Change-Id: Iefa0a41f5c42c42f2c66cef26750da68405f0f25	2013-09-24 15:55:49 -07:00
Yaowu Xu	fe533c9741	Merge "Change to prevent invalid memory access"	2013-09-24 10:37:17 -07:00
Dmitry Kovalev	f24b9b4f87	Merge "Adding best_mv[2] array instead of two variables."	2013-09-24 10:17:53 -07:00
Yaowu Xu	92a29c157f	Change to prevent invalid memory access After change of MI context storage , mi_8x8[] pointer may be null for a block outside of image border. The commit changes to access the data only after validation of mi_row and mi_col. Change-Id: I039c4eb486a228ea9d8e5f35ab9ae6717d718bf3	2013-09-24 08:36:59 -07:00
Jingning Han	a517343ca3	Enable per transformed block zero coeffs forcing This commit enables forcing all coefficients zero per transformed block, when its rate-distortion cost is lower than regular coeff quantization. The overall performance improvement (including its parent patch on calculating rd cost per transformed block) at speed 1: derf: 0.298% yt: 0.452% hd: 0.741% stdhd: 0.006% Change-Id: I66005fe0fd7af192c3eba32e02fd6d77952accb5	2013-09-23 10:39:35 -07:00
Dmitry Kovalev	bb5e2bf86a	Adding best_mv[2] array instead of two variables. Change-Id: I584fe50f73879f6a72fada45714ef80893b6d549	2013-09-20 17:08:53 +04:00
Yaowu Xu	014acfa2af	fix integer overflow errors Change-Id: I76f440a917832c02d7a727697b225bac66b99f56	2013-09-19 08:14:26 -07:00
Yaowu Xu	a783da80e7	Silence a bunch of MSVC warnings Change-Id: I16633269582a640809dca27572bbe99efa6369fc	2013-09-17 12:08:51 -07:00
Yaowu Xu	eeae6f946d	fix a problem where an invalid mv used in search The commit added reset of pred_mv at the beginning of each SB64x64 partition mv search, also limited the usage of pred_mv only when search on the largest partition is already done. This is to fix a crash at speed 1/2 encoder where an invalid mv is used in mv search. Change-Id: I39010177da76d054e3c90b7899a44feb2e3a5b1b	2013-09-16 12:49:27 -07:00
Jingning Han	c4826c5941	Adaptive motion search control This commit enables adaptive constraint on motion search range for smaller partitions, given the motion vectors of collocated larger partition as a candidate initial search point. It makes speed 0 runtime of bus at CIF and 2000 kbps goes from 167s down to 162s (3% speed-up), at 0.01dB performance gains. In the settings of speed 1, this makes the runtime goes from 33687 ms to 32142 ms (4.5% speed-up), at 0.03dB performance gains. Compression performance wise, it gains at speed 1: derf 0.118% yt 0.237% hd 0.203% stdhd 0.438% Change-Id: Ic8b34c67810d9504a9579bef2825d3fa54b69454	2013-09-13 13:58:10 -07:00
Scott LaVarnway	8fc95a1b11	Merge "New mode_info_context storage -- undo revert"	2013-09-13 08:56:20 -07:00
Jim Bankoski	d09abfa9f7	Merge "resolve clang issue : implicit convert tx_mode -> tx_size"	2013-09-11 13:40:11 -07:00
Scott LaVarnway	ac6093d179	New mode_info_context storage -- undo revert mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of pointers to MODE_INFO structs. The MODE_INFO structs are now stored as a stream (decoder only), eliminating unnecessary copies and is a little more cache friendly. Change-Id: I031d376284c6eb98a38ad5595b797f048a6cfc0d	2013-09-11 13:45:44 -04:00
Jingning Han	cb24406da5	Merge "Remove the use of uninitialized_safe in encode_sb_"	2013-09-10 12:05:22 -07:00
Yunqing Wang	0607abc3dd	Stop partition checking when distortion is small If the current obtained distortion is very small, which happens for static image case, we pick the current partition type without further split checking. This won't affect regular videos. For static videos, we got 10%~12% encoding speed gain. PSNR was better for some clips, and worse for others. Overall it was even. Change-Id: If787a57bedf46fc595ca4f5ded2b0c0a69e9fdef	2013-09-10 10:13:24 -07:00
Paul Wilkins	4f660cc018	Modified mode skip functionality. A previous speed feature skipped modes not used in earlier partitions but this not longer worked as intended following changes to the partition coding order and in conjunction with some other speed features (Especially speed 2 and above). This modified mode skip feature sets a mask after the first X modes have been tested in each partition depending on the reference frame of the current best case. This patch also makes some changes to the order modes are tested to fit better with this skip functionality. Initial testing suggests speed and rd hit count improvements of up to 20% at speed 1. Quality results. (derf -1.9%, std hd +0.23%). Change-Id: Idd8efa656cbc0c28f06d09690984c1f18b1115e1	2013-09-10 13:30:10 +01:00
Paul Wilkins	901c495482	Added extra check to rd_auto_partition_range() Added check that the returned max and minimum are valid in bottom and right border cases. Change-Id: I2d6cdc9b5f04c7d0ff512ddcf3228331e028bf9b	2013-09-10 13:29:23 +01:00
Jingning Han	18c780a0ff	Remove the use of uninitialized_safe in encode_sb_ Initialize the probability model context with default value in encode_sb. Change-Id: Id826114024dfc21c7ef41aea9f4a0316d4a5cb95	2013-09-09 15:41:16 -07:00
James Zern	54a03e20dd	Revert "New mode_info_context storage" This reverts commit `dae17734ec` Encode crashes, leaks and increases integer overflow errors. Change-Id: I595aa2649bb8d0b6552ff91652837a74c103fda2	2013-09-09 13:37:01 -07:00
Jim Bankoski	9faa7e8186	resolve clang issue : implicit convert tx_mode -> tx_size Change-Id: Ifc9da470358f58e800e3d0d70a565b61e5f7834a	2013-09-08 07:17:12 -07:00
Scott LaVarnway	dae17734ec	New mode_info_context storage mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of a pointer to a MODE_INFO struct and a "in the image" flag. The MODE_INFO structs are now stored as a stream, eliminating unnecessary copies and is a little more cache friendly. For the test clips used, the decoder performance improved by ~4.3% (1080p) and ~9.7% (720p). Patch Set 2: Re-encoded clips with latest. Now ~1.7% (1080p) and 5.9% (720p). Change-Id: I846f29e88610fce2523ca697a9a9ef2a182e9256	2013-09-06 12:33:34 -04:00
Paul Wilkins	49317cddad	Attempt to fix speed 4 Speed 4 fixed partition size. Use fixed size unless it does not fit inside image, in which case use the largest size that does. Change-Id: I250f7a80506750dd82ab355721624a1344247223	2013-09-03 17:46:25 +01:00
Dmitry Kovalev	e80bf802a9	Merge "Renaming txfm_size to tx_size."	2013-08-29 12:30:18 -07:00
Dmitry Kovalev	b62ddd5f8b	General code cleanup. Switching from mi_{width, height}_log2 and b_{width, height}_log2 to num_8x8_blocks_{wide, high} and num_4x4_blocks_{wide, high}. Removing redundant code, adding const. Change-Id: Iaab2207590fd24d0b76999071778d1395dc5cd5d	2013-08-28 12:22:37 -07:00
Dmitry Kovalev	851a2fd72c	Renaming txfm_size to tx_size. Change-Id: I752e374867d459960995b24d197301d65ad535e3	2013-08-27 19:47:53 -07:00
Dmitry Kovalev	7b95f9bf39	Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the encoder. Change-Id: I62bb07c377f947cb72fac68add7a6b199e42c6b9	2013-08-27 11:05:08 -07:00
Dmitry Kovalev	ba10aed86d	Merge "Using num_8x8_* lookup tables instead of mi_*_log2."	2013-08-27 10:49:36 -07:00
Dmitry Kovalev	78e670fcf8	Merge "Renaming D27 to D207."	2013-08-27 10:03:57 -07:00
Dmitry Kovalev	b25589c6bb	Using num_8x8_* lookup tables instead of mi_*_log2. Change-Id: I8a246b3d056c98be614d05a90bc261e2441ffc10	2013-08-26 14:22:54 -07:00
James Zern	c8ba8c513c	cosmetics: strip 'VP9_' from defines in vp9 only code Change-Id: I481d9bb2fa3ec72b6a83d5f04d545ad8013f295c	2013-08-23 19:16:49 -07:00
Dmitry Kovalev	50ee61db4c	Renaming D27 to D207. I've already renamed d27_predictor to d207_predictor but forgot about the corresponding constant. Change-Id: Id312aa80fc5b5a1ab8a709a33418a029552a6857	2013-08-23 17:33:48 -07:00
Yaowu Xu	13930cf569	Limit mv range to be based on partition size Previous change `c4048dbd` limits the mv search range assuming max block size of 64x64, this commit change the search range using actual block size instead. Change-Id: Ibe07ab02b62bf64bd9f8675d2b997af20a2c7e11	2013-08-23 15:43:57 -07:00
Jingning Han	9655c2c7a6	Merge "Fix rectangular partition check flag"	2013-08-22 18:59:18 -07:00
Dmitry Kovalev	33104cdd42	Merge "vp9_encodeframe.c cleanup."	2013-08-22 18:07:35 -07:00
Jingning Han	84f3b76e1c	Fix rectangular partition check flag Put rectangular partition check flag change according to the rd costs of NONE and SPLIT partition types under the speed feature. Change-Id: If681e1e078a8d43d86961ea4b748da5cd1b6c331	2013-08-22 17:15:01 -07:00
Dmitry Kovalev	604022d40b	vp9_encodeframe.c cleanup. Removing unused get_sbuv_perpixel_variance function, using has_second_ref/ is_inter_block functions, organizing includes. Change-Id: I016de4af12fbbb8b4ece26a70759b2392651b095	2013-08-22 15:50:51 -07:00
James Zern	40ae02c247	rename LOG2_* defines to *_LOG2 gets rid of a mix of styles Change-Id: I3591d312157bc6f53a25438bf047765c671fd8a8	2013-08-22 14:45:24 -07:00
Jingning Han	01a37177d1	Refactor rd_pick_partition for parameter control This commit changes the partition search order of superblocks from {SPLIT, NONE, HORZ, VERT} to {NONE, SPLIT, HORZ, VERT} for consistency with that of sub8x8 partition search. It enable the use of early termination in partition search for all block sizes. For ped_area_1080p 50 frames coded at 4000 kbps, it makes the runtime goes down from 844305ms -> 818003ms (3% speed-up) at speed 0. This will further move towards making the in-search partition types configurable, hence unifying various speed-up approaches. Some speed 1 and 2 features are turned off during the refactoring process, including: disable_split_var_thresh using_small_partition_info Stricter constraints are applied to use_square_partition_only for right/bottom boundary blocks. Will bring back/refine these features subsequently. At this point, it makes derf set at speed 1 about 0.45% higher in compression performance, and 9% down in run-time. Change-Id: I3db9f9d1d1a0d6cbe2e50e49bd9eda1cf705f37c	2013-08-22 12:36:02 -07:00
Deb Mukherjee	8b810c7a78	Fixes on feature disabling split based on variance Adds a couple of minor fixes, which may be absorbed in Jingning's patch. Thanks to Guillaume for pointing these out. Also adjusts the thresholds for speed 1 and 2 to 16 and 32 respectively, to keep quality drops small. Results: -------- derfraw300: threshold = 16, psnr -0.082%, speedup 2-3% threshold = 32, psnr -0.218%, speedup 5-6% stdhdraw250: threshold = 16, psnr -0.031%, speedup 2-3% threshold = 32, psnr -0.273%, speedup 5-6% Change-Id: I4b11ae8296cca6c2a9f644be7e40de7c423b8330	2013-08-22 07:05:44 -07:00
Scott LaVarnway	f39bf458e5	Merge "Initialize mb_skip_coeff before picking modes"	2013-08-22 06:26:04 -07:00
Scott LaVarnway	94bfbaa84e	Initialize mb_skip_coeff before picking modes It appears that the above/left mb_skip_coeff used during the pick modes, is left over from the previously encode frame. This patch initializes the flag to the default value of zero. Change-Id: Ida4684cc99611d6e3e82628db35ed717e28ce550	2013-08-22 08:51:04 -04:00
Dmitry Kovalev	048ccb2849	Cleaning up sum_intra_stats function. Using size_group_lookup table and better variable names. Change-Id: I6e67f2ce091845db43ace7d21b7ae31c6f165aec	2013-08-21 16:25:02 -07:00
Adrian Grange	ce28d0ca89	Fix typos and minor stylistic cleanup Change-Id: I32e43474e8651ef2eb181d24860a8f118cfea7bf	2013-08-21 08:45:42 -07:00
Paul Wilkins	e8923fe492	Changes to auto partition size selection. Changes to code to auto select a partition size range based on data from spatial neighbors. Now looks at the sb_type in each 8x8 block of above and left SB64. The effect on speed 1 is now weaker giving better quality but less speed gain. Now also used in speed 2. Change-Id: Iace33a97d5c3498dd2a9a8a4067351941abcbabc	2013-08-20 14:05:39 +01:00
Yaowu Xu	c4048dbdd3	Change to limit the mv search range As the pixel values beyond image border are duplicates of pixels on edge, the change limits the mv search range, any mv beyond the limits no longer produce new/different prediction values as entire block with pixels used for subpel interpolation are outside image border. Change-Id: I4c6fdf06e33c1cef1489f5470ce0fb4e5e01fb79	2013-08-19 17:19:36 -07:00
Dmitry Kovalev	26e5b5e25d	Removing unused or redundant arguments from *_args structures. Redundant dst, pre[2] from build_inter_predictors_args, unused cm from encode_b_args. Change-Id: I2c476cd328c5c0cca4c78ba451ca6ba2a2c37e2d	2013-08-16 12:51:20 -07:00
Dmitry Kovalev	b7616e387e	Moving segmentation struct from MACROBLOCKD to VP9_COMMON. VP9_COMMON is the right place to segmentatation struct because it has global segmentation parameters, not something specific to macroblock processing. Change-Id: Ib9ada0c06c253996eb3b5f6cccf6a323fbbba708	2013-08-15 10:47:48 -07:00
Deb Mukherjee	24856b6abc	Speed feature to skip split partition based on var Adds a speed feature to disable split partition search based on a given threshold on the source variance. A tighter threshold derived from the threshold provided is used to also disable horizontal and vertical partitions. Results on derfraw300: threshold = 16, psnr = -0.057%, speedup ~1% (football) threshold = 32, psnr = -0.150%, speedup ~4-5% (football) threshold = 64, psnr = -0.570%, speedup ~10-12% (football) Results on stdhdraw250: threshold = 32, psnr = -0.18%, speedup is somewhat more than derf because of a larger number of smoother blocks at higher resolution. Based on these results, a threshold of 32 is chosen for speed 1, and a threshold of 64 is chosen for speeds 2 and above. Change-Id: If08912fb6c67fd4242d12a0d094783a99f52f6c6	2013-08-15 10:01:45 -07:00
Paul Wilkins	26fead7ecf	Renaming in MB_MODE_INFO The macro block mode info context originally contained an entry for each 16x16 macroblock. In VP9 each entry refers to an 8x8 region not a macro block, so the naming is misleading. This first stage clean up changes the names of 3 entries in the structure to remove the mb_ prefix. TODO clean up the nomenclature more widely in respect of mbmi and bmi. Change-Id: Ia7305c6d0cb805dfe8cdc98dad21338f502e49c6	2013-08-14 12:47:52 +01:00
Guillaume Martres	fc50477082	Honor min_partition_size properly for non-square splits Don't do vertical or horizontal splits if subsize < min_partition_size, except for edge blocks where it makes sense. Change-Id: I479aa66ba1838d227b5de8312d46be184a8d6401	2013-08-13 15:24:03 -07:00
Paul Wilkins	8e35263bed	Merge "Honor min_partition_size properly"	2013-08-13 05:19:51 -07:00
Dmitry Kovalev	8b0e6035a2	Entropy context related cleanups. Adding set_skip_context() function used from both encoder and decoder. Change-Id: Ia22cfad3211a00a63eb294f64f857b78f4aa9b85	2013-08-12 11:24:24 -07:00
Dmitry Kovalev	3c43ec206c	Renaming BLOCK_SIZE_TYPES constant to BLOCK_SIZES. There will be another change set to rename BLOCK_SIZE_TYPE enum to BLOCK_SIZE. Change-Id: I8d1dfc873d6186fa5e554262f5169e929978085e	2013-08-09 17:47:32 -07:00
Guillaume Martres	58b07a6f9d	Honor min_partition_size properly It represents the minimum partition size, so don't split if bsize == min_partition_size . Change-Id: Id77c32d6afef7d2ddec0368eaae18fb13227d30e	2013-08-09 17:28:33 -07:00
Dmitry Kovalev	816d6c989c	Moving loopfilter struct to VP9_COMMON. Loop filter configuration doesn't belong to macroblock, so moving it from MACROBLOCKD to VP9_COMMON. Also moving the declaration of loopfilter struct from vp9_blockd.h to vp9_loopfilter.h. Change-Id: I4b3e34be9623b47cda35f9b1f9951f8c5b1d5d28	2013-08-09 14:41:51 -07:00
Scott LaVarnway	41251ae558	Bug fix: call set_offsets before rd_auto_partition_range The set_offsets call is necessary inorder to set the mode_info_context ptr correctly. Change-Id: I644910cc5bacc50ee9cd78458843274ad8ee636d	2013-08-09 14:09:49 -04:00
Adrian Grange	83ee80c045	Moved fast motion search level decision to function Moving this block of code into a function makes the code easier to read and change. Change-Id: If4ede570cce1eab1982b188c4d3e4fd3d4db236e	2013-08-08 11:01:44 -07:00
Adrian Grange	aae6a4c895	Simplify & fix potential bug in rd_pick_partition Different partitionings were not being evaluated against best_rd and there were unnecessary calls to RDCOST. This could have resulted in a non-optimal partioning being selected. I simplified the variables used to track the rate, distortion and RD values throughout the function. Change-Id: Ifa7085ee80d824e86791432a5bc6d8fea5a3e313	2013-08-08 09:55:45 -07:00
Jingning Han	debb9c68c8	Use low precision 32x32fdct for encodemb in speed1 The low precision 32x32 fdct has all the intermediate steps within 16-bit depth, hence allowing faster SSE2 implementation, at the expense of larger round-trip error. It was used in the rate-distortion optimization search loop only. Using the low precision version, in replace of the high precision one, affects the compression performance by about 0.7% (derf, stdhd) at speed 0. For speed 1, it makes derf set down by only 0.017%. Change-Id: I4e7d18fac5bea5317b91c8e7dabae143bc6b5c8b	2013-08-07 15:34:12 -07:00
Dmitry Kovalev	0c80065694	Inlining vp9_get_pred_probs_switchable_interp function. There was no benefit having this function. For example, inside read_switchable_filter_type switchable filter context was calculated twice. Change-Id: I79cd5bf95cbc0f6d8bf91a2e32289e01b18dcff1	2013-08-06 11:04:31 -07:00
Dmitry Kovalev	3e51acafec	Merge "Finally removing all old block size constants."	2013-08-06 10:30:37 -07:00
Dmitry Kovalev	4a692e4168	Merge "Changing the order switchable filter enum constants."	2013-08-06 10:30:26 -07:00
Deb Mukherjee	33afddadb9	Merge "Add variance based mode/skipping"	2013-08-06 10:19:15 -07:00
Dmitry Kovalev	b9c7d04e95	Finally removing all old block size constants. Change-Id: I3aae21e88b876d53ecc955260479980ffe04ad8d	2013-08-05 15:23:49 -07:00
Deb Mukherjee	8b3faccb9e	Add variance based mode/skipping Adds a speed feature to skip all intra modes other than DC_PRED if the source variance is small. This feature is made part of speed 1 and up. Results on derf300: psnr -0.07%, speedup about 1-2% Also uses the source variance to fine-tune the early termination criteria when FLAG_EARLY_TERMINATE is on. This feature is made part of speed 2 and up. Results on derf300: psnr -0.52%, speedup about 5-7% Change-Id: I59e38aa836557cfa5405ae706fc64815cbfe4232	2013-08-05 14:14:01 -07:00
Jim Bankoski	9f988a2edf	Merge "cleanups after bw bh code"	2013-08-05 14:02:02 -07:00
Dmitry Kovalev	3f611555d7	Changing the order switchable filter enum constants. This changeset allows to remove vp9_switchable_interp and vp9_switchable_interp_map arrays and make code much clear. Actually we still have to use these mapping but only inside read_interp_filter_type and write_interp_filter_type functions. Change-Id: I4026c6f8c4acefba6c81421b7bacbaa52cc45f50	2013-08-05 12:26:15 -07:00
Jim Bankoski	5d2cb7ead0	cleanups after bw bh code Cons bw/bh parms that should have been const. Additional formatting. Change-Id: Icd36a5c9dc17dadd7284315ac0d6fef1a565ca16	2013-08-05 12:15:52 -07:00
Dmitry Kovalev	d007446b3f	Replacing long block size enum values with shorter ones (2). Change-Id: I428c4d42212b757112e3acfe5b81314cfbb5fd6b	2013-08-05 10:51:02 -07:00
Dmitry Kovalev	fe2a201eb1	Replacing "txfm" with "tx" in identifiers. Consistent names with TX_SIZE, TX_MODE, and TX_MODE. Change-Id: I79592218bf5a40ace89197a34a06ee7de581ed8d	2013-08-02 17:28:23 -07:00
Dmitry Kovalev	680ec32d18	Adding is_inter_block function. Using it instead of long unclear verbose check "mbmi->ref_frame[0] != INTRA_FRAME". Change-Id: I9c7b4b3797942fa962bf3ba7460fff3084beabe9	2013-08-02 16:25:33 -07:00
Yunqing Wang	d340c114fb	Merge "Add more checking to using_small_partition_info"	2013-08-02 15:55:09 -07:00
Adrian Grange	60ff123536	Merge "Fixed typos and added a few explanatory comments"	2013-08-02 11:37:47 -07:00
Dmitry Kovalev	b47153deed	Replacing long block size enum values with shorter ones. Change-Id: I0e9329490828684a4fd46f540d89114cc68e8407	2013-08-02 10:48:27 -07:00
Yunqing Wang	0d68080445	Merge "Comment out 2 unused speed features"	2013-08-02 09:58:46 -07:00
Dmitry Kovalev	741537f3ce	Cleanup: replacing xd->seg with seg, and xd->lf with lf. Change-Id: I73b59d7699a8e7e7acd3bf8041cb6c98ce9ba4bf	2013-08-01 15:38:16 -07:00
Dmitry Kovalev	9f4f001ba5	Merge "Cleanup: removing unused function arguments."	2013-08-01 15:07:12 -07:00
Dmitry Kovalev	ce8dedc353	Cleanup: removing unused function arguments. Change-Id: I27471768980fc631916069f24bc7c482a5c9ca17	2013-08-01 13:41:38 -07:00
Deb Mukherjee	dbea726daf	Adds a source variance computation function Adds a function to compute source variance for various sb_types to be used for pruning mode and partition searches. [The existing activity measure function is currently specialized for only 16x16 MBs and needs to be updated]. Change-Id: I22a41e6f1430184201487326fdbebb9b47e6fc24	2013-08-01 13:01:54 -07:00
Yunqing Wang	215b010f4b	Add more checking to using_small_partition_info If the partition is out of partition size range, we don't need to process small partition information. Change-Id: Ice9bfbbdebe1f2ef79271a3aee17de0ed4608376	2013-08-01 11:37:41 -07:00
Yunqing Wang	7965a6ea34	Comment out 2 unused speed features use_min_partition_size and use_max_partition_size are not used currently, and could be added back if needed later. Change-Id: Ib22a9c06b064567a7c1d6d5445567ed77e0d3acc	2013-08-01 11:03:34 -07:00
Adrian Grange	89e73c63c0	Fixed typos and added a few explanatory comments Change-Id: Ib4e4b41094b54874ee34343dd77c0c131ceed9d2	2013-08-01 09:23:49 -07:00
Jingning Han	12f5762756	Remove unnecessary arguments in rd_pick_ref_frame This commit removes redundant arguments passing in the function of rd_pick_reference_frame. This resolves the clang warnings about potential use of uninitialized values. Change-Id: Ic68f949a9f8fcd0a583786b0c75321104ea44739	2013-07-31 17:04:13 -07:00
Jingning Han	ac7bab7575	Merge "Make the use of ref_frame index consistent"	2013-07-31 09:11:37 -07:00
Jingning Han	86c384d398	Make the use of ref_frame index consistent Refactor the frame buffer referencing in choose_partition and make it consistent with other places. This means to prevent potential issues when we extend reference frame buffer. Change-Id: I5ff33ed5f671e1f4cc7049622212769a9b4578d9	2013-07-30 19:49:36 -07:00
Adrian Grange	fbd73648dd	Merge "Cleanup typos, remove unnecessary lines, replace switch"	2013-07-30 12:59:46 -07:00
Adrian Grange	b30a06b930	Cleanup typos, remove unnecessary lines, replace switch Removed unnecessary code lines, replaced switch with an if, fixed spelling errors and formatting. Change-Id: Ie48aa4604aa0ed48362ca359d792fb21b2ec1dc6	2013-07-30 12:10:32 -07:00
Dmitry Kovalev	730a34416f	Renaming NB_TXFM_MODES constant to TX_MODES. Change-Id: I10bf06e3a3d5271221ae6a42a36074d01d493039	2013-07-29 13:38:40 -07:00
Dmitry Kovalev	23391ea835	Renaming TX_SIZE_MAX_SB to TX_SIZES. Change-Id: I6aa4191935aa93461a07c41b59fdae1eb5f5f107	2013-07-29 12:25:34 -07:00
Dmitry Kovalev	c09b81719f	Merge "General cleanups."	2013-07-26 13:59:39 -07:00
Paul Wilkins	fe5e2a91bb	Auto min and max partition size experiment. Speed feature experiment to set an upper and lower partition size limit based on what has been seen in spatial neighbors. This seems to gives quite reasonable speed gains in local (10-15%) and when used with speed 0 the losses are small (0.25% derf, 0.35% stdhd). However, for now I am only enabling it on speed 1 as there may be clashes with the existing temporal partition selection in speed 2. Using a tighter min / max around the range derived from the neighbors increases speed further but at the cost of a bigger quality loss. However, I think this spatial method could be combined with data from either the last frame or a variance method (or both) to refine the range of minimum and maximum partition size. I.e. consider the min and max from spatial and temporal neighbors and the variance recommendation. Change-Id: I1b96bf8b84368d6aad0c7aa600fe141b4f07435f	2013-07-26 18:30:49 +01:00
Yunqing Wang	845fd5011c	Merge "Add encoding option --static-thresh"	2013-07-25 14:58:00 -07:00
Yunqing Wang	d36852b702	Add encoding option --static-thresh This option exists in VP8, and it was rewritten in VP9 to support skipping on different partition levels. After prediction is done, we can check if the residuals in the partition block will be all quantized to 0. If this is true, the skip flag is set, and only prediction data are needed in reconstruction. Based on DCT's energy conservation property, the skipping check can be estimated in spatial domain. The prediction error is calculated and compared to a threshold. The threshold is determined by the dequant values, and also adjusted by partition sizes. To be precise, the DC and AC parts for Y, U, and V planes are checked to decide skipping or not. Test showed that 1. derf set: when static-thresh = 1, psnr loss is 0.666%; when static-thresh = 500, psnr loss is 1.162%; 2. stdhd set: when static-thresh = 1, psnr loss is 1.249%; when static-thresh = 500, psnr loss is 1.668%; For different clips, encoding speedup range is between several percentage and 20+% when static-thresh <= 500. For example, clip bitrate static-thresh psnr time akiyo(cif) 500 0 48.923 5.635s(50f) akiyo 500 500 48.863 4.402s(50f) parkjoy(1080p) 4000 0 30.380 77.54s(30f) parkjoy 4000 500 30.384 69.59s(30f) sunflower(1080p) 4000 0 44.461 85.2s(30f) sunflower 4000 500 44.418 78.1s(30f) Higher static-thresh values give larger speedup with larger quality loss. Change-Id: I857031ceb466ff314ab580ac5ec5d18542203c53	2013-07-25 14:28:05 -07:00
Dmitry Kovalev	7131cb0e3d	General cleanups. Removing unused constants, macros, and function declarations. Using ROUND_POWER_OF_TWO macro, vp9_zero, vp9_copy where possible. Moving #include from .h to .c. Merging for loops for motion vectors. Change-Id: Ic3bf841764a2bb177128bb3a6d7aa8f68229cd13	2013-07-25 14:13:48 -07:00
Adrian Grange	e862c6f9eb	Merge "Simplify handling of sub-partition motion vectors"	2013-07-25 12:58:38 -07:00
Adrian Grange	be700e140a	Simplify handling of sub-partition motion vectors Simplified the code that extracts and uses the motion vectors for the 4 sub-partitions in rd_pick_partition. Change-Id: Iaf698ef7ee3aef9edd59015e1ae065dd359b17d9	2013-07-25 11:51:51 -07:00
Yaowu Xu	3e386aefc2	fix a bug where flags are not reset The feature that uses small partition results as a measure to skip mode evaluation at larger partition requires the flags to be reset. The reset was missing in the code path that calls rd_use_partition(). Change-Id: Ia0a3a0aee1a862b6e2333d596808db7c48033d50	2013-07-25 10:28:38 -07:00
Adrian Grange	a183f17d33	Merge "Correct spelling mistakes"	2013-07-24 09:48:57 -07:00
Adrian Grange	bc8b0529db	Correct spelling mistakes Change-Id: Id4138293efeac4503b2e01ce7a6c150a5abeef77	2013-07-24 07:58:26 -07:00
Dmitry Kovalev	1099a436d3	Moving counts from FRAME_CONTEXT to new struct FRAME_COUNTS. Counts are separate from frame context. We have several frame contexts but need only one copy of all counts. Change-Id: I5279b0321cb450bbea7049adaa9275306a7cef7d	2013-07-23 17:02:08 -07:00
Adrian Grange	719cd35f3a	Merge "Rolled-up several for loops into one"	2013-07-23 15:00:06 -07:00
Adrian Grange	646edbc1b2	Rolled-up several for loops into one Several consecutive for loops executed over the same index range, so I rolled them into one. Change-Id: I5cfcc8c38c738478965768409cca9d09adf224e1	2013-07-23 14:32:21 -07:00
Dmitry Kovalev	2855d8aea1	Merge "Adding update_tx_counts function."	2013-07-23 13:57:59 -07:00
Jim Bankoski	86a9dec73c	clean up bw, bh many structures use bw and bh and they have different meanings. This cl attempts to start this clean up and remove unneccessary 2 step look up log and then shift operations... also removed partition type multiple operation code in bitstream.c. Change-Id: I7e03e552bdfc0939738e430862e3073d30fdd5db	2013-07-23 06:51:44 -07:00
Dmitry Kovalev	b2fc6fa969	Adding update_tx_counts function. Moving common encoder/decoder code to update_tx_counts. Also renaming vp9_get_pred_probs_tx_size to get_tx_probs2 and adding get_tx_probs to call vp9_get_pred_context_tx_size inside read_selected_tx_size only once (twice before). Change-Id: Ia50247f3893de88ef8e9041b0d44be44a40aaa4d	2013-07-22 14:57:43 -07:00
Jim Bankoski	2ac8b50cd8	fix left over overflow This cl fixes issues rbultje brought up. that I somehow neglected when I submitted yaowu's patch. Change-Id: I07ad18796317822510b96e951c88d29f194a3c2e	2013-07-22 06:39:39 -07:00
Dmitry Kovalev	8962d975b2	Merge "Moving all loop filter related variables into new struct."	2013-07-20 22:45:24 -07:00
Dmitry Kovalev	f66821afbb	Merge "Removing frame_type field from MACROBLOCKD struct."	2013-07-20 22:40:06 -07:00
Yaowu Xu	ea284d6281	added checks to prevent rate/distortion overflow At speed 2, due to the threshold scheme used, it is possible the rate and distortion assigned with INT_MAX value. The patch added checking to prevent the INT_MAX value is used in further calculation of RD scores. The patch also changed the assertion in rd_use_partition() to be mirror similar assertion in rd_pick_partition(). Change-Id: Idb52c543cc1e10abdf6e6a5d6e9cb535a42214dc	2013-07-19 17:52:50 -07:00
Dmitry Kovalev	ee1771ebaa	Moving all loop filter related variables into new struct. Adding loopfilter struct with fields from MACROBLOCKD and VP9Common. Eventually it will be moved to vp9_loopfilter.h for better code structure. Change-Id: Iaf5fb71c33719cdfa1b991f671caf071be9ea035	2013-07-19 16:19:10 -07:00
Dmitry Kovalev	e71a4a77bb	Merge "Renaming TXFM_MODE to TX_MODE (like TX_SIZE, TX_TYPE)."	2013-07-19 12:14:32 -07:00
Dmitry Kovalev	97e96bc4e9	Removing frame_type field from MACROBLOCKD struct. Change-Id: Ia4e83913251c1cdc7aa2abd64bf01ecb1a962119	2013-07-19 11:55:36 -07:00
Dmitry Kovalev	c0eb57406c	Renaming TXFM_MODE to TX_MODE (like TX_SIZE, TX_TYPE). Moving TX_MODE enum to vp9_enums.h. Renaming txfm_mode variables to tx_mode. Change-Id: I459d1af6dd928ce7fccdf8ce30b6f1ca057bef92	2013-07-19 11:37:13 -07:00
Dmitry Kovalev	afe43d4089	Removing redundant VP9_COMMON* from function signatures. Functions: vp9_get_pred_context_switchable_interp, vp9_get_pred_context_intra_inter, vp9_get_pred_context_single_ref_p1, vp9_get_pred_context_single_ref_p2. Change-Id: I3d6fb8aee23c9062270768e1e6da416dd9bb8f96	2013-07-19 11:20:49 -07:00
Ronald S. Bultje	e4686c589e	Fix slightly quality drop caused at speed 1. We would skip the rectangular blocks for sub8x8 partitions because we would conclude that PARTITION_NONE was better than PARTITION_SPLIT, however, that conclusion was made before we actually really tested PARTITION_SPLIT. Change-Id: I8fa91e59894badc1d8cee3ba8a49e40ae4c4a489	2013-07-18 17:52:08 -07:00
Ronald S. Bultje	5ebe503f04	Merge scale_factors and scale_factors_uv. This prevents a duplicate memcpy of a 128-byte struct every time set_scale_factors() is called (which is a lot), thus leading to a decrease from 3.7 MB to 1.85 MB of struct copying per 64x64 block RD/partition loop. Overall, this decreases encoding time of the first 50 frames of bus @ 1500kbps (speed 0) from 1min5.9 to 1min4.9, i.e. about a 1.5% overall speedup. We can likely get more gains by removing the copy of the other struct (and replacing it with an indexing) as well. Change-Id: I3dceb7e79f71e6fe911b11cc994cf89a869dde7a	2013-07-18 14:10:56 -07:00
Ronald S. Bultje	2d4929e340	Remove motion vectors from PARTITION_INFO. The same information already exists in union b_mode_info. Change-Id: Iac5086b99a3c3cc270380138062bb693e58f9e6d	2013-07-18 14:10:52 -07:00
Ronald S. Bultje	607424449c	Merge "Best_rd breakout in rd partition search."	2013-07-17 16:10:22 -07:00
Dmitry Kovalev	a7a1e96136	Merge changes Ieffea49e,Idf610746 * changes: Removing two unused arguments from vp9_inc_mv signature. Changing signature of vp9_get_pred_probs_tx_size.	2013-07-17 14:44:20 -07:00
Ronald S. Bultje	9f427bfe98	Best_rd breakout in rd partition search. About 15% faster for bus (speed 0) first 50 frames @ 1500kbps, which goes from 1min36 to 1min24. Results become slightly better (+0.2% on derf/yt, +0.4% on hd), probably because of a bugfix for skipmode in super_block_yrd(). Overall speed change (on derfraw300) is roughly -13%. This can probably be improved further by caching best_yrd between partition searches. Also, we might be able to get more speedups by always doing PARTITION_NONE before PARTITIONS_SPLIT, not just at the sb8x8 level. Change-Id: I83736949ebd5b4a3b400ee688d7661913fefc98b	2013-07-17 09:56:46 -07:00
Yunqing Wang	df90d58f4f	Speed up motion estimation using small partitions' result(experiment) Current partition checking starts from small sizes, and then goes up to large sizes. This experiment uses the small partitions' motion estimation result, which is already available, to speed up the large partition's motion estimation. We can decide to skip some patition checkings if they are unlikely choices. We could use the motion vector(MV) result as current partition's prediction MV, limit the search range and reference frame. Current result at speed 1: psnr loss: 1.19% for stdhd, 0.287% for derf. speed gain: 14% for sunflower(hd), 11% for akiyo. Further improvement will be done later. Change-Id: I5abfd070e9cace2e91e2a0247d1325df313887ab	2013-07-17 09:11:47 -07:00
Dmitry Kovalev	5b65a71cdc	Changing signature of vp9_get_pred_probs_tx_size. Removing VP9_COMMON* argument and adding struct tx_probs* instead of MACROBLOCKD*. Change-Id: Idf61074631a90ec51eac22c8dcd977f44ac0757c	2013-07-16 16:34:54 -07:00
Dmitry Kovalev	9482a0bf10	Cleaning up tile code. Removing tile_rows and tile_columns from VP9Common, removing redundant constants MIN_TILE_WIDTH and MAX_TILE_WIDTH, changing signature of vp9_get_tile_n_bits. Change-Id: I8ff3104a38179b2c6900df965c144c1d6f602267	2013-07-16 14:47:15 -07:00
Dmitry Kovalev	5de96b3ce6	Merge "Rewriting vp9_set_pred_flag_{seg_id, mbskip}."	2013-07-16 13:34:42 -07:00
James Zern	5baa416b6c	Merge "vp9: remove frames_{since,till}.. from MACROBLOCKD"	2013-07-16 13:00:14 -07:00
Dmitry Kovalev	863138a2ad	Rewriting vp9_set_pred_flag_{seg_id, mbskip}. Making implementation of vp9_set_pred_flag_{seg_id, mbskip} consistent with vp9_get_segment_id without using confusing sub(a, b) macro. Passing mi_row and mi_col to functions explicitly instead of replying on mb_to_right_edge and mb_to_bottom_edge. Change-Id: I54c1087dd2ba9036f8ba7eb165b073e807d00435	2013-07-16 10:44:48 -07:00
Jingning Han	faff6ed0fb	Skip duplicate block encoding in the rd loop This speed feature allows the encoder to largely remove the spatial dependency between blocks inside a 64x64 superblock, thereby removing the need to repeatedly encode superblocks per partition type in the rate-distortion optimization loop. A major challenge lies in the intra modes tested in the rate-distortion optimization loop. The subsequent blocks do not have access to the reconstructed boundary pixels without the intermediate coding steps. This was resolved by using the original pixels for intra prediction in the rd loop, followed by an appropriately designed distortion modeling on the quantization parameters. Experiments also suggested that the performance impact is more discernible at lower bit-rate/psnr settings. Hence a quantizer dependent threshold is applied to deactivate skip of block coding. For bus_cif at 2000 kbps, speed 0: runtime 269854ms -> 237774ms (12% speed-up) at 0.05dB performance loss. speed 1: runtime 65312ms -> 61536ms, (7% speed-up) at 0.04dB performance loss. This operation is currently turned on in settings of speed 1. Change-Id: Ib689741dfff8dd38365d8c1b92860a3e176f56ec	2013-07-15 11:08:58 -07:00
James Zern	dc1d2331f6	vp9: remove frames_{since,till}.. from MACROBLOCKD frames_since_golden / frames_till_alt_ref_frame are unused. Change-Id: I348e7689d4d75412cf4de7703d885be942e4a26b	2013-07-13 18:02:11 -07:00
Dmitry Kovalev	429070987a	Using vp9_copy and vp9_zero instead of custom code. Change-Id: Id9b6ceeddca3f9b34bfada5c499b1e7a2f42c30b	2013-07-12 18:07:43 -07:00
Dmitry Kovalev	cc662dd768	Adding struct tx_probs and struct tx_counts to cleanup the code. Also removing unused declarations from vp9_entropymode.h file. Change-Id: Ib9c5826db3584a32f6bb3297a76c522b99d83402	2013-07-12 15:22:38 -07:00
Dmitry Kovalev	dd150e8ea9	Removing redundant code mostly from vp9_pred_common.{h, c}. Removing redundant function arguments and curly braces. Change-Id: I46e02561f33fe02e84a3b19756f03b9504bd6a1b	2013-07-11 18:39:10 -07:00
Dmitry Kovalev	c4ad3273c7	Moving segmentation related vars into separate struct. Adding segmentation struct to vp9_seg_common.h. Struct members are from macroblockd and VP9Common structs. Moving segmentation related constants and enums to vp9_seg_common.h. Change-Id: I23fabc33f11a359249f5f80d161daf569d02ec03	2013-07-11 11:57:57 -07:00
Dmitry Kovalev	544d8c3316	Removing unused TOKENEXTRA arg from pick_sb_modes function. Change-Id: I0543e72fa092eef3976b65e16bb597197c364873	2013-07-10 15:57:28 -07:00
Jim Bankoski	68ef7a6b8a	configure with internal stats not working Change-Id: I5dea4570cb05df27a522abf6e7b695998654284a	2013-07-10 15:07:53 -07:00
Jim Bankoski	6591cf2f7e	remove warnings when NDEBUG is set Change-Id: Ie0cb732fdcb98616a422c4463bff80642248d136	2013-07-10 14:27:20 -07:00
Jim Bankoski	fb027a7658	removing case statements around prediction entropy coding Removes SEG_ID Removes MBSKIP Removes SWITCHABLE_INTERP Removes INTRA_INTER Removes COMP_INTER_INTER Removes COMP_REF_P Removes SINGLE_REF_P1 Removes SINGLE_REF_P2 Removes TX_SIZE Change-Id: Ie4520ae1f65c8cac312432c0616cc80dea5bf34b	2013-07-09 20:10:16 -07:00
Dmitry Kovalev	c6c279aff0	Merge "Using mi_cols instead of mb_cols."	2013-07-08 20:09:19 -07:00
Dmitry Kovalev	1c65c580d6	Merge "Refactoring setup_pre_planes function."	2013-07-08 20:08:05 -07:00
Ronald S. Bultje	a5062cc635	Don't call encode_sb() for the final of 4-split subpartitions. The resulting reconstruction is never used, thus it just wastes CPU cycles. Reduces encode time of first 50 frames of bus (speed 0) @ 1500kbps from 2min2.0 to 2min1.2, i.e. a 0.65% overall speedup. Change-Id: I74755ca3aadc21e2be220f486259060bd4088c45	2013-07-08 16:22:39 -07:00
Ronald S. Bultje	ed995afba1	Make frame-wide filter-type decision fully RD-based. Overall, on all test sets, this gains about +0.2% on all metrics. City is a clip where this really hurts (-1.0% on all metrics), I'm not quite sure why yet. Maybe interesting to look into in the future. Change-Id: I6f0eecb20e72f0194633270d30bf00d76d9eae78	2013-07-08 16:22:37 -07:00
Dmitry Kovalev	b7559258a4	Using mi_cols instead of mb_cols. Eliminating usage of mb-units, switching to mi-units. Adding ALIGN_POWER_OF_TWO macro. Change-Id: I2491c969f713207c062011878b57e4e531818607	2013-07-08 14:54:04 -07:00
Deb Mukherjee	d9b62160a0	Implements several heuristics to prune mode search Skips mode searches for intra and compound inter modes depending on the best mode so far and the reference frames. The various heuristics to be used are selected by bits from a flag. The previous direction based intra mode search pruning is also absorbed in this framework. Specifically the flags and their impact are: 1) FLAG_SKIP_INTRA_BESTINTER (skip intra mode search for oblique directional modes and TM_PRED if the best so far is an inter mode) derfraw300: -0.15%, 10% speedup 2) FLAG_SKIP_INTRA_DIRMISMATCH (skip D27, D63, D117 and D153 mode search if the best so far is not one of the closest hor/vert/diagonal directions. derfraw300: -0.05%, about 9% speedup 3) FLAG_SKIP_COMP_BESTINTRA (skip compound prediction mode search if the best so far is an intra mode) derfraw300: -0.06%, about 7-8% speedup 4) FLAG_SKIP_COMP_REFMISMATCH (skip compound prediction search if the best single ref inter mode does not have the same ref as one of the two references being tested in the compound mode) derfraw300: -0.56%, about 10% speedup Change-Id: I1a736cd29b36325489e7af9f32698d6394b2c495	2013-07-08 12:17:12 -07:00
Dmitry Kovalev	f72e072555	Refactoring setup_pre_planes function. Removing set_refs, adding set_ref function. Change-Id: I5635c478b106ae4e57d317f1c83d929644307e63	2013-07-03 17:42:01 -07:00
Dmitry Kovalev	430bd0c94a	Merge "Replacing 64 / MI_SIZE with MI_BLOCK_SIZE."	2013-07-03 14:16:02 -07:00
Dmitry Kovalev	5a21de8418	Replacing 64 / MI_SIZE with MI_BLOCK_SIZE. Change-Id: I32276552b3ea6dc1dce8e298be114cfe1019b31c	2013-07-03 10:54:50 -07:00
Paul Wilkins	72c5778ec5	Added two new skip experiments. sf->unused_mode_skip_lvl. Tests modes as normal for all sizes at or below the given level. At larger sizes it skips all modes that were not chosen at any smaller size. Hence setting BLOCK_SIZE_SB64X64 is in effect off. Setting BLOCK_SIZE_AB4X4 will only consider modes that were chosen for one or more 4x4 blocks at larger sizes. sf->reference_masking. Do a test encode of the NONE partition at one size and create a reference frame mask based on the best rd choice. In the full search only allow this reference frame. Currently it is testing 64x64 and repeats this in the full search. This does not work well with Jim's Partition code just now and is disabled by default. Change-Id: I8f8c52d2ef4a0c08100150b0ea4155d1aaab93dd	2013-07-03 16:56:06 +01:00
Dmitry Kovalev	1f6e95e76a	Merge "Removing redundant struct from union b_mode_info."	2013-07-02 18:09:31 -07:00
Dmitry Kovalev	be77f6bbbf	Removing redundant struct from union b_mode_info. Change-Id: I08fc6e474ff2c12cfa065bae4989c724276e2c83	2013-07-02 16:51:57 -07:00
Yaowu Xu	0d7b7c09cb	Added a speed feature use_square_partition_only This commit adds a speed feature where only squared partition are evaluated in partition picking. Enable this feature in cpu-used 2 reduces encoding time by ~30%. loss of compression: -0.9% on cif set -1.23% on stdhd Change-Id: Ia6fad11210f0b78365abb889f9245604513be5b9	2013-07-02 16:40:15 -07:00
Deb Mukherjee	8d3d2b76f3	Tx size selection enhancements (1) Refines the modeling function and uses that to add some speed features. Specifically, intead of using a flag use_largest_txfm as a speed feature, an enum tx_size_search_method is used, of which two of the types are USE_FULL_RD and USE_LARGESTALL. Two other new types are added: USE_LARGESTINTRA (use largest only for intra) USE_LARGESTINTRA_MODELINTER (use largest for intra, and model for inter) (2) Another change is that the framework for deciding transform type is simplified to use a heuristic count based method rather than an rd based method using txfm_cache. In practice the new method is found to work just as well - with derf only -0.01 down. The new method is more compatible with the new framework where certain rd costs are based on full rd and certain others are based on modeled rd or are not computed. In this patch the existing rd based method is still kept for use in the USE_FULL_RD mode. In the other modes, the count based method is used. However the recommendation is to remove it eventually since the benefit is limited, and will remove a lot of complications in the code (3) Finally a bug is fixed with the existing use_largest_txfm speed feature that causes mismatches when the lossless mode and 4x4 WH transform is forced. Results on derf: USE_FULL_RD: +0.03% (due to change in the tables), 0% encode time reduction USE_LARGESTINTRA: -0.21%, 15% encode time reduction (this one is a pretty good compromise) USE_LARGESTINTRA_MODELINTER: -0.98%, 22% encode time reduction (currently the benefit of modeling is limited for txfm size selection, but keeping this enum as a placeholder) . USE_LARGESTALL: -1.05%, 27% encode-time reduction (same as existing use_largest_txfm speed feature). Change-Id: I4d60a5f9ce78fbc90cddf2f97ed91d8bc0d4f936	2013-07-02 13:54:00 -07:00
Dmitry Kovalev	3140c443e4	Merge "Removing vp9_mbpitch.c, moving vp9_setup_block_dptrs to vp9_block.h."	2013-07-02 11:31:35 -07:00
Jim Bankoski	d4158283e7	use partitioning from last frame This cl converts use partition from last frame to do the following: if part is none,horz, vert -> try split if part != none and one of the children is not split - try none Change-Id: I5b6c659e35f3ac9f11c051b92ba98af6d7e8aa87 Signed-off-by: Jim Bankoski <jimbankoski@google.com>	2013-07-01 18:18:50 -07:00
Dmitry Kovalev	1ac0540296	Removing vp9_mbpitch.c, moving vp9_setup_block_dptrs to vp9_block.h. Change-Id: Ia547a5dd7650b771fd00edd673ab9f920270731c	2013-07-01 17:28:08 -07:00
Yaowu Xu	632289b31f	fix a mismatch in cpuused 2 Change-Id: I921c9faba6386535aaf717a54301dd346a9b8540	2013-07-01 08:54:50 -07:00
Dmitry Kovalev	59070f6e3c	Merge "Removing CONFIG_DEBUG checks on assertions."	2013-06-28 14:03:28 -07:00
Dmitry Kovalev	0345fc3ad9	Merge "Decoder's code cleanup."	2013-06-28 10:38:54 -07:00
Dmitry Kovalev	8e6ce6bb9e	Removing CONFIG_DEBUG checks on assertions. Adding CHECK_MEM_ERROR macro to vp9_common.h and removing two duplicated ones from vp9_onyx_int.h and vp9_onyxd_int.h. Change-Id: I916afec61b3019f18193135dac7c35ed0f89b8b6	2013-06-28 10:36:20 -07:00
Yaowu Xu	64bb996e03	Merge "Optimize partition search order"	2013-06-28 09:29:39 -07:00
Yaowu Xu	1374a06bd8	Optimize partition search order This commit change the partition search order to allow checking of rectangular partition to be done after square partitions. It also added a speed feature to skip rectangular partition check when NONE is better than SPLIT in RD sense. This feature roughly speed up encoder by 1.5X with loss on compression -0.91% on cif set -0.56% on stdhd set Change-Id: I0d2d06993041aa9ea9073fcc39c54f73a127dfa4	2013-06-28 07:13:54 -07:00
Ronald S. Bultje	fd4eed3b08	Fix tile independence with both column tiling and static_thresh set. Change-Id: I0b2be0ec2c410a527f88b95a44f24ac967b2dac1	2013-06-27 21:56:40 -07:00
Dmitry Kovalev	3231da0a9e	Decoder's code cleanup. Using vp9_set_pred_flag function instead of custom code, adding decode_tokens function which is now called from decode_atom, decode_sb_intra, and decode_sb. Change-Id: Ie163a7106c0241099da9c5fe03069bd71f9d9ff8	2013-06-27 16:15:43 -07:00
Dmitry Kovalev	a3664258c5	Merge "General cleanup in segmentation-related code."	2013-06-27 14:57:07 -07:00
Paul Wilkins	59af9049d3	Merge "Start adaptive threshold for each mode at max."	2013-06-27 02:28:36 -07:00
Jingning Han	bd9bac0391	Remove empty function vp9_build_block_offsets This function is empty, hence is removed. Change-Id: Ia9d01710806bffe0398a6dc9405f8a5a81b27d74	2013-06-26 14:55:47 -07:00
Dmitry Kovalev	be07485e9a	General cleanup in segmentation-related code. Using consistent function and variable names. Change-Id: I2deb3fded8797453a2081836c9ce2e79ade06eb7	2013-06-26 10:27:28 -07:00
Paul Wilkins	689957e3ad	Start adaptive threshold for each mode at max. Each frame we reset all adaptive thresholds to MAX rather than base. As modes are picked their thresholds drop down. Change-Id: Ia37f03a73003c2d9bfcda57edea07205e9a0e5e8	2013-06-26 17:04:47 +01:00
Dmitry Kovalev	70e9622185	Merge "Removing find_seg_id and using vp9_get_pred_mi_segid instead."	2013-06-25 10:16:06 -07:00
Yaowu Xu	e371cd73a3	change to enable use_largest_txform feature for all regular inter frames at speed 1 Change-Id: I0a8b301273ecf2b8730ab1f6b7a05f89f4d498e0	2013-06-24 16:43:26 -07:00
Dmitry Kovalev	40141681c0	Removing find_seg_id and using vp9_get_pred_mi_segid instead. Change-Id: Ia40229903c08f14020e90e94cfdf494aba1be827	2013-06-21 13:05:10 -07:00
Ronald S. Bultje	54b2a59623	Implement SSE2 block_error. Change vp9_block_error() to return a 64bit error variable, change all callers to expect a 64bit return value (this will prevent overflows, which we basically don't check for at all right now). Remove duplicate block_error() function, which fixed that through truncation. Remove old (incompatible) mmx/sse2 block_error SIMD versions and replace with a new one that returns a 64bit value. Encoding time of first 50 frames of bus @ 1500kbps goes from 3min29 to 3min23, i.e. a 3% overall speedup. Change-Id: Ib71ac5508b5ee8a80f1753cd85d72df1629abe68	2013-06-21 12:54:52 -07:00
Yaowu Xu	ee07a261a0	rename variables to avoid build error in MSVC Change-Id: I7960178c95c54d5c4497e44cfc8c493566294b34	2013-06-20 18:31:48 -07:00
Jim Bankoski	9f2a1ae23e	adds force partitioning greater than or less than block size adds a new speed feature to force partitioning to be greater than or less than a certain size Change-Id: I8c048eeeef93700ae822eccf98f8751a45b2e7d0	2013-06-20 09:51:42 -07:00
Jim Bankoski	18bdf708e7	adds a set partitioning to speed features this feature lets you set a partitioning size to be used by the entire frame. Change-Id: I208a4c8c701375cbb054418266f677768b6f8f06	2013-06-20 09:50:44 -07:00
Jim Bankoski	476d73d294	partition by variance using var from last frame This uses variance to split partition. Variance is calculated using nearest mv, always from last ref frame. Change-Id: Idd015b4a9aa3bc82591759eac239680c07496896	2013-06-20 09:48:22 -07:00
Jim Bankoski	727fa7b1e4	new partition via variance Change-Id: Ideee45cad8b38087c509cd404484728e85d0c427	2013-06-20 09:42:05 -07:00
Jim Bankoski	0fad6a9d99	fix to set up new speed feature This uses the speed feature functionality for code. Change-Id: I9cd16c0c5f98520ae27ebba81aa2c178546587f8	2013-06-20 09:35:02 -07:00
Jim Bankoski	df2314cfdd	don't copy partitions for key frames or altrefs force us to go through slow partitioning for keyframes, altref and overlays. Change-Id: I1a286361bf74083e71973575a7296be46eb98742	2013-06-20 09:34:32 -07:00
Jim Bankoski	fbcce4dd6f	Merge "copy partitioning from last fame"	2013-06-20 09:32:43 -07:00
Jim Bankoski	f033b44e74	copy partitioning from last fame Change-Id: I26e80ede80cb4389378a95afa95d229092a9859a	2013-06-20 09:32:19 -07:00
Jingning Han	7088426976	Merge "Make fdct32 computation flow within 16bit range"	2013-06-18 11:40:14 -07:00
Jingning Han	a41a4860c0	Make fdct32 computation flow within 16bit range This commit makes use of dual fdct32x32 versions for rate-distortion optimization loop and encoding process, respectively. The one for rd loop requires only 16 bits precision for intermediate steps. The original fdct32x32 that allows higher intermediate precision (18 bits) was retained for the encoding process only. This allows speed-up for fdct32x32 in the rd loop. No performance loss observed. Change-Id: I3237770e39a8f87ed17ae5513c87228533397cc3	2013-06-18 09:46:24 -07:00
Dmitry Kovalev	686b99741c	Removing vp9_invtrans.{c, h} files. Moving single function from vp9_invtrans.c to vp9_encodemb.c. Change-Id: I26bf6bb90de342a3036c0dbfba78a7dd75a61fe7	2013-06-17 16:09:03 -07:00
Ronald S. Bultje	8a0808a145	Fix row tiling. Change-Id: I57be4eeaea6e4402f6a0cc04f5c6b7a5d9aedf9b	2013-06-12 13:42:59 -04:00
Jim Bankoski	fca6c82b29	Fix rd partition search for corner blocks This commit enables proper partition type search for the bottom- right corner blocks. Change-Id: Id1123d0e4e81eba648ed4f3c0c7ab587e174f650	2013-06-11 09:29:21 -07:00
Ronald S. Bultje	eedd98ac0a	Fix crash on RD iterations with segmentation enabled. Change-Id: I3baf93c2fa5c2f7f45c6bc5514d317040975da71	2013-06-10 10:42:09 -07:00
Yaowu Xu	c08317e4f2	Merge "Fix the rd loop over partition types" into experimental	2013-06-08 13:30:27 -07:00
Deb Mukherjee	17da2cab78	TX_SIZE contexts simplification. Reduces TX_SIZE contexts to 2 for each kind. The code is cleaner and there is hardly any performance difference with more than two contexts. Results: almost neutral Change-Id: I17656bd6db76224ae2856adf882504560e7dbaa4	2013-06-08 12:32:26 -07:00
Jingning Han	e1d63c010e	Fix the rd loop over partition types This commit enables boundary blocks properly tested over allowable partition types. Change-Id: I405a9a46ddcfa0c7af2b63e3644cabfa3b6a951d	2013-06-07 23:36:35 -07:00
Yaowu Xu	b7da6d0c5a	Merge "Handle partition type coding of boundary blocks" into experimental	2013-06-07 18:16:16 -07:00
Deb Mukherjee	21401942b0	Coding tx-size selection by use of spatial context Adds coding of transform size within a frame by use of context of transform sizes selected in left and above blocks. Also incorporates code for generating stats. TODO: generate and incorporate new default stats Change-Id: I6a7af099f6ad61d448521d9a51167aedaf638ed6	2013-06-07 16:07:58 -07:00
Deb Mukherjee	869a39ba60	Cleans up mbskip encoding Refactors mbskip coding to be compatible with coding of the rest of the symbols. Adds forward/backward adaptation and removes a lot of the legacy code. Results: fast50: +1.6% derfraw300: +0.317% Change-Id: I395a2976d15af044d3b8ded5acfa45f6f065f980	2013-06-07 16:00:26 -07:00
Jingning Han	78b8190cc7	Handle partition type coding of boundary blocks The partition types of blocks sitting on the frame boundary are constrained by the block size and the position of each sub-block relative to the frame. Hence we use truncated probability models to handle the coding of such information. 100 frames run: yt 0.138% Change-Id: I85d9b45665c15280069c0234ea6f778af586d87d	2013-06-07 14:19:40 -07:00
Ronald S. Bultje	6462afe088	Fix ref_frame segment feature when it is intra. Change-Id: Ifbf790c14cee0c08a27f6728e3c637404e1f8477	2013-06-07 13:57:55 -07:00
Paul Wilkins	340c7a48e6	Change to segment ref frame feature. Simplify feature to only support a single reference frame instead of a mask. Change-Id: I5dd3a98c7a224aafb35708850ab82e2f220e68fb	2013-06-07 21:42:22 +01:00
Deb Mukherjee	78fbaf4d84	Merge "Coding updates for tx-size selection" into experimental	2013-06-07 09:19:36 -07:00
Deb Mukherjee	3ee1a21a42	Coding updates for tx-size selection Changes to the coding of transform sizes, along with forward and backward probability updates. Results: derf300: +0.241% Context based coding of transform sizes will be in a separate patch. Change-Id: I97241d60a926f014fee2de21fa4446ca56495756	2013-06-07 08:54:00 -07:00
Paul Wilkins	653a25569b	Compound inter encoder bug fix. In the longer term the encoder should allow compound as long as one of the buffers has opposite sign bias and as per the decoder this buffer is then set as the fixed reference. However at the moment the encoder and RD loop only supports the case where the ALTREF_FRAME buffer (or third of the 3 allowed in any given frame) is the odd one out. This patch fixes a bug that would allow compound inter and set fixed ref to ALTREF_FRAME when it is not the odd one out. Change-Id: Ic83a69486e088a147ba83a4aedc2a0042f6b3721	2013-06-07 12:31:54 +01:00
Yaowu Xu	e127bdc04c	fix a typo Change-Id: I8fd21e3a8435b873c5687d8b273922fc60988295	2013-06-06 22:25:13 -07:00
Ronald S. Bultje	6ef805eb9d	Change ref frame coding. Code intra/inter, then comp/single, then the ref frame selection. Use contextualization for all steps. Don't code two past frames in comp pred mode. Change-Id: I4639a78cd5cccb283023265dbcc07898c3e7cf95	2013-06-06 17:28:09 -07:00
Ronald S. Bultje	ad34368786	New intra mode and partitioning probabilities. Split partition probabilities between keyframes and non-keyframes, since they are fairly different. Also have per-blocksize interframe y intramode probabilities, since these vary heavily between different blocksizes. Lastly, replace default probabilities for partitioning and intra modes with new ones generated from current codec. Replace counts with actual probabilities also. Change-Id: I77ca996e25e4a28e03bdbc542f27a3e64ca1234f	2013-06-06 10:45:30 -07:00
Jim Bankoski	5a88271b09	don't tokenize & encode tokens for blocks in UMV This avoids encoding tokens for blocks that are entirely in the UMV border. This changes the bitstream. Change-Id: I32b4df46ac8a990d0c37cee92fd34f8ddd4fb6c9	2013-06-06 06:10:25 -07:00
Deb Mukherjee	83885235a7	Clean-ups on switchable interpolation and mv_ref Adds backward adaptation and differential forward updates of switchable interpolation filter probabilities. Also adds some cosmetic cleanups and minor fixes on mv_ref probabilities. derfraw300: +0.353% (with most coming from switchable interp changes) Change-Id: Ie2718be73528c945fd0d80cfd63ca2d9cb3032de	2013-06-05 10:11:52 -07:00
Dmitry Kovalev	3b9ec31eaf	Replacing memcpy with struct assignment. Change-Id: Ib557cc6351404b9e178e95a545883eb3666f11f0	2013-05-31 16:00:32 -07:00
Dmitry Kovalev	75cf80ee8e	Adding new encode_txfm function. Moving some code from vp9_pack_bitstream to encode_txfm function. Change-Id: Icc25d6083e54f09886216fea632ceac002042d7f	2013-05-31 12:33:44 -07:00
Ronald S. Bultje	a288cb3b10	Merge "Merge all various transform size data trackers into single variables." into experimental	2013-05-31 09:59:24 -07:00
Scott LaVarnway	1e025dbfd1	Merge "Moved use_prev_in_find_mv_refs check to frame level" into experimental	2013-05-31 09:35:51 -07:00
Ronald S. Bultje	e9d68a5e36	Merge all various transform size data trackers into single variables. Change-Id: I2dfc569106b29fbe4da20585a0e85e5e9ea6a4db	2013-05-31 09:18:59 -07:00
Jim Bankoski	21595f8e38	Merge "Creates a new speed 1:" into experimental	2013-05-30 20:36:05 -07:00
Jim Bankoski	ced21bd6a6	Creates a new speed 1: This speed 1 - uses variance threshold stolen from static-thresh to determine split. Any superblock with greater than the variance set by static thresh * quantizer index squared is split. In addition transform size is set to largest size less than or equal to partition size, sub pixel filter is set to normal, and only 12 modes are used at all. Change-Id: If7a2858ee70f96d1eb989c04fd87a332b147abef	2013-05-30 19:53:00 -07:00
Ronald S. Bultje	e6485581fe	Remove splitmv. We leave it in rdopt.c as a local define for now - this can be removed later. In all other places, we remove it, thereby slightly decreasing the size of some arrays in the bitstream. Change-Id: Ic2a9beb97a4eda0b086f62c039d994b192f99ca5	2013-05-30 17:21:01 -07:00
Ronald S. Bultje	98c192ae83	Merge all intra mode coding trees into a single one. Also merge all counters. This removes a few unused probability updates from the bitstream. Change-Id: I20f58853e9dac84d8c0d9703ae012c55917516eb	2013-05-30 09:58:53 -07:00
Scott LaVarnway	353642bc53	Moved use_prev_in_find_mv_refs check to frame level This patch checks at the frame level to see if the previous mode info context can be used. This patch eliminates the flag check that was done for every mode and removes another check that was done prior to every vp9_find_mv_refs(). Change-Id: I9da5e18b7e7e28f8b1f90d527cad087073df2d73	2013-05-29 16:42:23 -04:00
Ronald S. Bultje	5cac66078e	Remove splitmv. Also do per-partition motion vector referencing in <sb8x8 partitions, and adjust mvref finding for sub8x8 partitions. Change-Id: Id3ed1ed4d2a8910d11d327db6cc63b8eb79f941f	2013-05-26 14:40:49 -07:00
Jingning Han	d093027791	Fix transform size coding mismatch This commit fixes a transform size enc/dec mismatch issue in the key frame coding. Change-Id: I0c4f40464a367b33dd91ace84506650b1aec2873	2013-05-24 11:30:58 -07:00
Yaowu Xu	a2db88fc26	Fix two bugs 1) Added an initialization of rd_tx_select_threshs[]. 2) Made updating transform size counts to be consistent Change-Id: Iaa9d6c6be825b0364c9d61a9802873d01356815c	2013-05-24 09:28:19 -07:00
Yaowu Xu	f116abf774	update txfm size counting Change-Id: I3a26baf8b2f945fea4f1aea156e60fa79f620f86	2013-05-23 18:27:07 -07:00
Jingning Han	7ac5ac52f9	Merge 4x4 block level partition into codebase Move 4x4/4x8/8x4 partition coding out of experimental list. This commit fixed the unit test failure issues. It also resolved the merge conflicts between 4x4 block level partition and iterative motion search for comp_inter_inter. Change-Id: I898671f0631f5ddc4f5cc68d4c62ead7de9c5a58	2013-05-23 11:58:50 +01:00
Jingning Han	d2cacdc530	Merge "Make the intra rd search support 8x4/4x8" into experimental	2013-05-22 10:00:15 -07:00
Yaowu Xu	8ba92a0bed	changes intra coding to be based on txfm block This commit changed the encoding and decoding of intra blocks to be based on transform block. In each prediction block, the intra coding iterates thorough each transform block based on raster scan order. This commit also fixed a bug in D135 prediction code. TODO next: The RD mode/txfm_size selection should take this into account when computing RD values. Change-Id: I6d1be2faa4c4948a52e830b6a9a84a6b2b6850f6	2013-05-22 11:53:19 +01:00
Yaowu Xu	232d90d8fd	Generalized intra 4x4 encoding for all sizes Change-Id: I1b86744fa247233c8df031b3f4b87b212c8dd094	2013-05-22 11:44:12 +01:00
Jingning Han	f153a5d063	Make the intra rd search support 8x4/4x8 This commit allows the rate-distortion optimization of intra coding capable of supporting 8x4 and 4x8 partition settings. It enables the entropy coding of intra modes in key frame using a unified contextual probability model conditioned on its above/left prediction modes. Coding performance: derf 0.464% Change-Id: Ieed055084e11fcb64d5d5faeb0e706d30268ba18	2013-05-21 21:03:00 -07:00
John Koleszar	ddf13be8ef	Merge "Initial version of alpha channel support" into experimental	2013-05-21 17:29:51 -07:00
Jingning Han	1f7d810a72	Merge "Deprecate 4x4 intra modes from bit-stream" into experimental	2013-05-21 15:59:58 -07:00
Scott LaVarnway	1db6373267	Merge "WIP: 4x4 idct/recon merge" into experimental	2013-05-21 10:45:53 -07:00
Dmitry Kovalev	4ac70bd7d3	Adding get_ref_frame_idx function. Change-Id: I4f1a4eca6794cda78d00512196caacd5567e2dcc	2013-05-20 16:09:00 -07:00
Jingning Han	1e6be7bc75	Deprecate 4x4 intra modes from bit-stream Replace B_DC_PRED like syntax element writing/reading with sb_ymode set (e.g., DC_PRED, etc). Change-Id: I293006a6b3bcd130c08ea9f053e7a79c6819c6f8	2013-05-20 12:14:24 -07:00
Scott LaVarnway	ba48a11130	WIP: 4x4 idct/recon merge This patch eliminates the intermediate diff buffer usage by combining the short idct and the add residual into one function. The encoder can use the same code as well. Change-Id: I296604bf73579c45105de0dd1adbcc91bcc53c22	2013-05-20 13:03:17 -04:00
Jingning Han	810b612c23	Enable bit-stream support to 8x4 and 4x8 partition The recursive partition type search is enabled down to 4x4, 4x8 and 8x4, followed by the corresponding rate-distortion optimization for the per-partition encoding mode decisions. The bit-stream writing/reading synchronized in supporting the rectangular partition of 8x8 block. This provides above 1% coding performance gains on derf. To do next: 1. re-design the rate-distortion loop for inter prediction below 8x8. 2. re-design the rate-distortion loop for intra prediction below 4x4. 3. make the loop-filter aware of rectangular partition of 8x8 block. 4. clean the unused probability models. 5. update default probability values. Change-Id: Idd41a315b16879db08f045a322241f46f1d53f20	2013-05-19 14:59:04 -07:00
Jingning Han	49068dd985	Merge "Refactor encode_sb_ for 4x8/8x4 partition" into experimental	2013-05-17 09:19:48 -07:00
Paul Wilkins	51bc4bf4a0	Remove MODE_STATS flag and code Change-Id: I6c70a8a8a4633399842ac74792003ae5f7859ffa	2013-05-17 12:34:10 +01:00
John Koleszar	679e4abdd5	Initial version of alpha channel support This is a mostly-working implementation of an extra channel in the bitstream. Configure with --enable-alpha to test. Notable TODOs: - Add extra channel to all mismatch tests, PSNR, SSIM, etc - Configurable subsampling - Variable number of planes (currently always uses all 4) - Loop filtering - Per-plane lossless quantizer - ARNR support This implementation just uses the same contents as the Y channel for the A channel, due to lack of content and general pain in playing back 4 channel content. A later patch will use the actual alpha channel passed in from outside the codec. Change-Id: Ibf81f023b1c570bd84b3064e9b4b8ae52e087592	2013-05-16 22:21:09 -07:00
Jingning Han	5a1c953310	Refactor encode_sb_ for 4x8/8x4 partition Deprecate set_block_index. Replace it with get_sb_index_ for consistency with partition search and bit-stream writing/reading. Use b_width/height_log2 instead of mi_width/height_log2, to support 4x4 resolution partition types. Change-Id: Ic1e71981e163c669f7ea6b3c12b831c284c4a494	2013-05-16 16:33:31 -07:00
John Koleszar	f07602e403	Merge "Remove vp9_extend_mb_row()" into experimental	2013-05-16 15:22:38 -07:00
John Koleszar	16ac5a5cde	Remove vp9_extend_mb_row() This code is no longer needed for correct intra prediction. Change-Id: I822d1a8b0ad0a00e7c4c6e7b2931790c39d1267d	2013-05-16 11:56:00 -07:00
Dmitry Kovalev	b0c101e2b4	Removing lossless flag from the bitstream. Change-Id: If6aee510cbc4910f2f24fcd92dddc65fdf8edeea	2013-05-15 18:20:51 -07:00
Jingning Han	1f26840fbf	Enable recursive partition down to 4x4 This commit allows the rate-distortion optimization recursion at encoder to go down to 4x4 block size. It deprecates the use of I4X4_PRED and SPLITMV syntax elements from bit-stream writing/reading. Will remove the unused probability models in the next patch. The partition type search and bit-stream are now capable of supporting the rectangular partition of 8x8 block, i.e., 8x4 and 4x8. Need to revise the rate-distortion parts to get these two partition tested in the rd loop. Change-Id: I0dfe3b90a1507ad6138db10cc58e6e237a06a9d6	2013-05-14 12:39:56 -07:00
Jingning Han	6910f178f1	Use consistent partition context setup in enc/dec Move set_partition_seg_context_ to common file. Use consistent context setup conditions for partition probability model update at encoder and decoder. Change-Id: I24b7ed3b1c48e3d2568191a46b70136b99b67b1a	2013-05-11 15:22:13 -07:00

... 6 7 8 9 10 ...

922 Commits