generic-library/vpx

Author	SHA1	Message	Date
Paul Wilkins	4a28da5843	Enable more split modes for animated content. For content that is identified as likely to contain some animation or graphics content, increase the availability of split modes for good quality speeds 1-3. On a problem test animation clip this improves metrics results by about 0.25 db and makes a noticeable difference visually. It also causes a small drop in file size (~0.5%) but a rise in encode time of about 5-6% at speed 2. For more normal content it should have no effect. Change-Id: Ic4cd9a8de065af9f9402f4477a17442aebf0e439	2015-06-09 14:50:44 +01:00
Paul Wilkins	b19b16cfa1	Merge "Animation and dead zone detection."	2015-06-08 14:26:07 +00:00
Marco	8710cceb45	Fix to spatial svc: set reference_frame masking. For real-time mode: keep reference_frame masking off for spatial svc. Change-Id: I15e123c06f67ea040172b8d4042a672f3525b9d8	2015-06-05 08:25:33 -07:00
Paul Wilkins	668e804504	Animation and dead zone detection. Adds code to detect dead zone bars at the top and bottom of reformatted letterbox video (note that the code only looks at the top of the image and assumes any dead zone is symmetrical). Use of this to adapt rate control etc. will follow in a subsequent patch. Also counts other blocks (excluding the dead zone) that have no intra signal. The presence of a significant number of such blocks can be used as a identify that the frame may be artificial (e.g. animation, screen capture, graphics). This patch contains plumbing only and does not use the signal. Change-Id: I59bc93529cd4065416cef773e405fda3ae006a20	2015-06-04 01:01:20 +01:00
Marco	e88de49faa	Change tx_size_search_method setting for non-rd speed 5. Use the same settting as in speed >=6. This will use same logic for tx_size selecton as in speed >=6, which limits the transform size and reduces ringing artifact. Also metrics go up on average with this change: ~2% for PSNR, ~10% for SSIM. Change-Id: Ia2d50db236ae1cc72f742bfa6c9ec5ea50ff0e0a	2015-05-15 11:12:47 -07:00
paulwilkins	aecb1770d5	Merge "Image size restriction to rd auto partition search."	2015-05-07 14:12:14 +00:00
paulwilkins	af76953448	Merge "Remove CONSTRAIN_NEIGHBORING_MIN_MAX."	2015-05-05 09:32:11 +00:00
Marco	b9a72d3c4d	Allow for H and V intra modes for non-rd mode. For non-rd mode (speed >=5): use mask based on prediction block size, and (for non-screen content mode) allow for checking horiz and vert intra modes for blocks sizes < 16x16. Avg psnr/ssim metrics go up by about ~0.2%. Only allowing H/V intra on block sizes below 16x16 for now, to keep encoding time increase very small, and also when allowing H/V on 16x16 blocks, metrics went down on a few clips which need to be further examined. Change-Id: I8ae0bc8cb2a964f9709612c76c5661acaab1381e	2015-05-04 09:48:41 -07:00
paulwilkins	4a7dcf8eb2	Image size restriction to rd auto partition search. Impose a limit on the rd auto partition search based on the image format. Smaller formats require that the search includes includes a smaller minimum block size. This change is intended to mitigate the visual impact of ringing in some problem clips, for smaller image formats. Change-Id: Ie039e5f599ee079bbef5d272f3e40e2e27d8f97b	2015-05-01 16:16:02 +01:00
paulwilkins	287b0c6da9	Remove CONSTRAIN_NEIGHBORING_MIN_MAX. Remove one of the auto partition size cases. This case can behaves badly in some types of animated content and was only used for the rd encode path. A subsequent patch will add additional checks to help further improve visual quality. Change-Id: I0ebd8da3d45ab8501afa45d7959ced8c2d60ee4e	2015-05-01 15:15:16 +01:00
Marco	fa20a60f0d	Speed 5: use non-rd mode for key frame coding. Metrics on RTC set go down by ~1.5% on average. Key frame encoding time goes down by factor of ~5. Change-Id: Ia83acc55848613870e5ac6efe7f3d904d877febb	2015-03-27 16:19:26 -07:00
Adrian Grange	23ebacdb81	Auto-adaptive encoder frame resizing logic Note: This feature is still in development. Add an option for the encoder to decide the resolution at which to encode each frame. Each KF/GF/ARF goup is tested to see if it would be better encoded at a lower resolution. At present, each KF/GF/ARF is coded first at full-size and if the coded size exceeds a threshold (twice target data rate) at the maximum active Q then the entire group is encoded at lower resolution. This feature is enabled in vpxenc by setting: --resize-allowed=1 In addition, if the vpxenc command line also specifies valid frame dimensions using: --resize-width=XXXX & --resize_height=YYYY then all frames will be encoded at this resolution. Change-Id: I13f341e0a82512f9e84e144e0f3b5aed8a65402b	2015-02-10 09:59:32 -08:00
Yaowu Xu	65a1a3e85d	adjust rtc setting and threshold 1. Adjusted the threshold for coef update computation based on counts of tx used, avoid coef update computation when count is low (<20) 2. Move sf->lpf_pick = LPF_PICK_MINIMAL_LPF to speed 8. Change-Id: I02b44309e40fcdbf135c7934ae067a3f42502d30	2015-02-02 17:43:46 -08:00
Adrian Grange	527e073163	Remove elevate_newmv_thresh from SPEED_FEATURES (unused) Change-Id: I78ef7f89586a329787f6bc4c58ec83af210989a3	2015-01-22 16:12:50 -08:00
Yaowu Xu	a16f075375	Corrected value range of --cpu-used for vp9 This commit removes undefined value options of cpu-used for VP9 and changed vpxenc prompt to reflect the usable range of [-8,8] Change-Id: Ib80fef3dbb6ec9aabac45ed13e8ab6fbaf94f55e	2014-12-17 15:18:01 -08:00
Jingning Han	74ded4863e	Enable conditional skip path in rd_pick_intra_sby_mode These speed-up features for key frame coding are only turned on in the settings of hybrid non-RD and RD mode decision. It provides about 20% speed-up to the hybrid key frame coding at the expense of certain compression performance loss. For vidyo1, the key frame coding statistics are changed 9838F, 35.020 dB, 61677 us -> 9920F, 34.834 dB, 47556 us Overall rtc set compression performance is down by -0.257%. Change-Id: I0025447fda26bb7855e982955642b5f55d71b51f	2014-12-05 09:36:09 -08:00
Jingning Han	07711e9b27	Use hybrid RD and non-RD coding flow for key frame coding When block size is below 16x16, the encoder swap from non-RD to RD mode for key frame coding. This largely brough back the key frame compression performance. For vidyo1 at 1000 kbps, the key frame coding statistics are changed 9978F, 34.183 dB, 36807 us -> 9838F, 35.020 dB, 61677 us As compared to the full RD case 7187F, 34.930 dB, 214470 us The overall rtc set coding performance (single key frame setting) is improved by 1.5%. Change-Id: I78a4ecf025d7b24ec911e85be94e01da05e77878	2014-12-05 09:35:27 -08:00
Jingning Han	228ec17ff2	Merge "Rework coeff probability model update for rtc coding"	2014-12-03 11:34:35 -08:00
Marco	8fd3f9a2fb	Enable non-rd mode coding on key frame, for speed 6. For key frame at speed 6: enable the non-rd mode selection in speed setting and use the (non-rd) variance_based partition. Adjust some logic/thresholds in variance partition selection for key frame only (no change to delta frames), mainly to bias to selecting smaller prediction blocks, and also set max tx size of 16x16. Loss in key frame quality (~0.6-0.7dB) compared to rd coding, but speeds up key frame encoding by at least 6x. Average PNSR/SSIM metrics over RTC clips go down by ~1-2% for speed 6. Change-Id: Ie4845e0127e876337b9c105aa37e93b286193405	2014-12-03 09:18:08 -08:00
Jingning Han	8fe50191c6	Rework coeff probability model update for rtc coding This commit reworks the ONE_LOOP_REDUCED coefficient probability model update process. It allows model update for every coefficient across the spectrum at a coarser resolution, instead of performing precise update only for certain subset of probability models. The overall runtime remains nearly same (<1% change) for speed -6. The compression performance is improved by 7.5% in PSNR for speed -5 and 4.57% for speed -6, respectively. Change-Id: Ifb17136382ee7e39a9f34ff4a4f09a753125c8d1	2014-12-03 09:15:25 -08:00
Jingning Han	a6df0cbcca	Remove repeated search_type_check_frequency assign This parameter is initialized as 50. No need to re-assign the same value in speed -6. Change-Id: I8735a5593412df2fdcee53ae45c8ebd1c3d792e7	2014-11-25 18:36:41 -08:00
Yunqing Wang	edbd61e136	vp9_ethread: modify VP9_COMP structure This patch modified struct VP9_COMP. Created a struct ThreadData to include data that need to be copied for each thread. In multiple thread case, one thread processes one tile. all threads share one copy of VP9_COMP, (refer to VP9_COMP cpi in the code) but each thread has its own copy of ThreadData, (refer to ThreadData td in the code). Therefore, within the scope of encode_tiles(), both cpi and td need to be passed as function parameters. In single thread case, the FRAME_COUNTS pointer in ThreadData points to "counts" in VP9_COMMON. Change-Id: Ib37908b2d8e2c0f4f9c18f38017df5ce60e8b13e	2014-11-24 17:57:38 -08:00
Jingning Han	2fbdfd2c66	Key frame non-RD mode decision process This commit makes a non-RD coding mode decision process for key frame coding. It can be optionally turned on in speed -6 and above. Change-Id: I0847258b392877a0210b4768bef88ebc9ad009b5	2014-11-24 09:04:28 -08:00
Alex Converse	bc1b3d8412	Allow DC/H/V/TM on screen content. 6.3% better compression less than 1% compression time increase Change-Id: Ie83c059436e54c09de9e7c87e06e0a6d40dc38fe	2014-11-20 18:04:57 -08:00
Alex Converse	722e9d611b	Drop special inter mode selection for screen content. Better mode selection was implemented for all content. Change-Id: I479778ed21d3968892f4dce396c83733583f4f23	2014-11-20 18:04:57 -08:00
Yunqing Wang	54ba65a63e	Merge "vp9_ethread: move max/min partition size to mb struct"	2014-11-20 14:00:37 -08:00
Yunqing Wang	ad7586a9e1	vp9_ethread: move max/min partition size to mb struct The max_partition_size and max_partition_size are set at the beginning while setting speed features, and then adjusted at SB level. Moving them to mb struct ensures there is a local copy for each thread. Change-Id: I7dd08dc918d9f772fcd718bbd6533e0787720ad4	2014-11-20 09:24:50 -08:00
Yunqing Wang	70c9d2983b	Revert "vp9_ethread: include a pointer to mb in VP9_COMP" This reverts commit `6906d218dd`. Another way will be used to handle mb struct. Change-Id: Ic1111a46b2b1ee00f8f9e3fcd4cf3eb6030b2dc4	2014-11-20 08:31:12 -08:00
Yaowu Xu	1687c47bfd	change to call vp9_refining_search_sad() directly The function pointer in compressor instance does not change, so this commit changes to call the function directly. Change-Id: I9c9c460e3475711c384b74c9842f0b4f3d037cc5	2014-11-17 11:30:17 -08:00
Yunqing Wang	6906d218dd	vp9_ethread: include a pointer to mb in VP9_COMP Modified VP9_COMP struct to include MACROBLOCK *mb. This change makes it feasible in multi-thread case to allocate a mb for each thread. Change-Id: I624d6d1aa9c132362200753e5d90b581b1738d6e	2014-11-14 12:31:06 -08:00
Adrian Grange	35de9db312	Merge "Prepare for dynamic frame resizing in the recode loop"	2014-11-13 15:01:49 -08:00
Adrian Grange	0d085ebc0a	Prepare for dynamic frame resizing in the recode loop Prepare for the introduction of frame-size change logic into the recode loop. Separated the speed dependent features into separate static and dynamic parts, the latter being those features that are dependent on the frame size. Change-Id: Ia693e28c5cf069a1a7bf12e49ecf83e440e1d313	2014-11-13 11:41:20 -08:00
Jingning Han	e717d22b63	Use reconstructed pixels for intra prediction This commit makes the speed -6 and above use the reconstructed boundary pixels for precise intra prediction. This allows more intra prediction modes to be tested in the non-RD coding process. Enabling horizontal and vertical intra prediction modes can improve the speed -6 compression performance for rtc set by 0.331%. Change-Id: I3a99f9d12c6af54de2bdbf28c76eab8e0905f744	2014-11-11 10:04:43 -08:00
Yaowu Xu	0271ff7775	Fix speed 7 and speed 12 for rt A recent change has introduced big quality drops for speed 7 and 12 for --rt mode. The change reverted the big drop and improved quality by 9.5% for speed 7 and 13.4% for speed 12. Change-Id: I07b82e3bb6002a73af486a083458c88877bdad01	2014-10-31 17:29:02 -07:00
Yunqing Wang	aed48c786a	Remove unused speed feature Partition_check was unused and removed. Change-Id: I15ec9162d86dc61f04c09229c498629878ed7155	2014-10-29 17:05:04 -07:00
Jingning Han	9349a28e80	Enable mode search threshold update in non-RD coding mode Adaptively adjust the mode thresholds after each mode search round to skip checking less likely selected modes. Local tests indicate 5% - 10% speed-up in speed -5 and -6. Average coding performance loss is -1.055%. speed -5 vidyo1 720p 1000 kbps 16533 b/f, 40.851 dB, 12607 ms -> 16556 b/f, 40.796 dB, 11831 ms nik 720p 1000 kbps 33229 b/f, 39.127 dB, 11468 ms -> 33235 b/f, 39.131 dB, 10919 ms speed -6 vidyo1 720p 1000 kbps 16549 b/f, 40.268 dB, 10138 ms -> 16538 b/f, 40.212 dB, 8456 ms nik 720p 1000 kbps 33271 b/f, 38.433 dB, 7886 ms -> 33279 b/f, 38.416 dB, 7843 ms Change-Id: I2c2963f1ce4ed9c1cf233b5b2c880b682e1c1e8b	2014-10-29 10:55:34 -07:00
Yaowu Xu	87665f16f4	Merge "Change speed features for good quality(cpu-used=5)"	2014-10-22 08:40:15 -07:00
Yaowu Xu	c30f7e6cc5	Change speed features for good quality(cpu-used=5) The existing speed features produce horrible encoding results, almost 30% worse than cpu-used=4, this commit adjust the speed features to produce relatively resonable results to be within 3%-5% of cpu-used=4. Change-Id: I0ca6ebafb33024d4a0cbcf04c78a4a00b8dd1ecf	2014-10-21 11:59:12 -07:00
Jingning Han	abb2fbb10e	Remove deprecated constrain_copy_partitioning function Its functionality has been replaced with choose_partitioning and threshold based control on split mode check. Change-Id: Ic9bb321df06b524f5c38ea5874dc6f6a8f93c5e3	2014-10-20 17:08:21 -07:00
Jingning Han	e62ce79e1a	Remove deprecated use_lastframe_partitioning feature This speed feature has been deprecated in both yt and rtc coding modes. This commit removes the related operations. Change-Id: I079c79c6adafe45581af2ebf8b98faebcface1ce	2014-10-20 17:03:38 -07:00
Jingning Han	9f128b3ed9	Hybrid partition search for rtc coding mode This commit re-designs the recursive partition search scheme in rtc speed -5. It first checks if the current block is under cyclic refresh mode. If so, apply recursive partition search. Otherwise, perform sub-sampled pixel based partition selection. When the pre-selection finds the partition size should be 32x32 or above, use the partition size directly. Otherwise, apply partition search at nearby levels around the preset partition size. It is enabled in speed -5. The compression performance of rtc speed -5 is improved by 9.4%. Speed wise, the run-time goes slower from 1% to 10%. nik_720p, 1000 kbps 33220 b/f, 38.977 dB, 10109 ms -> 33200 b/f, 39.119 dB, 10210 ms vidyo1_720p, 1000 kbps 16536 b/f, 40.495 dB, 10119 ms -> 16536 b/f, 40.827 dB, 11287 ms Change-Id: I65adba352e3adc03bae50854ddaea1b421653c6c	2014-10-20 13:02:12 -07:00
Jingning Han	5e766ccee0	Use rate/distortion thresholds to control non-RD partition search Compare the estimated rate and distortion to the thresholds scaled according to the operating block size and determine if further split partition search will be run. The compression performance of speed -5 is changed by -0.074%. The encoding speed is 10% - 15% faster. vidyo1 720p 16545 b/f, 40.492 dB, 11475 ms -> 16535 b/f, 40.486 dB, 10100 ms nik720p 16624 b/f, 36.310 dB, 10071 ms -> 16617 b/f, 36.313 dB, 8346 ms Change-Id: Ic9197ab5761279ae55d2fb7813b2af0e0db497b8	2014-10-15 13:40:33 -07:00
Jingning Han	89b8c7a513	Replace copy_partitioning use case with choose_partitioning This commit replaces the use of copy_partitioning with choose_partitioning based on the sse of subsamped pixels, which provides significantly better coding performance and runs at similar speed, as compared to copy_partitioning. It improves rtc speed 5 coding performance by 3%. Change-Id: I52d3682a12dce0147f5e52383a594fc242ca3228	2014-10-15 11:37:20 -07:00
Deb Mukherjee	3117830af3	Merge "Subpel search cleanups and enhancements"	2014-10-09 11:14:51 -07:00
Deb Mukherjee	d78dbff09a	Subpel search cleanups and enhancements - Some fixes to surface fit. - Returns variance function as cost rather than sad in the pattern search and diamond search functions. Only vp9_pattern_search_sad function used in bigdia search uses sad as integer 1-away costs. - Deploys SUBPEL_TREE_PRUNED_MORE for speed 4+. Results: derf [Speed 3]: About +0.036% in coding efficiency without any discernible speed loss. derf [Speed 4]: About 2-3% faster at -0.199% loss in coding efficiency. derf [Speed 5]: About 3-4% faster at -0.149% loss in coding efficiency. Change-Id: I8462f94f6adb46966ca964f2bd0400977357fd63	2014-10-08 23:59:43 -07:00
Yunqing Wang	e18edd5eb6	Allow mode search breakout at very low prediction errors In model_rd_for_sb function, the spatial domain SSE and variance are checked to see if transform coefficients are quantized to 0. Besides that, this patch adds another set of thresholds that are much more strict. These thresholds are used to conduct a partition block level check to measure if all its TX blocks are skippable for YUV planes. If it is true, x->skip is set for this partition block, and thus its mode search is terminated. This speeds up the encoding at very low prediction error case, such as screen sharing application. This patch covers what rd_encode_breakout_test() does, so that function is removed. Borg test at speed 3 shows: For stdhd set, psnr: +0.008%, ssim: +0.014%; For derf set, psnr: +0.018%, ssim: +0.025%. No noticeable speed change. Change-Id: I4e5f15cf10016a282a68e35175ff854b28195944	2014-10-08 17:46:22 -07:00
Jim Bankoski	0ce51d823f	experimental : partition using 1/8 x 1/8 image The concept: There's too much noise in source pixels for variance and at low bitrate the reconstructed looks nothing like the source so we have problems getting good partitionings with either. This skirts the issue by using a box blur scaled down version for variance calculations. To compare against source_var_ moved keyframe to be rd based like source_var. Change-Id: Ie3babdbfadae324b7b5a76bea192893af27f0624	2014-10-07 16:36:14 -07:00
Jingning Han	bb260d9076	Rework partition search skip scheme This commit enables the encoder to skip split partition search if the bigger block size has all non-zero quantized coefficients in low frequency area and the total rate cost is below a certain threshold. It logarithmatically scales the rate threshold according to the current block size. For speed 3, the compression performance loss: derf -0.093% stdhd -0.066% Local experiments show 4% - 20% encoding speed-up for speed 3. blue_sky_1080p, 1500 kbps 51051 b/f, 35.891 dB, 67236 ms -> 50554 b/f, 35.857 dB, 59270 ms (12% speed-up) old_town_cross_720p, 1500 kbps 14431 b/f, 36.249 dB, 57687 ms -> 14108 b/f, 36.172 dB, 46586 ms (19% speed-up) pedestrian_area_1080p, 1500 kbps 50812 b/f, 40.124 dB, 100439 ms -> 50755 b/f, 40.118 dB, 96549 ms (4% speed-up) mobile_calendar_720p, 1000 kbps 10352 b/f, 35.055 dB, 51837 ms -> 10172 b/f, 35.003 dB, 44076 ms (15% speed-up) Change-Id: I412e34db49060775b3b89ba1738522317c3239c8	2014-10-03 11:54:30 -07:00
Yunqing Wang	b1b6fd85db	Merge "Skip the partition search for still frames"	2014-09-30 11:59:05 -07:00
Deb Mukherjee	4e9c0d2ad4	Adds two new subpel search methods One is a more aggressive version of the pruned subpel tree search where only a single halfpel candidate is searched. The search candidate is based on a surface fit result. The other is a method to obtain the subpel position at one shot based on the same surface fit. The methods have not been deployed in any speed setting yet. Change-Id: I34fef3f2e34f11396c9d1ba97f4be8c4ffca62d3	2014-09-29 12:51:20 -07:00
Yunqing Wang	1fcbf6ed56	Skip the partition search for still frames This patch re-enabled the feature in Pengchong's patch (commit `1286126073`). Originally, it was turned on while use_lastframe_partitioning > 0(not used anymore). Now it was added as a feature, and turned on while speed >= 2. As described in the original patch, this feature helps speed up the slideshows in YouTube. Change-Id: I1b0f18d65da1ee1c8d1e117dabba910c5207c471	2014-09-26 09:03:52 -07:00
Yaowu Xu	8751e49a6f	Merge "Adapt mode based rd_threshold for similar block size"	2014-09-23 22:28:08 -07:00
Deb Mukherjee	6c6213d960	Merge "Pruned subpel search for speed 3."	2014-09-23 17:12:03 -07:00
Yaowu Xu	4a101310e8	Adapt mode based rd_threshold for similar block size The rd_thresholds are adaptively changed based on best mode tested. It was only changed for the same block size, this commit makes the adaptation for similar block sizes too. The commit also made minor adjustment and code cleanups. The impact on encoding time for _ped: 118089 ms -> 111927 ms The impact on compression: derf: -0.339% stdhd: -0.303% Change-Id: I8817fed1102350497f2ec631849e43f753878e5d	2014-09-23 16:10:59 -07:00
Deb Mukherjee	c94b17f4b2	Pruned subpel search for speed 3. Adds code to return an integer cost list for NSTEP search. Then uses it for pruned subpel search in speed 3. derf: -0.06% Speed on mobcal 720p increaes from 10.28 fps to 10.65 fps. [Subject to further testing]. Change-Id: Ib591382d25b2c11bcaba9d3a27a93a9d1ab27a96	2014-09-23 11:27:58 -07:00
Jingning Han	eee904c9b9	Adaptive mode search scheduling This commit enables an adaptive mode search order scheduling scheme in the rate-distortion optimization. It changes the compression performance by -0.433% and -0.420% for derf and stdhd respectively. It provides speed improvement for speed 3: bus CIF 1000 kbps 24590 b/f, 35.513 dB, 7864 ms -> 24696 b/f, 35.491 dB, 7408 ms (6% speed-up) stockholm 720p 1000 kbps 8983 b/f, 35.078 dB, 65698 ms -> 8962 b/f, 35.054 dB, 60298 ms (8%) old_town_cross 720p 1000 kbps 11804 b/f, 35.666 dB, 62492 ms -> 11778 b/f, 35.609 dB, 56040 ms (10%) blue_sky 1080p 1500 kbps 57173 b/f, 36.179 dB, 77879 ms -> 57199 b/f, 36.131 dB, 69821 ms (10%) pedestrian_area 1080p 2000 kbps 74241 b/f, 41.105 dB, 144031 ms -> 74271 b/f, 41.091 dB, 133614 ms (8%) Change-Id: Iaad28cbc99399030fc5f9951eb5aa7fa633f320e	2014-09-22 09:28:16 -07:00
Jingning Han	f02e0b6cf6	Merge "Remove unused speed feature"	2014-09-13 10:43:03 -07:00
Deb Mukherjee	c0dfecfb89	Merge "Use bigdia search with pruned subpel search"	2014-09-12 16:42:18 -07:00
Deb Mukherjee	83c76118eb	Use bigdia search with pruned subpel search Improves function to return sad of integer pels by reusing integer pels already visited in the smallest scale. Turns on BIGDIA search for speed 4. Also, turns on the first version of the pruned subpel search at this speed. derf: -0.32% (speed 4) Speed seems to improve by at least 5% but subject to verification. Change-Id: Iaec8eaffd61d6237ac029e6a2a1b0a88b2a35271	2014-09-12 10:25:12 -07:00
Jingning Han	00fe92c22f	Remove unused speed feature The speed feature that skips compound inter prediction modes was subsumed by other speed features and effectively was not in use. This commit removes it. Change-Id: I22b0c71a8ddd15d93b25d86fa63a1dce2ba6a1a9	2014-09-11 15:54:53 -07:00
Jingning Han	82757250d6	Merge "Refactor to remove speed feature dependency on mode search order"	2014-09-11 11:14:26 -07:00
Jingning Han	f9f0879756	Refactor to remove speed feature dependency on mode search order This commit refactor the rate-distortion optimization search for regular block sizes to remove the speed feature dependency on mode search order. Change-Id: Ied033ee484c2957e17baa7b6450b720fe7dd0e7d	2014-09-10 17:09:14 -07:00
Yunqing Wang	f10d7eeda2	Remove the use of use_lastframe_partitioning at speed 4 The use of use_lastframe_partitioning is totally removed in good- quality encoding. Its usage in real-time encoding needs to be evaluated to see if it can be removed too. The Borg tests at speed 4 showed: stdhd set: 0.220% psnr gain, 0.166% ssim gain; derf set: 0.329% psnr gain, 0.476% ssim gain. Speed test on selected clips showed 1.54% speedup.(Worst case: pedestrian_area_1080p25.y4m, speed loss: 1.5%) Change-Id: I1c844d329b0b5678558439b887297c1be7ddab00	2014-09-09 10:54:07 -07:00
Yunqing Wang	1092140379	No longer use use_lastframe_partitioning speed feature The speedup in rd_pick_partition() function makes it possible to drop use_lastframe_partitioning feature. By doing that, we achieve good PSNR gain with small speed loss. Also, this makes encoding loop less complicated. The code cleanup patch will follow. Borg tests showed: 1. At speed 2, stdhd set: 0.201% PSNR gain, 0.133% SSIM gain; derf set: 0.262% PSNR gain, 0.276% SSIM gain. 2. At speed 3, stdhd set: 0.139% PSNR gain, 0.109% SSIM gain; derf set: 0.447% PSNR gain, 0.442% SSIM gain. The average speed loss over selected test clips is within 1% with the worst case of 4%. Change-Id: Icfd2ded7869372b585a6972855d933b3d0280d90	2014-09-05 16:24:41 -07:00
Yaowu Xu	7a33712475	Change last_partition_redo_frequency for speed 3 From 3 to 2, which seems to be slightly positive on compression for all test sets, also reduces encoding time by 2%-5%, varying on the test clips. Change-Id: If045417bd27311700c919b4a335eff0dc1130ae0	2014-09-03 09:34:10 -07:00
Yaowu Xu	cdda17ed77	Remove redundant code Change-Id: I453b167f03811a3cd3592089593b3f2823f62ab3	2014-09-03 09:34:10 -07:00
Jingning Han	4282955ee1	Skip intra mode tests depending on inter residuals This commit allows encoder to skip intra coding mode test, when the known inter residual is less than the source variance. It reduces the runtime of speed 3 for test clips: bus cif 1000 kbps: 8587 ms -> 8260 ms, 3.8% speed-up pedestrian 1080p 2000 kbps: 161381 ms -> 155241 ms, 3.7% speed-up. The compression performance is down by derf -0.36% stdhd -0.25% Change-Id: I75ce1e035b4da2153cb1ac14111d1a07c05a735d	2014-08-29 08:37:35 -07:00
Yunqing Wang	4d2c376923	Early termination in encoding partition search In the partition search, the encoder checks all possible partitionings in the superblock's partition search tree. This patch proposed a set of criteria for partition search early termination, which effectively decided whether or not to terminate the search in current branch based on the "skippable" result of the quantized transform coefficients. The "skippable" information was gathered during the partition mode search, and no overhead calculations were introduced. This patch gives significant encoding speed gains without sacrificing the quality. Borg test results: 1. At speed 1, stdhd set: psnr: +0.074%, ssim: +0.093%; derf set: psnr: -0.024%, ssim: +0.011%; 2. At speed 2, stdhd set: psnr: +0.033%, ssim: +0.100%; derf set: psnr: -0.062%, ssim: +0.003%; 3. At speed 3, stdhd set: psnr: +0.060%, ssim: +0.190%; derf set: psnr: -0.064%, ssim: -0.002%; 4. At speed 4, stdhd set: psnr: +0.070%, ssim: +0.143%; derf set: psnr: -0.104%, ssim: +0.039%; The speedup ranges from several percent to 60+%. speed1 speed2 speed3 speed4 (1080p, 100f): old_town_cross: 48.2% 23.9% 20.8% 16.5% park_joy: 11.4% 17.8% 29.4% 18.2% pedestrian_area: 10.7% 4.0% 4.2% 2.4% (720p, 200f): mobcal: 68.1% 36.3% 34.4% 17.7% parkrun: 15.8% 24.2% 37.1% 16.8% shields: 45.1% 32.8% 30.1% 9.6% (cif, 300f) bus: 3.7% 10.4% 14.0% 7.9% deadline: 13.6% 14.8% 12.6% 10.9% mobile: 5.3% 11.5% 14.7% 10.7% Change-Id: I246c38fb952ad762ce5e365711235b605f470a66	2014-08-28 11:27:28 -07:00
Deb Mukherjee	bb2a9abb1e	Merge "Updates vp9_pattern search to return integer sads"	2014-08-28 09:38:56 -07:00
Deb Mukherjee	04b100b23e	Updates vp9_pattern search to return integer sads Updates the vp9_pattern_search function to return integer one-away neighbors' sad values, for subsequent use in speeding up the sub-pel search. Also, removes code for the do_refine option which is not being used currently. Updates the integer and subpel functions to pass in a 5-element sad list for output or input. A new pruned sub-pel search algorithm is implemented that uses the sad returned from the integer pel search. But it is not deployed yet. Change-Id: Ifa9f5ad024b5b660570366d2bd900343e1891520	2014-08-28 06:49:58 -07:00
Yaowu Xu	bcfb1ffb9d	Merge "add a new interp filter search strategy."	2014-08-26 17:30:42 -07:00
Yaowu Xu	1144fee3d5	add a new interp filter search strategy. This commit addes a new strategy to reduce the search for optimal interpolation filter type. The encoder counts and store how many each filter type is selected and used for each of the reference frames. A filter type that is rarely used for all three reference frames is masked out to avoid computation. The impact on compression is neglectible: -0.02% on derf +0.02% on stdhd Encoding time is seen to reduce by 2~3%. Change-Id: Ibafa92291b51185de40da513716222db4b230383	2014-08-26 09:05:04 -07:00
Dmitry Kovalev	0082727cb7	Merge "Adding is_keyframe temp var."	2014-08-25 18:36:59 -07:00
Dmitry Kovalev	98c8eb85e6	Adding is_keyframe temp var. Change-Id: I5fec955c8b8f5a9b5027a0f92afb22d22770d84a	2014-08-21 17:41:03 -07:00
Dmitry Kovalev	45425f8c1e	Removing is_best_mode() function. Change-Id: Iccd7cec885e8aeb0e54613d888f9960c393cee0b	2014-08-21 11:32:33 -07:00
Dmitry Kovalev	f617889be7	Moving frame_is_boosted() to vp9_speed_features.c. Change-Id: I9261ded5fbba7a625d8224d91be296265a932410	2014-08-19 10:31:29 -07:00
Yunqing Wang	4d98b50be5	Merge "Add early termination in transform size search"	2014-08-18 19:00:24 -07:00
Yunqing Wang	ba70f16011	Add early termination in transform size search In the full-rd transform size search, we go through all transform sizes to choose the one with best rd score. In this patch, an early termination is added to stop the search once we see that the smaller size won't give better rd score than the larger size. Also, the search starts from largest transform size, then goes down to smallest size. A speed feature tx_size_search_breakout is added, which is turned off at speed 0, and on for other speeds. The transform size search is turned on at speed 1. Borg test results: 1. At speed 1, derf set: psnr gain: 0.618%, ssim gain: 0.377%; stdhd set: psnr gain: 0.594%, ssim gain: 0.162%; No noticeable speed change. 3. At speed 2, derf set: psnr loss: 0.157%, ssim loss: 0.175%; stdhd set: psnr loss: 0.090%, ssim loss: 0.101%; speed gain: ~4%. Change-Id: I22535cd2017b5e54f2a62bb6a38231aea4268b3f	2014-08-18 16:27:04 -07:00
Jingning Han	6a464eca05	Speed up mode search depending on relative ref frame position This commit enables the encoder to record the location of the center frame to generate alter reference frame. It then allows to skip checking prediction modes of other reference frame types when it comes to encode this frame. The speed 3 runtime is reduced for the test sequences: bus at CIF 1000 kbps, 9791 ms -> 9446 ms, i.e., 3.5% speed-up, pedestrian at 1080p 2000 kbps, 184043 ms -> 175730 ms, i.e., 4.5% speed-up. No compression performance change observed. Change-Id: Iacfde3bcc1445964e7a241f239bd6ea11cb94bd1	2014-08-18 16:06:54 -07:00
Pengchong Jin	997db6fc3f	Merge "Add a speed feature to give the tighter search range"	2014-08-15 19:51:04 -07:00
Pengchong Jin	eca93642e2	Add a speed feature to give the tighter search range Add a speed feature to give the tighter partition search range. Before partition search, calculate the histogram of the partition sizes of the left, above and previous co-located blocks of the current block. If the variance of observed partition sizes is small enough, adjust the search range around the mean partition size, which will be tigher. The feature is currently turned on at speed 2. Experiments on sample youtube clips show on average the runtime is reduced by 3-7%. For hard stdhd clips: park_joy_1080p @ 15000kbps: 509251 ms -> 491953 ms (3.3%) pedestrian_area_1080p @ 2000kbps: 223941 ms -> 214226 ms (4.3%) The PSNR performance is changed: derf: -0.112% yt: -0.099% hd: -0.090% stdhd:-0.102% Change-Id: Ie205ec5325bf92ec5676c243e30ba9d0adca10f2	2014-08-15 16:14:20 -07:00
Dmitry Kovalev	dc35b40a67	Merge "Simplifying vp9_set_speed_features() function."	2014-08-15 15:31:43 -07:00
Yunqing Wang	28b1437d77	Remove a unused speed feature Removed disable_split_var_thresh, which is not used anymore. Change-Id: I50119b150442e1571157433b5effc6aae0dbe0fd	2014-08-15 14:10:27 -07:00
Yaowu Xu	5966586aef	Mask out H_PRED and V_PRED for 32x32 blocks Change-Id: I2847af5062b5fa320629fcabb9fa6b23ba3e5513	2014-08-14 10:52:10 -07:00
Yaowu Xu	4d6d061316	Set max_intra_bsize to 32x32 At --good and speed 3 or above for resolution less than 720p. This disables the tests for 64x64 intra prediction modes. Encoding time reduction is about 1%. Change-Id: Ib396e3d1417fece416e3f0fee929b128acbb130f	2014-08-14 10:51:44 -07:00
Jingning Han	ccef8842d2	Allow full coeff probability model and cost update This commit moves the simplified coefficient probability model and costing update to speed 4, and turns on chessboard pattern mode search for sub 720p sequences. The overall coding performance of speed 3 is improved: derf 0.889% stdhd 1.744% The speed 3 runtime for test sequences are improved: bus cif at 1000 kbps 9823 ms -> 9642 ms pedestrian 1080p 2000 kbps 189559 ms -> 183284 ms Change-Id: Iecbc7496a68f31fd49fb09f8dfd97c028d675a5d	2014-08-13 14:17:14 -07:00
Jingning Han	6e086548cb	Merge "Enable motion field based mode seach skip"	2014-08-13 14:13:19 -07:00
Jingning Han	0daadeb60c	Enable motion field based mode seach skip This commit allows the encoder to check the above and left neighbor blocks' reference frames and motion vectors. If they are all consistent, skip checking the NEARMV and ZEROMV modes. This is enabled in speed 3. The coding performance is improved: pedestrian area 1080p at 2000 kbps, from 74773 b/f, 41.101 dB, 198064 ms to 74795 b/f, 41.099 dB, 193078 ms park joy 1080p at 15000 kbps, from 290727 b/f, 30.640 dB, 609113 ms to 290558 b/f, 30.630 dB, 592815 ms Overall compression performance of speed 3 is changed derf -0.171% stdhd -0.168% Change-Id: I8d47dd543a5f90d7a1c583f74035b926b6704b95	2014-08-13 12:15:13 -07:00
Jim Bankoski	5c55202c6b	intra blocks disallowed inadvertently At speed 6 the smallest partitioning was 16x16 and biggest intra block was 8x8, essentially disallowing all intra blocks which produces ugly artifacts when revealing new video. Change-Id: I364042d4c64e09be0666ade64aac94d0a1b586cf	2014-08-12 16:22:32 -07:00
Dmitry Kovalev	cd1fbc67f9	Simplifying vp9_set_speed_features() function. Change-Id: I3e67230690b81ef54ef48ae26107fe7bc880ab8e	2014-08-08 16:29:24 -07:00
Dmitry Kovalev	91c2f1e45a	Moving pass from VP9_COMP to VP9EncoderConfig. We had a very complicated way to initialize cpi->pass from cfg->g_pass: switch (cfg->g_pass) { case VPX_RC_ONE_PASS: oxcf->mode = ONE_PASS_GOOD; break; case VPX_RC_FIRST_PASS: oxcf->mode = TWO_PASS_FIRST; break; case VPX_RC_LAST_PASS: oxcf->mode = TWO_PASS_SECOND_BEST; break; } cpi->pass = get_pass(oxcf->mode). Now pass is moved to VP9EncoderConfig and initialization is simple: switch (cfg->g_pass) { case VPX_RC_ONE_PASS: oxcf->pass = 0; break; case VPX_RC_FIRST_PASS: oxcf->pass = 1; break; case VPX_RC_LAST_PASS: oxcf->pass = 2; break; } Change-Id: I8f582203a4575f5e39b071598484a8ad2b72e0d9	2014-08-08 14:27:54 -07:00
Alex Converse	2be9ea610f	Use INTER_ALL for VAR based partitions for screencast material. This offers 25% more compression on my HD screencast testset. Change-Id: I85eaef95fd8f2e03e326443e9514482b2ee35cef	2014-08-05 15:23:50 -07:00
Jingning Han	ca2dcb7fed	Chessboard pattern partition search This commit enables a chessboard pattern constrained partition search for 720p and above resolutions. The scheme applies stricter partition search to alternative blocks based on its above/left neighboring blocks' partition range, as well as that of the collocated blocks in the previous frame. It is currently turned on at 16x16 block size level. The chessboard pattern is flipped per coding frame. The speed 3 runtime is reduced: park_joy_1080p, 652832 ms -> 607738 ms (7% speed-up) pedestrian_area_1080p, 215998 ms -> 200589 ms (8% speed-up) The compression performance is changed: hd -0.223% stdhd -0.295% Change-Id: I2d4d123ae89f7171562f618febb4d81789575b19	2014-07-30 10:32:41 -07:00
Jingning Han	54ad09586c	Enable chessboard inter prediction filter type search This commit enables a chessboard pattern prediction filter type search scheme for rate-distortion optimization speed-up. For the inferred motion vector modes, the encoder can re-use its above/left neighbor blocks' prediction filter type and skip a full test on all possible filter types. Such operation is turned on/off alternatively in a chessboard manner. It is turned on in speed 3. For test clip pedestrian 1080p, the runtime is reduced from 231500 ms -> 221700 ms. The compression performance is changed: derf: -0.147% yt: -0.134% hd: -0.079% stdhd: -0.220% Change-Id: I1912f278e7576c2dc632688e3ad7a257410c605a	2014-07-22 16:49:03 -07:00
Jingning Han	ffd948bbd5	Turn on adaptive pred filter scheme for sub8x8 below 720p For sequences of resolution below 720p, the encoder will check intra prediction modes and inter prediction modes from LAST_FRAME. This commit turns on adaptive prediction filter scheme for sub8x8 blocks, where inter prediction modes are enabled. For the test sequence bus at CIF, the speed 2 runtime goes down from 17879 ms to 16783 ms, i.e., 6% speed up. The compression performance of derf set is down by -0.128%. Change-Id: I01d5321a5ceab4e0666ac5be56c52d896c7a8d45	2014-07-21 16:22:56 -07:00
Yaowu Xu	51c60a891e	make default_interp_filter choice a speed feature This commit changed the hard-coded DEFAULT_INTERP_FILTER to a speed feature with the same default value: SWITCHABLE. Change-Id: I7f54f40f1bd3f5277841d04b85db7a84e47313f1	2014-07-16 14:28:51 -07:00
Yaowu Xu	faa686bb1b	Added a rt speed 12 We target this speed to achieve similar encoding speed and better compression than vp8 rt mode with cpu-used at -12. Change-Id: Ic1bb4371c81a17ea80e83459c1cbf4c09a3498e8	2014-07-15 16:46:22 -07:00
Jingning Han	b957439c87	Fix a potential invalid memory access in non-RD coding flow This commit fixes a potential out-of-boundary memory access due to the use of reuse_inter_pred_sby in the non-RD coding flow. It resolves the corresponding asan error. Change-Id: Iff605f5921230966990013541cd855d698810922	2014-07-11 15:50:43 -07:00
Yunqing Wang	a581da218e	Remove repetitive code in mcomp.c Deleted vp9_find_best_sub_pixel_comp_tree(), and combined it in vp9_find_best_sub_pixel_tree(). Change-Id: Ifb25763c8b19822df5537cc1daa76ce88dc3b056	2014-07-09 14:50:50 -07:00
Yunqing Wang	9bd3be69a4	Adjust full-pixel search method in real-time mode Use FAST_HEX in speed 5 and 6, which covers more points than FAST_DIAMOND and improves motion search quality. At speed 6, RTC set borg tests showed slight quality gain (psnr gain: 0.143%, ssim gain: 0.226%). No noticeable encoding speed change. Change-Id: Ifa62875d9a52ee382ec494f271382bb77d8c67bf	2014-07-09 12:56:25 -07:00
Jingning Han	f6bf614b2f	Merge "Re-design quantization process for 32x32 transform block"	2014-07-09 11:55:26 -07:00
Jingning Han	9ad1b9fc67	Re-design quantization process for 32x32 transform block This commit enables a new quantization process for 32x32 2D-DCT transform coefficient blocks. It improves the compression performance of speed 5 by 1.4%. The overall compression gains of speed 5 due to the new quantization scheme is 4.7%. It also includes the SSSE3 implementation of the 32x32 quantization process. Change-Id: I0855b124fd6462418683f783f5bcb44255c9993b	2014-07-08 16:55:28 -07:00
Alex Converse	f60a1178c6	Cleanup motion search speed features. * Replace max_step_search_steps with constant MAX_MVSEARCH_STEPS * Fold (reduce_first_step_size + speed > 5) into reduce_first_step_size replacing uses of reduce_first_step_size that don't add the speed check with zero. Change-Id: Iae46395dbf3eaca138bf4d18b838a9e364b5a198	2014-07-07 10:08:45 -07:00
Yaowu Xu	92a6db7928	Added a speed feature controlling a motion search parameter This commit added a speed feature to control the step_param used in full pixel motion search. The intention is to reduced the search steps for high speed real time coding. Change-Id: I21d2f0105c2b647783a6688615da7fcf2b6d670b	2014-07-02 09:30:43 -07:00
Yaowu Xu	82fd084b35	Merge "Re-design quantization process"	2014-07-01 19:04:01 -07:00
Jingning Han	9ac2f66320	Re-design quantization process This commit re-designs the quantization process for transform coefficient blocks of size 4x4 to 16x16. It improves compression performance for speed 7 by 3.85%. The SSSE3 version for the new quantization process is included. The average runtime of the 8x8 block quantization is reduced from 285 cycles -> 255 cycles, i.e., over 10% faster. Change-Id: I61278aa02efc70599b962d3314671db5b0446a50	2014-07-01 17:00:07 -07:00
Yunqing Wang	f31ff029df	Elevate NEWMV mode checking threshold in real time The current threshold is knid of low, and in many cases NEWMV mode is checked but not picked as the best mode. This patch added a speed feature to increase NEWMV threshold, so that less partition mode checking goes to check NEWMV. This feature is enabled for speed 6 and 7. Rtc set borg tests showed: 1. Speed 6, overall psnr: -0.088%, ssim: -1.339%; Average speedup on rtc set is 11.1%. 2. Speed 7, overall psnr: -0.505%, ssim: -2.320% Average speedup on rtc set is 12.9%. Change-Id: I953b849eeb6e0d5a1f13eacba30c14204472c5be	2014-07-01 14:50:39 -07:00
Yunqing Wang	dee5782f93	Enable encode breakout in real time For real time speed 7, once encode breakout is on(i.e. encoding setting --static-thresh=1), a proper encode breakout threshold is set to speed up the encoder. Set --static-thresh=1, RTC set borg test showed a slight overall psnr loss of 0.162%, but ssim gain of 0.287%. The average speedup on RTC set is 6%, and for some clips, the speedup can be 10+%. Change-Id: Id522d9ce779ff7c699936d13d0c47083de4afb85	2014-06-30 10:41:12 -07:00
Yunqing Wang	9d41313e4b	Decide the partitioning threshold from the variance histogram Before encoding a frame, calculate and store each 16x16 block's variance of source difference between last and current frame. Find partitioning threshold T for the frame from its variance histogram, and then use T to make partition decisions. Comparing with fixed 16x16 partitioning, rtc set test showed an overall psnr gain of 3.242%, and ssim gain of 3.751%. The best psnr gain is 8.653%. The overall encoding speed didn't change much. It got faster for some clips(for example, 12% speedup for vidyo1), and a little slower for others. Also, a minor modification was made in datarate unit test. Change-Id: Ie290743aa3814e83607b93831b667a2a49d0932c	2014-06-30 09:36:23 -07:00
Yaowu Xu	d0cb273e04	Allow encoder to set lpf level to 0 As a way to speed-up rtc encoding at speed 7. Change-Id: Ie36a010392cf7b741dc130df21a4e733622a75b7	2014-06-27 15:23:41 -07:00
Yaowu Xu	3f92b7b994	Added a new speed 7 in rt mode To experiment with different speed/quality compromises. Change-Id: Ia9d4b85243554d620498a327da37c356e752b07f	2014-06-27 13:29:09 -07:00
Jingning Han	5a3e3c6d3f	Adaptive txfm size selection depending on residual sse/variance This commit enables an adaptive transform size selection method for speed -6. It uses largest transform size when the sse is more than 4 times of variance, i.e., most energy is compacted in the DC coefficient. Otherwise, use the default TX_8X8. It improves the compression efficiency for rtc set of speed -6 by 0.8%, no speed change observed. Change-Id: Ie6ed1e728ff7bf88ebe940a60811361cdd19969c	2014-06-26 16:00:42 -07:00
Jingning Han	2aa50eafb2	Make non-RD intra mode search txfm size dependent This commit fixes the potential issue in the non-RD mode decision flow that only checks part of the block to estimate the cost. It was due to the use of fixed transform size, in replacing the largest transform block size. This commit enables per transform block cost estimation of the intra prediction mode in the non-RD mode decision. Change-Id: I14ff92065e193e3e731c2bbf7ec89db676f1e132	2014-06-25 18:52:18 -07:00
Yunqing Wang	bccc785f63	Merge "Reuse inter prediction result in real-time speed 6"	2014-06-25 08:18:33 -07:00
Yunqing Wang	0aae100076	Reuse inter prediction result in real-time speed 6 In real-time speed 6, no partition search is done. The inter prediction results got from picking mode can be reused in the following encoding process. A speed feature reuse_inter_pred_sby is added to only enable the resue in speed 6. This patch doesn't change encoding result. RTC set tests showed that the encoding speed gain is 2% - 5%. Change-Id: I3884780f64ef95dd8be10562926542528713b92c	2014-06-24 12:46:33 -07:00
Paul Wilkins	8160a26fa0	Fix some bugs in multi-arf Fix some bugs relating to the use of buffers in the overlay frames. Fix bug where a mid sequence overlay was propagating large partition and transform sizes into the subsequent frame because of :- sf->last_partitioning_redo_frequency > 1 and sf->tx_size_search_method == USE_LARGESTALL Change-Id: Ibf9ef39a5a5150f8cbdd2c9275abb0316c67873a	2014-06-24 13:07:48 +01:00
Jingning Han	48b8ce21f0	Merge "Allow key frame more flexibility in mode search"	2014-06-20 09:38:02 -07:00
Jingning Han	c99a8fd7c8	Allow key frame more flexibility in mode search This commit allows the key frame to search through more prediction modes and more flexible block sizes. No speed change observed. The coding performance for rtc set is improved by 1.7% for speed -5 and 3.0% for speed -6. Change-Id: Ifd1bc28558017851b210b4004f2d80838938bcc5	2014-06-19 14:47:12 -07:00
Yunqing Wang	55834d42cc	Modify non-rd intra mode checking Speed 6 uses small tx size, namely 8x8. max_intra_bsize needs to be modified accordingly to ensure valid intra mode checking. Borg test on RTC set showed an overall PSNR gain of 0.335% in speed -6. This also changes speed -5 encoding by allowing DC_PRED checking for block32x32. Borg test on RTC set showed a slight PSNR gain of 0.145%, and no noticeable speed change. Change-Id: I1502978d8fbe265b3bb235db0f9c35ba0703cd45	2014-06-18 11:38:44 -07:00
Dmitry Kovalev	4ff1a614f1	Adding MV_SPEED_FEATURES struct. Moving all motion vector related speed parameters from SPEED_FEATURES to MV_SPEED_FEATURES. Change-Id: I3e9af0039c7162f8671878c5920bce3cb256a84e	2014-06-12 14:15:27 -07:00
Dmitry Kovalev	22368479c0	Merge "Removing chessboard_index from SPEED_FEATURES."	2014-06-10 10:53:53 -07:00
Yunqing Wang	b04d766800	Use small transform size in non-rd real-time mode In non-rd real-time mode, choosing smaller transform size in encoding gives better video quality and good speed gain than choosing larger transform size. This patch set tx size search method to ALLOW_8X8, which is better than using 4x4 or other larger sizes. Borg tests on rtc set at speed 6 showed significant gain on quality. PSNR gain: 11.034% and SSIM gain: 15.466%. The speed gain is 5% - 12% for <720p clips, and 2% - 7% for 720p clips. Change-Id: If4dc74ed2df359346b059f47fb73b4a0193ec548	2014-06-09 08:26:50 -07:00
Dmitry Kovalev	923c30a174	Removing chessboard_index from SPEED_FEATURES. This is not a speed feature, adding inline function instead. Change-Id: Ia48c41802eec9e92cf990339d724097279695c9a	2014-06-05 18:17:54 -07:00
Dmitry Kovalev	bd0bb363bd	Removing lossless field from VP9EncoderConfig. Right now there is just one place to check: xd->lossless and for the first pass there is a function is_lossless_requested(). Change-Id: I949a6834e64ce51e422e2892f097f2b871b5429a	2014-06-03 12:52:49 -07:00
Dmitry Kovalev	5132e6da1a	Merge "Converting disable_inter_mode_mask to inter_mode_mask."	2014-05-31 00:08:45 -07:00
Dmitry Kovalev	403719963e	Converting disable_inter_mode_mask to inter_mode_mask. Making this consistent with intra mode masks: you need to specify allowed inter/intra modes to use. Change-Id: Iaecd28bf79047259707d8e7a59a57bb7b856383e	2014-05-29 12:25:41 -07:00
Dmitry Kovalev	26bdf26ddc	Consistent names for intra mask flags. Change-Id: Ibdd5255d37200fb8a1d50f71a2a49c6089ae21e7	2014-05-29 12:11:02 -07:00
Dmitry Kovalev	d262cda524	Making speed checks consistent in set_rt_speed_feature(). Change-Id: Id3d0a49836fe996b806707d29a8130acf9d7ea0e	2014-05-29 11:11:50 -07:00
Alex Converse	b9c24dfa23	Always partition check after keyframe (rt speed 5) Prevents too small partitions from being copied to the next frame. Change-Id: I4b97c30b27d06051574d54aaaca5434407a0c9ff	2014-05-22 16:51:06 -07:00
Yaowu Xu	04cf82fb04	Merge "Enable various thresholds of motion detection"	2014-05-22 09:09:42 -07:00
Yaowu Xu	3bda7ec1ba	Enable various thresholds of motion detection This commit changed to enable the encoder to adjust motion dection speed threshold based on picture size. In addition, cpu-used 1 now does a partition search every other frame instead of every third frame for low resolution inputs. The change has no quality/speed impact for 720p and above. Test showed the change increase encoding time by between 3% to 6% for cpu-used 2 encodiong of 360p sequences. It also has a compression gain about .3%. For cpu-used 2, the change resolved some very disturbing visual artifacts in certain sequences when large block partitionings and transforms are used as a result of copying the partition from a previous frame. Change-Id: Ic7fd22508cdb811d4ca935655adbf20109286cfa	2014-05-21 12:08:56 -07:00
Yunqing Wang	b91b146d1d	Add static-threshold skipping in non-rd mode Added a skipping test in non-rd inter-mode. After interpolation prediction step, the residuals are tested to see if they will be quantized to 0 based on modeling between spatial domain and frequency domain. Set static-thresh to 800 for >=720p and 300 for <720p, rtc set tests showed 1. Speed 5, psnr: -0.514%; ssim: -1.748%; speedup on related clips: 5% -11% 2. Speed 6, psbr: -0.628%; ssim: -1.637%; speedup on related clips: 4% - 9% Change-Id: I62fbf26bc043ecd2b584f255f1a4ee5ab52bfcf3	2014-05-19 11:47:13 -07:00
Jingning Han	ace194a059	Merge "Chessboard pattern prediction filter type search in non-RD coding"	2014-04-23 12:48:27 -07:00
Jingning Han	8969f7c892	Chessboard pattern prediction filter type search in non-RD coding This commit introduces a chessboard pattern search for the prediction filter type search. It runs extensive search in alternate blocks and allows the rest blocks to refer coding decisions of their nearby neighbors. For pedestrian 1080p at 4000 kbps, the runtime of speed -5 goes down from 43990 ms to 42200 ms. The overall compression performance for RTC set is changed by -1.37%. Change-Id: Icfe220c49451cda796f0ca91d935c9ed01e56c9d	2014-04-23 10:41:07 -07:00
Dmitry Kovalev	ef003078e8	Renaming "onyx" to "encoder". Actual renames: vp9_onyx_if.c -> vp9_encoder.c vp9_onyx_int.h -> vp9_encoder.h Change-Id: I80532a80b118d0060518e6c6a0d640e3f411783c	2014-04-22 14:57:05 -07:00
Yaowu Xu	d928b34efe	Allow full RD TX size search for GF/ALT at speed 2 For speed 3 and above, such search is only allowed at speed 3. The change helped cif and stdhd set by 1.2% and .7% in compression, but increased the encoding time by around 5%. Change-Id: Ifa4832327f1c1bef3decb032ceb769cbf50e059f	2014-04-21 12:31:46 -07:00
Dmitry Kovalev	07f86d0944	Renaming VP9_CONFIG to VP9EncoderConfig. Change-Id: Id48edd12c6f649c82113128491ef6ea7410e93b2	2014-04-18 11:01:36 -07:00
Dmitry Kovalev	2c8c1f5370	Replacing cpu_used with speed in VP9_CONFIG. Change-Id: I86b85b5c11388e84a48f8936330c0d920df5d1f0	2014-04-16 18:31:42 -07:00
Dmitry Kovalev	617a367c54	Merge "Consistent mode names."	2014-04-15 22:59:37 -07:00
Dmitry Kovalev	e58ea39fd0	Merge "Using anonymous enum instead of macros."	2014-04-15 10:25:14 -07:00
Dmitry Kovalev	c1981bdda0	Using anonymous enum instead of macros. Change-Id: I5ed360585dae2c9fea6c32058dbfb8ec07700677	2014-04-14 15:11:13 -07:00
Dmitry Kovalev	f3739f9052	Consistent mode names. Change-Id: Icb4851d98c951506fe5d73d8d5a8ac7e53fecfd0	2014-04-14 14:09:39 -07:00
Paul Wilkins	19536d692e	Remove vp9_ prefix on static inline function. This prefix reserved for global scope functions etc. Change-Id: I671cda2a63e01a40aae3d7407dd1bb4d338d709c	2014-04-14 10:08:04 -07:00
Paul Wilkins	0639b5cff5	Speed 2 alternative balance. A previous path improved speed 2 quality a little but more extensive testing showed that it slowed encode by a few %. The change will have a similar effect for speed 3 but should not impact speeds 4+; This experiment should reverse that and give a speed up at the cost of a small quality loss. Borg results pending. Change-Id: I4493fc1541aaf44587f1a41ff219f7088da9252c	2014-04-14 17:55:19 +01:00
Yaowu Xu	585e01bfe2	Remove duplicate code Change-Id: Ie26b6ca8de733a527f820b16148e7a2231073f0f	2014-04-09 12:58:10 -07:00
Yunqing Wang	2e7d327789	Merge "Use source frame difference to make partition decision"	2014-04-09 10:26:42 -07:00
Yunqing Wang	4e66293fcb	Use source frame difference to make partition decision Calculate the difference variance between last source frame and current source frame. The variance is calculated at 16x16 block level. The variances are compared to several thresholds to decide final partition sizes. An adaptive strategy is implemented to decide using SOURCE_VAR_BASED_PARTITION or FIXED_PARTITION based on motions in the video. The switching test is done once every search_type_check_frequency frames. The selection of source_var_thresh needs to be investigated further later. RTC set Borg test showed 0.424% overall psnr gain, and 0.357% ssim gain. For clips with large enough static area, the encoding speedup is around 2% to 15%. Change-Id: Id7d268f1d8cbca7fb8026aa4a53b3c77459dc156	2014-04-08 17:03:02 -07:00
Marco Paniconi	181949948e	Add constrained-copy partition to speed features. Copy up to a certain bsize, otherwise set to a fixed bsize. This helsp to reduce artifact near moving boundary caused by full partition copy without checking motion of super-block. This artifact can occur at speeds 3,4 in real-time mode. Issue: https://code.google.com/p/webm/issues/detail?id=738. Change-Id: I05812521fd38816a467f72eb6a951cae4c227931	2014-04-07 14:58:28 -07:00
Dmitry Kovalev	233b581cef	Cleaning up vp9_speed_features.c. Change-Id: I7b2b690d2d51d55daa642be4131e5febd78cbe0a	2014-04-07 12:54:11 -07:00
Dmitry Kovalev	107929dc6b	Adding FAST_COEFF_UPDATE enum. Change-Id: I75ad328c6d719df81cc24f3ae21c152af4ebdacc	2014-04-04 10:31:34 -07:00
Marco Paniconi	2a2e8a9d72	Merge "Add codec control function for enabling frame_periodic_boost."	2014-04-03 16:57:44 -07:00
Dmitry Kovalev	30a15633f8	Merge "Replacing magic constants with expressions."	2014-04-03 15:54:49 -07:00
Marco Paniconi	0eb88c9064	Add codec control function for enabling frame_periodic_boost. Change-Id: I6371ef7301c6bc3138552bd349e9bd154dee3e08	2014-04-03 15:49:03 -07:00
Jingning Han	2536a89a4f	Adjust key frame quantization index for rtc coding This commit slightly increases the bit allocation for key frame. This improves speed -5 coding performance by 2.77% with aq-mode=0 and by 2.78% with aq-mode=3. Change-Id: Iaa3e777f80b9706306606af06e89852bac146659	2014-04-02 22:52:22 -07:00
Jingning Han	c42e8f1429	Merge "Make frame level quantization change a parameter"	2014-04-02 12:48:05 -07:00
Jingning Han	409f48ec52	Merge "Adjust rate allocation in non-RD coding mode"	2014-04-02 12:47:50 -07:00
Jingning Han	d8b8d87eb7	Make frame level quantization change a parameter Parameterize the frame level quantization index used in non-RD coding flow with cbr. Change-Id: I79cfbd9c32c0d0fc33ad720d2180432e910baf7a	2014-04-02 09:51:22 -07:00
Jingning Han	751d002f2f	Adjust rate allocation in non-RD coding mode This commit reduces the frequency of frames using finer quantizer in non-RD coding flow, and slightly tune up the quantizer resolution when used. It provides 1.7% compression gains in speed -5 at no speed difference. Change-Id: I430249a51260a841a0402666e5ec1566e4f7d5a6	2014-04-02 09:51:22 -07:00
Paul Wilkins	769cd78ff2	Add speed feature for recode tolerance. The new tolerance is a little higher than before (especially for kf/gf/arf) so this change gives an encode speed up for some clips up for speeds 0-2. Change-Id: I63f7d6c9cc11c7f58742f41e250dcd3eab1741eb	2014-04-02 12:18:05 +01:00
Dmitry Kovalev	5f3773d0f7	Replacing magic constants with expressions. Change-Id: I434682bbd4cb2089cd213d77ed5a5e06256dc45d	2014-04-01 17:42:14 -07:00
Yaowu Xu	6fc20a8434	Adjust rt speed steps. This commit adjusted the speed steps in rt mode to make the steps more evenly spaced on speed and quality, specifically: 1. Merged 3 and 4 into one single step 3 and removed confilicting features. 2. Move 8, 7, 6, 5 to be 7, 6, 5, 4 repsectively. Change-Id: I38d56d61531f3561d772aef953c411c8fb38c063	2014-03-28 16:40:28 -07:00
Dmitry Kovalev	0a6d5547e2	Adding vp9_speed_features.{h, c}. Change-Id: I7d9874da8ff78a2d7e0cf11073af9c30538bc9a6	2014-03-28 10:30:28 -07:00

... 2 3 4 5 6 ...

312 Commits