generic-library/vpx

Author	SHA1	Message	Date
Yunqing Wang	b1b6fd85db	Merge "Skip the partition search for still frames"	2014-09-30 11:59:05 -07:00
Yunqing Wang	c8d01b1eaf	Merge "Refactor encode_rd_sb_row function"	2014-09-30 11:58:39 -07:00
Yunqing Wang	1fcbf6ed56	Skip the partition search for still frames This patch re-enabled the feature in Pengchong's patch (commit `1286126073`). Originally, it was turned on while use_lastframe_partitioning > 0(not used anymore). Now it was added as a feature, and turned on while speed >= 2. As described in the original patch, this feature helps speed up the slideshows in YouTube. Change-Id: I1b0f18d65da1ee1c8d1e117dabba910c5207c471	2014-09-26 09:03:52 -07:00
Deb Mukherjee	993d10a217	Adds various high bit-depth encode functions Change-Id: I6f67b171022bbc8199c6d674190b57f6bab1b62f	2014-09-25 01:50:36 -07:00
Yunqing Wang	14ee2805a3	Refactor encode_rd_sb_row function Simplified the code and removed some code that was not used anymore. This patch didn't change encoding result. Change-Id: I7e54a74c8f35a6726dfc8a1c55b337448b7ea124	2014-09-24 10:24:18 -07:00
hkuang	c70cea97ac	Remove mi_grid_* structures. mi_grid_* are arrays of pointer to pointer. They save the pointers that point to the MIs in cm->mi. But they are unnecessary and complicated. The original goal was to remove MODE_INFO_t copy. But with an extra MODE_INFO_t pointer inside MODE_INFO_t, same goal could be achieved. This commit totally removes the mi_grid_* structures. But there are still many dummy MODE_INFO_t inside cm->mi which are a waste of memory. Next commit will do on-demand MODE_INFO_t allocation in order to save these memories. Change-Id: I3a05cf1610679fed26e0b2eadd315a9ae91afdd6	2014-09-19 21:27:11 -07:00
Yunqing Wang	1bf0beb5fc	Refactor encode_superblock function The code covers both x->skip=0 & x->skip=1 cases. Change-Id: I09745c10e5994dc700ae4c01b4b62979cdaf3306	2014-09-12 15:58:17 -07:00
Yunqing Wang	f10d7eeda2	Remove the use of use_lastframe_partitioning at speed 4 The use of use_lastframe_partitioning is totally removed in good- quality encoding. Its usage in real-time encoding needs to be evaluated to see if it can be removed too. The Borg tests at speed 4 showed: stdhd set: 0.220% psnr gain, 0.166% ssim gain; derf set: 0.329% psnr gain, 0.476% ssim gain. Speed test on selected clips showed 1.54% speedup.(Worst case: pedestrian_area_1080p25.y4m, speed loss: 1.5%) Change-Id: I1c844d329b0b5678558439b887297c1be7ddab00	2014-09-09 10:54:07 -07:00
Yaowu Xu	c1058e5bbe	select_tx_mode(): remove special case for key frame This commit removes the special case for key frame, as transform size decision is controlled by the appropriate speed feature for all lossy coding modes: tx_size_search_method. Change-Id: I9677171e3f2432ec23705f7c5ea8170dd4562fae	2014-09-03 09:34:10 -07:00
Yunqing Wang	4d2c376923	Early termination in encoding partition search In the partition search, the encoder checks all possible partitionings in the superblock's partition search tree. This patch proposed a set of criteria for partition search early termination, which effectively decided whether or not to terminate the search in current branch based on the "skippable" result of the quantized transform coefficients. The "skippable" information was gathered during the partition mode search, and no overhead calculations were introduced. This patch gives significant encoding speed gains without sacrificing the quality. Borg test results: 1. At speed 1, stdhd set: psnr: +0.074%, ssim: +0.093%; derf set: psnr: -0.024%, ssim: +0.011%; 2. At speed 2, stdhd set: psnr: +0.033%, ssim: +0.100%; derf set: psnr: -0.062%, ssim: +0.003%; 3. At speed 3, stdhd set: psnr: +0.060%, ssim: +0.190%; derf set: psnr: -0.064%, ssim: -0.002%; 4. At speed 4, stdhd set: psnr: +0.070%, ssim: +0.143%; derf set: psnr: -0.104%, ssim: +0.039%; The speedup ranges from several percent to 60+%. speed1 speed2 speed3 speed4 (1080p, 100f): old_town_cross: 48.2% 23.9% 20.8% 16.5% park_joy: 11.4% 17.8% 29.4% 18.2% pedestrian_area: 10.7% 4.0% 4.2% 2.4% (720p, 200f): mobcal: 68.1% 36.3% 34.4% 17.7% parkrun: 15.8% 24.2% 37.1% 16.8% shields: 45.1% 32.8% 30.1% 9.6% (cif, 300f) bus: 3.7% 10.4% 14.0% 7.9% deadline: 13.6% 14.8% 12.6% 10.9% mobile: 5.3% 11.5% 14.7% 10.7% Change-Id: I246c38fb952ad762ce5e365711235b605f470a66	2014-08-28 11:27:28 -07:00
Dmitry Kovalev	4478553efc	Removing tx_stepdown_count from VP9_COMP. The variable is never read. Change-Id: I94141c1667fa5d10604cd6f83c5f64df107dee94	2014-08-25 14:42:05 -07:00
Dmitry Kovalev	e576c42f1b	Cleaning up is_background(). Change-Id: I2b9609dd22bacbf26e669f70bf155613b0316eb3	2014-08-25 11:55:30 -07:00
Pengchong Jin	997db6fc3f	Merge "Add a speed feature to give the tighter search range"	2014-08-15 19:51:04 -07:00
Pengchong Jin	eca93642e2	Add a speed feature to give the tighter search range Add a speed feature to give the tighter partition search range. Before partition search, calculate the histogram of the partition sizes of the left, above and previous co-located blocks of the current block. If the variance of observed partition sizes is small enough, adjust the search range around the mean partition size, which will be tigher. The feature is currently turned on at speed 2. Experiments on sample youtube clips show on average the runtime is reduced by 3-7%. For hard stdhd clips: park_joy_1080p @ 15000kbps: 509251 ms -> 491953 ms (3.3%) pedestrian_area_1080p @ 2000kbps: 223941 ms -> 214226 ms (4.3%) The PSNR performance is changed: derf: -0.112% yt: -0.099% hd: -0.090% stdhd:-0.102% Change-Id: Ie205ec5325bf92ec5676c243e30ba9d0adca10f2	2014-08-15 16:14:20 -07:00
Yunqing Wang	28b1437d77	Remove a unused speed feature Removed disable_split_var_thresh, which is not used anymore. Change-Id: I50119b150442e1571157433b5effc6aae0dbe0fd	2014-08-15 14:10:27 -07:00
Jingning Han	80e5550723	Merge "Remove redundant vp9_init_plane_quantizers call"	2014-08-14 18:50:16 -07:00
Jingning Han	d67b608c5d	Remove redundant vp9_init_plane_quantizers call When aq mode is on, the quantizer will be reset later in the same function (line 571). Change-Id: I20635db31261d136d04d5deeb881ad3957078bf1	2014-08-14 14:21:08 -07:00
Yaowu Xu	741a23cd97	Replace current_video_frame with better alternatives In the encoder, current_video_frame is used in a couple of places to decide encoding strategy, this commit replaces with more appropriate variables. Change-Id: I3d3d8d8e2ea02c489e4639b9d4c446a63e357d29	2014-08-13 17:19:34 -07:00
Yaowu Xu	b6a41802c4	Simplify select_tx_mode() The function is called only once, right after all stats counters are reset to 0. Therefore all the computations have zero effect on return values. This commmit to removed those effectless code. Change-Id: I50d27c0802547921fa36c60aa4bd92d76247f595	2014-08-13 11:48:29 -07:00
Jingning Han	5b63c2797a	Merge "Integrate fast txfm and quant path into skip_recode system"	2014-08-11 08:53:34 -07:00
Jingning Han	9da4cd94f5	Merge "Extend skip_txfm flag into array to cover YUV planes"	2014-08-11 08:53:25 -07:00
Dmitry Kovalev	91c2f1e45a	Moving pass from VP9_COMP to VP9EncoderConfig. We had a very complicated way to initialize cpi->pass from cfg->g_pass: switch (cfg->g_pass) { case VPX_RC_ONE_PASS: oxcf->mode = ONE_PASS_GOOD; break; case VPX_RC_FIRST_PASS: oxcf->mode = TWO_PASS_FIRST; break; case VPX_RC_LAST_PASS: oxcf->mode = TWO_PASS_SECOND_BEST; break; } cpi->pass = get_pass(oxcf->mode). Now pass is moved to VP9EncoderConfig and initialization is simple: switch (cfg->g_pass) { case VPX_RC_ONE_PASS: oxcf->pass = 0; break; case VPX_RC_FIRST_PASS: oxcf->pass = 1; break; case VPX_RC_LAST_PASS: oxcf->pass = 2; break; } Change-Id: I8f582203a4575f5e39b071598484a8ad2b72e0d9	2014-08-08 14:27:54 -07:00
Dmitry Kovalev	2fe6fa72fc	Merge "Cleaning up vp9_encodeframe.c."	2014-08-08 13:55:34 -07:00
Alex Converse	2a5c46d8f5	Fix active_map speed 6. Fix the interaction between active map and reuse_inter_pred_sby. The reuse_inter_pred_sby feature expects inter predictors to already be built, but blocks with active map on skip this step. Change-Id: Ibb2bf0d228f678935d82a0ede9cb0919ab7c8878	2014-08-07 15:57:58 -07:00
Alex Converse	e874aea74c	Cleanup SEG_LVL_SKIP handling in encode_superblock. Change-Id: Ib7497ba08696765cbc1b2cc4218d37f4298f278c	2014-08-07 15:57:58 -07:00
Dmitry Kovalev	b539705916	Cleaning up vp9_encodeframe.c. Change-Id: Ia3001ae5c44faee3978fc3eb7a027cd9712a0373	2014-08-07 14:55:54 -07:00
Jingning Han	8684c23260	Integrate fast txfm and quant path into skip_recode system This commit integrates the fast transform and quantization process into skip_recode scheme in the rate-distortion optimization loop. Previously the fast transform and quantization process was only enabled for non-RD coding flow. Change-Id: Ib7db4d39b7033f1495c75897271f769799198ba8	2014-08-06 16:11:22 -07:00
Pengchong Jin	74593c1eae	Directly split the block in partition search This patch allows the encoder to directly split the block in partition search, therefore skip searching NONE. It computes a score which measures whether 16x16 motion vectors from the first pass in the current block are consistent with each others. If they are inconsistent and we have enough Q to encode, split the block directly, and skip searching NONE. This feature is under flag CONFIG_FP_MB_STATS. In speed 2, it further gives a speedup of 3-8% on sample yt clips as compared to the previous version under the same flag. Overall, the features under the flag will give 7-15% on typical yt clips at up to 6000kbps data rate. The speedup at very high data rate is not significant. For hard stdhd clips: park_joy_1080p @ 15000kbps: 504541ms -> 506293ms (-0.35%) pedestrian_area_1080p @ 2000kbps: 326610ms -> 290090ms (+11.2%) The compression performance using the features under the flag: derf: -0.068% yt: -0.189% hd: -0.318% stdhd:-0.183% To use the feature, set CONFIG_FP_MB_STATS and turn on cpi->use_fp_mb_stats. Change-Id: Iad58a2966515c8861aa9eb211565b1864048d47f	2014-08-05 16:13:40 -07:00
Jingning Han	1a8d45f309	Extend skip_txfm flag into array to cover YUV planes Change-Id: Ieae182d72d625d0d3fd4ed7c7d24cb521a0f21b0	2014-08-05 15:42:12 -07:00
Pengchong Jin	5971e8985b	Merge "Store first pass motion vector directions"	2014-08-04 17:35:42 -07:00
Pengchong Jin	233e0ccc73	Store first pass motion vector directions Re-organize the one-byte structure for 16x16 first pass block. Add bits to indicate motion vector directions. Change-Id: Id10754ba343dfc712c7fed5bcc85c67fa0bbcb89	2014-08-04 16:17:47 -07:00
Jim Bankoski	7f63dabfe9	break at the end of clauses with assert(0) to avoid gcc warning Change-Id: I1b3c5337f018dde27dc819ab18bd081d169a91e8	2014-08-04 08:52:53 -07:00
Jingning Han	1c3a80b9a1	Skip calling vp9_block_energy when aq-mode is off The mb_energy value is used by aq-mode. Turn off computing its value when aq-mode is off. Change-Id: I26c239f124eca45a5ee58b90d19eae00d9a7cda5	2014-07-31 11:51:59 -07:00
Jingning Han	a6a348b85e	Merge "Refactor rd_pick_parition interface"	2014-07-31 09:25:34 -07:00
Jingning Han	a3b062c56f	Merge "Chessboard pattern partition search"	2014-07-30 14:34:42 -07:00
Pengchong Jin	7f29d22e51	Merge "Early termination after partition NONE is done in RD."	2014-07-30 13:35:02 -07:00
Pengchong Jin	49866baae6	Early termination after partition NONE is done in RD. This patch allows the encoder to skip the search for partition SPLIT, HORZ, VERT after the search for partition NONE is done in RD optimization. It uses the first pass block-wise statistics to make the decision. If all 16x16 blocks in the current partition have zero motions and small residues from the frist pass statistics, and it has small difference variance, further partition search is skipped. For speed 2 setting, experiments on general youtube clips show that the speedup varies from 1% - 10%, 5% on average. On the performance side in PSNR, derf 0.004%, yt -0.059%, hd -0.106%, stdhd 0.032%. For hard stdhd clips: park_joy_1080p, 502952 ms -> 503307 ms (-0.07%) pedestrian_area_1080p, 227049 ms -> 220531 ms (+3%) This feature is under the compilation flag CONFIG_FP_MB_STATS and it is off in current setting. Change-Id: I554537e9242178263b65ebe14a04f9c221b58bae	2014-07-30 11:54:49 -07:00
Jingning Han	d82ff94284	Refactor rd_pick_parition interface Remove the variable that indicates the relative block index. This is explicitly covered by the use of pc_tree. Change-Id: Ib13142582fff926c85e375bde656aa050add8350	2014-07-30 10:53:57 -07:00
Jingning Han	ca2dcb7fed	Chessboard pattern partition search This commit enables a chessboard pattern constrained partition search for 720p and above resolutions. The scheme applies stricter partition search to alternative blocks based on its above/left neighboring blocks' partition range, as well as that of the collocated blocks in the previous frame. It is currently turned on at 16x16 block size level. The chessboard pattern is flipped per coding frame. The speed 3 runtime is reduced: park_joy_1080p, 652832 ms -> 607738 ms (7% speed-up) pedestrian_area_1080p, 215998 ms -> 200589 ms (8% speed-up) The compression performance is changed: hd -0.223% stdhd -0.295% Change-Id: I2d4d123ae89f7171562f618febb4d81789575b19	2014-07-30 10:32:41 -07:00
Jingning Han	6646ea73e2	Clean up max/min allowed block size in rd_pick_partition This commit replace the repetitive retrieve of max and min allowed partition from speed_feature with local variables max_size and min_size. Change-Id: Ib06f11f16615e4876e4dd5fb6a968c6bf5f7b216	2014-07-29 11:03:52 -07:00
Jingning Han	c36f78b054	Use frame index directly in get_chessboard_index The get_chessboard_index() used to call the entire VP9_COMMON struct pointer to retrieve the chessboard pattern index. This cl makes it call the frame index directly. Change-Id: I3cad9d209ea2e77a358085a04fe1ff0ddec5ba03	2014-07-29 10:55:56 -07:00
Jingning Han	ac1f06188d	Merge "Fix rd_pick_partition search loop for 4x4 blocks"	2014-07-25 15:57:35 -07:00
Jingning Han	84af0486f9	Fix rd_pick_partition search loop for 4x4 blocks The partition search for 4x4 blocks takes unnecessary steps to reconstruct pixels and an extra partition type update. This commit removes such operations. No visible compression/speed difference. Thanks to Yue (yuec@) for finding this issue. Change-Id: I3f83824aa3fd3717d63be0b280fa57258939a70a	2014-07-25 07:17:58 -07:00
Tim Kopp	9d337d34f2	s/CONFIG_DENOISING/CONFIG_VP9_TEMPORAL_DENOISING This should prevent confusion with the VP8 CONFIG_TEMPORAL_DENOISING and other flags. Change-Id: I1fe4e2977895b7966841d861ab74317ad875b6c8	2014-07-24 13:43:52 -07:00
Adrian Grange	1f3c43e602	Merge "Fix get_frame_type function"	2014-07-22 15:17:27 -07:00
Adrian Grange	caad1686d4	Fix get_frame_type function Fixed the function get_frame_type to return the correct frame type for golden and last frames. Change-Id: I8edddd9aa26cbe7a1de8ff211389410b22b1bd14	2014-07-22 12:12:16 -07:00
Alex Converse	5926e7c0e8	Remove unfinished VP9 alpha channel. Change-Id: Ic5d3a3a0dac10b49495771886a31e793bb78b5ca	2014-07-21 15:55:50 -07:00
Yunqing Wang	765485cab2	Add -DNDEBUG when config option debug is disabled For gcc, when libvpx config option debug is disabled, added the flag -DNDEBUG to disable the assertions in libvpx for some speedup. Change-Id: Ifcb7b9e8ef5cbe5d07a24407b53b9a2923f596ee	2014-07-21 09:20:03 -07:00
Pengchong Jin	ac638125ea	Merge "Fixed a bug of setting wrong first pass mb stats pointer"	2014-07-17 14:24:52 -07:00
Pengchong Jin	e358ab5fc9	Fixed a bug of setting wrong first pass mb stats pointer The bug sets the wrong pointer to the first pass mb stats if the encoder does the re-coding in the second pass. Change-Id: I8a11f45dd7dceb38de814adec24cecccae370d00	2014-07-17 12:04:15 -07:00
Tim Kopp	03819ed9ab	VP9 Denoiser denoises after mode/bsize search In vp8, statistics are collected about the different modes as they are searched. This process is more complicated due to the variable block size. Fields were added to the PICM_MODE_CONTEXT struct to hold this information for each point in the search. The information is then taken from the appropriate part of the tree during denoising. Change-Id: I89261ab77ad637821287ae157dfdf694702b8e77	2014-07-15 08:43:43 -07:00
Yunqing Wang	1b5e9871f7	Code refactoring: use defined inline functions Changed to use defined inline functions consistently through the code. Change-Id: I7644d24fa7a837378564a6e0790416d3725dd200	2014-07-11 10:30:25 -07:00
Jingning Han	3316918b3b	Remove an empty line Change-Id: Id6eedc502c86433df1456dd994aee6bc9a1359a2	2014-07-07 10:28:05 -07:00
Alex Converse	03c276ea17	Split vp9_rdopt into vp9_rdopt and vp9_rd. vp9_rdopt is for making rd optimal mode decisions. vp9_rd is for all other rd related routines. Anything used outside of making an rd optimal decision belongs in rd. Change-Id: I772a3073f7588bdf139f551fb9810b6864d8e64b	2014-07-02 15:33:33 -07:00
Yaowu Xu	82fd084b35	Merge "Re-design quantization process"	2014-07-01 19:04:01 -07:00
Jingning Han	9ac2f66320	Re-design quantization process This commit re-designs the quantization process for transform coefficient blocks of size 4x4 to 16x16. It improves compression performance for speed 7 by 3.85%. The SSSE3 version for the new quantization process is included. The average runtime of the 8x8 block quantization is reduced from 285 cycles -> 255 cycles, i.e., over 10% faster. Change-Id: I61278aa02efc70599b962d3314671db5b0446a50	2014-07-01 17:00:07 -07:00
Yunqing Wang	9ba1d60bd1	Fix visual studio build issue Fixed the signed/unsigned mismatch. Change-Id: Id83d603b8f1745b71f4cf695a0751e55518b1316	2014-07-01 08:58:05 -07:00
Yaowu Xu	370618ffb4	Merge "change to not force interp_type as SWITCHABLE"	2014-06-30 15:44:08 -07:00
Yaowu Xu	186bd4eb52	change to not force interp_type as SWITCHABLE Encoder still uses SWITCHABLE as default via DEFAULT_INTERP_FILTER, but does not override the default if it is not SWITCHABLE. Change-Id: I3c0f6653bd228381a623a026c66599b0a87d01d5	2014-06-30 12:48:21 -07:00
Jingning Han	30ab37019c	Remove unused set_mode_info function When the frame is intra coded only, the encoder takes the RD coding flow. Hence the function set_mode_info is not practically in use. This commit removes it and the associated conditional branches. Change-Id: I1e42659ceb55b771ba712d1cdecacb446aa6460d	2014-06-30 10:59:04 -07:00
Yunqing Wang	9d41313e4b	Decide the partitioning threshold from the variance histogram Before encoding a frame, calculate and store each 16x16 block's variance of source difference between last and current frame. Find partitioning threshold T for the frame from its variance histogram, and then use T to make partition decisions. Comparing with fixed 16x16 partitioning, rtc set test showed an overall psnr gain of 3.242%, and ssim gain of 3.751%. The best psnr gain is 8.653%. The overall encoding speed didn't change much. It got faster for some clips(for example, 12% speedup for vidyo1), and a little slower for others. Also, a minor modification was made in datarate unit test. Change-Id: Ie290743aa3814e83607b93831b667a2a49d0932c	2014-06-30 09:36:23 -07:00
Jim Bankoski	a13bf65315	remove unused parms from rd_pick_inter_mode_sb_seg_skip Change-Id: I7f989d197444d166133ad91eb23ac1033109f58d	2014-06-29 09:23:21 -07:00
Jingning Han	5a3e3c6d3f	Adaptive txfm size selection depending on residual sse/variance This commit enables an adaptive transform size selection method for speed -6. It uses largest transform size when the sse is more than 4 times of variance, i.e., most energy is compacted in the DC coefficient. Otherwise, use the default TX_8X8. It improves the compression efficiency for rtc set of speed -6 by 0.8%, no speed change observed. Change-Id: Ie6ed1e728ff7bf88ebe940a60811361cdd19969c	2014-06-26 16:00:42 -07:00
Pengchong Jin	73eeb3beff	Merge "Skip the partition search for the frame with no motion"	2014-06-26 14:36:10 -07:00
Pengchong Jin	1286126073	Skip the partition search for the frame with no motion This patch allows the encoder to skip the partition search for the frame if it is an inter frame and only zero motion vectors have been detected in the first pass. The partition size is directly assigned according to the difference variance. Borg tests show overall little performance changes in term of PSNR (derf -0.027%, yt 0.152%, hd 0.078%, stdhd 0%). The worst case of PSNR loss is -0.514% from yt. The best PSNR gain is 4.293% from yt. The second pass encoding speedup for slideshow clips is 15%-40%. Change-Id: I881f347d286553ee5594a9ea09ba1a61ac684045	2014-06-26 12:10:34 -07:00
Yunqing Wang	bccc785f63	Merge "Reuse inter prediction result in real-time speed 6"	2014-06-25 08:18:33 -07:00
Yunqing Wang	0aae100076	Reuse inter prediction result in real-time speed 6 In real-time speed 6, no partition search is done. The inter prediction results got from picking mode can be reused in the following encoding process. A speed feature reuse_inter_pred_sby is added to only enable the resue in speed 6. This patch doesn't change encoding result. RTC set tests showed that the encoding speed gain is 2% - 5%. Change-Id: I3884780f64ef95dd8be10562926542528713b92c	2014-06-24 12:46:33 -07:00
Paul Wilkins	8160a26fa0	Fix some bugs in multi-arf Fix some bugs relating to the use of buffers in the overlay frames. Fix bug where a mid sequence overlay was propagating large partition and transform sizes into the subsequent frame because of :- sf->last_partitioning_redo_frequency > 1 and sf->tx_size_search_method == USE_LARGESTALL Change-Id: Ibf9ef39a5a5150f8cbdd2c9275abb0316c67873a	2014-06-24 13:07:48 +01:00
Alex Converse	aeacaac574	Switch active map implementation to segment based. Change-Id: Ibb841a1fa4d08d164cf5461246ec290f582b1f80	2014-06-20 13:13:23 -07:00
Alex Converse	e8a4edf49e	Fork vp9_rd_pick_inter_mode_sb_seg_skip Change-Id: I549868725b789f0f4f89828005a65972c20df888	2014-06-20 13:13:18 -07:00
Alex Converse	173a86b2a2	Actually skip blocks in skip segments in non-rd encoder. Copy split from macroblock to pick mode context so it doesn't get lost. Change-Id: Ie37aa12558dbe65c4f8076cf808250fffb7f27a8	2014-06-20 11:49:02 -07:00
Dmitry Kovalev	f80a346e0e	Merge "Replacing txfm_size with tx_size."	2014-06-12 13:07:11 -07:00
Jingning Han	d5ae43318e	Merge "Fast computation path for forward transform and quantization"	2014-06-12 11:59:52 -07:00
Dmitry Kovalev	4345d12d28	Replacing txfm_size with tx_size. Change-Id: Ifa6374e9db5919322733b656e0865f5f19ee6f2c	2014-06-12 11:57:26 -07:00
Jingning Han	ccba289f8d	Fast computation path for forward transform and quantization This commit enables a fast path computational flow for forward transformation. It checks the sse and variance of prediction residuals and decides if the quantized coefficients are all zero, dc only, or more. It then selects the corresponding coding path in the forward transformation and quantization stage. It is currently enabled in rtc coding mode. Will do it for rd coding mode next. In speed -6, the runtime for pedestrian_area 1080p at 1000 kbps goes down from 14234 ms to 13704 ms, i.e., about 4% speed-up. Overall coding performance for rtc set is changed by -0.18%. Change-Id: I0452da1786d59bc8bcbe0a35fdae9f623d1d44e1	2014-06-12 11:10:54 -07:00
Alex Converse	6c3f311ba2	Fix SEG_LVL_SKIP in non-RD inter mode selection. Add a set_mode_info_seg_skip function that fills the requisite mode info. Change-Id: I460b1b6845d720d9b09ed5b64df0ea0aac443f62	2014-06-11 17:53:26 -07:00
Dmitry Kovalev	22368479c0	Merge "Removing chessboard_index from SPEED_FEATURES."	2014-06-10 10:53:53 -07:00
Dmitry Kovalev	9636601146	Merge "Removing unused motion_vector_context enum from vp9_encodeframe.c"	2014-06-10 10:53:25 -07:00
Yunqing Wang	70eb862fd3	Merge "Use small transform size in non-rd real-time mode"	2014-06-09 13:07:24 -07:00
Dmitry Kovalev	e0c6507229	Merge "Removing unused tt_activity_measure()."	2014-06-09 10:45:56 -07:00
Yunqing Wang	b04d766800	Use small transform size in non-rd real-time mode In non-rd real-time mode, choosing smaller transform size in encoding gives better video quality and good speed gain than choosing larger transform size. This patch set tx size search method to ALLOW_8X8, which is better than using 4x4 or other larger sizes. Borg tests on rtc set at speed 6 showed significant gain on quality. PSNR gain: 11.034% and SSIM gain: 15.466%. The speed gain is 5% - 12% for <720p clips, and 2% - 7% for 720p clips. Change-Id: If4dc74ed2df359346b059f47fb73b4a0193ec548	2014-06-09 08:26:50 -07:00
Dmitry Kovalev	923c30a174	Removing chessboard_index from SPEED_FEATURES. This is not a speed feature, adding inline function instead. Change-Id: Ia48c41802eec9e92cf990339d724097279695c9a	2014-06-05 18:17:54 -07:00
Dmitry Kovalev	31403fd7d7	Adding encode_tiles() function. Change-Id: Ib8187c8f2556e1e9268b0683cd2b6ff3489f0205	2014-06-05 18:03:40 -07:00
Dmitry Kovalev	580d72d3ea	Removing unused tt_activity_measure(). Change-Id: Ifcb46e6904730d14b9ef76b648b4d0dc3cd5d0c5	2014-06-04 17:11:30 -07:00
Dmitry Kovalev	8567739396	Removing unused motion_vector_context enum from vp9_encodeframe.c The same enum defined and used in vp9_mvref_common.c. Change-Id: I3975103997797add0a258d36c96d20ac9561a73d	2014-06-04 17:03:10 -07:00
Dmitry Kovalev	bd0bb363bd	Removing lossless field from VP9EncoderConfig. Right now there is just one place to check: xd->lossless and for the first pass there is a function is_lossless_requested(). Change-Id: I949a6834e64ce51e422e2892f097f2b871b5429a	2014-06-03 12:52:49 -07:00
Dmitry Kovalev	19c492a749	Merge "Reusing existing vp9_get{8x8, 16x16}var() instead of new ones."	2014-06-03 10:04:27 -07:00
Deb Mukherjee	fc88292ef2	Remove Wextra warnings from vp9_sad.c As a side-effect, the sad unit tests for VP8 and VP9 had to be separated. Fixes a bug in original patch: (https://gerrit.chromium.org/gerrit/#/c/70163/8) that was reverted due to a nightly test failure. Change-Id: Ia2a4e9e278fd3c89d6c3c82fcc6381320ec2a8a6	2014-06-02 13:50:20 -07:00
Frank Galligan	c40a968e13	Merge "Revert "Remove Wextra warnings from vp9_sad.c""	2014-06-01 16:58:11 -07:00
Frank Galligan	0b44988952	Revert "Remove Wextra warnings from vp9_sad.c" This reverts commit `916550428d` Change-Id: I500822b03f09c64ff6ec5396c68edee9ca3b75cb	2014-06-01 16:20:26 -07:00
Alex Converse	aaf3765606	Don't update encoder skip count for SEG_LVL_SKIP. This aligns the encoder behavior with the decoder. Change-Id: Ifa0840e4b07b19309e0bf1d1182498883249ec45	2014-05-29 11:24:03 -07:00
Dmitry Kovalev	e14f900ae3	Merge "Moving itxm_add pointer from MACROBLOCKD to MACROBLOCK."	2014-05-29 11:16:39 -07:00
Dmitry Kovalev	f7ff24cdd0	Reusing existing vp9_get{8x8, 16x16}var() instead of new ones. Change-Id: I87b7c657d8813d7fb383ab519d150c0ffb1dd377	2014-05-29 11:14:06 -07:00
Yunqing Wang	1f2200080b	Revert "Making vp9_get_sse_sum_{8x8, 16x16} static." This reverts commit `e8bbb3d9db`. Change-Id: Ie368d36fd249d323d859d208609c711f04537bbc	2014-05-27 13:37:08 -07:00
Deb Mukherjee	444f93945b	Merge "Remove Wextra warnings from vp9_sad.c"	2014-05-27 11:54:05 -07:00
Deb Mukherjee	916550428d	Remove Wextra warnings from vp9_sad.c As a side-effect, the sad unit tests for VP8 and VP9 had to be separated. Change-Id: I068cc2391eed51e9b140ea6aba78338c5fec8d71	2014-05-22 22:21:16 -07:00
Alex Converse	80e5326cf2	Use offset mode info when filling pc tree. Use the appropriate subblock offset mode info rather than the parent block base, when filling mbmi in the pc tree in nonrd_use_partition. This mimics what is done in the vertical case and what is done for both cases in nonrd_pick_partition. This change has little practical effect at the moment since in speed 5 rt horizontal and vertical partitions are currently only used unpaired at edges of the picture. Change-Id: I4632f66ca84086dac56c7d36b45ddbe38a06f42a	2014-05-22 16:24:40 -07:00
Yaowu Xu	04cf82fb04	Merge "Enable various thresholds of motion detection"	2014-05-22 09:09:42 -07:00
Yaowu Xu	3bda7ec1ba	Enable various thresholds of motion detection This commit changed to enable the encoder to adjust motion dection speed threshold based on picture size. In addition, cpu-used 1 now does a partition search every other frame instead of every third frame for low resolution inputs. The change has no quality/speed impact for 720p and above. Test showed the change increase encoding time by between 3% to 6% for cpu-used 2 encodiong of 360p sequences. It also has a compression gain about .3%. For cpu-used 2, the change resolved some very disturbing visual artifacts in certain sequences when large block partitionings and transforms are used as a result of copying the partition from a previous frame. Change-Id: Ic7fd22508cdb811d4ca935655adbf20109286cfa	2014-05-21 12:08:56 -07:00
Dmitry Kovalev	35a83677a5	Moving itxm_add pointer from MACROBLOCKD to MACROBLOCK. The final goal is eventually to get rid of both itxm_add and fwd_txm4x4. This patch does it in the decoder. Change-Id: Ibb3db57efbcbb1ac387c6742538a9fcf2c6f24a5	2014-05-21 11:09:44 -07:00

1 2 3 4 5 ...

931 Commits