generic-library/vpx

Author	SHA1	Message	Date
Dmitry Kovalev	eda4e24c0d	Using is_inter_block and has_second_ref functions. Change-Id: I60dee58a4fd24d3c4f3c101a49d30e217309f43a	2013-09-25 19:03:04 -07:00
Guillaume Martres	7755b9dada	Merge "Correctly set the segment_id prediction flag and context"	2013-09-25 18:04:21 -07:00
Yaowu Xu	0c02bfcc2a	Merge "Limit mv search range for first pass and mbgraph"	2013-09-25 17:21:13 -07:00
Dmitry Kovalev	8266da1cd1	Moving from int_mv* to MV* (3). Change-Id: I9795d0937bc07793c13d067281995e0750f694d9	2013-09-25 16:44:19 -07:00
Dmitry Kovalev	f9e2140cab	Merge "Moving from int_mv* to MV* (2)."	2013-09-25 16:12:13 -07:00
Dmitry Kovalev	64eff7f360	Removing vp9_subpelvar.h from common. Moving all code from that file to vp9_variace_c.c in the encoder. Change-Id: Ic803d5b4c78d5191e4d25541b3df97337878fc3e	2013-09-25 16:10:43 -07:00
Dmitry Kovalev	2b5670238b	Merge "Replacing txfm with tx."	2013-09-25 15:57:56 -07:00
Dmitry Kovalev	87a214c277	Merge "Adding vp9_get_entropy_contexts function."	2013-09-25 15:43:55 -07:00
Dmitry Kovalev	9cd14ea6ed	Merge "Removing redundant 'extern' keyword."	2013-09-25 15:42:48 -07:00
Dmitry Kovalev	d445945a84	Adding vp9_get_entropy_contexts function. Change-Id: Ife0dd29fb4ad65c7e12ac5f1db8cea4ed81de488	2013-09-24 17:26:05 -07:00
Dmitry Kovalev	d0365c4a2c	Replacing txfm with tx. Renaming txfm_stepdown_count to tx_stepdown_count and max_txfm_size to max_tx_size. Change-Id: Ifc173e22c78240e561a57c4c741b64b1b8fc6fef	2013-09-24 17:24:35 -07:00
Dmitry Kovalev	450cbfe53a	Cleaning up vp9_update_nmv_count function. Using best_mv[2] array instead of two separate variables. Change-Id: Iefa0a41f5c42c42f2c66cef26750da68405f0f25	2013-09-24 15:55:49 -07:00
Dmitry Kovalev	12d57a9409	Removing redundant 'extern' keyword. Change-Id: Ie51306689c0dc527a8aa12d3984389dd8f360dea	2013-09-24 15:13:09 -07:00
Guillaume Martres	57272e41dd	Correctly set the segment_id prediction flag and context This fix a bug introduced by `ac6093d179` Change-Id: I0700a4daf7a6a2471074f81a4596352287fb2ac9	2013-09-24 14:18:27 -07:00
Yaowu Xu	35c5d79e6b	Limit mv search range for first pass and mbgraph Both first pass and mbgraph search use block size 16x16 for motion estimation. This commit put a limit of motion vector range. The effective range allows the entire 16x16 with required subpel interpolation input to be completely outside image border, but not any further away from image border. Change-Id: Id70a5ed08be49e70959f064859d72adc7d775d08	2013-09-24 13:47:29 -07:00
Dmitry Kovalev	b87696ac37	Moving from int_mv* to MV* (2). Updating fractional_mv_step_fp and fractional_mv_step_comp_fp function types. Change-Id: I601c4378bc39ac3ffd4e295d9cbd8e1f74829d46	2013-09-24 12:48:12 -07:00
Dmitry Kovalev	30888742f4	Merge "Moving from int_mv to MV."	2013-09-24 12:25:56 -07:00
Yaowu Xu	71cfaaa689	Merge "Replace memcpy with vpx_memcpy"	2013-09-24 11:35:03 -07:00
Yaowu Xu	9be0bb19df	Replace memcpy with vpx_memcpy Also removed obselete comment Change-Id: Iae1664777d76383639c637ee786e0d50fc45819a	2013-09-24 10:56:06 -07:00
Yaowu Xu	6037f17942	Rename defined constants The change is to better reflect the nature of the constants. Change-Id: Icabac6e9bceefbdb3f03f8218f88ef75943c30fb	2013-09-24 10:53:01 -07:00
Yaowu Xu	ff1ae7f713	Prevent using uninitialized value in RD decision INT64_MAX may be assigned as RDCOST when RDCSOST computation is skipped for speed, this commit to prevent INT64_MAX from being used as real RDCOST in transform size decision. Change-Id: I89a945134191bbdea1f1431ade70424ac079eaac	2013-09-24 10:53:01 -07:00
Yaowu Xu	fe533c9741	Merge "Change to prevent invalid memory access"	2013-09-24 10:37:17 -07:00
Dmitry Kovalev	f24b9b4f87	Merge "Adding best_mv[2] array instead of two variables."	2013-09-24 10:17:53 -07:00
Deb Mukherjee	f1a627e8a2	Merge "Small tweak in the constant quality parameter"	2013-09-24 09:51:08 -07:00
Jingning Han	9bcd750565	Merge "Enable per transformed block zero coeffs forcing"	2013-09-24 09:18:17 -07:00
Jingning Han	24ad692572	Merge "Calculate rd cost per transformed block"	2013-09-24 09:18:03 -07:00
Deb Mukherjee	b7a93578e5	Small tweak in the constant quality parameter Improves results a little. Change-Id: I7bcac02dbb65b43a993445cf557c520197114e5c	2013-09-24 09:09:35 -07:00
Yunqing Wang	bacb5925ff	Merge "Number of instructions in fdct4_1d_sse2 reduced by two."	2013-09-24 08:40:56 -07:00
Yaowu Xu	92a29c157f	Change to prevent invalid memory access After change of MI context storage , mi_8x8[] pointer may be null for a block outside of image border. The commit changes to access the data only after validation of mi_row and mi_col. Change-Id: I039c4eb486a228ea9d8e5f35ab9ae6717d718bf3	2013-09-24 08:36:59 -07:00
A.Mahfoodh	13c7715a75	Number of instructions in fdct4_1d_sse2 reduced by two. Mathematically the results are the same. Change-Id: I1c5126cd3ca64e8515ca6331e0989c6f7dd651a0	2013-09-23 17:23:27 -07:00
Yaowu Xu	838eae3961	Correct 3 step search site initialziation `39c7b01d` accidently reverted the row/col initialization, which broke mv clamps, which is dependent on the sites for valid motion vector range. This commit fixed the issue. Change-Id: Ibcce0226e0360b1ef483fe760b2e33f1af4bf494	2013-09-23 16:11:49 -07:00
Jingning Han	a517343ca3	Enable per transformed block zero coeffs forcing This commit enables forcing all coefficients zero per transformed block, when its rate-distortion cost is lower than regular coeff quantization. The overall performance improvement (including its parent patch on calculating rd cost per transformed block) at speed 1: derf: 0.298% yt: 0.452% hd: 0.741% stdhd: 0.006% Change-Id: I66005fe0fd7af192c3eba32e02fd6d77952accb5	2013-09-23 10:39:35 -07:00
Jingning Han	54c87058bf	Merge "Remove redundant mv_pred use for sub8x8 blocks"	2013-09-23 08:47:21 -07:00
Deb Mukherjee	d11221f433	Improves constant qual, constrained qual turned on Adds modeled functions to decide the qp for altref frames in constant q mode similar to other functions in use in bitrate mode. Also turns on the constrained quality mode (end-usage=2) option which was turned off before. Basic testing shows the mode works in principle, to cap bitrate to the target-bitrate specified, while allowing lower bitrate depending on the cq-level specified. The mode will need to be improved over time. Results for constant quality vs bitrate control mode: derfraw300/fullderfraw: +3.0% at constant quality over bitrate control. fullstdhdraw: +4.341% stdhdraw250: +5.361% Change-Id: If5027c9ec66c8e88d33e47062c6cb84a07b1cda9	2013-09-22 23:04:50 -07:00
Jingning Han	78fbb10642	Calculate rd cost per transformed block This commit makes the rate-distortion optimization loop evaluate the rd costs of regular quantization and all zero coeffs, per transformed block. It improves speed 1 compression performance: derf: 0.245% yt: 0.515% For a large partition that consists multiple transformed blocks, this allows more flexibility to selectively force a portion of them coded as all zero coeffs, as well be continued in the next patches. Change-Id: I211518be4179747b57375696f017d1160cc91851	2013-09-20 12:40:17 -07:00
Dmitry Kovalev	bb5e2bf86a	Adding best_mv[2] array instead of two variables. Change-Id: I584fe50f73879f6a72fada45714ef80893b6d549	2013-09-20 17:08:53 +04:00
Dmitry Kovalev	e51e7a0e8d	Moving from int_mv to MV. Converting vp9_mv_bit_cost, mv_err_cost, and mvsad_err_cost functions for now. Change-Id: I60e3cc20daef773c2adf9a18e30bc85b1c2eb211	2013-09-20 13:52:43 +04:00
Dmitry Kovalev	39c7b01d3c	Cleanup in vp9_init3smotion_compensation. Change-Id: Ie47f53e76bc9530475c8c6d24e9b7a5a0189de56	2013-09-20 12:54:14 +04:00
Dmitry Kovalev	24df77e951	Merge "Adding get_scan_and_band function."	2013-09-20 00:15:06 -07:00
Jingning Han	44b708b4c4	Remove redundant mv_pred use for sub8x8 blocks The sub8x8 blocks has its own motion vector reference scheme. The mv_pred is only used blocks of sizes 8x8 and above, to find the starting point for motion search. This change does not change any coding behavior. It makes the encoding process slightly faster. (0.5% speed-up for local test on speed 1.) Change-Id: I746ee6ef0eac19aa3621be014afa12be8d82cbb9	2013-09-19 10:32:44 -07:00
Yaowu Xu	79af591368	change to avoid invalid memory read. The fake token EOSB may cause invaild memory read in pack token, this commit reworked the loop to avoid such invalid read. Change-Id: I37fdfce869b44a7f90003f82a02f84c45472a457	2013-09-19 08:22:10 -07:00
Yaowu Xu	014acfa2af	fix integer overflow errors Change-Id: I76f440a917832c02d7a727697b225bac66b99f56	2013-09-19 08:14:26 -07:00
Dmitry Kovalev	a23c2a9e7b	Adding get_scan_and_band function. Extracting get_scan_and_band function from get_entropy_context to remove duplicated code. Change-Id: I5da1f5a60263017e887da68bc834317b5f084cb2	2013-09-19 16:53:48 +04:00
Dmitry Kovalev	1600707d35	Merge "Removing redundant code from vp9_mcomp.c."	2013-09-19 00:30:18 -07:00
Dmitry Kovalev	cda802ac86	Merge "Removing redundant coef calculation + cleanup."	2013-09-19 00:28:31 -07:00
Dmitry Kovalev	98cf0145b1	Removing redundant coef calculation + cleanup. Adding temp variable for &x->plane[0], inlining src_diff values. Change-Id: I24c08a5425a6da6fd66f5b0278f2fce74f9989b2	2013-09-18 16:20:10 +04:00
Dmitry Kovalev	72fd127f8c	Removing redundant code from vp9_mcomp.c. Replacing ((1 << MV_MAX_BITS) - 1) with MV_MAX, adding const qualifiers, reusing computed values. Change-Id: I7b46d47f6c644b079d9c3478116a9de465a9baec	2013-09-18 13:11:38 +04:00
Dmitry Kovalev	245ca04bab	Fixing typo in the encoder. Change-Id: I168efdc366eecf638694f357ccad2f4eba7e2fdb	2013-09-18 12:02:22 +04:00
Yaowu Xu	85fd8bdb01	Merge "Silence a bunch of MSVC warnings"	2013-09-17 17:10:58 -07:00
Jingning Han	c437bbcde0	Clean up second ref check in sub8x8 rd loop This commit cleans up the second reference check in the rate-distortion optimization loop of sub8x8 blocks. Change-Id: Ife68feaa4cddbfad2878c9b44d3012788d634f97	2013-09-17 15:59:49 -07:00
Yaowu Xu	a783da80e7	Silence a bunch of MSVC warnings Change-Id: I16633269582a640809dca27572bbe99efa6369fc	2013-09-17 12:08:51 -07:00
Paul Wilkins	84758960db	Merge "Minor clean up."	2013-09-17 03:39:24 -07:00
Paul Wilkins	90a52694f3	Merge "Adjustment to mode_skip_start."	2013-09-17 03:39:15 -07:00
Yaowu Xu	eeae6f946d	fix a problem where an invalid mv used in search The commit added reset of pred_mv at the beginning of each SB64x64 partition mv search, also limited the usage of pred_mv only when search on the largest partition is already done. This is to fix a crash at speed 1/2 encoder where an invalid mv is used in mv search. Change-Id: I39010177da76d054e3c90b7899a44feb2e3a5b1b	2013-09-16 12:49:27 -07:00
Paul Wilkins	cb50dc7f33	Minor clean up. Removed some unused code and minor cleanup / reordering. Change-Id: I4083ae56aeb8edfe9b85aa2f42a16aa28d19da94	2013-09-16 13:45:20 +01:00
Paul Wilkins	3b01778450	Adjustment to mode_skip_start. Corrected values relating to modified mode order. Change-Id: I24fccba3af4bc16721d5e7e51888a66305bfa7fe	2013-09-16 13:44:48 +01:00
Jingning Han	e8a967d960	Merge "Adaptive motion search control"	2013-09-13 14:43:23 -07:00
Jingning Han	c4826c5941	Adaptive motion search control This commit enables adaptive constraint on motion search range for smaller partitions, given the motion vectors of collocated larger partition as a candidate initial search point. It makes speed 0 runtime of bus at CIF and 2000 kbps goes from 167s down to 162s (3% speed-up), at 0.01dB performance gains. In the settings of speed 1, this makes the runtime goes from 33687 ms to 32142 ms (4.5% speed-up), at 0.03dB performance gains. Compression performance wise, it gains at speed 1: derf 0.118% yt 0.237% hd 0.203% stdhd 0.438% Change-Id: Ic8b34c67810d9504a9579bef2825d3fa54b69454	2013-09-13 13:58:10 -07:00
Deb Mukherjee	0c3038234d	Merge "Clean up of the search best filter speed feature"	2013-09-13 11:03:59 -07:00
Paul Wilkins	5d8642354e	Merge "Fix VP9_mode_order[]"	2013-09-13 09:19:31 -07:00
Scott LaVarnway	8fc95a1b11	Merge "New mode_info_context storage -- undo revert"	2013-09-13 08:56:20 -07:00
Paul Wilkins	1407cf8588	Fix VP9_mode_order[] Mis-merge of the following change managed to break mode order and delete two mode options (new alt ref and near alt ref) It also created a situation where we could test two undefined modes off the end of the VP9_mode_order[] data structure. "clang warnings : remove split and i4x4_pred fake modes" "Change Id: I8ef3c*" Initial testing on Akiyo at speed 2. 101.35 44.567 44.447 improves to 96.82 44.915 44.815 Approx 0.3-0.4db gain and 2.5% size reduction Change-Id: Icff813e7c0778d140ad4f0eea18cf1ed203c4e34	2013-09-13 13:33:26 +01:00
Jim Bankoski	324ebb704a	Merge "fix clang warning in rdopt"	2013-09-12 16:39:05 -07:00
Jim Bankoski	9ee9918dad	fix clang warning in rdopt either missed this or it crept back in Change-Id: I6cc1519d09e558be7250254c25bde2ae720555ea	2013-09-12 06:39:42 -07:00
Jim Bankoski	cddde51ec5	Merge "clang warnings : remove split and i4x4_pred fake modes"	2013-09-12 06:20:45 -07:00
Paul Wilkins	66755abff4	Merge "Changes in speed 2 settings"	2013-09-12 02:22:45 -07:00
Jim Bankoski	7fb42d909e	clang warnings : remove split and i4x4_pred fake modes Change-Id: I8ef3c7c0f08f0f1f4ccb8ea4deca4cd8143526ee	2013-09-11 16:34:55 -07:00
Deb Mukherjee	b964646756	Clean up of the search best filter speed feature Removes this speed feature since it is very slow and unlikely to be used in practice. This cleanup removes a bunch of unnecessary complications in the outer encode loop. Change-Id: I3c66ef1ca924fbfad7dadff297c9e7f652d308a1	2013-09-11 15:16:36 -07:00
Jim Bankoski	d09abfa9f7	Merge "resolve clang issue : implicit convert tx_mode -> tx_size"	2013-09-11 13:40:11 -07:00
Deb Mukherjee	69fe840ec4	Changes in speed 2 settings Propose some changes to the speed 2 settings to improve quality. In particular, turns off the adjust_thresholds_by_speed feature which improves results by 6%. Also removes the code for adjust_thresholds_by_speed since it conflicts with the adaptive rd thresh feature. Overall, with this change speed 2 is -15.2% from speed 0 settings, on derf, which is significantly better than -21.6% down before. Change-Id: I6e90a563470979eb0c258ec32d6183ed7ce9a505	2013-09-11 10:54:07 -07:00
Scott LaVarnway	ac6093d179	New mode_info_context storage -- undo revert mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of pointers to MODE_INFO structs. The MODE_INFO structs are now stored as a stream (decoder only), eliminating unnecessary copies and is a little more cache friendly. Change-Id: I031d376284c6eb98a38ad5595b797f048a6cfc0d	2013-09-11 13:45:44 -04:00
Jingning Han	65fe7d7605	Merge "Remove redundant condition check in 32x32 quant"	2013-09-10 16:39:18 -07:00
Jingning Han	cb24406da5	Merge "Remove the use of uninitialized_safe in encode_sb_"	2013-09-10 12:05:22 -07:00
Jingning Han	5d93feb6ad	Remove redundant condition check in 32x32 quant The c code implementation of 32x32 quantization does the zbin check of all coefficients prior to the quant/dequant loop, hence removing the redundant zbin check inside the loop. This only affects the c code version. SSSE3 version does not separate the zbin check out. Change-Id: Ic197a7d61d0b25fcac3cc092987651378cb56e4e	2013-09-10 12:04:33 -07:00
Deb Mukherjee	3d22d3ae0c	Merge "Small tweaks on the constant quality mode"	2013-09-10 11:16:47 -07:00
Deb Mukherjee	09830aa0ea	Small tweaks on the constant quality mode Improves results a little. derf is now +1.078% over bitrate control. Change-Id: I4812136f3e67be21d14ec089419976a32a841785	2013-09-10 10:16:19 -07:00
Yunqing Wang	0607abc3dd	Stop partition checking when distortion is small If the current obtained distortion is very small, which happens for static image case, we pick the current partition type without further split checking. This won't affect regular videos. For static videos, we got 10%~12% encoding speed gain. PSNR was better for some clips, and worse for others. Overall it was even. Change-Id: If787a57bedf46fc595ca4f5ded2b0c0a69e9fdef	2013-09-10 10:13:24 -07:00
Yunqing Wang	939791a129	Modify encode breakout for static frames Thank Paul for the suggestions. While turning on static-thresh for static-image videos, a big jump on bitrate was seen. In this patch, we detected static frames in the video using first-pass stats. For different cases, disable encode breakout or reduce encode breakout threshold to limit the skipping. More modification need be done to break incorrect partition picking pattern for static frames while skipping happens. Change-Id: Ia25f47041af0f04e229c70a0185e12b0ffa6047f	2013-09-10 09:06:03 -07:00
Paul Wilkins	4f660cc018	Modified mode skip functionality. A previous speed feature skipped modes not used in earlier partitions but this not longer worked as intended following changes to the partition coding order and in conjunction with some other speed features (Especially speed 2 and above). This modified mode skip feature sets a mask after the first X modes have been tested in each partition depending on the reference frame of the current best case. This patch also makes some changes to the order modes are tested to fit better with this skip functionality. Initial testing suggests speed and rd hit count improvements of up to 20% at speed 1. Quality results. (derf -1.9%, std hd +0.23%). Change-Id: Idd8efa656cbc0c28f06d09690984c1f18b1115e1	2013-09-10 13:30:10 +01:00
Paul Wilkins	901c495482	Added extra check to rd_auto_partition_range() Added check that the returned max and minimum are valid in bottom and right border cases. Change-Id: I2d6cdc9b5f04c7d0ff512ddcf3228331e028bf9b	2013-09-10 13:29:23 +01:00
Ivan Maltz	20abe595ec	Merge "API extensions and sample app for spacial scalable encoder"	2013-09-09 16:57:01 -07:00
Ivan Maltz	01b35c3c16	API extensions and sample app for spacial scalable encoder Sample app: vp9_spatial_scalable_encoder vpx_codec_control extensions: VP9E_SET_SVC VP9E_SET_WIDTH, VP9E_SET_HEIGHT, VP9E_SET_LAYER VP9E_SET_MIN_Q, VP9E_SET_MAX_Q expanded buffer size for vp9_convolve modified setting of initial width in vp9_onyx_if.c so that layer size can be set prior to initial encode Default number of layers set to 3 (VPX_SS_DEFAULT_LAYERS) Number of layers set explicitly in vpx_codec_enc_cfg.ss_number_layers Change-Id: I2c7a6fe6d665113671337032f7ad032430ac4197	2013-09-09 15:57:56 -07:00
Jingning Han	18c780a0ff	Remove the use of uninitialized_safe in encode_sb_ Initialize the probability model context with default value in encode_sb. Change-Id: Id826114024dfc21c7ef41aea9f4a0316d4a5cb95	2013-09-09 15:41:16 -07:00
James Zern	c1913c9cf4	Merge "Revert "New mode_info_context storage""	2013-09-09 14:38:01 -07:00
James Zern	54a03e20dd	Revert "New mode_info_context storage" This reverts commit `dae17734ec` Encode crashes, leaks and increases integer overflow errors. Change-Id: I595aa2649bb8d0b6552ff91652837a74c103fda2	2013-09-09 13:37:01 -07:00
Paul Wilkins	740acd6891	Merge "Enable kf restrictions at speed 4"	2013-09-09 05:39:13 -07:00
Jim Bankoski	9faa7e8186	resolve clang issue : implicit convert tx_mode -> tx_size Change-Id: Ifc9da470358f58e800e3d0d70a565b61e5f7834a	2013-09-08 07:17:12 -07:00
Jim Bankoski	e378566060	Merge "New mode_info_context storage"	2013-09-08 07:16:25 -07:00
Jingning Han	09bc942b47	Fix overflow issue in 16x16 quantization SSSE3 The 16x16 transform unit test suggested that the peak coefficient value can reach 32639. This could cause potential overflow issue in the SSSE3 implmentation of 16x16 block quantization. This commit fixes this issue by replacing addition with saturated addition. Change-Id: I6d5bb7c5faad4a927be53292324bd2728690717e	2013-09-06 21:06:10 -07:00
Paul Wilkins	f15cdc7451	Enable kf restrictions at speed 4 Change-Id: I453409d3be3f5fe118b15affde45cb52184aef20	2013-09-06 11:16:04 -07:00
Deb Mukherjee	e378a89bd6	Support a constant quality mode in VP9 Adds a new end-usage option for constant quality encoding in vpx. This first version implemented for VP9, encodes all regular inter frames using the quality specified in the --cq-level= option, while encoding all key frames and golden/altref frames at a quality better than that. The current performance on derfraw300 is +0.910% up from bitrate control, but achieved without multiple recode loops per frame. The decision for qp for each altref/golden/key frame will be improved in subsequent patches based on better use of stats from the first pass. Further, the qp for regular inter frames may also be varied around the provided cq-level. Change-Id: I6c4a2a68563679d60e0616ebcb11698578615fb3	2013-09-06 10:30:53 -07:00
Scott LaVarnway	dae17734ec	New mode_info_context storage mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of a pointer to a MODE_INFO struct and a "in the image" flag. The MODE_INFO structs are now stored as a stream, eliminating unnecessary copies and is a little more cache friendly. For the test clips used, the decoder performance improved by ~4.3% (1080p) and ~9.7% (720p). Patch Set 2: Re-encoded clips with latest. Now ~1.7% (1080p) and 5.9% (720p). Change-Id: I846f29e88610fce2523ca697a9a9ef2a182e9256	2013-09-06 12:33:34 -04:00
Jingning Han	1c263d6918	Merge "Use saturated addition in SSSE3 of 32x32 quant"	2013-09-05 14:09:40 -07:00
Jingning Han	458c2833c0	Use saturated addition in SSSE3 of 32x32 quant The 32x32 forward transform can potentially reach peak coefficient value close to 32700, while the rounding factor can go upto 610. This could cause overflow issue in the SSSE3 implementation of 32x32 quantization process. This commit resolves this issue by replacing the addition operations with saturated addition operations in 32x32 block quantization. Change-Id: Id6b98996458e16c5b6241338ca113c332bef6e70	2013-09-05 12:49:12 -07:00
Jim Bankoski	9fc3d32a50	Merge "faster accounting of inc_mv"	2013-09-05 12:38:56 -07:00
Paul Wilkins	e5deed06c0	Merge "Attempt to fix speed 4"	2013-09-04 17:19:22 -07:00
Jim Bankoski	bb2313db28	Merge "make vp9 postproc a config option"	2013-09-04 10:35:26 -07:00
Yunqing Wang	9fd2767200	Merge "Use correct bit cost while static-thresh is on"	2013-09-04 10:26:37 -07:00
Jim Bankoski	79401542f7	make vp9 postproc a config option Vp9 postproc is disabled for now as its not been shown to help and may be merged with vp8. Change-Id: I25620d6cd34c6e10331b18c7b5ef7482e39c6057	2013-09-04 10:02:08 -07:00
Jim Bankoski	532179e845	faster accounting of inc_mv Moves counting of mv branches to where we have a new mv, instead of after the whole frame is summed. Change-Id: I945d9f6d9199ba2443fe816c92d5849340d17bbd	2013-09-04 09:47:57 -07:00
Paul Wilkins	49317cddad	Attempt to fix speed 4 Speed 4 fixed partition size. Use fixed size unless it does not fit inside image, in which case use the largest size that does. Change-Id: I250f7a80506750dd82ab355721624a1344247223	2013-09-03 17:46:25 +01:00
Jingning Han	010c0ad0eb	Merge "Fix 32x32 forward transform SSE2 version"	2013-09-03 08:58:03 -07:00
Jingning Han	3cf46fa591	Fix 32x32 forward transform SSE2 version This commit fixed the potential overflow issue in the SSE2 implementation of 32x32 forward DCT. It resolved the corrupted coded frames in the border of scenes. Change-Id: If87eef2d46209269f74ef27e7295b6707fbf56f9	2013-08-31 18:47:08 -07:00
Yunqing Wang	0ca7855f67	Use correct bit cost while static-thresh is on While static-thresh is on, we only need to transmit skip flag if skip = 1. The cost of skip bit is added to the total rate cost. Change-Id: I64e73e482bc297eba22907026298a15fa8cc3920	2013-08-30 15:25:13 -07:00
Paul Wilkins	2b9baca4f0	Merge "Added per pixel inter rd hit count stats"	2013-08-30 08:56:01 -07:00
Jingning Han	c86c5443eb	Merge "Fix overflow issue in SSSE3 32x32 quantization"	2013-08-29 16:49:04 -07:00
Paul Wilkins	1f4bf79d65	Added per pixel inter rd hit count stats Added some code to output normalized rd hit count stats. In effect this approximates to the average number of rd operations/tests per pixel for the sequence. The results are not quite accurate and I have not bothered to account for partial SB64s at frame edges and for key frames However they do give some idea of the number of modes / prediction methods being tested for each pixel across the different partition sizes. This indicates how much scope their is for further gains either by reducing the number of partitions examined or the modes per partition through heuristics. Patch 3 moved place where count incremented so partial rd tests that are aborted with INT_MAX return are also counted. Example numbers for first 50 frames of Akiyo. Speed 0 ~84.4 rd operations / pixel Speed 1 ~28.8 Speed 2 ~11.9 Change-Id: Ib956e787e12f7fa8b12d3a1a2f6cda19a65a6cb8	2013-08-30 00:13:51 +01:00
Deb Mukherjee	b6dbf11ed5	Merge "Adds a speed feature for fast 1-loop forw updates"	2013-08-29 15:54:04 -07:00
James Zern	e83e8f0426	Merge changes Ib1e853f9,Ifd75c809,If3e83404 * changes: consistently name VP9_COMMON variables #3 consistently name VP9_COMMON variables #2 consistently name VP9_COMMON variables #1	2013-08-29 15:50:56 -07:00
Yaowu Xu	ee961599e1	Merge "Fixed potential overflows"	2013-08-29 15:43:26 -07:00
James Zern	d765df2796	consistently name VP9_COMMON variables #3 stragglers Change-Id: Ib1e853f9a331b7b66639dc34d79568d84d1930f1	2013-08-29 13:27:41 -07:00
James Zern	924d74516a	consistently name VP9_COMMON variables #1 pc -> cm Change-Id: If3e83404f574316fdd3b9aace2487b64efdb66f3	2013-08-29 13:25:57 -07:00
Dmitry Kovalev	e80bf802a9	Merge "Renaming txfm_size to tx_size."	2013-08-29 12:30:18 -07:00
Jingning Han	abff678866	Fix overflow issue in SSSE3 32x32 quantization The 32x32 quantization process can potentially have the intermediate stacks over 16-bit range, thereby causing enc/dec mismatch. This commit fixes this overflow issue in the SSSE3 implementation, as well as the prototype, of 32x32 quantization. This fixes issue 607 from webm@googlecode. Change-Id: I85635e6ca236b90c3dcfc40d449215c7b9caa806	2013-08-29 11:00:54 -07:00
Yaowu Xu	aaa7b44460	Fixed potential overflows The two arrays are typically initialized to INT64_MAX, if they are not filled with valid values before the addition, the values can overflow and lead to wrong results. Change-Id: I515de22cf3e8f55af4b74bdb2c8eb821a02d3059	2013-08-29 10:26:52 -07:00
Dmitry Kovalev	b62ddd5f8b	General code cleanup. Switching from mi_{width, height}_log2 and b_{width, height}_log2 to num_8x8_blocks_{wide, high} and num_4x4_blocks_{wide, high}. Removing redundant code, adding const. Change-Id: Iaab2207590fd24d0b76999071778d1395dc5cd5d	2013-08-28 12:22:37 -07:00
Deb Mukherjee	e02dc84c1a	Adds a speed feature for fast 1-loop forw updates Incorporates a speed feature for fast forward updates of coefficients. This feature takes 3 values: 0 - use standard 2-loop version 1 - use a 1-loop version 2 - use a 1-loop version with reduced updates Results: derfraw300 +0.007% (on speed 0) at feature value = 1 -0.160% (on speed 0) at feature value = 2 There is substantial speed up at speeds 2 and above for low resolution sequences where the entropy updates are a big part of the overall computations. Change-Id: Ie96fc50777088a5bd441288bca6111e43d03bcae	2013-08-28 10:56:52 -07:00
Dmitry Kovalev	851a2fd72c	Renaming txfm_size to tx_size. Change-Id: I752e374867d459960995b24d197301d65ad535e3	2013-08-27 19:47:53 -07:00
Jingning Han	eb7acb5524	Merge "Fix buf alignment in sub8x8 comp inter-inter pred"	2013-08-27 19:03:12 -07:00
Dmitry Kovalev	a93992e725	Adding get_entropy_context function. Moving common code from encoder and decoder to this function. Change-Id: I60fa643fb1ddf7ebbff5e83b6c4710137b0195ef	2013-08-27 14:17:53 -07:00
Dmitry Kovalev	7b95f9bf39	Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the encoder. Change-Id: I62bb07c377f947cb72fac68add7a6b199e42c6b9	2013-08-27 11:05:08 -07:00
Dmitry Kovalev	ba10aed86d	Merge "Using num_8x8_* lookup tables instead of mi_*_log2."	2013-08-27 10:49:36 -07:00
Dmitry Kovalev	f389ca2acc	Merge "Cleaning up model_rd_for_sb_y_tx."	2013-08-27 10:17:10 -07:00
Dmitry Kovalev	78e670fcf8	Merge "Renaming D27 to D207."	2013-08-27 10:03:57 -07:00
Jingning Han	2d6aadd7e2	Fix buf alignment in sub8x8 comp inter-inter pred This commit resolved a mis-alignment issue in compound inter-inter prediction of sub8x8. This patch follows solution from dkovalev@. Change-Id: I3cc0cf7e55b84110e0c42ef4b2e6ca7ac3f8f932	2013-08-27 09:28:05 -07:00
Yaowu Xu	9482c07953	fixed the reading too many bytes In subpel_avg_variance functions, code similar to the following punpkldq m2, [addr] actually reads 8 bytes. For functions that are supposed to work on buffers only have less 8 bytes a line, this caused valgrind error of reading uninitialized memory. Change-Id: I2a4c079dbdbc747829bd9e2ed85f0018ad2a3a34	2013-08-27 08:39:20 -07:00
Dmitry Kovalev	657ee2d719	Cleaning up model_rd_for_sb_y_tx. Removing references to plane_block_width and plane_block_height (we are going to delete the latter ones). Change-Id: I7982da4d373aebb54d2209dc8886f6192df4d287	2013-08-26 16:18:28 -07:00
Dmitry Kovalev	b25589c6bb	Using num_8x8_* lookup tables instead of mi_*_log2. Change-Id: I8a246b3d056c98be614d05a90bc261e2441ffc10	2013-08-26 14:22:54 -07:00
Yaowu Xu	4505e8accb	Merge "Fix the reading of too many input pixels"	2013-08-26 14:01:50 -07:00
Paul Wilkins	aa823f8667	Merge "Changes to adaptive inter rd thresholds."	2013-08-26 12:48:11 -07:00
Yaowu Xu	6c5433c836	Fix the reading of too many input pixels in VP9_get4x4var_mmx Change-Id: I4b4a8f45f25ebdfad281f169cc87aba5e2d6f227	2013-08-26 12:35:27 -07:00
Paul Wilkins	642696b678	Merge "Limit Key frame Intra modes checks."	2013-08-26 12:34:56 -07:00
James Zern	c8ba8c513c	cosmetics: strip 'VP9_' from defines in vp9 only code Change-Id: I481d9bb2fa3ec72b6a83d5f04d545ad8013f295c	2013-08-23 19:16:49 -07:00
Dmitry Kovalev	50ee61db4c	Renaming D27 to D207. I've already renamed d27_predictor to d207_predictor but forgot about the corresponding constant. Change-Id: Id312aa80fc5b5a1ab8a709a33418a029552a6857	2013-08-23 17:33:48 -07:00
Dmitry Kovalev	e6c435b506	Merge "Cleanup in mvref_common.{h, c}."	2013-08-23 17:09:49 -07:00
Yaowu Xu	13930cf569	Limit mv range to be based on partition size Previous change `c4048dbd` limits the mv search range assuming max block size of 64x64, this commit change the search range using actual block size instead. Change-Id: Ibe07ab02b62bf64bd9f8675d2b997af20a2c7e11	2013-08-23 15:43:57 -07:00
Yaowu Xu	8e04257bc5	Merge "Added border extension"	2013-08-23 14:43:58 -07:00
Dmitry Kovalev	21d8e8590b	Cleanup in mvref_common.{h, c}. Making code more compact, adding consts, removing redundant arguments, adding do/while(0) for macros. Change-Id: Ic9ec0bc58cee0910a5450b7fb8cfbf35fa9d0d16	2013-08-23 12:00:30 -07:00
Yaowu Xu	656632b776	Added border extension To the source buffer to be encoded as an alt ref frame. This is to fix the problem of using uninitialized memory in encoder. See https://code.google.com/p/webm/issues/detail?id=605 Change-Id: I97618a2fc207e08abcf5301b734aa9e3ad695e2c	2013-08-23 11:31:28 -07:00
Dmitry Kovalev	1c159c470a	Merge "Checking scale factors on access."	2013-08-23 11:05:17 -07:00
Paul Wilkins	aa5b67add0	Changes to adaptive inter rd thresholds. Values now carried over frame to frame. Change to algorithm for decreasing threshold after a hit and to max threshold (now based on speed) Removed some old commented out code relating to VP8 adaptive thresholds. The impact of these changes tested on Akiyo (50 frames) and measured in terms of unit rd hits is as follows: Speed 0 84.36 -> 84.67 Speed 1 29.48 -> 22.22 Speed 2 11.76 -> 8.21 Speed 3 12.32 -> 7.21 Encode speed impact is broadly in line with these. Change-Id: I5b886efee3077a11553fa950d796fd6d00c8cb19	2013-08-23 16:18:45 +01:00
Paul Wilkins	f76f52df61	Limit Key frame Intra modes checks. Most of the focus so far has been on inter frames. At high speed settings the key frame is now taking a high % of the cycles. This patch puts in some masking to reduce the number of INTRA modes searched during key frame coding (as already happens for inter frames) at higher speed settings TODO: Develop this further with either adaptive rd thresholds when choosing which intra modes to consider or some other heuristic. Impact. At high speed settings on some clips the key frame was starting to dominate. In a coding of the first 50 frames of AKIYO at speed 2 limiting the key frame intra modes to DC or TM_PRED resulted in ~30% overall speedup. For Bus the number was lower at ~4-5%. Change-Id: I7bde68aee04995f9d9beb13a1902143112e341e2	2013-08-23 16:10:30 +01:00
Jingning Han	9655c2c7a6	Merge "Fix rectangular partition check flag"	2013-08-22 18:59:18 -07:00
Dmitry Kovalev	33104cdd42	Merge "vp9_encodeframe.c cleanup."	2013-08-22 18:07:35 -07:00
James Zern	711aff9d9d	Merge "vp9/encoder: fix last_frame_seg_map mem leak"	2013-08-22 18:04:03 -07:00
James Zern	d843ac5132	Merge "rename LOG2_* defines to *_LOG2"	2013-08-22 18:02:42 -07:00
Jingning Han	84f3b76e1c	Fix rectangular partition check flag Put rectangular partition check flag change according to the rd costs of NONE and SPLIT partition types under the speed feature. Change-Id: If681e1e078a8d43d86961ea4b748da5cd1b6c331	2013-08-22 17:15:01 -07:00
Dmitry Kovalev	604022d40b	vp9_encodeframe.c cleanup. Removing unused get_sbuv_perpixel_variance function, using has_second_ref/ is_inter_block functions, organizing includes. Change-Id: I016de4af12fbbb8b4ece26a70759b2392651b095	2013-08-22 15:50:51 -07:00
Dmitry Kovalev	335b1d360b	check_bsize_coverage cleanup. Change-Id: Ib7803857b35c00e317c9deb8630e777e25eb278f	2013-08-22 15:45:56 -07:00
Dmitry Kovalev	3c42657207	Checking scale factors on access. It is possible to have invalid scale factors and not access them during decoding. Error is reported if we really try to use invalid scale factors. Change-Id: Ie532d3ea7325ee0c7a6ada08269f804350c80fdf	2013-08-22 15:19:05 -07:00
James Zern	40ae02c247	rename LOG2_* defines to *_LOG2 gets rid of a mix of styles Change-Id: I3591d312157bc6f53a25438bf047765c671fd8a8	2013-08-22 14:45:24 -07:00
Dmitry Kovalev	13eed79c77	Merge "Adding vp9_is_scaled function."	2013-08-22 14:39:55 -07:00
James Zern	a5726ac453	vp9/encoder: fix last_frame_seg_map mem leak remove duplicate allocation from vp9_create_compressor, it was added to vp9_alloc_frame_buffers in: `d5bec52` Added resizing & initialization of last frame segment map Change-Id: I996723226a16a62aff8f9a52ac74e0b73cc98fdf	2013-08-22 14:13:04 -07:00
Dmitry Kovalev	640dea4d9d	Adding vp9_is_scaled function. Change-Id: Ieb7077ca3586b9491912027eed450a4f6fd38d30	2013-08-22 14:04:59 -07:00
Jingning Han	01a37177d1	Refactor rd_pick_partition for parameter control This commit changes the partition search order of superblocks from {SPLIT, NONE, HORZ, VERT} to {NONE, SPLIT, HORZ, VERT} for consistency with that of sub8x8 partition search. It enable the use of early termination in partition search for all block sizes. For ped_area_1080p 50 frames coded at 4000 kbps, it makes the runtime goes down from 844305ms -> 818003ms (3% speed-up) at speed 0. This will further move towards making the in-search partition types configurable, hence unifying various speed-up approaches. Some speed 1 and 2 features are turned off during the refactoring process, including: disable_split_var_thresh using_small_partition_info Stricter constraints are applied to use_square_partition_only for right/bottom boundary blocks. Will bring back/refine these features subsequently. At this point, it makes derf set at speed 1 about 0.45% higher in compression performance, and 9% down in run-time. Change-Id: I3db9f9d1d1a0d6cbe2e50e49bd9eda1cf705f37c	2013-08-22 12:36:02 -07:00
Deb Mukherjee	8b810c7a78	Fixes on feature disabling split based on variance Adds a couple of minor fixes, which may be absorbed in Jingning's patch. Thanks to Guillaume for pointing these out. Also adjusts the thresholds for speed 1 and 2 to 16 and 32 respectively, to keep quality drops small. Results: -------- derfraw300: threshold = 16, psnr -0.082%, speedup 2-3% threshold = 32, psnr -0.218%, speedup 5-6% stdhdraw250: threshold = 16, psnr -0.031%, speedup 2-3% threshold = 32, psnr -0.273%, speedup 5-6% Change-Id: I4b11ae8296cca6c2a9f644be7e40de7c423b8330	2013-08-22 07:05:44 -07:00
Scott LaVarnway	f39bf458e5	Merge "Initialize mb_skip_coeff before picking modes"	2013-08-22 06:26:04 -07:00
Scott LaVarnway	94bfbaa84e	Initialize mb_skip_coeff before picking modes It appears that the above/left mb_skip_coeff used during the pick modes, is left over from the previously encode frame. This patch initializes the flag to the default value of zero. Change-Id: Ida4684cc99611d6e3e82628db35ed717e28ce550	2013-08-22 08:51:04 -04:00
Dmitry Kovalev	cb05a451c6	Merge "Cleaning up optimize_init_b function."	2013-08-22 01:35:27 -07:00
Dmitry Kovalev	64c0f5c592	Merge "Cleaning up sum_intra_stats function."	2013-08-22 01:34:39 -07:00
Jingning Han	fcb890d751	Merge "Enable zero coeff check in sub8x8 UV rd loop"	2013-08-21 22:07:00 -07:00
Dmitry Kovalev	be60924f29	Cleaning up optimize_init_b function. Change-Id: Ib2c975e1d96deefb7ac4d6b600c8c5388035d111	2013-08-21 16:40:16 -07:00
Dmitry Kovalev	048ccb2849	Cleaning up sum_intra_stats function. Using size_group_lookup table and better variable names. Change-Id: I6e67f2ce091845db43ace7d21b7ae31c6f165aec	2013-08-21 16:25:02 -07:00
Dmitry Kovalev	3286abd82e	Merge "Adding scale factor check."	2013-08-21 14:11:13 -07:00
Dmitry Kovalev	2f1a0a0e2c	Removing PLANE_TYPE argument from cost_coeffs function. We can determine plane_type for another function arguments. Change-Id: I85331877aedb357632ae916a37b5b15f22c0bb1f	2013-08-21 13:02:28 -07:00
Dmitry Kovalev	27a984fbd3	Removing a lot of duplicated code. Adding set_contexts contexts function and call it instead of set_contexts_on_border. Calling txfrm_block_to_raster_xy to get aoff and loff. Change-Id: I41897e344afd2cae1f923f4fdbe63daccf6fe80e	2013-08-21 11:55:12 -07:00
Dmitry Kovalev	a3ae4c87fd	Adding scale factor check. We support only [1/16, 2] scale factors, enforcing this now. Change-Id: I0822eb7cea51720df6814e42d3f35ff340963061	2013-08-21 11:24:47 -07:00
Adrian Grange	ce28d0ca89	Fix typos and minor stylistic cleanup Change-Id: I32e43474e8651ef2eb181d24860a8f118cfea7bf	2013-08-21 08:45:42 -07:00
Dmitry Kovalev	7f814c6bf8	Merge "Passing plane_bsize to foreach_transformed_block_visitor."	2013-08-20 14:25:01 -07:00
Jingning Han	1bf1428654	Enable zero coeff check in sub8x8 UV rd loop Check the minimum rate-distortion cost of regular quantization and all zero coeffs cases in the sub8x8 inter prediction rd loop for luma components. Use this as the cumulative rdcost sent to UV rd estimation. Change-Id: Ia4bc7700437d5e13d7cdad4cf9ae57ab036d3e97	2013-08-20 10:33:42 -07:00
Deb Mukherjee	246381faf2	Merge "Cleanup/enhancements of switchable filter search"	2013-08-20 10:16:51 -07:00
Dmitry Kovalev	5826407f2a	Merge "Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c."	2013-08-20 10:06:22 -07:00
Deb Mukherjee	2ffe64ad5c	Cleanup/enhancements of switchable filter search Cleans up the switchable filter search logic. Also adds a speed feature - a variance threshold - to disable filter search if source variance is lower than this value. Results: derfraw300 threshold = 16, psnr -0.238%, 4-5% speedup (tested on football) threshold = 32, psnr -0.381%, 8-9% speedup (tested on football) threshold = 64, psnr -0.611%, 12-13% speedup (tested on football) threshold = 96, psnr -0.804%, 16-17% speedup (tested on football) Based on these results, the threshold is chosen as 16 for speed 1, 32 for speed 2, 64 for speed 3 and 96 for speed 4. Change-Id: Ib630d39192773b1983d3d349b97973768e170c04	2013-08-20 09:47:04 -07:00
Jingning Han	bb64c9a355	Merge "Enable early termination in uv rd loop"	2013-08-20 09:07:26 -07:00
Paul Wilkins	e8923fe492	Changes to auto partition size selection. Changes to code to auto select a partition size range based on data from spatial neighbors. Now looks at the sb_type in each 8x8 block of above and left SB64. The effect on speed 1 is now weaker giving better quality but less speed gain. Now also used in speed 2. Change-Id: Iace33a97d5c3498dd2a9a8a4067351941abcbabc	2013-08-20 14:05:39 +01:00
Yaowu Xu	c4048dbdd3	Change to limit the mv search range As the pixel values beyond image border are duplicates of pixels on edge, the change limits the mv search range, any mv beyond the limits no longer produce new/different prediction values as entire block with pixels used for subpel interpolation are outside image border. Change-Id: I4c6fdf06e33c1cef1489f5470ce0fb4e5e01fb79	2013-08-19 17:19:36 -07:00
Yaowu Xu	f70330a906	fix a bug when null function pointer is used. For certain partition size, the function poniter may not be intialized at all. The patch prevent the call if the pointer is not set. Change-Id: I78b8c3992b639e8799a16b3c74f0973d07b8b9ac	2013-08-19 17:16:12 -07:00
Dmitry Kovalev	569ca37d09	Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c. Change-Id: Ib8af21f2e7f603c2fb407e5d15a3bba64b545b49	2013-08-19 16:44:10 -07:00
Jingning Han	3275ad701a	Enable early termination in uv rd loop This commit enables early termination in the rate-distortion optimization search loop for chroma components. When the cumulative rd cost is above the current best value, skip the rest per-block transform/quantization/coeff_cost and continue to the next prediction mode. For bus_cif at 2000 kbps, the average run-time goes down from 168546ms -> 164678ms, (2% speed-up) at speed 0 36197ms -> 34465ms, (4% speed-up) at speed 1 Change-Id: I9d3043864126e62bd0166250d66b3170d520b3c0	2013-08-19 16:31:19 -07:00
Dmitry Kovalev	82d4d9a008	Passing plane_bsize to foreach_transformed_block_visitor. Updating all foreach_transformed_block_visitor functions to work with plane block size instead of general block. Removing a lot of duplicated code. Change-Id: I6a9069e27528c611f5a648e1da0c5a5fd17f1bb4	2013-08-19 15:47:24 -07:00
Jingning Han	31c97c2bdf	Merge "Fix potential use of uninitialized value"	2013-08-19 15:15:58 -07:00
Jingning Han	5dc0b309ab	Merge "Fix the returned distortion value in rd_pick_intra"	2013-08-19 14:34:19 -07:00
Dmitry Kovalev	2e3478a593	Using plane_bsize instead of bsize. This change set is intermediate. The next one will remove all repetitive plane_bsize calculations, because it will be passed as argument to foreach_transformed_block_visitor. Change-Id: Ifc12e0b330e017c6851a28746b3a5460b9bf7f0b	2013-08-19 13:20:21 -07:00
Jingning Han	b34ce04378	Fix potential use of uninitialized value Initialize the best mode and tx_size values in the rate-distortion optimization search loop. Change-Id: Ibfb5c0895691f172abcd4265c23aef4cb99fa8af	2013-08-19 11:15:53 -07:00
Jingning Han	f67919ae86	Fix the returned distortion value in rd_pick_intra Return the distortion value in vp9_rd_pick_intra_mode_sb as sum of dist_y and dist_uv. Remove the right shift operation on dist_uv, and make it consistent with that of vp9_rd_pick_inter_mode_sb. Change-Id: I9d564e242d9add38e32595d33b0e0dddb1d55e5b	2013-08-16 21:23:22 -07:00
Dmitry Kovalev	26e5b5e25d	Removing unused or redundant arguments from *_args structures. Redundant dst, pre[2] from build_inter_predictors_args, unused cm from encode_b_args. Change-Id: I2c476cd328c5c0cca4c78ba451ca6ba2a2c37e2d	2013-08-16 12:51:20 -07:00
Dmitry Kovalev	367cb10fcf	Merge "Moving from ss_txfrm_size to tx_size."	2013-08-16 12:46:45 -07:00
Adrian Grange	79f4c1b9a4	Fixed typos and formatting Change-Id: I3814984a624bc64147c57efa74fbdda8eda47262	2013-08-16 09:15:26 -07:00
Dmitry Kovalev	afd9bd3e3c	Moving from ss_txfrm_size to tx_size. Updating foreach_transformed_block_visitor and corresponding functions to accept tx_size instead of ss_txfrm_size. List of functions per file: vp9_decodframe.c decode_block decode_block_intra vp9_detokenize.c decode_block vp9_encodemb.c optimize_block vp9_xform_quant vp9_encode_block_intra vp9_rdopt.c dist_block rate_block block_yrd_txfm vp9_tokenize.c set_entropy_context_b tokenize_b is_skippable Change-Id: I351bf563eb36cf34db71c3f06b9bbc9a61b55b73	2013-08-15 17:03:03 -07:00
Jingning Han	5e80a49307	Merge "Refactor rd loop for chroma components"	2013-08-15 16:02:12 -07:00
Dmitry Kovalev	9451e8d37e	Merge "Converting code from using ss_txfrm_size to tx_size."	2013-08-15 15:21:09 -07:00
Dmitry Kovalev	939b1e4a8c	Merge "Moving segmentation struct from MACROBLOCKD to VP9_COMMON."	2013-08-15 15:14:32 -07:00
Jingning Han	68369ca897	Refactor rd loop for chroma components This commit makes the rate-distortion optimization search of chroma components consistent across all block sizes. It removes redundant codes. Change-Id: I7e76f54d045e8efdd41d84a164c71f55b484471b	2013-08-15 14:54:48 -07:00
Jingning Han	c2ff1882ff	Merge "Remove unused RDCOST_8X8 macro"	2013-08-15 13:48:25 -07:00
Jingning Han	ca983f34f7	Merge "Unify luma and chroma rd-cost estimation"	2013-08-15 13:48:15 -07:00
Dmitry Kovalev	bb3b817c1e	Converting code from using ss_txfrm_size to tx_size. Updated function signatures: txfrm_block_to_raster_block txfrm_block_to_raster_xy extend_for_intra vp9_optimize_b Change-Id: I7213f4c4b1b9ec802f90621d5ba61d5e4dac5e0a	2013-08-15 11:44:57 -07:00
Dmitry Kovalev	6f4fa44c42	Using { 0 } for initialization instead of memset. Change-Id: I4fad357465022d14bfc7e13b348c6da267587314	2013-08-15 11:37:56 -07:00
Dmitry Kovalev	b7616e387e	Moving segmentation struct from MACROBLOCKD to VP9_COMMON. VP9_COMMON is the right place to segmentatation struct because it has global segmentation parameters, not something specific to macroblock processing. Change-Id: Ib9ada0c06c253996eb3b5f6cccf6a323fbbba708	2013-08-15 10:47:48 -07:00
Jingning Han	b0646f9e98	Remove unused RDCOST_8X8 macro Change-Id: I17c7d7eaa60fe69c543403c340f7c1078bfd339f	2013-08-15 10:40:44 -07:00
Dmitry Kovalev	4d73416099	Merge "Quantization code cleanup."	2013-08-15 10:23:01 -07:00
Deb Mukherjee	24856b6abc	Speed feature to skip split partition based on var Adds a speed feature to disable split partition search based on a given threshold on the source variance. A tighter threshold derived from the threshold provided is used to also disable horizontal and vertical partitions. Results on derfraw300: threshold = 16, psnr = -0.057%, speedup ~1% (football) threshold = 32, psnr = -0.150%, speedup ~4-5% (football) threshold = 64, psnr = -0.570%, speedup ~10-12% (football) Results on stdhdraw250: threshold = 32, psnr = -0.18%, speedup is somewhat more than derf because of a larger number of smoother blocks at higher resolution. Based on these results, a threshold of 32 is chosen for speed 1, and a threshold of 64 is chosen for speeds 2 and above. Change-Id: If08912fb6c67fd4242d12a0d094783a99f52f6c6	2013-08-15 10:01:45 -07:00
Jingning Han	ec01f52ffa	Unify luma and chroma rd-cost estimation This commit unifies the rate-distortion cost calculation process of luma and chroma components. It allows early termination to be enabled later in the rd search loop of chroma components, in consistent with luma pixels. Change-Id: I2e52a7c6496176bf2a5e3ef338d34ceb8aad9b3d	2013-08-15 09:41:33 -07:00
Paul Wilkins	1a3641d91b	Merge "Renaming in MB_MODE_INFO"	2013-08-15 02:12:48 -07:00
Dmitry Kovalev	bb072000e8	foreach_transformed_block_in_plane cleanup, explicit tx_size var. Making foreach_transformed_block_in_plane more clear (it's not finished yet). Using explicit tx_size variable consistently instead of (ss_txfrm_size / 2) or (ss_txfrm_size >> 1) expression. Change-Id: I1b9bba2c0a9f817fca72c88324bbe6004766fb7d	2013-08-14 11:39:31 -07:00
Paul Wilkins	26fead7ecf	Renaming in MB_MODE_INFO The macro block mode info context originally contained an entry for each 16x16 macroblock. In VP9 each entry refers to an 8x8 region not a macro block, so the naming is misleading. This first stage clean up changes the names of 3 entries in the structure to remove the mb_ prefix. TODO clean up the nomenclature more widely in respect of mbmi and bmi. Change-Id: Ia7305c6d0cb805dfe8cdc98dad21338f502e49c6	2013-08-14 12:47:52 +01:00
Paul Wilkins	54979b4350	Merge "Honor min_partition_size properly for non-square splits"	2013-08-14 04:45:18 -07:00
Guillaume Martres	fc50477082	Honor min_partition_size properly for non-square splits Don't do vertical or horizontal splits if subsize < min_partition_size, except for edge blocks where it makes sense. Change-Id: I479aa66ba1838d227b5de8312d46be184a8d6401	2013-08-13 15:24:03 -07:00
Guillaume Martres	ecb78b3e0c	Merge "Trivial clean up."	2013-08-13 12:40:37 -07:00
Jingning Han	7e0f88b6be	Use lookup table to find largest txfm size Refactor choose_largest_txfm_size_ and make it find the largest transform size via lookup table. Change-Id: I685e0396d71111b599d5367ab1b9c934bd5490c8	2013-08-13 10:32:14 -07:00
Jingning Han	dc70fbe42d	Merge "Refactor model based tx search in super_block_yrd"	2013-08-13 08:48:49 -07:00
Paul Wilkins	5459f68d71	Trivial clean up. Delete unused / commented out variable references. Change-Id: Iaf20c0c3744f89adb296d153b516b5ea41b4f3b4	2013-08-13 13:26:18 +01:00
Paul Wilkins	8e35263bed	Merge "Honor min_partition_size properly"	2013-08-13 05:19:51 -07:00
Jingning Han	78136edcdc	SSE2 high precision 32x32 forward DCT Enable SSE2 implementation of high precision 32x32 forward DCT. The intermediate stacks are of 32-bits. The run-time goes down from 32126 cycles to 13442 cycles. Change-Id: Ib5ccafe3176c65bd6f2dbdef790bd47bbc880e56	2013-08-12 16:52:53 -07:00
Jingning Han	14cc7b319f	Refactor model based tx search in super_block_yrd Remove unnecessary conditional branches in model-based transform size search. Change-Id: Ic862dc33ed6710a186f6248239dd5f09b5c19981	2013-08-12 16:34:48 -07:00
Dmitry Kovalev	98e3d73e16	Merge "Using MV* instead of int_mv* as argument of vp9_clamp_mv_min_max."	2013-08-12 15:53:25 -07:00
Dmitry Kovalev	9d5885b0ab	Quantization code cleanup. Change-Id: I77b42418b852093f79260cbd880533a0bd86678f	2013-08-12 15:23:47 -07:00
Dmitry Kovalev	c66320b3e4	Merge "Entropy context related cleanups."	2013-08-12 15:18:24 -07:00
Dmitry Kovalev	1aedfc992a	Using MV* instead of int_mv* as argument of vp9_clamp_mv_min_max. Change-Id: I3c45916a9059f11b41e9d798e34ffee052969a44	2013-08-12 13:56:04 -07:00
Jingning Han	3984b41c87	Fix a compile failure in vp9_get_compressed_data The lf struct is now with VP9_COMMON, instead of MACROBLOCKD. Change-Id: Idfdd4f91f78f486078a138322d58bb61e93e1bc9	2013-08-12 11:42:17 -07:00
Dmitry Kovalev	8b0e6035a2	Entropy context related cleanups. Adding set_skip_context() function used from both encoder and decoder. Change-Id: Ia22cfad3211a00a63eb294f64f857b78f4aa9b85	2013-08-12 11:24:24 -07:00
Dmitry Kovalev	097046ae28	Merge "Removing redundant code and function arguments."	2013-08-11 12:20:58 -07:00
Dmitry Kovalev	3c43ec206c	Renaming BLOCK_SIZE_TYPES constant to BLOCK_SIZES. There will be another change set to rename BLOCK_SIZE_TYPE enum to BLOCK_SIZE. Change-Id: I8d1dfc873d6186fa5e554262f5169e929978085e	2013-08-09 17:47:32 -07:00
Guillaume Martres	58b07a6f9d	Honor min_partition_size properly It represents the minimum partition size, so don't split if bsize == min_partition_size . Change-Id: Id77c32d6afef7d2ddec0368eaae18fb13227d30e	2013-08-09 17:28:33 -07:00
Dmitry Kovalev	67fe9d17cb	Removing redundant code and function arguments. Change-Id: Ia5cdda0f755befcd1e64397452c42cb7031ca574	2013-08-09 17:24:40 -07:00
Dmitry Kovalev	e7c5ca8983	Merge "Inlining 16 as a stride for BLOCK_OFFSET macro."	2013-08-09 17:22:46 -07:00
James Zern	ef101af8ae	Merge "vp9_rd_pick_inter_mode_sb: fix uninitialized value"	2013-08-09 17:13:32 -07:00
Dmitry Kovalev	f1559bdeaf	Inlining 16 as a stride for BLOCK_OFFSET macro. Change-Id: I7f23d174eb089e5500f268a10db09648634c1b82	2013-08-09 16:40:05 -07:00
James Zern	f295774d43	vp9_rd_pick_inter_mode_sb: fix uninitialized value 'skippable' can remain unset and negatively affect later decisions address one aspect of issue #599 Change-Id: Iffdf0ac2e49ac481c27dc27c87fa546d4167bb28	2013-08-09 16:26:22 -07:00
Dmitry Kovalev	cd0629fe68	Merge "Removing plane_block_{width, height}_log2by4 functions."	2013-08-09 15:26:51 -07:00
Dmitry Kovalev	816d6c989c	Moving loopfilter struct to VP9_COMMON. Loop filter configuration doesn't belong to macroblock, so moving it from MACROBLOCKD to VP9_COMMON. Also moving the declaration of loopfilter struct from vp9_blockd.h to vp9_loopfilter.h. Change-Id: I4b3e34be9623b47cda35f9b1f9951f8c5b1d5d28	2013-08-09 14:41:51 -07:00
Scott LaVarnway	41251ae558	Bug fix: call set_offsets before rd_auto_partition_range The set_offsets call is necessary inorder to set the mode_info_context ptr correctly. Change-Id: I644910cc5bacc50ee9cd78458843274ad8ee636d	2013-08-09 14:09:49 -04:00
Yaowu Xu	6ec2b85bad	Added lpf level picking using partial frame Change-Id: I599ab1bd22b5f3f10d5962c609952abdef8ff67a	2013-08-09 07:37:08 -07:00
Yaowu Xu	6a7a4ba753	renamed vp8_yv12_copy_y to vpx_yv12_copy_y Becuase the routine is used by both vp8 and vp9 Change-Id: I2d35b287b5bc2394865d931a27da61f4ce7edeeb	2013-08-09 07:37:08 -07:00
Yaowu Xu	c7c9901845	added a speed feature on lpf level picking Change-Id: Id578f8afdeab3702fc8386969f2d832d8f1b5420	2013-08-09 07:36:32 -07:00
Dmitry Kovalev	6a8ec3eac2	General code cleanup. Removing redundant parenthesis and curly braces. Combining declarations with initializations. Adding useful intermediate variables instead of recalculating expressions every time. Change-Id: I00106f404afd60bfc189905b0fded881684f941a	2013-08-08 21:12:34 -07:00
Deb Mukherjee	2158909fc3	Merge "Adds a new subpel motion function"	2013-08-08 12:26:55 -07:00
Deb Mukherjee	1ba91a84ad	Adds a new subpel motion function Adds a new subpel motion estimation function that uses a 2-level tree-structured decision tree to eliminate redundant computations. It searches fewer points than iterative search (which can search the same point multiple times) but has the same quality roughly. This is made the default setting at speeds 0 and 1, while at speed 2 and above only a 1-level search is used. Also includes various cleanups for consistency and redundancy removal. Results: derf: +0.012% psnr stdhd: +0.09% psnr Speedup of about 2-3% Change-Id: Iedde4866f5475586dea0f0ba4cb7428fba24eee9	2013-08-08 11:41:49 -07:00
Adrian Grange	83ee80c045	Moved fast motion search level decision to function Moving this block of code into a function makes the code easier to read and change. Change-Id: If4ede570cce1eab1982b188c4d3e4fd3d4db236e	2013-08-08 11:01:44 -07:00
Adrian Grange	aae6a4c895	Simplify & fix potential bug in rd_pick_partition Different partitionings were not being evaluated against best_rd and there were unnecessary calls to RDCOST. This could have resulted in a non-optimal partioning being selected. I simplified the variables used to track the rate, distortion and RD values throughout the function. Change-Id: Ifa7085ee80d824e86791432a5bc6d8fea5a3e313	2013-08-08 09:55:45 -07:00
Jingning Han	6bfcce8c7a	Merge "Use low precision 32x32fdct for encodemb in speed1"	2013-08-07 19:05:14 -07:00
Dmitry Kovalev	61c33d0ad5	Removing plane_block_{width, height}_log2by4 functions. Change-Id: I040b82b8e32aee272d10cbb021c7ba1c76343d7a	2013-08-07 17:06:33 -07:00
Dmitry Kovalev	1492698ed3	Merge "Adding ss_size_lookup table."	2013-08-07 16:08:24 -07:00
Jingning Han	debb9c68c8	Use low precision 32x32fdct for encodemb in speed1 The low precision 32x32 fdct has all the intermediate steps within 16-bit depth, hence allowing faster SSE2 implementation, at the expense of larger round-trip error. It was used in the rate-distortion optimization search loop only. Using the low precision version, in replace of the high precision one, affects the compression performance by about 0.7% (derf, stdhd) at speed 0. For speed 1, it makes derf set down by only 0.017%. Change-Id: I4e7d18fac5bea5317b91c8e7dabae143bc6b5c8b	2013-08-07 15:34:12 -07:00
Dmitry Kovalev	8db2675b97	Adding ss_size_lookup table. Removing the old one bsize_from_dim_lookup. Now we have a way to determine block size for plane using its subsampling values (ss_size_lookup). And then we can find the number of pixels in the block (num_pels_log2_lookup). Change-Id: I6fc981da2ae093de81741d3d78eaefed11015db9	2013-08-07 15:33:17 -07:00
Dmitry Kovalev	ea2348ca29	Merge "Removing NMS_STATS defines."	2013-08-07 15:28:30 -07:00
Deb Mukherjee	296931c817	Merge "Clean ups of the subpel search functions"	2013-08-06 17:28:48 -07:00
Deb Mukherjee	71b43b0ff0	Clean ups of the subpel search functions Removes some unused code and speed features, and organizes the interfaces for fractional mv step functions for use in new speed features to come. In the process a new speed feature - number of iterations per step during the subpel search - is exposed. No change when this parameter is set as the original value of 3. Results: subpel_iters_per_step = 3: baseline subpel_iters_per_step = 2: psnr -0.067%, 1% speedup subpel_iters_per_step = 1: psnr -0.331%, 3-4% speedup Change-Id: I2eba8a21f6461be8caf56af04a5337257a5693a8	2013-08-06 17:23:50 -07:00
Jingning Han	2c091f9768	Merge "Place holder for high-precision 32x32 fdct"	2013-08-06 14:47:30 -07:00
Jim Bankoski	5b307886fb	variance x86inc guards also fixed bug in sad calcs Change-Id: I6571fcbe37556c16ae32be66dc0fd879852aac1d	2013-08-06 14:17:13 -07:00
Deb Mukherjee	fac7c8c9f9	Merge "Flexible support for various pattern searches"	2013-08-06 14:03:27 -07:00
Dmitry Kovalev	8725ca2ed2	Merge "Inlining vp9_get_pred_probs_switchable_interp function."	2013-08-06 11:57:45 -07:00
Deb Mukherjee	15b5a6a2c7	Flexible support for various pattern searches Adds a few pattern searches to achieve various tradeoffs between motion estimation complexity and performance. The search framework is unified across these searches so that a common pattern search function is used for all. Besides it will be easier to experiment with various patterns or combinations thereof at different scales in the future. The new pattern search is multi-scale and is capable of using different patterns at different scales. The new hex search uses 8 points at the smallest scale and 6 points at other scales. Two other pattern searches - big-diamond and square are also added. Big diamond uses 4 points at the smallest scale and 8 points in diamond shape at the larger scales. Square is very similar conceptually to the default n-step search but is somewhat faster since it keeps only one survivor across all scales. Psnr/speed-up results on derf300: hex: -1.6% psnr%, 6-8% speed-up big-diamond: -0.96% psnr, 4-5% speedup square: -0.93% psnr, 4-5% speedup Change-Id: I02a7ef5193f762601e0994e2c99399a3535a43d2	2013-08-06 11:56:39 -07:00
Jingning Han	28566a6cd5	Place holder for high-precision 32x32 fdct Resolve compile warnings on re-define FDCT32x32_2D template. Change-Id: Idb3a54ef8d2710ce7245b726379a0e5c875f5cad	2013-08-06 11:44:08 -07:00
Dmitry Kovalev	0c80065694	Inlining vp9_get_pred_probs_switchable_interp function. There was no benefit having this function. For example, inside read_switchable_filter_type switchable filter context was calculated twice. Change-Id: I79cd5bf95cbc0f6d8bf91a2e32289e01b18dcff1	2013-08-06 11:04:31 -07:00
Jingning Han	7d61f8fe53	Merge "Move fdct32x32 SSE2 implementation in separate file."	2013-08-06 10:46:41 -07:00
Dmitry Kovalev	3e51acafec	Merge "Finally removing all old block size constants."	2013-08-06 10:30:37 -07:00
Dmitry Kovalev	4a692e4168	Merge "Changing the order switchable filter enum constants."	2013-08-06 10:30:26 -07:00
Dmitry Kovalev	25b7dc08cd	Merge "Removing unused functions."	2013-08-06 10:29:57 -07:00
Deb Mukherjee	33afddadb9	Merge "Add variance based mode/skipping"	2013-08-06 10:19:15 -07:00
Christian Duvivier	3d98205fce	Move fdct32x32 SSE2 implementation in separate file. This is in preparation for the SSE2 version of the high-precision 32x32 forward DCT which will share a lot of code with the existing low precision version used for rate-distortion search. Change-Id: I7084b6bdfb480b1fabb8493fb14e3f7fcc7888c0	2013-08-06 10:17:11 -07:00
Dmitry Kovalev	b9c7d04e95	Finally removing all old block size constants. Change-Id: I3aae21e88b876d53ecc955260479980ffe04ad8d	2013-08-05 15:23:49 -07:00
Deb Mukherjee	8b3faccb9e	Add variance based mode/skipping Adds a speed feature to skip all intra modes other than DC_PRED if the source variance is small. This feature is made part of speed 1 and up. Results on derf300: psnr -0.07%, speedup about 1-2% Also uses the source variance to fine-tune the early termination criteria when FLAG_EARLY_TERMINATE is on. This feature is made part of speed 2 and up. Results on derf300: psnr -0.52%, speedup about 5-7% Change-Id: I59e38aa836557cfa5405ae706fc64815cbfe4232	2013-08-05 14:14:01 -07:00
Jim Bankoski	9f988a2edf	Merge "cleanups after bw bh code"	2013-08-05 14:02:02 -07:00
Dmitry Kovalev	3f611555d7	Changing the order switchable filter enum constants. This changeset allows to remove vp9_switchable_interp and vp9_switchable_interp_map arrays and make code much clear. Actually we still have to use these mapping but only inside read_interp_filter_type and write_interp_filter_type functions. Change-Id: I4026c6f8c4acefba6c81421b7bacbaa52cc45f50	2013-08-05 12:26:15 -07:00
Jim Bankoski	5d2cb7ead0	cleanups after bw bh code Cons bw/bh parms that should have been const. Additional formatting. Change-Id: Icd36a5c9dc17dadd7284315ac0d6fef1a565ca16	2013-08-05 12:15:52 -07:00
Dmitry Kovalev	d007446b3f	Replacing long block size enum values with shorter ones (2). Change-Id: I428c4d42212b757112e3acfe5b81314cfbb5fd6b	2013-08-05 10:51:02 -07:00
Dmitry Kovalev	fe2a201eb1	Replacing "txfm" with "tx" in identifiers. Consistent names with TX_SIZE, TX_MODE, and TX_MODE. Change-Id: I79592218bf5a40ace89197a34a06ee7de581ed8d	2013-08-02 17:28:23 -07:00
Dmitry Kovalev	5edc65d00d	Removing NMS_STATS defines. Change-Id: Iabab0e59042a33456df1d449c0d0f01debc00c7c	2013-08-02 17:10:15 -07:00
Dmitry Kovalev	7b50333e8f	Merge "Adding is_inter_block function."	2013-08-02 16:54:32 -07:00
Dmitry Kovalev	fec4ec4edd	Removing unused functions. Removed functions: model_rd_for_sb_y, block_error_sby, get_sb_variance Change-Id: Iec458df180caf6f8eac3605773841a4121dd3a8f	2013-08-02 16:41:09 -07:00
Dmitry Kovalev	603931e291	Merge "Changing function arg type from int_mv* to MV*."	2013-08-02 16:30:06 -07:00
Dmitry Kovalev	a6adc82e78	Merge "Cleanups around allow_high_precision_mv flag."	2013-08-02 16:27:05 -07:00
Dmitry Kovalev	680ec32d18	Adding is_inter_block function. Using it instead of long unclear verbose check "mbmi->ref_frame[0] != INTRA_FRAME". Change-Id: I9c7b4b3797942fa962bf3ba7460fff3084beabe9	2013-08-02 16:25:33 -07:00
Dmitry Kovalev	d4e020c4b1	Merge "Cleaning up set_contexts_on_border function."	2013-08-02 16:22:50 -07:00
Yunqing Wang	d340c114fb	Merge "Add more checking to using_small_partition_info"	2013-08-02 15:55:09 -07:00
Dmitry Kovalev	769bcab3f5	Cleaning up set_contexts_on_border function. Change-Id: I8f21c18b29f54b277fb1c167f278f109d9f3b996	2013-08-02 15:52:26 -07:00
Dmitry Kovalev	25b77e2569	Changing function arg type from int_mv* to MV*. Change-Id: Ic878d31df2ce783a2c9a8c4bc9ed301ec8ffe25e	2013-08-02 15:26:32 -07:00
Adrian Grange	60ff123536	Merge "Fixed typos and added a few explanatory comments"	2013-08-02 11:37:47 -07:00
Adrian Grange	075b11f004	Merge "Changed name of rd_pick_intra4x4mby_modes"	2013-08-02 11:36:46 -07:00
Dmitry Kovalev	86053d3ae2	Cleanups around allow_high_precision_mv flag. Change-Id: Ic07f5f8ffeaedd5b7513b464871f83afc82dcd5c	2013-08-02 11:21:16 -07:00
Dmitry Kovalev	b47153deed	Replacing long block size enum values with shorter ones. Change-Id: I0e9329490828684a4fd46f540d89114cc68e8407	2013-08-02 10:48:27 -07:00
Yunqing Wang	0d68080445	Merge "Comment out 2 unused speed features"	2013-08-02 09:58:46 -07:00
Dmitry Kovalev	741537f3ce	Cleanup: replacing xd->seg with seg, and xd->lf with lf. Change-Id: I73b59d7699a8e7e7acd3bf8041cb6c98ce9ba4bf	2013-08-01 15:38:16 -07:00
Dmitry Kovalev	9f4f001ba5	Merge "Cleanup: removing unused function arguments."	2013-08-01 15:07:12 -07:00
Dmitry Kovalev	ddf02e323a	Merge "Nice looking motion vector clamping functions."	2013-08-01 14:50:14 -07:00
Dmitry Kovalev	ce8dedc353	Cleanup: removing unused function arguments. Change-Id: I27471768980fc631916069f24bc7c482a5c9ca17	2013-08-01 13:41:38 -07:00
Dmitry Kovalev	b621e2d72e	Nice looking motion vector clamping functions. Removing assign_and_clamp_mv function, making implementation of clamp_mv and clamp_mv2 more clear and consistent. Change-Id: Iecd08e1c1bf0379f8314ebe01811f8253f4ade58	2013-08-01 13:40:26 -07:00
Deb Mukherjee	dbea726daf	Adds a source variance computation function Adds a function to compute source variance for various sb_types to be used for pruning mode and partition searches. [The existing activity measure function is currently specialized for only 16x16 MBs and needs to be updated]. Change-Id: I22a41e6f1430184201487326fdbebb9b47e6fc24	2013-08-01 13:01:54 -07:00
Yunqing Wang	215b010f4b	Add more checking to using_small_partition_info If the partition is out of partition size range, we don't need to process small partition information. Change-Id: Ice9bfbbdebe1f2ef79271a3aee17de0ed4608376	2013-08-01 11:37:41 -07:00
Yunqing Wang	7965a6ea34	Comment out 2 unused speed features use_min_partition_size and use_max_partition_size are not used currently, and could be added back if needed later. Change-Id: Ib22a9c06b064567a7c1d6d5445567ed77e0d3acc	2013-08-01 11:03:34 -07:00
Dmitry Kovalev	ff4bfa726b	Merge "Adding missing const to vp9_extra_bits array."	2013-08-01 10:19:51 -07:00
Adrian Grange	89e73c63c0	Fixed typos and added a few explanatory comments Change-Id: Ib4e4b41094b54874ee34343dd77c0c131ceed9d2	2013-08-01 09:23:49 -07:00
Adrian Grange	5271d47892	Changed name of rd_pick_intra4x4mby_modes The function name rd_pick_intra4x4mby_modes is confusing, so I changed it to rd_pick_intra_sub_8x8_y_modes to better reflect what the function does. Also added const qualifiers to some of the input parameters and removed camel-case. Change-Id: I23d53d4c7af5d79ed8a471acd59a09bbb47add39	2013-08-01 09:23:49 -07:00
Dmitry Kovalev	5b65246a71	Adding missing const to vp9_extra_bits array. Change-Id: Icd128ab58719e0b9066bdfa66a5d0d427a84d6df	2013-07-31 18:51:18 -07:00
Jingning Han	12f5762756	Remove unnecessary arguments in rd_pick_ref_frame This commit removes redundant arguments passing in the function of rd_pick_reference_frame. This resolves the clang warnings about potential use of uninitialized values. Change-Id: Ic68f949a9f8fcd0a583786b0c75321104ea44739	2013-07-31 17:04:13 -07:00
Dmitry Kovalev	9239e96536	Removing get_mi_{row, col} functions. Passing mi_row and mi_col parameters to functions explicitly. Removing unused xd argument from scale_mv function. Change-Id: Icb4c495ec72d26fb066c14470d3ae0b741fbf18a	2013-07-31 14:06:55 -07:00
Dmitry Kovalev	3be9fd9120	Merge "Removing unused "ishp" arguments."	2013-07-31 12:03:04 -07:00
Dmitry Kovalev	0e0a6f840b	Merge "Consistent update for inter_mode probabilities."	2013-07-31 12:02:35 -07:00
Dmitry Kovalev	500ade243a	Removing unused "ishp" arguments. Using different variable names "allow_hp" and "use_hp" instead of "usehp". Change-Id: I0cd5996ddeb46bd754473b680a993c0aaf8eb879	2013-07-31 11:27:53 -07:00
Jingning Han	ac7bab7575	Merge "Make the use of ref_frame index consistent"	2013-07-31 09:11:37 -07:00
Jingning Han	86c384d398	Make the use of ref_frame index consistent Refactor the frame buffer referencing in choose_partition and make it consistent with other places. This means to prevent potential issues when we extend reference frame buffer. Change-Id: I5ff33ed5f671e1f4cc7049622212769a9b4578d9	2013-07-30 19:49:36 -07:00
Dmitry Kovalev	8701bc11df	Consistent update for inter_mode probabilities. Using inter-mode counts instead of inter-mode-tree branch counts inside FRAME_COUNTS structure. Change-Id: I60dde13af37d06146d7d15543311c1b5044e9e04	2013-07-30 18:06:34 -07:00
Adrian Grange	fbd73648dd	Merge "Cleanup typos, remove unnecessary lines, replace switch"	2013-07-30 12:59:46 -07:00
Adrian Grange	b30a06b930	Cleanup typos, remove unnecessary lines, replace switch Removed unnecessary code lines, replaced switch with an if, fixed spelling errors and formatting. Change-Id: Ie48aa4604aa0ed48362ca359d792fb21b2ec1dc6	2013-07-30 12:10:32 -07:00
Yaowu Xu	88e48444da	Merge "removed duplication"	2013-07-30 09:38:02 -07:00
Yaowu Xu	a15d1f3134	removed duplication Change-Id: Ica23b66f6664e5a5b168499584f0afffbc54794f	2013-07-30 09:09:14 -07:00
Jingning Han	525745b17a	Remove a redundant branching in tokenize_b The tokenize_b function is only called when output flag is on. Hence removing the conditional branch on it therein. Change-Id: Ib709f47f23f39ca05a695faf86fa3377f11f2dd0	2013-07-29 17:08:13 -07:00
Jingning Han	455f2de20b	Tune tokenization/detokenization flow for speed-up This commit optimizes the tokenization and detokenization operational flow for speed-up. It makes the coding process about 0.3% faster at speed 0. Change-Id: I28008df7482874e4b5f237f2d418ff82a249dd56	2013-07-29 16:15:30 -07:00
Jingning Han	b5323ed89a	Skip redundant tokenization in rd loop This commit makes the encoder skip the redundant tokenization process in the rate-distortion optimization search loop, while updating the entropy contexts accordingly. It makes the speed 0 encoding process about 0.5% faster at no performance change. Change-Id: I34a4155a0b5332afeb45c93a51c7f35a294d685c	2013-07-29 16:09:16 -07:00
Jingning Han	5875d7a4a4	Merge "16x16 inverse 2D-DCT with DC only"	2013-07-29 15:29:25 -07:00
Jingning Han	a7c4de22e1	16x16 inverse 2D-DCT with DC only This commit provides special handle on 16x16 inverse 2D-DCT, where only DC coefficient is quantized to be non-zero value. Change-Id: I7bf71be7fa13384fab453dc8742b5b50e77a277c	2013-07-29 14:45:53 -07:00
Dmitry Kovalev	828119d6ab	Renaming txfm to tx for consistency in some places. Change-Id: I2a6a646570e2af66315e7c658d00d99f80c4b127	2013-07-29 14:35:55 -07:00
Dmitry Kovalev	730a34416f	Renaming NB_TXFM_MODES constant to TX_MODES. Change-Id: I10bf06e3a3d5271221ae6a42a36074d01d493039	2013-07-29 13:38:40 -07:00
Dmitry Kovalev	23391ea835	Renaming TX_SIZE_MAX_SB to TX_SIZES. Change-Id: I6aa4191935aa93461a07c41b59fdae1eb5f5f107	2013-07-29 12:25:34 -07:00
Jingning Han	decb1b94de	Merge "Shortcut 8x8/16x16 inverse 2D-DCT"	2013-07-29 11:04:07 -07:00
Ronald S. Bultje	118ccdcd30	Inverse dimension order in token_cost array. This allows us to increment the position at the band-level only as we go from one band to the next; more importantly, that allows us to use an add instead of multiply instruction, and omit the instruction altogether if the band doesn't change from one coef to the next, thus being slightly faster (probably more noticeable on systems where a multiply is expensive, like arm). Change-Id: I4343fe35b9f9a47fa00b217bdcbf5f91ff96c381	2013-07-26 17:30:04 -07:00
Ronald S. Bultje	dcacce6dd9	Merge "Save pixels instead of coefficients in intra4x4 RD loop."	2013-07-26 17:20:58 -07:00
Ronald S. Bultje	d30c8f41ef	Merge "Add best_rd breakout in intra4x4 RD loop."	2013-07-26 17:20:51 -07:00
Jingning Han	38fa487164	Shortcut 8x8/16x16 inverse 2D-DCT This commit brought back the shortcut implementation of 8x8/16x16 inverse 2D-DCT. When the eob <= 10, it skips the inverse transform operations on row 4:7/4:15 in the first round. For bus_cif at 1000 kbps, this provides about 2% speed-up at speed 0. Change-Id: I453e2d72956467d75be4ad8c04b4482ab889d572	2013-07-26 17:19:14 -07:00
Jingning Han	b9c3dd481a	Merge "Special handle on DC only inverse 8x8 2D-DCT"	2013-07-26 16:04:14 -07:00
Jingning Han	325e0aa650	Special handle on DC only inverse 8x8 2D-DCT This commit enables a special handle for the 8x8 inverse 2D-DCT, where only DC coefficient is quantized to be non-zero. For bus_cif at 2000 kbps, it provides about 1% speed-up at speed 0. Change-Id: I2523222359eec26b144cf8fd4c63a4ad63b1b011	2013-07-26 14:16:51 -07:00
Dmitry Kovalev	c09b81719f	Merge "General cleanups."	2013-07-26 13:59:39 -07:00
Yaowu Xu	4f75a1f4ed	Merge "Auto min and max partition size experiment."	2013-07-26 12:10:27 -07:00
Paul Wilkins	fe5e2a91bb	Auto min and max partition size experiment. Speed feature experiment to set an upper and lower partition size limit based on what has been seen in spatial neighbors. This seems to gives quite reasonable speed gains in local (10-15%) and when used with speed 0 the losses are small (0.25% derf, 0.35% stdhd). However, for now I am only enabling it on speed 1 as there may be clashes with the existing temporal partition selection in speed 2. Using a tighter min / max around the range derived from the neighbors increases speed further but at the cost of a bigger quality loss. However, I think this spatial method could be combined with data from either the last frame or a variance method (or both) to refine the range of minimum and maximum partition size. I.e. consider the min and max from spatial and temporal neighbors and the variance recommendation. Change-Id: I1b96bf8b84368d6aad0c7aa600fe141b4f07435f	2013-07-26 18:30:49 +01:00
Yunqing Wang	52256cdbca	Modify static threshold calculation Used 3 * standard_deviation in internal threshold calculation instead of fit curve. This actually approached the algorithm better. For comparison, similar tests were done: The overall psnr loss is less than before. 1. derf set: when static-thresh = 1, psnr loss is 0.329%; when static-thresh = 500, psnr loss is 0.970%; 2. stdhd set: when static-thresh = 1, psnr loss is 0.922%; when static-thresh = 500, psnr loss is 1.307%; Similar speedup is achieved. For example, clip bitrate static-thresh psnr time akiyo(cif) 500 0 48.952 5.077s(50f) akiyo 500 500 48.866 4.169s(50f) parkjoy(1080p) 4000 0 30.388 78.20s(30f) parkjoy 4000 500 30.367 70.85s(30f) sunflower(1080p) 4000 0 44.402 74.55s(30f) sunflower 4000 500 44.414 68.69s(30f) Change-Id: Ic78833642ce1911dbbd1cb6c899a2d7e2dfcc1f3	2013-07-25 19:59:33 -07:00
Yunqing Wang	845fd5011c	Merge "Add encoding option --static-thresh"	2013-07-25 14:58:00 -07:00
Yunqing Wang	d36852b702	Add encoding option --static-thresh This option exists in VP8, and it was rewritten in VP9 to support skipping on different partition levels. After prediction is done, we can check if the residuals in the partition block will be all quantized to 0. If this is true, the skip flag is set, and only prediction data are needed in reconstruction. Based on DCT's energy conservation property, the skipping check can be estimated in spatial domain. The prediction error is calculated and compared to a threshold. The threshold is determined by the dequant values, and also adjusted by partition sizes. To be precise, the DC and AC parts for Y, U, and V planes are checked to decide skipping or not. Test showed that 1. derf set: when static-thresh = 1, psnr loss is 0.666%; when static-thresh = 500, psnr loss is 1.162%; 2. stdhd set: when static-thresh = 1, psnr loss is 1.249%; when static-thresh = 500, psnr loss is 1.668%; For different clips, encoding speedup range is between several percentage and 20+% when static-thresh <= 500. For example, clip bitrate static-thresh psnr time akiyo(cif) 500 0 48.923 5.635s(50f) akiyo 500 500 48.863 4.402s(50f) parkjoy(1080p) 4000 0 30.380 77.54s(30f) parkjoy 4000 500 30.384 69.59s(30f) sunflower(1080p) 4000 0 44.461 85.2s(30f) sunflower 4000 500 44.418 78.1s(30f) Higher static-thresh values give larger speedup with larger quality loss. Change-Id: I857031ceb466ff314ab580ac5ec5d18542203c53	2013-07-25 14:28:05 -07:00
Dmitry Kovalev	7131cb0e3d	General cleanups. Removing unused constants, macros, and function declarations. Using ROUND_POWER_OF_TWO macro, vp9_zero, vp9_copy where possible. Moving #include from .h to .c. Merging for loops for motion vectors. Change-Id: Ic3bf841764a2bb177128bb3a6d7aa8f68229cd13	2013-07-25 14:13:48 -07:00
Dmitry Kovalev	d53fc9ee4e	Merge "Adding lookup table for size group."	2013-07-25 13:57:28 -07:00
Dmitry Kovalev	08fd41ccd7	Adding lookup table for size group. Change-Id: Ia6144d77ebed66e0739b62e4d673e26a95aa9550	2013-07-25 12:58:54 -07:00
Adrian Grange	e862c6f9eb	Merge "Simplify handling of sub-partition motion vectors"	2013-07-25 12:58:38 -07:00
Adrian Grange	6f0f0e4907	Merge "Use local variables rather than structure members"	2013-07-25 12:57:52 -07:00
Dmitry Kovalev	d604914f09	Merge "Removing vp9_adapt_mode_context function."	2013-07-25 12:46:31 -07:00
Jingning Han	d571af76d3	Merge "Make coeff_optimize initialized per-plane"	2013-07-25 12:46:14 -07:00
Yaowu Xu	51a8458822	Merge "fix a bug where flags are not reset"	2013-07-25 12:18:51 -07:00
Adrian Grange	be700e140a	Simplify handling of sub-partition motion vectors Simplified the code that extracts and uses the motion vectors for the 4 sub-partitions in rd_pick_partition. Change-Id: Iaf698ef7ee3aef9edd59015e1ae065dd359b17d9	2013-07-25 11:51:51 -07:00
Jingning Han	2f58faffa4	Make coeff_optimize initialized per-plane This commit makes the initialization of trellis coeff optimization a per-plane operation, thereby eliminating the redundant steps in encode_sby and encode_sbuv. It makes the encoder at speed 0 slightly faster. Change-Id: Iffe9faca6a109dafc0dd69dc7273cbdec19b17cd	2013-07-25 11:44:29 -07:00
Dmitry Kovalev	47d61f008f	Removing vp9_adapt_mode_context function. Moving code from vp9_adapt_mode_context to vp9_adapt_mode_probs. Change-Id: I60829c30b28968cd813551ef3a206dfb98d323c9	2013-07-25 10:48:45 -07:00
Yaowu Xu	3e386aefc2	fix a bug where flags are not reset The feature that uses small partition results as a measure to skip mode evaluation at larger partition requires the flags to be reset. The reset was missing in the code path that calls rd_use_partition(). Change-Id: Ia0a3a0aee1a862b6e2333d596808db7c48033d50	2013-07-25 10:28:38 -07:00
Scott LaVarnway	a0e8b45fee	Merge "pack_inter_mode_mvs cleanup"	2013-07-25 04:47:56 -07:00
Dmitry Kovalev	fcc34796d2	Removing CONFIG_BALANCED_COEFTREE experiment. Change-Id: I61a8b0101eac3ee2e0621d56151b90c269fd4db4	2013-07-24 15:53:42 -07:00
Dmitry Kovalev	9139ee0908	Adding condition inside get_tx_type_{4x4, 8x8, 16x16}. Adding plane type check condition because it was always used outside of get_tx_type_{4x4, 8x8, 16x16}. Change-Id: I02f0bbfee8063474865bd903eb25b54d26e07230	2013-07-24 12:55:45 -07:00
Adrian Grange	4cfd36d8fd	Use local variables rather than structure members Although local copies of the mode member variables (mode, ref_frame) were made, they were not used in all places. Also, made a local copy of the second_ref_frame member. Change-Id: I84d8c822e5cb3d8a02fc3de8a4037ca3fea8bfad	2013-07-24 11:17:44 -07:00
Adrian Grange	a183f17d33	Merge "Correct spelling mistakes"	2013-07-24 09:48:57 -07:00
Ronald S. Bultje	7817d3221f	Save pixels instead of coefficients in intra4x4 RD loop. Prevents doing duplicate IDCTs; encoding of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min4.0 to 1min3.5, i.e. 0.87% faster overall. Change-Id: I2df39e29ed9d5ea5e7d2704a34940ba622832ddd	2013-07-24 09:03:20 -07:00
Ronald S. Bultje	b72ecbb1b9	Add best_rd breakout in intra4x4 RD loop. Encoding time of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min5.4 to 1min4.0, i.e. 2.2% faster overall. Change-Id: I8c32f2aff9a649ce7dd49d910dc5ba16b99c3bc6	2013-07-24 09:02:05 -07:00
Adrian Grange	bc8b0529db	Correct spelling mistakes Change-Id: Id4138293efeac4503b2e01ce7a6c150a5abeef77	2013-07-24 07:58:26 -07:00
Ronald S. Bultje	47336afd8d	Merge "More optimizations for cost_coeffs()."	2013-07-23 21:36:12 -07:00
Jingning Han	666c266623	Merge "Unify the use of encode_b_args/optimize_block_args"	2013-07-23 18:08:50 -07:00
Dmitry Kovalev	1099a436d3	Moving counts from FRAME_CONTEXT to new struct FRAME_COUNTS. Counts are separate from frame context. We have several frame contexts but need only one copy of all counts. Change-Id: I5279b0321cb450bbea7049adaa9275306a7cef7d	2013-07-23 17:02:08 -07:00
Jingning Han	ab77828b36	Unify the use of encode_b_args/optimize_block_args The struct optimize_block_args is defined same as encode_b_args. Remove this redundant definition, and use encode_b_args consistently. Change-Id: I1703aeeb3bacf92e98a34f4355202712110173d9	2013-07-23 16:04:02 -07:00
Dmitry Kovalev	8d13b0d1df	Removing LOW_PRECISION_MV_UPDATE define. Change-Id: I78d16ee758e1fae0200b746f00031f6d9c6d6ce7	2013-07-23 15:41:45 -07:00
Dmitry Kovalev	a9bbabd94b	Merge "Removing vp9_is_interpolating_filter array."	2013-07-23 15:01:19 -07:00
Adrian Grange	719cd35f3a	Merge "Rolled-up several for loops into one"	2013-07-23 15:00:06 -07:00
Adrian Grange	646edbc1b2	Rolled-up several for loops into one Several consecutive for loops executed over the same index range, so I rolled them into one. Change-Id: I5cfcc8c38c738478965768409cca9d09adf224e1	2013-07-23 14:32:21 -07:00
Dmitry Kovalev	db7f5d28b9	Removing vp9_is_interpolating_filter array. All filters are interpolating now, so we don't need this array, all values from this array are evaluated to true. Change-Id: I9af6d8219ae0eb984063cd15e4e2296374ae4961	2013-07-23 14:24:39 -07:00
Dmitry Kovalev	2855d8aea1	Merge "Adding update_tx_counts function."	2013-07-23 13:57:59 -07:00
Jingning Han	e9e2fe8ec3	Make xform_quant operations tx_type independent The xform_quant() module is only used by inter modes, hence removing the redundant switches therein conditioned on tx_type. Change-Id: Ib87ce5b2f2e4cbf3ceb133a1108afa173c933a3f	2013-07-23 12:37:25 -07:00
James Zern	8dede954c7	Merge "vp9: make some static tables const"	2013-07-23 11:37:01 -07:00
Jingning Han	4ef1d35abf	Merge "Skip inverse transform when eob is zero"	2013-07-23 10:31:19 -07:00
Deb Mukherjee	9360fd3dcf	Merge "Diamond search change to accelerate movement"	2013-07-23 10:14:10 -07:00
Jingning Han	0359ad7f9a	Skip inverse transform when eob is zero When all the transform coefficients were quantized to zero, skip the inverse transform operation. For bus_cif at 1000 kbps, the runtime goes from 154967ms -> 149842ms, i.e., about 3% speed-up, at speed 0. Change-Id: Ic0a813fff5e28972d4888ee42d8747846a6c3cc6	2013-07-23 10:06:41 -07:00
Paul Wilkins	cedd24ec61	Merge "Renaming of segment constants."	2013-07-23 08:16:12 -07:00
Scott LaVarnway	7bc294a3fe	pack_inter_mode_mvs cleanup xd->mode_info_context is set to m prior to this call. Change-Id: Ibc442529961750c29ccf0c6cae08cb2b0431415f	2013-07-23 10:08:28 -04:00
Jim Bankoski	256ee00093	Merge "clean up bw, bh"	2013-07-23 06:58:28 -07:00
Jim Bankoski	86a9dec73c	clean up bw, bh many structures use bw and bh and they have different meanings. This cl attempts to start this clean up and remove unneccessary 2 step look up log and then shift operations... also removed partition type multiple operation code in bitstream.c. Change-Id: I7e03e552bdfc0939738e430862e3073d30fdd5db	2013-07-23 06:51:44 -07:00
Scott LaVarnway	2fd20eb37d	Merge "Eliminated prev_mip memsets/memcpys in encoder"	2013-07-23 06:43:52 -07:00
Paul Wilkins	7c134bc0cd	Merge "Reworked the auto_mv_step_size speed feature"	2013-07-23 04:49:55 -07:00
Paul Wilkins	32042af14b	Renaming of segment constants. Renamed: MAX_MB_SEGMENTS to MAX_SEGMENTS MB_SEG_TREE_PROBS to SEG_TREE_PROBS The minimum unit for segmentation in the segment map is now 8x8 so it is misleading to use MB_ as macro-block traditionally refers to a 16x16 region. Change-Id: I0b55a6f0426bb46dd13435fcfa5bae0a30a7fa22	2013-07-23 12:09:04 +01:00
James Zern	3c8cce353f	vp9: make some static tables const Change-Id: I8bcae51271673da8755c66a51aea005dfe6a3739	2013-07-22 19:19:13 -07:00
Ronald S. Bultje	e20fcd9585	More optimizations for cost_coeffs(). 4x4: 163 -> 123 cycles (33% faster) 8x8: 491 -> 399 cycles (23% faster) 16x16: 1889 -> 1763 cycles (7% faster) 32x32: 8311 -> 8180 cycles (1.6% faster) Overall encoding time of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min4.33 to 1min3.00, i.e. 2.11% faster. Change-Id: Ib52d1dbb5649b14de769d3e7a74af67440b5284f	2013-07-22 16:09:09 -07:00
Dmitry Kovalev	b2fc6fa969	Adding update_tx_counts function. Moving common encoder/decoder code to update_tx_counts. Also renaming vp9_get_pred_probs_tx_size to get_tx_probs2 and adding get_tx_probs to call vp9_get_pred_context_tx_size inside read_selected_tx_size only once (twice before). Change-Id: Ia50247f3893de88ef8e9041b0d44be44a40aaa4d	2013-07-22 14:57:43 -07:00
Yaowu Xu	6261d79206	Merge "fix a build error"	2013-07-22 13:02:15 -07:00
James Zern	76db4d599a	Merge "VP[89]_COMMON: remove golden/altref frame counts"	2013-07-22 12:55:07 -07:00
Yaowu Xu	fc186dcad6	fix a build error Change-Id: I3b05687f439ff6a7c426d2c97a6c58c831fa51ac	2013-07-22 12:37:30 -07:00
Jingning Han	416f315e82	Merge "Skip buffer update in sub8x8 rd loop"	2013-07-22 12:08:22 -07:00
Jingning Han	a5a9f5f7f3	Merge "Optimize operation flow in sub8x8 rd loop"	2013-07-22 12:08:15 -07:00
Deb Mukherjee	a1e2d50be9	Diamond search change to accelerate movement Optional change in diamond search to continue in the best move direction until that move turns worse. This is still WIP since the exact way the new method is to be used is under investigation. One option is to make it an option in diamond search and use it only when motion is large. Overall slightly positive on derfraw300 +0.02%, stdhdraw +0.13%, but works a lot better for high motion sequences (ex. football : +1%). Change-Id: If88e01a6021daa0cda934680cdc70be1ee04f798	2013-07-22 11:19:15 -07:00
Paul Wilkins	3798d7a641	Merge "Re-order mode search in rd."	2013-07-22 10:46:04 -07:00
Jingning Han	409e77f2d4	Optimize operation flow in sub8x8 rd loop Stack the rate-distortion statistics in the sub8x8 rd loop. This allows the encoder to skip the forward transform, quantization, and coeff cost estimation, in the sub8x8 rd optimization search, if the motion vector(s) are of integer pixel value, and have been tested in the previous prediction filter type rd loops of the same block. This gives about 2% speed-up for bus_cif at 2000 kpbs, for speed 0. Its efficacy depends how frequently the motion search will select an integer motion vector. Change-Id: Iee15d4283ad4adea05522c1d40b198b127e6dd97	2013-07-22 10:40:33 -07:00
Paul Wilkins	1d189d6464	Re-order mode search in rd. Mode search order in rd loop changed to better reflect observed hit counts. Also some adjustment of the baseline mode rd thresholds to reflect the order change and observed frequencies. Change-Id: I47a131cc83e11551df8add6d6d8d413d78d3a63c	2013-07-22 17:21:12 +01:00
Jim Bankoski	9ad604c6fb	Merge "fix left over overflow"	2013-07-22 08:51:26 -07:00
Jim Bankoski	2ac8b50cd8	fix left over overflow This cl fixes issues rbultje brought up. that I somehow neglected when I submitted yaowu's patch. Change-Id: I07ad18796317822510b96e951c88d29f194a3c2e	2013-07-22 06:39:39 -07:00
Paul Wilkins	888375d243	Fix build error. When CONFIG_POSTPROC is set there was a now invalid reference to cm->filter_level. Changed to cpi->mb.e_mbd.lf.filter_level in line with change Iaf5fb71c33719cdfa1b991f671caf071be9ea035 Change-Id: If746e60044903f7ba8d0d346225b3d015226c7d0	2013-07-22 14:01:43 +01:00
Dmitry Kovalev	ee1fe2f750	Merge "Removing pre probabilities from FRAME_CONTEXT."	2013-07-20 22:50:32 -07:00
Dmitry Kovalev	8962d975b2	Merge "Moving all loop filter related variables into new struct."	2013-07-20 22:45:24 -07:00
Dmitry Kovalev	39342db138	Merge "Consistent names for inter mode probabilities and encodings."	2013-07-20 22:40:51 -07:00
Dmitry Kovalev	f66821afbb	Merge "Removing frame_type field from MACROBLOCKD struct."	2013-07-20 22:40:06 -07:00
Dmitry Kovalev	2b089f149a	Merge "Removing unused static arrays from vp9_reatectrl.c."	2013-07-20 22:39:33 -07:00
Jingning Han	c725502bf3	Skip buffer update in sub8x8 rd loop This commit allows the encoder to skip a few buffer update steps in rd_pick_best_mbsegmentation, when early breakout has been triggered in the rd_check_segment_txsize. It provides about 1% speed-up for bus_cif at 2000 kbps, in the settings of speed 0. Change-Id: Ica034f10a24dec572b397d8389a2b81020ebc0b9	2013-07-20 21:38:12 -07:00
Yaowu Xu	ea284d6281	added checks to prevent rate/distortion overflow At speed 2, due to the threshold scheme used, it is possible the rate and distortion assigned with INT_MAX value. The patch added checking to prevent the INT_MAX value is used in further calculation of RD scores. The patch also changed the assertion in rd_use_partition() to be mirror similar assertion in rd_pick_partition(). Change-Id: Idb52c543cc1e10abdf6e6a5d6e9cb535a42214dc	2013-07-19 17:52:50 -07:00
Dmitry Kovalev	7e703de729	Removing pre probabilities from FRAME_CONTEXT. Using cm->frame_contexts[cm->frame_context_idx] as source of previous probabilities. Change-Id: Ie03778acf0e7bebdc3a1f6a51854d4a0712f24a1	2013-07-19 17:33:10 -07:00
Dmitry Kovalev	ee1771ebaa	Moving all loop filter related variables into new struct. Adding loopfilter struct with fields from MACROBLOCKD and VP9Common. Eventually it will be moved to vp9_loopfilter.h for better code structure. Change-Id: Iaf5fb71c33719cdfa1b991f671caf071be9ea035	2013-07-19 16:19:10 -07:00
Dmitry Kovalev	29f0f79317	Removing unused static arrays from vp9_reatectrl.c. Removed arrays: kf_boost_seperation_adjustment, gf_adjust_table, gf_intra_usage_adjustment, gf_interval_table. Change-Id: I62e400cb6e4d039787615169a3779e31ebf95893	2013-07-19 15:55:09 -07:00
Dmitry Kovalev	c3a56ee583	Merge "Moving Scale2Ration function from vp9_onyx.h to vp9_onyx_if.c."	2013-07-19 15:27:24 -07:00
Deb Mukherjee	302698fb12	Reworked the auto_mv_step_size speed feature This patch modifies the auto_mv_step_size speed feature to use a combination of the maximum magnitude mv from the last inter frame, and the maximum magnitude mv for the two reference mvs with the same reference. For arf frames, the max mav step for the resolution is used. The bounds therefore are slightly tighter. The feature is made a speed 1 feature. Rebased. Results (when this feature is turned on over speed 0): derfraw300: -0.046% psnr, about 5+% speedup (tested on football: goes from 4m30.760s to 4m17.410s). Change-Id: If492797a61b0b4b3e58c0b8f86afb880165fc9f6	2013-07-19 15:12:56 -07:00
Dmitry Kovalev	e71a4a77bb	Merge "Renaming TXFM_MODE to TX_MODE (like TX_SIZE, TX_TYPE)."	2013-07-19 12:14:32 -07:00
Dmitry Kovalev	97e96bc4e9	Removing frame_type field from MACROBLOCKD struct. Change-Id: Ia4e83913251c1cdc7aa2abd64bf01ecb1a962119	2013-07-19 11:55:36 -07:00
Dmitry Kovalev	c0eb57406c	Renaming TXFM_MODE to TX_MODE (like TX_SIZE, TX_TYPE). Moving TX_MODE enum to vp9_enums.h. Renaming txfm_mode variables to tx_mode. Change-Id: I459d1af6dd928ce7fccdf8ce30b6f1ca057bef92	2013-07-19 11:37:13 -07:00
Dmitry Kovalev	afe43d4089	Removing redundant VP9_COMMON* from function signatures. Functions: vp9_get_pred_context_switchable_interp, vp9_get_pred_context_intra_inter, vp9_get_pred_context_single_ref_p1, vp9_get_pred_context_single_ref_p2. Change-Id: I3d6fb8aee23c9062270768e1e6da416dd9bb8f96	2013-07-19 11:20:49 -07:00

... 6 7 8 9 10 ...

2054 Commits