generic-library/vpx

Author	SHA1	Message	Date
hkuang	25e5552630	Remove border extension in intra frame prediction. Change-Id: Id677df4d3dbbed6fdf7319ca6464f19cf32c8176	2013-12-16 14:05:58 -08:00
Dmitry Kovalev	b5c9261832	Converting vp9_treecoder.h to vp9_prob.{h, c} Moving vp9_norm probability table from vp9_entropy.c to vp9_prob.c Change-Id: Ie757b73860c6f43130790c332b292e2a1a81b788	2013-12-16 12:53:09 -08:00
Frank Galligan	fbada948fa	Add frame buffer lru cache. Add an option for libvpx to return the least recently used frame buffer. Change-Id: I886a96ffb94984f1c42de53086e0131922df3260	2013-12-15 19:57:42 -08:00
Frank Galligan	d0ee1fd797	Merge "Add support to pass in external frame buffers."	2013-12-15 19:18:25 -08:00
Frank Galligan	10f891696b	Add support to pass in external frame buffers. VP9 decoder can now use frame buffers passed in by the application. Change-Id: I599527ec85c577f3f5552831d79a693884fafb73	2013-12-15 18:45:46 -08:00
Dmitry Kovalev	4d2d1591a3	Converting mode_lf_lut struct member into static lookup table. Change-Id: I6e6c7cb5ff5b60fbe6a7c314daec5ccdc2cafcc3	2013-12-14 17:42:12 -08:00
Dmitry Kovalev	2aadc06e0d	Yet another vp9_pred_common.c cleanup. Change-Id: I617d6c610d181076773c5c3d6f3dbc6717b02580	2013-12-14 17:39:24 -08:00
Dmitry Kovalev	64cf398713	Merge "Using MV struct instead of int_mv union in encoder."	2013-12-13 16:42:54 -08:00
Dmitry Kovalev	33df4f0483	Merge "vp9_convole.c cleanup."	2013-12-13 15:40:00 -08:00
Dmitry Kovalev	f54b515797	Merge "Cleaning up vp9_append_sub8x8_mvs_for_idx()."	2013-12-13 15:38:53 -08:00
Dmitry Kovalev	25da21b14e	Using MV struct instead of int_mv union in encoder. Change-Id: I8b81a3e4b4fa530a654c28d9c136afa0c1d379fd	2013-12-13 15:24:48 -08:00
Dmitry Kovalev	466cc94e7a	Getting rid of b_{width, height}_log2 calls in non-420 loop filter. Using num_{4x4, 8x8}_blocks_{wide, high}_lookup instead. Change-Id: I66a7ab807fa57395253b2d0e636c2479fa8c4adf	2013-12-13 12:53:41 -08:00
James Zern	178db94cd6	vp9 asserts: fix compile warning string literal to int within an assert Change-Id: I0c889256b67a078e6e2a79577f0b7ae084243258	2013-12-12 19:49:19 -08:00
Dmitry Kovalev	629fb85f17	vp9_convole.c cleanup. Making overall logic more clear, moving "hacked" calculation of base filter array pointer to get_filter_base() function. Change-Id: Ibbd38a9f937e48d35bbbfef3ad933ab36664cccb	2013-12-12 11:14:06 -08:00
Deb Mukherjee	7edd5170b5	Merge "Changes interfaces to vp9_get_compressed_data fn"	2013-12-11 15:50:40 -08:00
Dmitry Kovalev	e79103166f	Merge "Renames for consistency in vp9_pred_common.{c, h} files."	2013-12-11 14:30:44 -08:00
Deb Mukherjee	e33855cc47	Changes interfaces to vp9_get_compressed_data fn Silences some lint warnings in previous patches Change-Id: I04bf47ebe7e63a95fd322719a3154e589c115d78	2013-12-11 14:22:51 -08:00
hkuang	9460226acd	Merge "Fix valgrind error."	2013-12-11 13:22:32 -08:00
hkuang	1339f3842c	Fix valgrind error. Temporarily change memcpy to memmove. Change-Id: I700a197bc1ce496be1ddad7118429c5da465b0ca	2013-12-11 13:21:28 -08:00
Dmitry Kovalev	3274fc30ee	Renames for consistency in vp9_pred_common.{c, h} files. Change-Id: Icba06e84ca55c419abbacedf5825eeb394a1b140	2013-12-10 18:31:46 -08:00
Dmitry Kovalev	098d13ba10	Cleaning up vp9_append_sub8x8_mvs_for_idx(). Replacing if-else with switch statement, reordering function arguments. Change-Id: I4825d2ef311ba8999b6d4ceb0eef003587a13434	2013-12-10 17:56:53 -08:00
Dmitry Kovalev	2dd20e468a	Cleaning up skip context calculation. Renames: vp9_get_pred_context_mbskip => vp9_get_skip_context vp9_get_pred_prob_mbskip => vp9_get_skip_prob Change-Id: I2af499848ef73f3f5cd8cdb27852d0bcdfe31d09	2013-12-10 14:11:26 -08:00
Dmitry Kovalev	35b7b0b549	Merge "Removing unused vp9_get_pred_flag_mbskip() function."	2013-12-10 13:58:35 -08:00
hkuang	19bbe41c71	Merge "Refactor inter_predictor function."	2013-12-10 13:34:24 -08:00
Dmitry Kovalev	48088f210d	Removing unused vp9_get_pred_flag_mbskip() function. Change-Id: Ib46a97d8ff9f2915b9fa2abba3cd18b6711fcb0c	2013-12-10 12:53:17 -08:00
Dmitry Kovalev	e18eb7721e	Merge "Renaming comp_pred_mode to reference_mode."	2013-12-10 10:52:34 -08:00
hkuang	6c9dcae532	Refactor inter_predictor function. Change-Id: Ic429b2f16462e926f30efb3af4da3080026359d8	2013-12-10 10:36:44 -08:00
Dmitry Kovalev	d2dad31e79	Merge "Cleaning up vp9_get_pred_context_switchable_interp() functuion."	2013-12-09 17:34:30 -08:00
hkuang	d70a8c09c6	Merge "Implenment on demand border extension. In place extend the border now. Next commit will totally remove the border."	2013-12-09 17:16:31 -08:00
Dmitry Kovalev	9edd4d4db7	Cleaning up vp9_get_pred_context_switchable_interp() functuion. Change-Id: I67a45a41312ca0efd8fe00ccd8bdc0f97675d09f	2013-12-09 17:02:38 -08:00
hkuang	ff2c96be1f	Implenment on demand border extension. In place extend the border now. Next commit will totally remove the border. Change-Id: Ic1e1ca9cc34f81c688715b3948689b47df63a151	2013-12-09 16:44:08 -08:00
Jingning Han	f92b5842bf	Merge "Full range motion search for regular block sizes"	2013-12-09 16:12:35 -08:00
Dmitry Kovalev	08c48ddc01	Renaming comp_pred_mode to reference_mode. Change-Id: I83ffed2b1878a35ac35f07f9ee74309adc9c7b11	2013-12-09 15:13:34 -08:00
Dmitry Kovalev	347df4ce55	Merge "Renaming vp9_get_pred_context_tx_size() function."	2013-12-09 15:10:49 -08:00
Dmitry Kovalev	2c3120274a	Removing max_uv_txsize_lookup lookup table. Adding get_uv_tx_size_impl() with tx size selection logic, rewriting get_uv_tx_size(). Change-Id: I3ecb108059a41be227a8c89a0710bd174f508951	2013-12-09 14:03:23 -08:00
Dmitry Kovalev	a19d694f09	Merge "Removing BLOCK_TYPES and adding PLANE_TYPES constant instead."	2013-12-07 02:20:41 -08:00
Dmitry Kovalev	cb92f4f042	Renaming vp9_get_pred_context_tx_size() function. Change-Id: Ia6d6f4dfb1fd1ec0f8ba53796b59a802e9d7881d	2013-12-06 15:31:06 -08:00
Dmitry Kovalev	b6e5bb27c9	Merge "Renaming reference mode context calculation function."	2013-12-06 14:22:47 -08:00
Jingning Han	b295092b8f	Full range motion search for regular block sizes Add a full range motion search for regular block sizes. This runs exhaustive search within the given reference area. This commit further optimizes the search process by combining 4 points test into one pipeline, which gives 30% speed-up as compared to run each individual point at a time. This full range search serves as a best possible motion search reference. When replacing the diamond search with full range search, the speed 0 runtime of bus CIF at 2000 kbps goes from 153872ms to 623051ms. The compression performance compared to speed 0 setting gains 0.585% for derf set. Change-Id: Ieef1225216b0b86b4ac4872fa7fb9e18bf2eabb3	2013-12-06 12:24:53 -08:00
Dmitry Kovalev	2da30a96d4	Merge "Removing duplicated C code from vp9_loopfilter_filters.c file."	2013-12-06 12:13:24 -08:00
Dmitry Kovalev	63963f51ef	Renaming reference mode context calculation function. Renames: vp9_get_pred_context_comp_inter_inter => vp9_get_reference_mode_context vp9_get_pred_prob_comp_inter_inter => vp9_get_reference_mode_prob Change-Id: I3bbb69481e6b0c848028667c9269f567f293d3bd	2013-12-06 11:23:01 -08:00
Dmitry Kovalev	d6b159d4a6	Removing BLOCK_TYPES and adding PLANE_TYPES constant instead. Change-Id: Ic3bb862e93aedf6a489a33ea6f7e5097d96855ee	2013-12-06 10:54:00 -08:00
Dmitry Kovalev	cf4dfdc8e7	Merge "Moving vp9_tree_probs_from_distribution() to encoder."	2013-12-06 10:18:30 -08:00
Dmitry Kovalev	8eac2ca840	Merge "Renaming constants."	2013-12-06 09:55:02 -08:00
Dmitry Kovalev	5be34ba80f	Merge "vp9_get_pred_context_intra_inter() clean up."	2013-12-06 09:14:36 -08:00
Adrian Grange	de2046275d	Merge "Remove redundant calls to vp9_update_mode_info_border"	2013-12-06 08:59:47 -08:00
Dmitry Kovalev	4ac6a2552b	Moving vp9_tree_probs_from_distribution() to encoder. Writing custom coeff branch count calculation (which is much clearer) in adapt_coef_probs() function. Removing vp9_treecoder.c file. Change-Id: I8880fb7a39996c8bcf6cd0acf9898a8c712ba91f	2013-12-05 18:13:26 -08:00
Dmitry Kovalev	377fa8aff8	Renaming PREV_COEF_CONTEXTS to COEFF_CONTEXTS. Also adding BAND_COEFF_CONTEXTS macro to simplify for loop logic. Change-Id: I12a78a49cf1addf81e6b3fe2a3736ec2b79bd79e	2013-12-05 17:08:06 -08:00
Dmitry Kovalev	6fd71e1b09	vp9_get_pred_context_intra_inter() clean up. Renaming: vp9_get_pred_context_intra_inter => vp9_get_intra_inter_context vp9_get_pred_prob_intra_inter => vp9_get_intra_inter_prob Change-Id: I2c1affea2e84f4e616137c6df82adb11c7845781	2013-12-05 17:01:03 -08:00
Dmitry Kovalev	f7396f3394	Merge "Removing vp9_default_coef_probs.h file."	2013-12-05 16:44:26 -08:00
Dmitry Kovalev	0d4b8d7e43	Renaming constants. NUM_YV12_BUFFERS => FRAME_BUFFERS ALLOWED_REFS_PER_FRAME => REFS_PER_FRAME NUM_REF_FRAMES_LOG2 => REF_FRAMES_LOG2 NUM_REF_FRAMES => REF_FRAMES NUM_FRAME_CONTEXTS_LOG2 => FRAME_CONTEXTS_LOG2 NUM_FRAME_CONTEXTS => FRAME_CONTEXTS Change-Id: I4e1ada08f25d8fa30fdf03aebe1b1c9df0f87e63	2013-12-05 16:23:09 -08:00
Dmitry Kovalev	2b95a05bf6	Removing duplicated C code from vp9_loopfilter_filters.c file. Change-Id: I299b621fca1c8ff5d296afde9698cdcccfecaf3f	2013-12-05 15:49:57 -08:00
Adrian Grange	93d8a3fd29	Remove redundant calls to vp9_update_mode_info_border Removed calls to vp9_update_mode_info_border since they immediately followed code that initialized the entire buffer to 0. Change-Id: Ife06794daa20439a0b607a83a87f88df59afac40	2013-12-05 15:02:32 -08:00
Dmitry Kovalev	6df9ec52a0	Merge "Cleaning up vp9_get_pred_context_tx_size() function."	2013-12-05 09:59:00 -08:00
Tero Rintaluoma	047b0b01bb	Fix show existing frame - Disable mode info update in case where current frame is coded as "show existing frame". - Should fix issue 676. Change-Id: Ibee681850eb307f982da6528d3e31cb94f881c08	2013-12-05 12:10:10 +02:00
Frank Galligan	7ecf3bc91c	Fix ref count decrement code. Buffer 0 would never be decremented, so it could only be used once. Change-Id: I605d99fa2a513eadae6a0e230161729880653282	2013-12-04 22:21:00 -08:00
Dmitry Kovalev	5eeffc9fc5	Cleaning up vp9_get_pred_context_tx_size() function. Change-Id: Ia6ef876e3d1e66b2182a9c0bce3fd758691cd381	2013-12-04 21:35:30 -08:00
Dmitry Kovalev	a1123538a5	Moving vp9_token from common to encoder. Change-Id: I40a070c353663e82c59e174d7c92eb84f72ed808	2013-12-04 19:36:58 -08:00
Frank Galligan	8363349b84	Merge "Fix the initial references to frame buffers."	2013-12-04 19:26:40 -08:00
Dmitry Kovalev	4afd141a05	Removing vp9_default_coef_probs.h file. Moving all probability tables from removed file to vp9_entropy.c. Change-Id: I12846f1da778c3016d96b82e53384d4634883430	2013-12-04 17:04:35 -08:00
Dmitry Kovalev	cf8e3d2c5c	Merge "Cleaning up vp9_dec_build_inter_predictors_sb function."	2013-12-04 16:57:54 -08:00
Frank Galligan	9ed616a56c	Fix the initial references to frame buffers. The old code would start in a mixed state, where all the reference frames were pointing to frame buffer 0, but the reference counts were 0. This is why we needed special code for the first frame. Change-Id: I734961012917654ff8c0c8b317aac00ab75ded1a	2013-12-04 16:53:18 -08:00
Dmitry Kovalev	3712b58c2f	Merge "Cleaning up vp9_entropy.h file."	2013-12-04 16:46:41 -08:00
Dmitry Kovalev	c6ca5c5ad9	Compact formatting default_coef_probs_{4x4, 8x8, 16x16, 32x32}. Change-Id: If40b930431766d5179b9769509b5e4ca1628e9cc	2013-12-04 15:45:28 -08:00
Dmitry Kovalev	da2da79012	Merge "Formatting vp9_pareto8_full array."	2013-12-04 12:22:50 -08:00
Dmitry Kovalev	beb35aba19	Cleaning up vp9_dec_build_inter_predictors_sb function. Using get_plane_block_size() instead of manipulation with subsampling values, calculating all required values only once without redundant calls to b_width_log2(). Change-Id: I00303f2a0926f9c4cb17f34591adda60615f8919	2013-12-04 12:11:01 -08:00
Yunqing Wang	f6582d6928	Revert "Simplify mask checking in loop filters" Jingning saw bitstream change with this patch. It could be true that (mask_16x16_0 & 1) is 1, but (mask_16x16_1 & 1) is 0 in some edge cases. This reverts commit `8f05e70340`. Change-Id: I0a529435ce816a1e14653eb510d5090de276070a	2013-12-04 11:31:19 -08:00
Dmitry Kovalev	1470789927	Merge "Moving eob array to the encoder."	2013-12-04 10:58:02 -08:00
Yunqing Wang	920a074e89	Merge "Improve idct16x16: _256_add_sse2(x1.107)&_10_add_sse2(x1.012)"	2013-12-04 08:50:51 -08:00
Dmitry Kovalev	ff6d6a9f07	Formatting vp9_pareto8_full array. Change-Id: Ic7f47a8d233daf5e61e82092865837ea4eda4095	2013-12-03 18:49:19 -08:00
Dmitry Kovalev	f00d157c12	Moving eob array to the encoder. In the decoder we don't need to save eobs, we can pass eob as an argument. That's why removing eob arrays from VP9Decompressor and TileWorkerData, and moving eob pointer from macroblockd_plane to macroblock_plane. Change-Id: I8eb919acc837acfb3abdd8319af63d1bbca8217a	2013-12-03 17:59:32 -08:00
Dmitry Kovalev	8e89e2f2e0	Cleaning up vp9_entropy.h file. Renaming constants for consistency: DCT_VAL_CATEGORY1 => CATEGORY1_TOKEN DCT_VAL_CATEGORY2 => CATEGORY2_TOKEN DCT_VAL_CATEGORY3 => CATEGORY3_TOKEN DCT_VAL_CATEGORY4 => CATEGORY4_TOKEN DCT_VAL_CATEGORY5 => CATEGORY5_TOKEN DCT_VAL_CATEGORY6 => CATEGORY6_TOKEN DCT_EOB_TOKEN => EOB_TOKEN DCT_EOB_MODEL_TOKEN => EOB_MODEL_TOKEN MAX_ENTROPY_TOKENS => ENTROPY_TOKENS Moving constants: INTER_MODE_CONTEXTS from vp9_entropy.h to vp9_blockd.h. EOSB_TOKEN from vp9_entropy.h to vp9_tokenize.h Change-Id: I5fcbf081318e1d365792b6d290a930c6cb0f3fc2	2013-12-03 17:23:03 -08:00
Dmitry Kovalev	09577b8c8d	Merge "Removing dummy assignments."	2013-12-03 10:59:34 -08:00
Abo Talib Mahfoodh	e4419ab691	Improve idct16x16: _256_add_sse2(x1.107)&_10_add_sse2(x1.012) The performance gain of idct16x16_10_add_sse2 function is not noticeable. However since both functions use the IDCT16_1D, idct16x16_10_add_sse2 should be modified as well. Tested with: park_joy_420_720p50.y4m Change-Id: I02b957e36fcf997c677d15baf496533895271bff	2013-12-02 21:08:56 -05:00
Yunqing Wang	8f182a1cac	Merge "improve vp9_idct32x32_34(x1.472)&1024(x1.032)_add_sse2"	2013-12-02 15:10:05 -08:00
Yunqing Wang	37e68aba55	Merge "Simplify mask checking in loop filters"	2013-12-02 12:06:26 -08:00
Dmitry Kovalev	862c22cf7d	Merge "Moving token-encoding related stuff from common to encoder."	2013-12-02 10:32:04 -08:00
Yunqing Wang	8f05e70340	Simplify mask checking in loop filters Considering a horizontal edge, if mask_16x16 is 1 for an even- indexed 8x8 block, then mask_16x16 is 1 for next 8x8 block in same row. Similiar to a verticle edge, if mask_16x16 is 1 for an even-rowed 8x8 block, then mask_16x16 is 1 for the 8x8 block right below it in next raw. Based on that, the mask_16x16 checking can be simplified to save cycles. The corresponding 8-pixel vp9_mb_lpf_horizontal_edge code can also be removed. Change-Id: Ic3fe7a5674322239208cbe2731dc3216ce2084f3	2013-11-27 14:10:57 -08:00
Dmitry Kovalev	d83d61d942	Moving reaster_block_offset{,_int16} from vp9_blockd.h to vp9_rdopt.h. Change-Id: I5a5888d4639cc6b7eb266be47581dd15ba08c91e	2013-11-27 12:57:21 -08:00
Dmitry Kovalev	f9da823216	Moving token-encoding related stuff from common to encoder. Change-Id: I0e59d320407b3bed0ba3622a7b29975f6fad7ebf	2013-11-27 11:27:57 -08:00
Dmitry Kovalev	e2f1d02eb3	Merge "Moving mode encodings from common to encoder + cleanup."	2013-11-27 11:00:54 -08:00
Yaowu Xu	e9c19617bf	Merge "vp9_short_fdct32x32_rd vp9_short_fdct32x32 optimized for AVX2"	2013-11-27 10:27:32 -08:00
Dmitry Kovalev	d3a2e55af4	Removing qcoeff buffers from the decoder. We only need qcoeff buffers in the encoder. Reducing TileWorkerData struct and VP9Decompressor struct sizes by 24K. Change-Id: Id148868461f7ffa3d3dd634b371503ae9c57e207	2013-11-26 18:52:10 -08:00
Dmitry Kovalev	fc3c3303f1	Removing dummy assignments. Change-Id: I10d1a4bcac751a982d9dd135f019e3a4d92f8522	2013-11-26 15:35:11 -08:00
Dmitry Kovalev	f4bf712fbb	Moving mode encodings from common to encoder + cleanup. Change-Id: I248ccb1532e2cd95314d0b95108f2c2e71cf084f	2013-11-26 14:53:17 -08:00
Yaowu Xu	b60293e1ce	Merge "Amended some comments for clarity"	2013-11-26 14:32:02 -08:00
Frank Galligan	b4874e2c82	Fix 16 wide neon horz loopfilter. Multiply by 3 was on 8bit vectors when it should have been on 16bit vectors. Change-Id: I248c1429b3134dfd171dfab0ebb109fd2437e1fc	2013-11-26 10:02:40 -08:00
Yunqing Wang	7a5fd6a1bf	Merge "Do vertical loopfiltering in parallel"	2013-11-26 09:35:14 -08:00
Abo Talib Mahfoodh	f97d91ab67	improve vp9_idct32x32_34(x1.472)&1024(x1.032)_add_sse2 vp9_idct32x32_34_add_sse2: speedup: 1.472 IDCT32_1D_34 and MULTIPLICATION_AND_ADD_2 are optimized based on the fact that Only upper-left 8x8 has non-zero values. vp9_idct32x32_1024_add_sse2: speedup: 1.032 Tested with: park_joy_420_720p50.y4m Change-Id: I8670ce547552b48695049de298e2fc46ce28dfbc	2013-11-26 12:28:26 -05:00
Dmitry Kovalev	5488da280d	Merge "Moving mv entropy encodings calculation to the encoder side."	2013-11-25 19:15:21 -08:00
Dmitry Kovalev	56d048c412	Moving mv entropy encodings calculation to the encoder side. Moved arrays: vp9_mv_joint_encodings vp9_mv_class_encodings vp9_mv_class0_encodings vp9_mv_fp_encodings Change-Id: Iaf5008c579fcbd6d77fdd81d1aef8c71b5f308b7	2013-11-25 16:36:28 -08:00
Dmitry Kovalev	7ba7a5f817	Merge "Removing redundant call of vp9_init_mbmode_probs()."	2013-11-25 16:08:42 -08:00
Dmitry Kovalev	cfc1f91c9f	Merge "Moving {left, right}_block_mode to vp9_blockd.h."	2013-11-25 10:59:24 -08:00
Dmitry Kovalev	e8af3db88a	Merge "Renaming COMPPREDMODE_TYPE enum and its members."	2013-11-25 10:59:08 -08:00
Yaowu Xu	dd69337e6e	Amended some comments for clarity Change-Id: I31c3908ba394095deb5d3a5d7b7c9b2b5328c3e8	2013-11-25 10:55:01 -08:00
Yaowu Xu	cc1e05ca5f	Merge "In frame Q adjustment experiment."	2013-11-25 10:52:22 -08:00
Jingning Han	f547fb8e07	Merge "Use separate inter predictors for enc/dec"	2013-11-25 10:29:07 -08:00
Paul Wilkins	644bd87e8e	In frame Q adjustment experiment. The idea here is to allow "in frame" adjustment of the final Q value used to encode each SB64, using segmentation. There is also adjustment of the rd mult in regions of overspend. Activated using aq_mode=2 Change-Id: I2f140cd898c9f877c32cd6d2e667f5e11ada4b1c	2013-11-25 10:22:55 -08:00
Yaowu Xu	3183135dd3	Merge "Fix a build issue with visual c."	2013-11-25 10:20:53 -08:00
Jingning Han	ba8b5e8d6d	Use separate inter predictors for enc/dec The decoder will construct inter predictor using lazy border extension, while the encoder, going with multiple runs of motion search in the rate- distortion optimization loop for each block, does border extension at frame level. This commit makes separate the inter predictors for encoder and decoder, respectively. Change-Id: Ieca2fecba3a7201a6d64ef9f219e5d91e50559c3	2013-11-25 09:43:34 -08:00
Jingning Han	12e5ec6aa8	Merge "Separate setup_scale_factor/extend_frame_borders"	2013-11-25 09:14:46 -08:00
Yaowu Xu	86368faca9	Fix a build issue with visual c. Change-Id: Ic8fc16ee1734cfde0d12a2e3abb3e9299382f3b1	2013-11-25 08:11:35 -08:00
Dmitry Kovalev	9fe88870c5	Merge "Cleaning up vp9_append_sub8x8_mvs_for_idx."	2013-11-24 16:08:20 -08:00
Dmitry Kovalev	52b43a2876	Inlining and removing vp9_set_pred_flag_seg_id() function. Change-Id: I0fd76937e847f78378a7ab3fa0af00a7c2c52b42	2013-11-22 17:32:11 -08:00
Dmitry Kovalev	fb9c19c62d	Renaming COMPPREDMODE_TYPE enum and its members. List of renames: COMPPREDMODE_TYPE => REFERENCE_MODE SINGLE_PREDICTION_ONLY => SINGLE_REFERENCE COMP_PREDICTION_ONLY => COMPOUND_REFERENCE HYBRID_PREDICTION => REFERENCE_MODE_SELECT (like TX_MODE_SELECT) NB_PREDICTION_TYPES => REFERENCE_MODES Change-Id: If723dabe9435325d0165dcd028142a2c78b417b4	2013-11-22 16:35:37 -08:00
Dmitry Kovalev	350731e8f9	Organizing all scan tables into lookup table. Change-Id: Ie829ee58a55157e6972c63cebe69a5d0a3221349	2013-11-22 16:20:45 -08:00
Dmitry Kovalev	52fa10a9a3	Cleaning up vp9_append_sub8x8_mvs_for_idx. Change-Id: Ic92f15d82ff5cfa3df655d08e460335c2ef8a325	2013-11-22 15:28:32 -08:00
Jingning Han	86d2a9b978	Separate setup_scale_factor/extend_frame_borders This commit takes out vp9_extend_frame_borders from vp9_setup_scale_factors. The refactoring is for the preparation of the use of lazy border extension at decoder. This makes it necessary to handle border extension separately at encoder/decoder. The use of vp9_extend_frame_borders will be removed, when lazy border extension is ready. Change-Id: Ia3baba3d179d5f11eee1634f19b3b319d2a59186	2013-11-22 12:02:08 -08:00
Dmitry Kovalev	e0ec61187e	Merge "Removing txfrm_block_to_raster_xy() call from extend_for_intra()."	2013-11-22 10:51:38 -08:00
Yunqing Wang	ed36720b66	Do vertical loopfiltering in parallel This patch followed "Add filter_selectively_vert_row2 to enable parallel loopfiltering" commit, and added x86 SSE2 optimization to do 16-pixel filtering in parallel. For other optimizations (neon and dspr2), current 16-pixel functions were done by calling 8-pixel functions twice, and real 16-pixel functions could be added later. Decoder speedup: tulip clip: 2% speed gain; old_town_cross: 1.2% speed gain; bus: 2% speed gain. Change-Id: I4818a0c72f84b34f5fe678e496cf4a10238574b7	2013-11-22 10:04:51 -08:00
Dmitry Kovalev	7c8cac3c21	Removing txfrm_block_to_raster_xy() call from extend_for_intra(). Change-Id: I6a48d1f35ed5fe7a2c7499675b339994c9c3bdf2	2013-11-21 19:30:58 -08:00
Dmitry Kovalev	ad3333e2cd	Merge "Removing plane_block_{width, height} functions."	2013-11-21 16:37:27 -08:00
levytamar82	8def766de2	vp9_short_fdct32x32_rd vp9_short_fdct32x32 optimized for AVX2 Change-Id: I6366e84490883b72362f762369d7e5bccb64f02f	2013-11-21 14:19:49 -08:00
Frank Galligan	97d1258375	Revert "Add 16 wide neon horz loopfilter." The change caused mismatches with some test vectors on neon. Original CL: https://gerrit.chromium.org/gerrit/#/c/67863/ Change-Id: I913891636d53783e93cb1865ca78ded1821dc4b0	2013-11-21 14:01:33 -08:00
Dmitry Kovalev	4896d5c7ef	Moving {left, right}_block_mode to vp9_blockd.h. Both functions have no relation to motion vectors, so moving them from vp9_findnearmv.h to vp9_blockd.h. Change-Id: I74f524267886ab0fff4a2da793a10c906ed0f43a	2013-11-21 11:43:53 -08:00
Yunqing Wang	e002bb99a8	Merge "Add filter_selectively_vert_row2 to enable parallel loopfiltering"	2013-11-21 11:25:55 -08:00
hkuang	370bf116a2	Merge "Remove unnecessary eob checking."	2013-11-21 11:24:02 -08:00
Frank Galligan	2dd77580c0	Merge "Add 16 wide neon horz loopfilter."	2013-11-21 10:29:30 -08:00
Yunqing Wang	b5e6d6cccf	Add filter_selectively_vert_row2 to enable parallel loopfiltering Added filter_selectively_vert_row2 to be ready for parallel loopfiltering in vertical direction. This change did 2-row filtering at a time. If 2 vertically adjacent 8x8 blocks do same type of filtering, we can do 16-pixel filtering in parallel. Next, we need to provide 16-pixel loopfiltering functions in c and optimized versions for codec speedup. Change-Id: Idf97bbdd70566e55bd30e1fd25cb8544e33291be	2013-11-21 09:53:15 -08:00
Yunqing Wang	6c4964602a	Merge "Correct ssse3 8/16-pixel wide sub-pixel filter calculation"	2013-11-21 09:40:02 -08:00
Frank Galligan	98de15137e	Add 16 wide neon horz loopfilter. Add support to do 16 pixel horizontal filtering in Neon. Nexus devices saw about 0.5% decode speed increase. Change-Id: I2993f6c2d49f31fa74976879eeaa289fd3f4e15d	2013-11-21 09:39:36 -08:00
Dmitry Kovalev	c90b6bb101	Removing redundant call of vp9_init_mbmode_probs(). This function is called from vp9_setup_past_independence() which is called before the modified piece of code. Moving reset of inter_mode_probs into vp9_init_mbmode_probs() for consistency. Change-Id: Ib188e8798e1fbe15407fd501406761b746fdda95	2013-11-20 21:56:38 -08:00
Dmitry Kovalev	a218a96784	Merge "Adding MV_FP_SIZE constant."	2013-11-20 14:39:58 -08:00
Yunqing Wang	256cf7ee7d	Correct ssse3 8/16-pixel wide sub-pixel filter calculation Although no mismatch was indicated for 8/16 wide sub-pixel filters in issue 661, they had similar problems that could cause mismatch potentially. This patch fixed calculations in HORIZx8/16 and VERTx8/16. Change-Id: I169961c9d40a20340995b7d22aafc89ccf30bfca	2013-11-20 12:52:56 -08:00
Dmitry Kovalev	79b5a2b142	Removing plane_block_{width, height} functions. Change-Id: I29c0dfcf41a1253d5e2a0d2ff740c0c38ebaa5a2	2013-11-20 12:39:29 -08:00
Jim Bankoski	302c33e49f	Merge "Clean up removal of vp9_pareto8 table."	2013-11-20 12:30:03 -08:00
Dmitry Kovalev	4956fcd31b	Adding MV_FP_SIZE constant. Change-Id: I98d750ee92ff51fb714980418ea28be3b1d0f3c6	2013-11-20 12:07:57 -08:00
hkuang	6debc446e0	Remove unnecessary eob checking. Change-Id: Ia568f70bddc1a2b62141a0197459119ca74c22b5	2013-11-20 11:58:11 -08:00
Jim Bankoski	25aae73a30	Merge "remove the model and copy in pack_mb_tokens"	2013-11-20 11:34:30 -08:00
Jim Bankoski	5bbb0c6295	Clean up removal of vp9_pareto8 table. Change-Id: I5556e8d1fc150be8a3e93af21900829b59a500dc	2013-11-20 11:17:26 -08:00
Jingning Han	81b9fd4310	Merge "Take out assertion from inverse transforms"	2013-11-20 10:55:27 -08:00
Jim Bankoski	03276bf6e6	remove the model and copy in pack_mb_tokens Change-Id: I00a5203c8ed76c184d936fccf93d76e7c06773d3	2013-11-20 10:06:04 -08:00
Yunqing Wang	0ef63f596d	Fix stack pointer in sub-pixel filters In commit "3d50da5397d20abc932d81453b26cde758293a40", the stack pointer was modified while aligning the stack, and it needed to be pop out at the end. Change-Id: I062971e195f1f2ab9d0ab5fb84dcf215a0fcaa67	2013-11-20 09:42:44 -08:00
Guillaume Martres	b00057c88a	Merge "vpxenc: add --aq-mode flag to control adaptive quantization"	2013-11-20 08:13:28 -08:00
Jim Bankoski	7a8a68e2bd	Merge "scan order table lookup same for encoder and decoder"	2013-11-19 16:22:48 -08:00
Yunqing Wang	e8f8e77642	Merge "Fix decoder mismatch with ssse3 enabled"	2013-11-19 16:19:32 -08:00
Yaowu Xu	dd04ff506b	Merge "Move vp9_setup_interp_filter() to encoder"	2013-11-19 16:01:19 -08:00
Jim Bankoski	d6667dd54f	scan order table lookup same for encoder and decoder Change-Id: I473947b5ca70b7a81151926284bff86f8555492a	2013-11-19 15:31:43 -08:00
Yunqing Wang	3d50da5397	Fix decoder mismatch with ssse3 enabled This patch fixed issue 661: "Decoder produces mismatched outputs with ssse3 enabled and disabled." In sub-pixel filters, a pixel value was multiplied by a filter coefficient, and the results were added up. The order of adding up these multiplications had to be arranged carefully to prevent incorrect overflowing. Change-Id: Id08af4200fea9e1b896fc40157b8651c2c7e80f2	2013-11-19 15:10:04 -08:00
Dmitry Kovalev	65cee2f01a	Merge "Simplifying partition context calculation."	2013-11-19 15:09:01 -08:00
Jim Bankoski	60aba6558f	Merge "entropy code speedup"	2013-11-19 14:58:44 -08:00
Yaowu Xu	df78fea166	Move vp9_setup_interp_filter() to encoder As it is used in encoder only. Change-Id: I5f2a8abbe72bb18cbf6ce36a3dc7e132aeae8ec2	2013-11-19 14:57:58 -08:00
Yaowu Xu	f92cfa1ca6	Merge "Move vp9_sadmxn.h from common to encoder"	2013-11-19 14:41:33 -08:00
Jim Bankoski	8cf352abac	entropy code speedup Change-Id: Ic316d3374ff9a2b43897272260947d56765a0fdd	2013-11-19 14:31:38 -08:00
Jim Bankoski	ff4f1c4b76	scan order / neighbors converted to lookup Change-Id: I64b189dfeee1cf3e90134a1a93497072f3361e5e	2013-11-19 12:55:44 -08:00
Yaowu Xu	30b03050a2	Move vp9_sadmxn.h from common to encoder Change-Id: I6f6ba91b1b8b280902b171472314d665aa0baf0b	2013-11-19 12:46:08 -08:00
Dmitry Kovalev	f6ec323906	Simplifying partition context calculation. Reversing bit order of partition_context_lookup, and modifying accordingly update_partition_context() and partition_plane_context(). Change-Id: I64a11f1a94962a3bf217de2f50698cb781db71a5	2013-11-19 11:17:30 -08:00
Yunqing Wang	f16fb829e6	Merge "Improve vp9_iht4x4_16_add_sse2 (x1.341)"	2013-11-19 11:11:47 -08:00
Dmitry Kovalev	953b1e9683	Removing raster_block_offset_uint8() function. There is no need to use that function, it is much clear to pass offset directly to the buffer. Change-Id: I9026cb0c5094c46f97df5d7f7daeb952f2843b24	2013-11-18 19:00:49 -08:00
Dmitry Kovalev	9e1e7bee48	Merge "Finally removing txfrm_block_to_raster_block() function."	2013-11-18 18:43:16 -08:00
Dmitry Kovalev	220af9ac2c	Merge "Cleaning up vp9_entropy.c file."	2013-11-18 18:04:56 -08:00
Abo Talib Mahfoodh	613e2d2e90	Improve vp9_iht4x4_16_add_sse2 (x1.341) This rebase is a better implementation of the previous ones. Modifications are done to reduce the total clock cycle. Speedup: 1.341 Compiled with -O3 Tested with: park_joy_420_720p50.y4m Change-Id: I940eaf283f60597ca0d9d2e13d518878d55ff02d	2013-11-18 20:53:13 -05:00
Dmitry Kovalev	d8c06d23da	Cleaning up vp9_entropy.c file. Change-Id: I568f5e2d4ef2f2affe013ba1691ffb546f1fe8c6	2013-11-18 17:18:14 -08:00
Yaowu Xu	a42ab027fd	Merge "Move vp9_extend.{h,c} from common to encoder"	2013-11-18 15:43:32 -08:00
Yaowu Xu	1c61e1960d	Move vp9_extend.{h,c} from common to encoder Since they used in encoder only. This commit also re-order includes for the files that include vp9_extend.h Change-Id: I929fc113f2135d3198cd1fc6a17434e5a2f8a459	2013-11-18 12:43:36 -08:00
Yunqing Wang	e3168b0c54	Merge "Do horizontal loopfiltering in parallel"	2013-11-18 10:03:41 -08:00
Jim Bankoski	83eb1975df	partition context update speedup This removes a lot of operations in setting partition context... Change-Id: I365e6f5607ece85190cb21443988816dfa510ce3	2013-11-17 06:58:08 -08:00
Yunqing Wang	64f728caef	Do horizontal loopfiltering in parallel This patch followed "Rewrite filter_selectively_horiz for parallel loopfiltering" commit, and added x86 SSE2 optimization to do 16-pixel filtering in parallel. Also, corrected the declaration of aligned arrays. For 8-pixel-in-parallel case, improved the calculation of the masks and filters. Updated the threshold loading since the thresholds were already duplicated. Updated neon C functions to call neon loopfilters twice. Using tulip clip, tests showed it gave a ~1.5% decoder speed gain. Change-Id: Id02638626ac27a4b0e0b09d71792a24c0499bd35	2013-11-15 16:18:43 -08:00
Jingning Han	bdc4371174	Take out assertion from inverse transforms Separate the rounding and right shift operations of forward transform from those of inverse transform. Take out the assertion check from inverse transforms. If the transform coefficients were constructed to cause intermediate steps of inverse transform overflow, the codec will just let it overflow without breaking the decoding flow. Change-Id: I73cfc3706c4e840fc543a77cbc4cdb0b05d07730	2013-11-15 15:30:47 -08:00
hkuang	7424492a0b	Let the idct vp9_idct32x32_34_add = vp9_idct32x32_1024_add on arm until we implenment real vp9_idct32x32_34_add_neon. This issue is due to commit `47665452f0` Merge "Add 32x32 idct function for eob<=34 case". Change-Id: I56b5f0abc20e7dd1bba521f78a995e85d65ea296	2013-11-15 14:59:16 -08:00
Guillaume Martres	17084657e6	vpxenc: add --aq-mode flag to control adaptive quantization Change-Id: I57e1ad4bed3487df12893ced77c49093f8755706	2013-11-15 19:42:20 +01:00
Dmitry Kovalev	8d7bd4d126	Merge "Cleaning up vp9_loopfilter.c file."	2013-11-15 10:10:59 -08:00
Jingning Han	a9b9f22bcd	Merge "Fix coding format in vp9_idct"	2013-11-15 08:59:14 -08:00
Jim Bankoski	e1b6c42eed	partition plane context speed up Removes silly operations inside loop. Change-Id: I9eeab1e914e715a887f86cf1089de508e2364165	2013-11-15 08:00:43 -08:00
Jim Bankoski	ffb17e2c09	Merge "loop filter assert cleanout"	2013-11-15 07:48:36 -08:00
Dmitry Kovalev	38e6cb8c7b	Merge "Cleaning up vp9_tile_common.{h, c} files."	2013-11-14 20:55:01 -08:00
Jingning Han	7637387cf1	Fix coding format in vp9_idct Change-Id: If97ae16a4478717933345b6b9d5bc1b417b8dd84	2013-11-14 16:05:22 -08:00
Adrian Grange	38144ed8b2	fix scalling bug by buffer auto-reallocation Change-Id: Ib748eb287520c794631697204da6ebe19523ce95	2013-11-14 15:53:09 -08:00
Dmitry Kovalev	3f9fc6f6f8	Cleaning up vp9_loopfilter.c file. Change-Id: Ic6770072f80dfb54d2725ed96370d4f243a9f474	2013-11-14 15:04:14 -08:00
Dmitry Kovalev	49fbbf72fa	Finally removing txfrm_block_to_raster_block() function. We only use txfrm_block_to_raster_xy() now. Change-Id: I4242cd592da99e761041acf9fef1bac3d55a48e1	2013-11-14 13:45:51 -08:00
Dmitry Kovalev	f91ac9b436	Cleaning up vp9_tile_common.{h, c} files. Change-Id: I9d18f351abe7614107f34f47eeb38a234a9937c9	2013-11-14 13:40:56 -08:00
Jim Bankoski	ef99b7b884	loop filter assert cleanout Change-Id: I4e2ad4b7342681e6ac236356ef3a4927a54f105b	2013-11-14 12:25:32 -08:00
Deb Mukherjee	cfcd5c4f61	Simplifies band-getting with a static array Simplifies the code by implementing band mapping with static arrays. A lot of the code complexity introduced in a previous patch disappears. Change-Id: Ia3fac36e594fb5ad2d55ae141c58bba4c55c2d28	2013-11-13 22:15:16 -08:00
Dmitry Kovalev	26a1ad604f	Merge "Removing function pointers from inter prediction."	2013-11-13 13:54:15 -08:00
Dmitry Kovalev	60d1a52995	Merge "Optimizing set_contexts() function."	2013-11-13 10:01:05 -08:00
Yunqing Wang	8ce0967df8	Merge "Use 1D array to store super block filter levels"	2013-11-13 09:40:14 -08:00
Johann	4da2a8b718	Merge "mips dsp-ase r2 vp9 decoder intra module optimizations (rebase)"	2013-11-13 09:00:09 -08:00
Parag Salasakar	1530a6b77f	mips dsp-ase r2 vp9 decoder intra module optimizations (rebase) Change-Id: Ib27fc4f3dbe01fe8adfa04a61aaba21b3480e75c	2013-11-13 11:17:14 +05:30
Parag Salasakar	248cf6f69f	mips dsp-ase r2 vp9 decoder loopfilter module optimizations (rebase) Change-Id: Ia7f640ca395e8deaac5986f19d11ab18d85eec2d	2013-11-13 10:53:16 +05:30
Dmitry Kovalev	3f3d14e1d3	Moving q_index from MACROBLOCKD to MACROBLOCK. Moving because q_index is used only by encoder. Change-Id: I0b96175614ed4fd3d76ee56a0ba36258e1e896f6	2013-11-12 18:13:19 -08:00
Dmitry Kovalev	73a5cbeba4	Merge "Using max_tx_size instead of bsize when possible."	2013-11-12 16:54:30 -08:00
Dmitry Kovalev	3a2ea76469	Merge "Moving {sb, mb, b, ab}_index from MACROBLOCKD to MACROBLOCK."	2013-11-12 15:59:28 -08:00
Dmitry Kovalev	58b004ff64	Merge "Adding const to tree pointer inside vp9_extra_bit struct."	2013-11-12 15:48:07 -08:00
Johann	8dd3905163	Merge "Added optimized vp9_idct32x32_34_add_dspr2"	2013-11-12 15:30:00 -08:00
Dmitry Kovalev	20f34ff0db	Adding const to tree pointer inside vp9_extra_bit struct. Change-Id: I60e02fa3de930ff1f969687ab5af93dee40d86ad	2013-11-12 14:21:15 -08:00
Yunqing Wang	ce89309b45	Use 1D array to store super block filter levels As Jim suggested, 1D array was used to store filter levels instead of 2D array. This used shift_y in setup_mask directly, and saved few cycles. Change-Id: If61ab298784861f1806b1cd396d4e4e2e0f097b9	2013-11-12 12:07:57 -08:00
Deb Mukherjee	a33a84b11a	Merge "Removes conditional statements from band getting"	2013-11-12 11:22:21 -08:00
Johann	e72d49a97a	Use lowercase 'b' to branch iOS doesn't recognize B: bad instruction `B idct32_pass_loop' Change-Id: I3cf6aede4639f1d9efa97f7962fa287ba6feaaef	2013-11-12 10:41:06 -08:00
Yunqing Wang	17322275dd	Merge "Rewrite filter_selectively_horiz for parallel loopfiltering"	2013-11-12 10:20:49 -08:00
Yunqing Wang	7989768766	Merge "Improve loopfilter function"	2013-11-12 10:19:56 -08:00
Deb Mukherjee	5ade423774	Removes conditional statements from band getting Implements scan order to band map with arrays in both the encoder and decoder to remove conditional statements. Encoding seems to be about 1% faster at speed 0, tested on football. Decoding seems to be about 0.5-1% faster on a set of 25 videos. Change-Id: Idb233ca0b9e0efd790e30880642e8717e1c5c8dd	2013-11-12 10:13:27 -08:00
Dmitry Kovalev	50f97cf7fb	Removing function pointers from inter prediction. Removing foreach_predicted_block_visitor and calling build_inter_predictors directly. Change-Id: I11bb3c872b99b47c2680b01b0dbcc01c558c4a2b	2013-11-11 18:37:00 -08:00
Yunqing Wang	b45438181c	Rewrite filter_selectively_horiz for parallel loopfiltering Added loop filter mask checking, and made the caller function ready for implementation of parallel loopfiltering in horizontal direction. Next, we need to go through the loopfilter functions (both c and optimized versions), and provide 16-byte wide loopfiltering for each filter type. Change-Id: Ifef47e7ef9086ebc2fd6ca7ede8f27c9bbf79e66	2013-11-11 17:06:01 -08:00
Dmitry Kovalev	3551e25099	Moving {sb, mb, b, ab}_index from MACROBLOCKD to MACROBLOCK. We use {sb, mb, b, ab}_index only inside encoder, so moving them into appropriate data structure. Change-Id: Ib5c1036716354d9d321e11a60c1634c1cb8f9716	2013-11-11 15:58:57 -08:00
Jingning Han	d8b4c79270	Decouple macroblockd_plane buffer usage Make the macroblockd_plane contain dynamic buffer pointers instead static pointers to the memory space allocated therein. The decoder uses the buffer allocated in pbi, while encoder will use a dual buffer approach for rate-distortion optimization search. Change-Id: Ie6f24be2dcda35df7c15b4014e5ccf236fb3f76c	2013-11-11 15:26:10 -08:00
hkuang	c689a126ed	Fix a bug in the assembly code. Change-Id: Ic416e3f8a11e82ee298e6f709b2119a9ddf1e2f8	2013-11-11 12:49:12 -08:00
Dmitry Kovalev	c53a9c70fb	Merge "Localizing NEARESTMV special cases in the code."	2013-11-11 11:12:06 -08:00
Dmitry Kovalev	22a001988b	Optimizing set_contexts() function. Inlining set_contexts_on_border() into set_contexts(). The only difference is the additional check that "has_eob != 0" in addition to "xd->mb_to_right_edge < 0" and "xd->mb_to_right_edge < 0". If has_eob == 0 then memset does the right thing and works faster. Change-Id: I5206f767d729f758b14c667592b7034df4837d0e	2013-11-08 12:44:56 -08:00
Yunqing Wang	e731b2ba2c	Merge "Improve vp9_idct4x4_1_add_sse2"	2013-11-08 12:00:36 -08:00
Yunqing Wang	49cf335e7f	Improve loopfilter function This patch continued the work done in "Rewrite loop_filter_info_n struct"(commit:00dbd369c70270428d56da6d15ea5486fc821c52) to further improve loopfilter function. 1. Instead of storing pointers to thresholds, store loopfilter levels within 64x64 SB; 2. Since loopfilter levels are already calculated in setup_mask, we don't need call build_lfi to look up them again. Just save loopfilter levels in setup_mask. 3. Reorganized and simplified filter_block_plane(). Tests showed a ~0.8% decoder speedup. Change-Id: I723c7779738bbc2afcb9afa2c6f78580ee6c3af7	2013-11-08 11:48:31 -08:00
hkuang	a6462990e6	Merge "Add back vp9_short_idct32x32_1_add_neon which is deleted in cleanup I63df79a13cf62aa2c9360a7a26933c100f9ebda3."	2013-11-07 14:42:29 -08:00
Ivan Maltz	741c14fcf0	Merge "Move SVC per-frame loop from sample app into libvpx proper"	2013-11-06 17:24:05 -08:00
Ivan Maltz	1ed0e1beb5	Move SVC per-frame loop from sample app into libvpx proper SVC multiple layer per frame encoding is invoked with vpx_svc_init and vpx_svc_encode. These interfaces are designed to be invoked from ffmpeg. Additional improvements: - make dummy frame handling a bit more explicit - fixed bug with single layer encodes - track individual frame sizes and psnrs instead of averages - parameterized quantizer, 16th scalefactors, more logging, - enabled single layer encodes to generate baseline - include new mode for 3 layer I frame with 5 total layers Change-Id: I46cfa600d102e208c6af8acd6132e0cc25cda8d4	2013-11-06 14:49:27 -08:00
Dmitry Kovalev	7b011c5467	Replacing mi_{width,height}_log2 with num_8x8_blocks_{wide,high}_lookup. Change-Id: I04c55daef89bca2b85cb7db0850f9b052abc5a7c	2013-11-06 13:34:23 -08:00
Yaowu Xu	2f4bade348	Merge "Missing _ means no sse3 for vp9_h_predictor_32x32."	2013-11-06 13:04:28 -08:00
Paul Wilkins	0c39318a8b	Missing _ means no sse3 for vp9_h_predictor_32x32. Error in script means vp9_h_predictor_32x32 sse3 version is not enabled. Change-Id: Ia43672740da1ecdfb7fcd420490ef424b04accc4	2013-11-06 13:57:55 +00:00
Dmitry Kovalev	4a96e64dc2	Using max_tx_size instead of bsize when possible. Change-Id: I246364bc4270ca13aefb4bc3445bcf102b3170dc	2013-11-05 17:36:43 -08:00
hkuang	6b16f63332	Add back vp9_short_idct32x32_1_add_neon which is deleted in cleanup I63df79a13cf62aa2c9360a7a26933c100f9ebda3. Change-Id: I034848cf05031618818f7df2e7f9c35102686948	2013-11-05 14:57:32 -08:00
Dmitry Kovalev	815189613b	Localizing NEARESTMV special cases in the code. Removing special case handling from vp9_tree_probs_from_distribution(), tree_merge_probs(), and vp9_tokens_from_tree_offset() functions. Replacing inter_mode_offset() function with macro INTER_OFFSET which is used now for vp9_inter_mode_tree definition. Change-Id: Iff75a1499d460beb949ece543389c8754deaf178	2013-11-05 11:58:57 -08:00
Dmitry Kovalev	c622e1d18f	Unified approach for backward probability update. Replacing update_mode_probs() and adapt_probs() with tree_merge_probs(). Change-Id: I50b2c968d67c9265f5216c700cbeba25fb014654	2013-11-04 16:12:29 -08:00
Dmitry Kovalev	dde8069e57	Splitting partition_probs array into two arrays. We only update partition_probs for inter frames but they are constant for key frames. It is not necessary to have constants inside frame context and copy them every time. This change reduces FRAME_CONTEXT size by at least 48 bytes. Change-Id: If70a53be51043f37fe7d113853217937710932a7	2013-11-04 14:26:16 -08:00
Dmitry Kovalev	dd209fae3a	Merge "Removing 'new' probability calculation from convert_distribution()."	2013-11-04 11:14:58 -08:00
James Zern	152181b25c	Merge "vp9 ssse3 d207_predictor_32x32: add missing GLOBAL()"	2013-11-02 12:25:47 -07:00
James Zern	2d980b803a	vp9 ssse3 d207_predictor_32x32: add missing GLOBAL() removes a textrel for sh_b23456789abcdefff Change-Id: I80cb9dfd8e49a0fe884c8ff76472275b3a00cb57	2013-11-01 20:33:22 -07:00
Dmitry Kovalev	df19c6b64c	Removing 'new' probability calculation from convert_distribution(). We don't have to calculate 'new' probability in convert_distribution() because it is enough to calculate only 'new' counters which could be used to calculate probability if necessary. That's why removing a lot of unused temporary probability arrays and reducing number of get_binary_prob() calls. Change-Id: I4e14eb7203d1ace61bbddefd6b9b6326be83ba63	2013-11-01 15:09:43 -07:00
Yaowu Xu	333345cd26	Merge "Convert filter kernel choice to lookup"	2013-11-01 13:43:09 -07:00
Yaowu Xu	0f76ba5523	Convert filter kernel choice to lookup Also removed unused declaration related 6 tap filter Change-Id: Ic17f516141d885157918505f4204081e4c951fad	2013-11-01 13:03:18 -07:00
Dmitry Kovalev	340b2b076e	Merge "Cleanup. Adding const to function pointer arguments."	2013-11-01 10:57:03 -07:00
Dmitry Kovalev	0e1756330b	Merge "Removing is_intra_mode() function."	2013-10-31 18:06:53 -07:00
Dmitry Kovalev	7c524bbef4	Cleanup. Adding const to function pointer arguments. Change-Id: I12c67c8c0fa1aa7fb3f7d6cc2ef65be29c4ea292	2013-10-31 14:34:21 -07:00
Yaowu Xu	d515716140	Merge "mb_lpf_horizontal_edge AVX2 optimization"	2013-10-31 10:43:57 -07:00
Yunqing Wang	d03b3cbdd7	Merge "Fix x_offset_q4/y_offset_q4 calculation"	2013-10-31 09:47:54 -07:00
Tamar Levy	54f9205653	mb_lpf_horizontal_edge AVX2 optimization This CL contains two AVX2 optimized loop filter functions, mb_lpf_horizontal_edge_w_avx2_8 and mb_lpf_horizontal_edge_w_avx2_16. Change-Id: I604e4fe6e99752b7800c2ea98721d97f7e0b931b	2013-10-31 10:26:15 -06:00
Parag Salasakar	d5a52edc11	Added optimized vp9_idct32x32_34_add_dspr2 Change-Id: I2ba9467525b87a8e4a58f0c546e63031b4e38a4e	2013-10-31 12:12:34 +05:30
Dmitry Kovalev	6761872e49	Replacing (SWITCHABLE_FILTERS + 1) with SWITCHABLE_FILTER_CONTEXTS. Change-Id: I9781a62bc1a4cd9176554d1271d87dbcafda9cb0	2013-10-30 14:40:34 -07:00
Yunqing Wang	9ed2d0a577	Fix x_offset_q4/y_offset_q4 calculation "<< SUBPEL_BITS" needs to be added in the calculation. Call set_scaled_offsets() to calculate x_offset_q4 and y_offset_q4. Change-Id: Ied130ea771510e918f51cd1dc3abe57f4c0962b5	2013-10-29 17:46:55 -07:00
Dmitry Kovalev	1bea58e4a8	Merge "Adding const to vp9_quantize_b_{32x32,} parameters."	2013-10-29 16:57:52 -07:00
Erik Niemeyer	27b8040c76	Merge "CL for adding AVX-AVX2 support in libvpx."	2013-10-29 15:55:54 -07:00
Dmitry Kovalev	065972f959	Adding const to vp9_quantize_b_{32x32,} parameters. Change-Id: I56f8c50ac382202f66040cd9cfaa05d889572fc7	2013-10-29 15:25:19 -07:00
Erik Niemeyer	e6863ef318	CL for adding AVX-AVX2 support in libvpx. Change-Id: Idc03f3fca4bf2d0afd33631ea1d3caf8fc34ec29	2013-10-29 15:11:16 -07:00
Dmitry Kovalev	e5956258dd	Merge "Making get_tx_counts() similar to get_tx_probs()."	2013-10-29 10:48:50 -07:00
Yunqing Wang	c634ec6a56	Merge "Rewrite loop_filter_info_n struct"	2013-10-29 09:49:36 -07:00
Dmitry Kovalev	aa76cd1e49	Removing is_intra_mode() function. It is enough to check just block type: intra or inter. Intra block implies intra prediction mode, and inter block implies inter mode. Change-Id: I3cf98731a3935f670a3cd8e2b2443483eb944be4	2013-10-28 20:00:55 -07:00
Dmitry Kovalev	fa1ac00aee	Making get_tx_counts() similar to get_tx_probs(). Change-Id: I5b17f40e515c4bcf9ebef5380270a214af4e0115	2013-10-28 19:52:38 -07:00
Dmitry Kovalev	19cf72eddc	Adding {read, write}_partition() instead of check_bsize_coverage(). Making partition read/write logic more clear. Change-Id: I1981e90327257d37095567c62d72a103cda1da33	2013-10-28 15:14:45 -07:00
James Zern	58a0f6dbdd	vp9: add TileInfo replaces use of cur_tile_mi_(row\|col)_(start\|end) by VP9_COMMON, making it less stateful and more reusable for parallel tile decoding Change-Id: I1df09382b4567a0e5f4434825d47c79afe2399be	2013-10-28 20:54:43 +01:00
James Zern	3ffa41aae3	Merge changes If9b16f7d,I75aab21c,I9cbb768c,If5cea3d3,I96940657,I025595d8,Ie0bc3935,I3ebb172d * changes: vp9: remove partition+entropy contexts from common vp9: add above/left_context to MACROBLOCKD vp9: add above/left_seg_context to MACROBLOCKD vp9: add above/left_context to encoder vp9: add above/left_seg_context to encoder vp9: pass entropy context directly to set_skip_context vp9: pass context directly to partition functions vp9/decode: add alloc_tile_storage()	2013-10-28 12:45:11 -07:00
Dmitry Kovalev	ded951793c	Merge "Replacing is_inter_mode with is_inter_block."	2013-10-28 10:07:06 -07:00
James Zern	7b9ca3caa7	vp9: remove partition+entropy contexts from common these are now handled separately by the encoder and decoder Change-Id: If9b16f7d734e992fb94a510a6d88f2690d7fb7cb	2013-10-28 11:34:20 +01:00
James Zern	e571d3badc	vp9: add above/left_context to MACROBLOCKD Change-Id: I75aab21c1692cbad717564cbb436578fddbc348d	2013-10-28 11:34:18 +01:00
James Zern	d9a317c8b2	vp9: add above/left_seg_context to MACROBLOCKD Change-Id: I9cbb768c5f857a096cf6c29d6755d0e5e6728435	2013-10-28 11:32:16 +01:00
Dmitry Kovalev	07502f1963	Merge "Adding get_frame_new_buffer() function to replace duplicated code."	2013-10-25 15:25:13 -07:00
Dmitry Kovalev	ddfc87c6f3	Merge "Making input pointer constant for all fdct/fht functions."	2013-10-25 15:14:49 -07:00
Yunqing Wang	00dbd369c7	Rewrite loop_filter_info_n struct Restructured the storing of loopfilter information. Deleted loop_filter_info struct and reduced copying happened in every superblock. Tests showed a 0.5% ~ 0.8% decoder speed gain. Change-Id: Ie6a8e46bae71dc3a3cd8c6054f5de540b8e0ef5e	2013-10-25 14:56:28 -07:00
James Zern	d2bf696ee0	vp9: pass entropy context directly to set_skip_context this will allow for separate storage to be used in tile decoding Change-Id: I025595d83118bdc82a545dae69bc6602e8d2a6e3	2013-10-25 22:01:13 +02:00
James Zern	88d79eabdc	vp9: pass context directly to partition functions update_partition_context / partition_plane_context: this will allow for separate storage to be used in tile decoding Change-Id: Ie0bc393531ab7e9d2ce35c95111849b294aad4ed	2013-10-25 22:01:13 +02:00
Dmitry Kovalev	d5ac877f7f	Adding COLOR_SPACE enum. Change-Id: If5711eb166609cce0a88b3cb5b56b3afeebc4fb0	2013-10-25 12:35:20 -07:00
Yunqing Wang	47665452f0	Merge "Add 32x32 idct function for eob<=34 case"	2013-10-25 09:34:46 -07:00
Yunqing Wang	f88315cb29	Add 32x32 idct function for eob<=34 case When only upper-left 8x8 area has non-zero dct coefficients, we could skip 1D IDCT for 9th to 32th rows to save operations. This function is called when eob <= 34. Change-Id: I9684b75947bdde346cfe3720f08a953aa7a13fb5	2013-10-24 16:13:21 -07:00
Johann	35c4437bf5	Merge "mips dsp-ase r2 vp9 decoder idct module optimizations (rebase)"	2013-10-24 15:49:31 -07:00
Dmitry Kovalev	237ce8724a	Adding get_frame_new_buffer() function to replace duplicated code. Change-Id: I6e0e19231a48364c1de7dfab730b121ab227f111	2013-10-24 12:20:35 -07:00
Dmitry Kovalev	600a3860a4	Making input pointer constant for all fdct/fht functions. Change-Id: I78f7012f967a777ddd39bae6671eb501df6bbfe8	2013-10-24 11:48:25 -07:00
Dmitry Kovalev	7bb48e5e8e	Replacing is_inter_mode with is_inter_block. It should be only a check based on the block type (inter vs intra), not on the mode value. Change-Id: I0378cb4ba7c9a1631c1e870a537187b8650fa30a	2013-10-24 11:22:06 -07:00
Dmitry Kovalev	dfc7945d1e	Adding get_frame_ref_buffer() function + cleanup. Change-Id: Ib9ead216fc54b2df6f6f1fe82d2ea137197beebd	2013-10-24 11:05:35 -07:00
Dmitry Kovalev	8001ed71ed	Merge "Renaming vp9_short_fdct4x4 and vp9_short_walsh4x4."	2013-10-24 10:08:42 -07:00
Dmitry Kovalev	710ca1fe36	Merge changes I1868fb75,I9ff504c6 * changes: Renaming INTERPOLATIONFILTERTYPE to INTERPOLATION_TYPE. Adding VP9_FRAME_MARKER constant.	2013-10-24 10:08:19 -07:00
Dmitry Kovalev	153d70ca9b	Merge "Cleaning up {above, left}_block_mode functions."	2013-10-24 10:07:51 -07:00
Yunqing Wang	93ec31dff6	Merge "Improve scale_factors struct"	2013-10-24 09:13:41 -07:00
James Zern	eec622d178	Merge "vp9/extend_for_intra: avoid crossing tile boundary"	2013-10-24 06:04:10 -07:00
James Zern	3c038b6c40	vp9/extend_for_intra: avoid crossing tile boundary Change-Id: I0d8a71778aa3c73b8b1673e14053074bb866548b	2013-10-24 14:21:24 +02:00
Parag Salasakar	1699eb0bf6	mips dsp-ase r2 vp9 decoder idct module optimizations (rebase) Change-Id: Iedcdb8867084f328f4fce2fadb968e0984217308	2013-10-24 11:29:04 +05:30
Dmitry Kovalev	5d28b63687	Cleaning up {above, left}_block_mode functions. Making {above, left}_block_mode more clear and symmetric. Change-Id: Ie348a950fb9a5cf52861d0cba838a58010ff56ad	2013-10-23 17:54:13 -07:00
Dmitry Kovalev	ad867fe237	Renaming INTERPOLATIONFILTERTYPE to INTERPOLATION_TYPE. Change-Id: I1868fb75ed88bfa65c1c2ca24677d65f2894d713	2013-10-23 17:45:52 -07:00
Dmitry Kovalev	a53075f7c5	Adding VP9_FRAME_MARKER constant. Also renaming SYNC_CODE_* to VP9_SYNC_CODE_*. Change-Id: I9ff504c6ebce6cd6673d7df2085d597b818f5960	2013-10-23 17:24:17 -07:00
Dmitry Kovalev	fd724f13b0	Renaming vp9_short_fdct4x4 and vp9_short_walsh4x4. For consistency with idct function names. Renames: vp9_short_fdct4x4 -> vp9_fdct4x4 vp9_short_walsh4x4 -> vp9_fwht4x4 Change-Id: Id15497cc1270acca626447d846f0ce9199770f58	2013-10-23 14:28:39 -07:00
Dmitry Kovalev	a018988ce8	Renaming vp9_short_fdct32x32 to vp9_fdct32x32. For consistency with idct function names. Change-Id: Ie77b7178e0894c57cd5cb9243c949eb9224ece18	2013-10-23 13:41:40 -07:00
Dmitry Kovalev	5bdd4d9ccf	Merge "Renaming vp9_short_fdct16x16 to vp9_fdct16x16."	2013-10-23 13:37:09 -07:00
Dmitry Kovalev	a9c8251b9d	Merge "Renaming vp9_short_fdct8x8 to vp9_fdct8x8."	2013-10-23 11:38:55 -07:00
Jingning Han	9cc4935d7b	Merge "Make decode modules independent of tile index"	2013-10-23 11:08:12 -07:00
Dmitry Kovalev	02feb63684	Renaming vp9_short_fdct16x16 to vp9_fdct16x16. For consistency with idct function names. Change-Id: I5ca355ba99fdba04f09254be95cf79808b534f71	2013-10-23 10:57:12 -07:00
Dmitry Kovalev	fa143dbc8e	Renaming vp9_short_fdct8x8 to vp9_fdct8x8. For consistency with idct function names. Change-Id: I7b6af2f92c66eff56f84ed29edc3a66af8dc421f	2013-10-23 10:52:33 -07:00
Dmitry Kovalev	73fe696c91	Merge "Reordering probability tables for consistency."	2013-10-23 10:10:24 -07:00
Adrian Grange	2f58b813bb	Remove right_available member from VP9_COMP This member of VP9_COMP is no longer used, so I removed it. Change-Id: I3509f52756da4768a3e4581cec5ed5d2a70d5fb8	2013-10-22 16:53:37 -07:00
Jingning Han	bd23e084eb	Make decode modules independent of tile index Assign the pointer to mode_info stream per tile. Remove the use of tile_col in the decoding modules. Change-Id: I7df87086708a3d92c5e20e86bcfb04e458ff47a6	2013-10-22 15:22:59 -07:00
Yunqing Wang	175c313a12	Improve scale_factors struct The ref's scale_factors are set at frame level, and then copied for each partition block. Since the struct members are mostly constant, this patch separated the constant and non-constant members, and reduced struct copying. This gave 0.5% ~ 1.4% decoder speed gain. Change-Id: I94043bf5a6995c8042da52e5c661818dfa6f6d4c	2013-10-22 13:10:22 -07:00
Dmitry Kovalev	9f09618bd4	Merge "Using stride (# of elements) instead of pitch (bytes) in fdct4x4."	2013-10-22 13:05:24 -07:00
James Zern	64d94b4aa6	Merge "Revert "Merge "SVC improvements"""	2013-10-22 12:47:22 -07:00
Dmitry Kovalev	68c02593df	Reordering probability tables for consistency. Putting vp9_kf_y_mode_prob[] before vp9_kf_uv_mode_prob[]. Change-Id: I2404910e35de1ee24ce46337e00c07eb1446e50f	2013-10-22 12:21:37 -07:00
Dmitry Kovalev	fa57135b2c	Merge "Removing NUM_ prefix from constant names."	2013-10-22 11:34:28 -07:00
Dmitry Kovalev	a767d10fa5	Merge "Using stride (# of elements) instead of pitch (bytes) in fdct8x8."	2013-10-22 11:34:17 -07:00
Jingning Han	7b54556008	Merge "Prevent left_block_mode stepping into left tile"	2013-10-22 09:37:17 -07:00
Jingning Han	c807949408	Prevent left_block_mode stepping into left tile This commit uses left_available flag to decide if the left mode_info struct is available for left_block_mode. As discussed with James Zern (jzern@), this prevents the codec from fetching mode_info from blocks in the left tile, which although effectively not used might present concerns for multi-threaded tile decoding. This is NOT a bit-stream change. Change-Id: I1dc8cf1bcbf056688eee27c7bc5706ac4b4e0125	2013-10-22 09:02:41 -07:00
Abo Talib Mahfoodh	908a992d7f	Improve vp9_idct4x4_1_add_sse2 Simple modification to reduce number of cycles in the function. Original function number of cycles: 973 Modified function number of cycles: 835 Improvment factor: 1.165 Tested with: park_joy_420_720p50.y4m Change-Id: Ic5857272ea3aafe21d5ef9a69258d78c688f69bd	2013-10-22 09:35:36 -04:00
James Zern	cd74a901a7	Revert "Merge "SVC improvements"" This reverts commit `a82001b1cf`, reversing changes made to `f6d870f7ae`. This commit breaks windows builds and needs some work to fix those and some additional comments. Change-Id: Ic0b0228e36704b127e5e399ce59db26182cfffe7	2013-10-22 11:09:22 +02:00
Ivan Maltz	a82001b1cf	Merge "SVC improvements"	2013-10-21 16:28:31 -07:00
Dmitry Kovalev	190c2b4591	Using stride (# of elements) instead of pitch (bytes) in fdct4x4. Just making fdct consistent with iht/idct/fht functions which all use stride (# of elements) as input argument. Change-Id: I0ba3c52513a5fdd194f1e7e2901092671398985b	2013-10-21 15:27:35 -07:00
Ivan Maltz	663916cea7	SVC improvements These changes were originally made in the Stratacaster team-review repository commit e114bffcd82ad74c3696ec58e13c0ac895d6c82d Author: Charles 'Buck' Krasic <ckrasic@google.com> Date: Mon Oct 14 16:52:13 2013 -0700 Make dummy frame handling a bit more explicit, fixing bug with single layer encodes. Squashed commit of the following: commit 1ebbfd976c0fadb02bf1ea562a2d0e3f0206daad Merge: `ac468dd` 54e88b7 Author: Ivan Maltz <ivanmaltz@google.com> Date: Fri Oct 11 17:29:58 2013 -0700 Move SVC code from vp9_spatial_scalable_encoder to libvpx module accessible from ffmpeg commit 54e88b78b160becc9569fc3c6cb6b0a8c95dc357 Author: Ivan Maltz <ivanmaltz@google.com> Date: Tue Oct 8 09:08:40 2013 -0700 common svc encoding code for sample app and ffmpeg added svc_encodeframe.c, svc_context.h, svc_test.cc vp9_spatial_scalable_encoder uses vpx_svc_encode commit 5616ec8e2e3d3e8d277333d8a9242f6c70151162 Merge: 4528014 `e29137d` Author: Ivan Maltz <ivanmaltz@google.com> Date: Tue Oct 8 08:47:58 2013 -0700 Merge branch 'master' into stratacaster commit 45280148450b1f3d61e390df8aadedf85cd5bce1 Merge: bb2b675 `1ab60f7` Author: Sujeevan Rajayogam <sujee@google.com> Date: Fri Oct 4 10:22:31 2013 -0700 Merge branch 'master' into stratacaster commit bb2b675e595dc9bfc8551e963edf56800c3aea61 Author: Sujeevan Rajayogam <sujee@google.com> Date: Wed Oct 2 12:37:26 2013 -0700 Track individual frame sizes and psnrs instead of averages. commit c6d303b714795c81e7ceb4173967115c9f8ff5b7 Merge: fa87df9 `3583087` Author: Sujeevan Rajayogam <sujee@google.com> Date: Fri Sep 27 10:05:35 2013 -0700 Merge branch 'master' into stratacaster commit fa87df94fba923d9f7aeb8ae20c6e15f777e00b5 Merge: bf22d71 `3c465af` Author: Sujeevan Rajayogam <sujee@google.com> Date: Thu Sep 26 16:10:31 2013 -0700 Merge branch 'master' into stratacaster commit bf22d7144895a82e0c348ac177c8a261b9e2b88e Author: Sujeevan Rajayogam <sujee@google.com> Date: Thu Sep 26 11:10:34 2013 -0700 Parameterized quantizer, 16th scalefactors, more logging, enabled single layer encodes to generate baseline. commit ceffd7e6025b765f9886b5ea0f324248aa37e327 Author: Sujeevan Rajayogam <sujee@google.com> Date: Thu Sep 19 10:04:49 2013 -0700 - Include new mode for 3 layer I frame with 5 total layers. - Refactor svc api. Change-Id: Ie4d775e21e006fa597d884c59488dc999478e9b5	2013-10-21 14:34:37 -07:00
Dmitry Kovalev	a0be71c703	Inlining set_partition_seg_context function. We used set_partition_seg_context() only before calls to: 1. update_partition_context() 2. partition_plane_context() Moving these functions from vp9_blockd.h to vp9_onyxc_int.h and inlining set_partition_seg_context into them. After that it is not necessary to have {above, left}_seg_context fields in MACROBLOCKD struture, so removing them also. Change-Id: I4723f59e1c8f3788432b7f51185d8d747b3a97f9	2013-10-21 12:02:19 -07:00
Dmitry Kovalev	33a29f3c35	Merge "Moving allow_high_precision_mv from MACROBLOCKD to VP9_COMMON."	2013-10-21 10:55:02 -07:00
Yunqing Wang	4afc3a6542	Merge "Fix d207 intra prediction SSSE3 functions"	2013-10-21 10:45:20 -07:00
Dmitry Kovalev	d1b65c6bda	Moving allow_high_precision_mv from MACROBLOCKD to VP9_COMMON. This value is a global frame-level flag, not a macroblock-level. Change-Id: Ie8c5790a931150741c2167c00c3e3dd2cf26744d	2013-10-21 10:12:14 -07:00
Dmitry Kovalev	41ff8d7aaa	Merge "Removing unused struct member mvcount[MV_VALS]."	2013-10-21 09:46:07 -07:00
Dmitry Kovalev	6d2a0da7a7	Removing NUM_ prefix from constant names. Renames for consistency with other constants: NUM_FRAME_TYPES -> FRAME_TYPES NUM_PARTITION_CONTEXTS -> PARTITION_CONTEXTS Change-Id: I3db30acb2868eb0a424237c831087b2e264ec47f	2013-10-18 17:44:19 -07:00
Yunqing Wang	dd51042802	Fix d207 intra prediction SSSE3 functions This patch fixed a bug that caused 32bit PIC build mismatch. The stack pointer was modified after "GET_GOT". Loading left pointer from a hard-coded position gave wrong result. Change-Id: Iea0aec6f917b12a6b3393ffc986bad74510248cc	2013-10-18 17:00:18 -07:00
Yunqing Wang	997e19092e	Disable d207 intra prediction SSSE3 functions Commit "d207 intra prediction ssse3 using bytes" caused mismatch while building 32bit PIC code. Disabled these SSSE3 functions until we fix the bug. Change-Id: Ic444e531d3d4058092fe6eab09006b44fcb18e4c	2013-10-18 14:23:17 -07:00
James Zern	4e6c799e9f	Merge "vp9 dec/com: only update frame counts when necessary"	2013-10-18 13:56:11 -07:00
James Zern	68573c9d2b	Merge "vp9 com/dec: avoid reading unavailable above/left"	2013-10-18 13:22:19 -07:00
James Zern	7563dd4a8d	vp9 dec/com: only update frame counts when necessary don't update them when frame_parallel_mode is true Change-Id: I22ff131a6c6eea238415d10b729f195c7d6dc60d	2013-10-18 22:16:56 +02:00
Yaowu Xu	db1045f2c0	Merge "Use lookup table to simplify logic"	2013-10-18 12:55:24 -07:00
Dmitry Kovalev	5cb8cca9eb	Merge "Using stride (# of elements) instead of pitch (bytes) in fdct16x16."	2013-10-18 12:53:09 -07:00
James Zern	67e41fe2f6	vp9 com/dec: avoid reading unavailable above/left in most cases at least the left column was a harmless race as it was left unused later in the code. Change-Id: I43211df66fb157c6feecf08c681add4fcf18b644	2013-10-18 21:39:37 +02:00
Dmitry Kovalev	e5fa44c869	Using stride (# of elements) instead of pitch (bytes) in fdct8x8. Just making fdct consistent with iht/idct/fht functions which all use stride (# of elements) as input argument. Change-Id: Ibc944952a192e6c7b2b6a869ec2894c01da82ed1	2013-10-18 12:20:26 -07:00
Dmitry Kovalev	1f5d744742	Removing unused struct member mvcount[MV_VALS]. Change-Id: Iaaca88097904b889769901f2bd331f4fff0e5044	2013-10-18 11:56:55 -07:00
Dmitry Kovalev	1aa7fd5aef	Using stride (# of elements) instead of pitch (bytes) in fdct16x16. Just making fdct consistent with iht/idct/fht functions which all use stride (# of elements) as input argument. Change-Id: I2d95fdcbba96aaa0ed24a80870cb38f53487a97d	2013-10-18 11:49:33 -07:00
Dmitry Kovalev	a8ffa96e9b	Passing block index explicitly instead of using get_sb_index(). That makes decoder and encoder (only bitstream writing part) a little bit simpler and faster. Moving get_sb_index() function to the encoder. Change-Id: Ie91aaeefd69c84b085948267b33556a7666c6278	2013-10-18 11:02:32 -07:00
Yaowu Xu	30d1ec38a7	Use lookup table to simplify logic In deciding the transform size for a given block in a given TX_MODE. Change-Id: I1467da09853e69cd320695a24c04e19a2f3d04fb	2013-10-17 14:54:16 -07:00
Dmitry Kovalev	ab1e65b380	Merge "Using TREE_SIZE macro for vp9_segment_tree."	2013-10-17 14:46:08 -07:00
Dmitry Kovalev	631d216273	Merge "Removing last_kf_gf_q member from VP9Common structure."	2013-10-17 14:46:02 -07:00
Dmitry Kovalev	e05412fc23	Using stride (# of elements) instead of pitch (bytes) in fdct32x32. Just making fdct consistent with iht/idct/fht functions which all use stride (# of elements) as input argument. Change-Id: Id623c5113262655fa50f7c9d6cec9a91fcb20bb4	2013-10-17 13:02:28 -07:00
Dmitry Kovalev	01993f7d4a	Removing last_kf_gf_q member from VP9Common structure. It looks like we don't actually use this value. Change-Id: If21d52b597337e7755f7ea817824fc2b1e477a14	2013-10-16 18:01:48 -07:00
Dmitry Kovalev	1350f885f6	Using TREE_SIZE macro for vp9_segment_tree. Change-Id: I2965453135643d8f061b9fa9406fdca2db9c961e	2013-10-16 17:35:06 -07:00
Guillaume Martres	7fd2561d64	Merge changes I6226456d,I97925178,I766c4b74 * changes: Use a separate MODE_INFO stream for each tile column Get rid of "this_mi", use "mi_8x8[0]" everywhere instead Make the static_segmentation feature work again	2013-10-16 17:05:39 -07:00
Guillaume Martres	5b984b36ca	Use a separate MODE_INFO stream for each tile column This should make parallel tiles decoding easier to implement. Change-Id: I6226456dd11f275fa991e4a7a930549da6675915	2013-10-16 16:24:48 -07:00
Guillaume Martres	acf0d56f0b	Get rid of "this_mi", use "mi_8x8[0]" everywhere instead The only case where they were intentionally pointing to different structures was in mbgraph, and this didn't have the expected behavior because both of these pointers are used interchangeably through the code Change-Id: I979251782f90885fe962305bcc845bc05907f80c	2013-10-16 16:24:03 -07:00
Dmitry Kovalev	9deb614a57	Adding get_band_translate() function. Moving code that gets band_translate array from get_scan_and_band() function to get_band_translate() function. Renaming get_scan_and_band() to get_scan(). Change-Id: I43047c205a1ca2a6e24be44db39dc04b7a385008	2013-10-16 15:11:42 -07:00
Dmitry Kovalev	501a8c6b91	Merge "Removing print_prob_tree function and vp9_coeff_probs typedef."	2013-10-16 13:13:25 -07:00
Dmitry Kovalev	65583b14e0	Merge "Moving FILTER_BITS constant from vp9_convolve.h to vp9_filter.h."	2013-10-16 13:13:14 -07:00
Dmitry Kovalev	9e66515886	Merge "Using constants instead of plain numbers."	2013-10-16 13:13:04 -07:00
Adrian Grange	12b2c712ca	Merge "Updated encoder to handle intra-only frames"	2013-10-15 17:19:28 -07:00
Johann	e404db44ff	Merge "Remove Windows-style newlines using dos2unix"	2013-10-15 17:05:32 -07:00
Jingning Han	9b05f23e05	Merge "Make vp9_zero use cases of consistent format"	2013-10-15 16:49:05 -07:00
Alexander Voronov	d6a59fb12c	Updated encoder to handle intra-only frames Updated the encoder to handle frames that are coded intra-only. Intra-only frames must be non-showable, that is, the "show frame" flag must be set to 0 in the frame header. Tested by forcing the ARF frames to be coded intra- only. Note: The rate control code will need to be modified to account for intra-only frames better than they are currently handled. Change-Id: I6a9dd5337deddcecc599d3a44a7431909ed21079	2013-10-15 16:44:02 -07:00
Jingning Han	bf187d1b2d	Merge "Fix a few indent format issues in buffer defs"	2013-10-15 16:23:50 -07:00
Jingning Han	c8e48f4b02	Make vp9_zero use cases of consistent format Remove the semicolon in the definition of vp9_zero macro. Make all the use cases of vp9_zero of consistent format. Change-Id: Ibaf9751e8595872b12766381a93d185a4d90df8f	2013-10-15 16:12:21 -07:00
Guillaume Martres	67cf81b1c0	Remove Windows-style newlines using dos2unix Change-Id: I0a0f9c07e774450896abc9455728b97fd38ef00c	2013-10-15 15:49:52 -07:00
Jingning Han	0a66541619	Fix a few indent format issues in buffer defs Change-Id: Iac55891ac9e6f13718c9f822aa099b5ca491832a	2013-10-15 11:51:09 -07:00
Dmitry Kovalev	a4585285ed	Removing unused 8x4 transform from the encoder. Change-Id: Icbcf68b5b685a56f255ebc3859c9692accdadf9e	2013-10-15 11:27:28 -07:00
Dmitry Kovalev	77cd8db1bf	Moving FILTER_BITS constant from vp9_convolve.h to vp9_filter.h. Change-Id: Idd7bdb0c364d94c5a0d24c87bb8574292e4c840c	2013-10-14 21:15:40 -07:00
Dmitry Kovalev	6965e6f3d5	Removing print_prob_tree function and vp9_coeff_probs typedef. Change-Id: If14265084e9b4c85c75b43e8d33a6fafad468cbc	2013-10-14 21:08:21 -07:00
Dmitry Kovalev	a97fe89538	Using constants instead of plain numbers. Replacing 22 with TREE_SIZE(MAX_ENTROPY_TOKENS) 12 with MAX_ENTROPY_TOKENS Change-Id: If24919336e8ace9cf64991bd5ae33fa6656f7b93	2013-10-14 20:33:37 -07:00
Dmitry Kovalev	f36ba3da20	Merge "Making input pointer of any inverse transform constant."	2013-10-13 12:22:55 -07:00
Dmitry Kovalev	898c217cbc	Merge "Adding TREE_SIZE macro + cleanup."	2013-10-13 12:21:09 -07:00
Dmitry Kovalev	65f118d72f	Making input pointer of any inverse transform constant. Also renaming dest_stride to stride in some places. Change-Id: I75f602b623a5a7071d4922b747c45fa0b7d7a940	2013-10-11 18:27:12 -07:00
Johann	1ea04d980c	Merge "Get libvpx to compile on VS2013."	2013-10-11 17:26:29 -07:00
Dmitry Kovalev	860e467643	Adding TREE_SIZE macro + cleanup. Using TREE_SIZE for the following trees: vp9_intra_mode_tree vp9_inter_mode_tree vp9_partition_tree vp9_switchable_interp_tree vp9_mv_joint_tree vp9_mv_class_tree vp9_mv_class0_tree vp9_mv_fp_tree Change-Id: I0212bb4c1ee6648249f68517e28a67a56591ee1b	2013-10-11 16:25:50 -07:00
Dmitry Kovalev	ac468dde46	Consistent names for inverse hybrid transforms (2 of 2). Renames: vp9_iht_add -> vp9_iht4x4_add vp9_iht_add_8x8 -> vp9_iht8x8_add vp9_iht_add_16x16 -> vp9_iht16x16_add Change-Id: I8f1a2913e02d90d41f174f27e4ee2fad0dbd4a21	2013-10-11 15:49:05 -07:00
Dmitry Kovalev	107897cf05	Merge "Consistent names for inverse hybrid transforms (1 of 2)."	2013-10-11 15:33:00 -07:00
Scott Graham	3806bab283	Get libvpx to compile on VS2013. `round` is defined in the runtime library now. https://codereview.chromium.org/23922008/ Change-Id: I3852740058d32f63ce283579acbe284865e32dba	2013-10-11 14:27:00 -07:00
Dmitry Kovalev	e765aade0b	Merge "Replacing {VP9_COEF, MODE}_UPDATE_PROB with DIFF_UPDATE_PROB."	2013-10-11 14:15:46 -07:00
Dmitry Kovalev	7ef573914d	Consistent names for inverse hybrid transforms (1 of 2). Renames: vp9_short_iht4x4_add -> vp9_iht4x4_16_add vp9_short_iht8x8_add -> vp9_iht8x8_64_add vp9_short_iht16x16_add_c -> vp9_iht16x16_256_add Change-Id: Ibca7a188fd062b196787ac5efc1ea545e7f166c0	2013-10-11 13:31:32 -07:00
Dmitry Kovalev	44195fda71	Adding const to the input argument of all 1D transforms. Also adding static to iadst16_1d and fadst16 functions. Change-Id: I13c7df3b776f0f8efc6e80099bdb0a2f6d29edaf	2013-10-11 11:19:58 -07:00
Dmitry Kovalev	4a0f9478ef	Replacing {VP9_COEF, MODE}_UPDATE_PROB with DIFF_UPDATE_PROB. Values of MODE_UPDATE_PROB and VP9_COEF_UPDATE_PROB are equal, so replacing them with one constant. Inlining appropriate arguments for functions: vp9_cond_prob_diff_update (encoder) vp9_diff_update_prob (decoder) Change-Id: I1255a1cb477743b799b3bfbbcd8de6b32b067338	2013-10-11 10:47:22 -07:00
Dmitry Kovalev	6e21ca7635	Merge "Removing vp9_tree_p typedef."	2013-10-11 10:44:04 -07:00
Dmitry Kovalev	9c8f3063b1	Merge "Removing vp9_idct4_1d_sse2 function."	2013-10-11 10:43:56 -07:00
Yunqing Wang	57b97b56f6	Code cleanup Minor code cleanup. Change-Id: I47c1f794842d4570bb39cfd23b80f54f5606bba6	2013-10-11 09:08:41 -07:00
Yunqing Wang	3a0b59e3fd	Merge "SSE2 8-tap sub-pixel filter optimization"	2013-10-11 08:44:56 -07:00
Dmitry Kovalev	98400c1bc4	Removing vp9_tree_p typedef. It is used only two times and it is more clear to use real type instead of typedef. Change-Id: Idc25c16504c3da4d040e0cdb33a2987631bb6a5b	2013-10-10 17:16:20 -07:00
Dmitry Kovalev	ddf1b76205	Removing vp9_idct4_1d_sse2 function. We have two SSE2-optimized functions for idct4_1d: vp9_idct4_1d_sse2 <-- removing this one idct4_1d_sse2 vp9_idct4_1d_sse2 was used only by the following functions which already have SSE2 optimized variants: vp9_idct4x4_16_add_c -> vp9_idct4x4_16_add_see2 idct8_1d -> vp9_idct8x8_{16, 10, 1}_see2 vp9_short_iht4x4_add_c -> vp9_short_iht4x4_add_see2 Change-Id: Ib0a7f6d1373dbaf7a4a41208cd9d0671fdf15edb	2013-10-10 16:50:43 -07:00
Scott LaVarnway	83936e8cd5	d207 intra prediction ssse3 using bytes byte version of ronalds d207 ssse3 optimizations (commit: f891f84d3ba9345b0074e682f0fea09b8ddf4f1e) Change-Id: If15f71a589ea16f78ac86a501b0c5c6231dc9af1	2013-10-10 15:50:31 -07:00
Dmitry Kovalev	2be3b84aed	Merge "Giving consistent names to IDCT 32x32 functions."	2013-10-10 15:31:25 -07:00
Yunqing Wang	86528586a3	Merge "d153 intra prediction (32x32) ssse3 using bytes"	2013-10-10 15:16:45 -07:00
Yunqing Wang	3fb728c749	SSE2 8-tap sub-pixel filter optimization To ensure fast encoding/decoding on devices without ssse3 support, SSE2 optimization of sub-pixel filters was done. Test using 1080p clip showed the decoder speeds were ~70fps with ssse3 filters, ~60fps with sse2 filters, and ~15fps with c filters. Change-Id: Ie2088f87d83a889fba80a613e4d0e287aadd785c	2013-10-10 14:12:47 -07:00
Dmitry Kovalev	1e766b50e2	Giving consistent names to IDCT 32x32 functions. Renames: vp9_short_idct32x32_add -> vp9_idct32x32_1024_add vp9_short_idct32x32_1_add -> vp9_idct32x32_1_add vp9_idct_add_32x32 -> vp9_idct32x32_add Change-Id: Id85306f5814bac6c47463a6b5901a93082510666	2013-10-10 11:27:39 -07:00
Dmitry Kovalev	1e8fc24af8	Merge "Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers."	2013-10-10 10:49:27 -07:00
Dmitry Kovalev	9a1250e3e0	Merge "Moving all scan/iscan code into separate vp9_scan.{h, c} files."	2013-10-10 10:45:07 -07:00
Dmitry Kovalev	419c3f6fba	Merge "Giving consistent names to IDCT 16x16 functions."	2013-10-10 10:43:14 -07:00
Dmitry Kovalev	d9d7040e98	Adding const to several pointers. Change-Id: I7231589bda71d0d23c730283febd5bb58585a0da	2013-10-09 19:46:30 -07:00
Yaowu Xu	8a06cb55ee	Merge "Added #define of snprintf for MSVC"	2013-10-09 13:04:20 -07:00
Yaowu Xu	850a919640	Added #define of snprintf for MSVC snprintf is not supported by MSVC, the commit replace it with the msvc variant _snprintf to enable build. Change-Id: I686943a78c289bae6b486a5e75effad5f86c24de	2013-10-09 12:16:53 -07:00
Parag Salasakar	eeb5b62dc1	mips dsp-ase r2 vp9 decoder bilinear convolve optimizations Change-Id: Ic31b4ef85e65070b4f8b9f26e068ccfaae00c4f0	2013-10-09 18:05:27 +05:30
James Zern	b4148c3a03	Merge "vp9_blockd.h: update get_tx_eob() signature"	2013-10-09 00:55:48 -07:00
Jingning Han	83b285e546	Merge "All zero coeff skip in IDCT 32x32"	2013-10-08 12:30:48 -07:00
Jingning Han	6594ca8897	All zero coeff skip in IDCT 32x32 When all coefficients are zeros, skip the corresponding 1-D inverse transform. This practice has been used in the SSE2 implementation of inverse 32x32 DCT. This commit imports this algorithm into the C code. Change-Id: I0f58bfcb183a569fab85d524d5d9cf8ae8653f86	2013-10-08 11:47:29 -07:00
Dmitry Kovalev	c983c966cb	Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers. We already have itxm_add member in MACROBLOCKD structure. Both inv_txm4x4_1_add and inv_txm4x4_add are just its special cases for different eob values. But eob logic is already implemented in vp9_iwht4x4_add and vp9_idct4x4_add (that's why also removing inverse_transform_b_4x4_add). Change-Id: I80bec9b6f7d40c5e5033c613faca5c819c3e6326	2013-10-08 11:27:56 -07:00
Dmitry Kovalev	8d3ef287a2	Merge "Removing redundant vp9_pt_energy_class declarations."	2013-10-08 10:54:48 -07:00
Paul Wilkins	f9ec0433ad	Merge "Fix MSVC warning."	2013-10-08 10:19:49 -07:00
Jim Bankoski	56af13a1b1	cpplint issue with convolve resolved Change-Id: I38b2100f1a64cb067c63f4e1662c36914b3569df	2013-10-07 15:55:42 -07:00
Dmitry Kovalev	b096c5a336	Giving consistent names to IDCT 16x16 functions. Renames: vp9_short_idct16x16_add -> vp9_idct16x16_256_add vp9_short_idct16x16_10_add -> vp9_idct16x16_10_add vp9_short_idct16x16_1_add -> vp9_idct16x16_1_add vp9_idct_add_16x16 -> vp9_idct16x16_add Change-Id: Ief8a3904de78deab0f4ede944c4d0339c228cfc3	2013-10-07 14:31:10 -07:00
Dmitry Kovalev	2ae93a776b	Merge "Giving consistent names to IDCT 8x8 functions."	2013-10-07 14:19:50 -07:00
Dmitry Kovalev	23cc1cd8e6	Removing redundant vp9_pt_energy_class declarations. Declaring vp9_pt_energy_class in vp9_entropy.h instead of many external places. Change-Id: I66e8a3fc119a43f88d130d0dae4133c825a047a3	2013-10-07 14:11:01 -07:00
Dmitry Kovalev	e3597c6af7	Moving all scan/iscan code into separate vp9_scan.{h, c} files. Now we have entropy code separate from scan/iscan code. The next step in future is to move iscan code from common part to the encoder. Change-Id: Id9732f7d80aec00af35c1d58d1137c4c96c91451	2013-10-07 13:55:56 -07:00
Dmitry Kovalev	6d3db91d3b	Merge "Cleaning up foreach_predicted_block_in_plane() function."	2013-10-07 11:30:45 -07:00
Scott LaVarnway	a2a3b4a479	d153 intra prediction (32x32) ssse3 using bytes Change-Id: Ie2c0d84ff9f6294084d65f4380e1f30c09e681c9	2013-10-07 11:21:10 -04:00
James Zern	879e21ddfd	vp9_blockd.h: update get_tx_eob() signature as the name implies, the segmentation pointer can be const Change-Id: I945f01a077c112ec86c00e35a1e9395bc230c2d9	2013-10-07 11:45:16 +02:00
Paul Wilkins	950058765d	Fix MSVC warning. A new set of MSVC warnings were introduced by change I3f36d3f7cd8d15195a6e2fafd1777cdaf9ecb847 In particular MSVC does not like:- typedef const int16_t subpel_kernel[SUBPEL_TAPS]; struct subpix_fn_table { const subpel_kernel filter_x; const subpel_kernel filter_y; }; causes new warning in MSVC. warning C4114: same type qualifier used more than once Change-Id: Iae596fd13aadf36169faf00c68eabe9a32a9b156	2013-10-07 02:26:44 -07:00
Jim Bankoski	bf893e84bd	Merge changes I8a106dd6,Iec442603 * changes: d153 intra prediction (16x16) ssse3 using bytes d153 intra prediction ssse3 using bytes	2013-10-06 20:11:24 -07:00
Dmitry Kovalev	c6ad70d5f1	Giving consistent names to IDCT 8x8 functions. Renames: vp9_short_idct8x8_add -> vp9_idct8x8_64_add vp9_short_idct8x8_1_add -> vp9_idct8x8_1_add vp9_short_idct8x8_10_add -> vp9_idct8x8_10_add vp9_idct_add_8x8 -> vp9_idct8x8_add Change-Id: Ifb8d3a45b4c0397aa805b30463f3d14581bf72c1	2013-10-06 00:24:09 -07:00
Dmitry Kovalev	9dba044be2	Merge "Giving consistent names to IDCT/IWHT functions."	2013-10-05 23:44:05 -07:00
Dmitry Kovalev	ee74054e81	Cleaning up foreach_predicted_block_in_plane() function. Change-Id: Ibb3d9667eba56621667412f62097aa7a392659c2	2013-10-04 15:53:32 -07:00
Dmitry Kovalev	56acf7e528	Merge "Adding vp9_get_filter_kernel() function."	2013-10-04 15:21:39 -07:00
Dmitry Kovalev	3a0602578e	Giving consistent names to IDCT/IWHT functions. The idea is to have the following names for each transform size: vp9_idct4x4_add vp9_idct4x4_1_add vp9_idct4x4_10_add vp9_idct4x4_16_add vp9_idct8x8_add vp9_idct8x8_1_add vp9_idct8x8_10_add vp9_idct8x8_64_add etc for 16x16, 32x32 The actual list of renames in this patch: vp9_idct_add_lossless -> vp9_iwht4x4_add vp9_short_iwalsh4x4_add -> vp9_iwht4x4_16_add vp9_short_iwalsh4x4_1_add -> vp9_iwht4x4_1_add vp9_idct_add -> vp9_idct4x4_add vp9_short_idct4x4_add -> vp9_idct4x4_16_add vp9_short_idct4x4_1_add -> vp9_idct4x4_1_add Change-Id: I6f43f7437c68dd30cdd05d72e213765578ed30b1	2013-10-04 14:17:06 -07:00
Dmitry Kovalev	042c475a8f	Merge "Moving all idct/iht functions in one place."	2013-10-04 12:01:42 -07:00
Dmitry Kovalev	9ec09700d6	Adding vp9_get_filter_kernel() function. Moving INTERPOLATIONFILTERTYPE enum and subpix_fn_table struct to vp9_filter.h. Adding convenient typedef for subpel kernels. Function vp9_setup_interp_filters() besides setting xd->subpix.filter_x & xd->subpix.filter_y has a side effect of also setting scale factors. This is not required inside decode_modes_b() because scale factors have been already set by set_ref() calls. That's why replacing vp9_setup_interp_filters() call with newly created vp9_get_filter_kernel() call. The behavior of vp9_setup_interp_filters() is unchanged (it is used from the encoder). Change-Id: I3f36d3f7cd8d15195a6e2fafd1777cdaf9ecb847	2013-10-03 18:55:21 -07:00
Jingning Han	4093192ec9	Change b_mode_info definition from union to struct This commit defines b_mode_info as a struct type. This will allow us to further remove the use of PARTITION_INFO in the encoding process. Change-Id: I975b0f7d557b5e0f66545a61b472def76b671cce	2013-10-03 12:34:11 -07:00
Yunqing Wang	134dfea878	Merge "Rewrite HORIZx4 and HORIZx8 in subpixel filter functions"	2013-10-03 12:17:47 -07:00
Dmitry Kovalev	8394f1a015	Merge "Making decode_modes_b function more straightforward."	2013-10-03 11:06:29 -07:00
Johann	fd6c4c71d6	Merge "mips dsp-ase r2 vp9 decoder convolve module optimizations"	2013-10-03 09:41:16 -07:00
Yunqing Wang	ed22179a82	Rewrite HORIZx4 and HORIZx8 in subpixel filter functions In subpixel filters, prefetched source data, unrolled loops, and interleaved instructions. In HORIZx4, integrated the idea in Scott's CL (commit: `d22a504d11`), which was suggested by Erik/Tamar from Intel. Further tweaking was done to combine row 0, 2, and row 1, 3 in registers to do more 2-row-in-1 operations until the last add. Test showed a ~2% decoder speedup. Change-Id: Ib53d04ede8166c38c3dc744da8c6f737ce26a0e3	2013-10-03 09:04:02 -07:00
Parag Salasakar	40edab5e39	mips dsp-ase r2 vp9 decoder convolve module optimizations Change-Id: I401536778e3c68ba2b3ae3955c689d005e1f1d59	2013-10-02 16:58:37 -07:00
Dmitry Kovalev	43e979db3b	Merge "Adding const to function arguments."	2013-10-02 16:26:20 -07:00
Dmitry Kovalev	7fa14f42c1	Merge "Removing unused vp9_coeff_stats_model typedef."	2013-10-02 16:26:09 -07:00
Dmitry Kovalev	a88a0e88a4	Merge "Moving get_token_alloc function from common to the encoder."	2013-10-02 16:26:00 -07:00
Dmitry Kovalev	be7eec79be	Moving all idct/iht functions in one place. Moving functions from vp9_idct_blk to vp9_idct because these functions are used from both encoder and decoder. Removing duplicated code from vp9_encodemb.c and reusing existing functions. Change-Id: Ia0a6782f8c4c409efb891651b871dd4bf22d5fe8	2013-10-02 14:13:33 -07:00
Scott LaVarnway	20a09d928a	d153 intra prediction (16x16) ssse3 using bytes Change-Id: I8a106dd61b0a2520fae792d87d6348e662649b2d	2013-10-02 16:34:05 -04:00
Jingning Han	6d3bd96607	BITSTREAM - CLARIFICATION OF MV SIZE RANGE The codec should effectively run with motion vector of range (-2048, 2047) in full pixels, for sequences of 1080p and below. Add assertions to clarify this behavior. Change-Id: Ia0cac28249f587d8f8882205228fa480263ab313	2013-10-02 10:29:45 -07:00
Dmitry Kovalev	3c4e9e341f	Adding SSE2 optimized vp9_short_idct32x32_1_add function. Change-Id: I4b1c6bb9ff615f5872b96ed07dbf0f5e18e63643	2013-10-01 18:34:36 -07:00
Dmitry Kovalev	aeb603f2af	Making decode_modes_b function more straightforward. Moving out decode_tokens function calls and adding decode_blocks boolean variable. We only have to decode if eobtotal > 0, i.e. we have at least one non-zero coefficient. Also inlining and remove vp9_set_pred_flag_mbskip function. Change-Id: I7be38b12ee8206faf0beea2bbf4d52be42575b03	2013-10-01 15:41:30 -07:00
Yunqing Wang	03698aa6d8	Merge "Modify HORIZx16 macro in subpixel filter functions"	2013-10-01 14:18:10 -07:00
Yunqing Wang	df8e156432	Modify HORIZx16 macro in subpixel filter functions Interleaved the instructions, reduced register dependency, and prefetched the source data. This improved the decoder speed by 0.6% - 2%. Change-Id: I568067aa0c629b2e58219326899c82aedf7eccca	2013-10-01 12:49:25 -07:00
Dmitry Kovalev	0a5e9ee054	Moving get_token_alloc function from common to the encoder. Also renaming mb_row -> mi_row, mb_col -> mi_col arguments and calculate mb_rows/mb_cols values from mi_rows/mi_cols. Change-Id: I6919a279f560648e23bc9a12f507d17c21ffd5d7	2013-10-01 11:54:10 -07:00
Scott LaVarnway	27b390e1a1	d153 intra prediction ssse3 using bytes byte version of ronalds d153 ssse3 optimizations for 4x4 and 8x8 (commit: fc91a2a112238a1aee568f3b840585de4e928fca) Change-Id: Iec4426032311483f615fd9e0dceba3ee85ddebd7	2013-10-01 09:05:20 -04:00
Dmitry Kovalev	c982a73b9f	Removing unused vp9_coeff_stats_model typedef. Change-Id: I6973e7121b6393379b5759f288632e8eab763d3e	2013-09-30 15:10:00 -07:00
Dmitry Kovalev	c64e23832f	Adding const to function arguments. Function list: tx_counts_to_branch_counts_32x32 tx_counts_to_branch_counts_8x8 tx_counts_to_branch_counts_8x8 update_ct update_ct2 update_mode_probs Change-Id: I120d8945a34378cf285d6bd415e23de1d522cf2f	2013-09-30 14:50:15 -07:00
Dmitry Kovalev	cd945c7bd9	Merge "Removing vp9_add_constant_residual_{8x8, 16x16, 32x32} functions."	2013-09-30 13:16:34 -07:00
Dmitry Kovalev	548671dd20	Removing vp9_add_constant_residual_{8x8, 16x16, 32x32} functions. We don't need these functions anymore. The only one which was actually used is vp9_add_constant_residual_32x32. Addition of vp9_short_idct32x32_1_add eliminates this single usage. SSE2 optimized version of vp9_short_idct32x32_1_add will be added in the next patch set, right now it is only C implementation. Now we have all idct functions implemented in a consistent manner. Change-Id: I63df79a13cf62aa2c9360a7a26933c100f9ebda3	2013-09-30 10:56:37 -07:00
Jim Bankoski	4906fe45e2	Merge "systemdependent lint issue resolved"	2013-09-30 10:55:07 -07:00
Jim Bankoski	fd09be0984	Merge changes I2b2af1dd,Id2cc5c82 * changes: fixed cpp lint issue in vp9_postproc_x86 nolintify intrinsic idct file	2013-09-30 10:53:30 -07:00
Jim Bankoski	e3c1f0880f	Merge "cpplint issues in vp9_loopfilter.h"	2013-09-30 10:53:13 -07:00
Jim Bankoski	509ba98938	Merge "treecoder lint issues resolved"	2013-09-30 10:43:22 -07:00
Jim Bankoski	7ddd9f7f27	Merge "cpplint issue with entropymv.h"	2013-09-30 10:43:16 -07:00
Jim Bankoski	c424c5e808	Merge "cpplint issue with vp9_loopfilter_filters.c"	2013-09-30 10:43:05 -07:00
Jim Bankoski	282704145d	Merge "cpplint issue in blockd.h"	2013-09-30 10:42:45 -07:00
Jim Bankoski	58a09c32c2	Merge "common_data.h lint issues resolved"	2013-09-30 10:42:35 -07:00
Jim Bankoski	9e056fa094	Merge "vp9_loopfilter.c cpplint issues resolved."	2013-09-30 10:42:27 -07:00
Jim Bankoski	d2a4ddf982	Merge "cpplint issue resolved in vp9_pred_common.h"	2013-09-30 10:42:19 -07:00
Jim Bankoski	cbdcc215b3	Merge "resolved lint issues in default_coef_probs"	2013-09-30 10:42:12 -07:00
Jim Bankoski	d35e9a0c53	Merge "lint issues in mvref_common.c"	2013-09-30 10:41:50 -07:00
Jim Bankoski	14916b0ca6	Merge "vp9 convolve lint issues"	2013-09-30 10:41:43 -07:00
Jim Bankoski	4e5d99ca72	Merge "vp9_rtcd.c lint issues"	2013-09-30 10:41:32 -07:00
Jim Bankoski	bc1b089372	Merge changes Id58e2176,I7efc74ef * changes: cpplint issues in vp9_filter.h cpplint issues with onyxc_int.h	2013-09-30 10:41:23 -07:00
Jim Bankoski	0f8805e086	Merge "vp9_entropy.c lint issues"	2013-09-30 10:34:11 -07:00
Jim Bankoski	7f13b33a78	Merge "cpplint issues resolved in vp9_postproc.c"	2013-09-30 08:26:00 -07:00
Jim Bankoski	1a2f4fd2f5	Merge "fix lint issues in quant common"	2013-09-30 08:26:00 -07:00
Jim Bankoski	88251c86dc	Merge "fix cpplint issue in reconintra"	2013-09-30 08:26:00 -07:00
Jim Bankoski	777460329b	vp9_entropy.c lint issues Change-Id: I4e163cc4ce9ec2f3a5a8b9da478049c71b08d71f	2013-09-29 20:29:43 -07:00
Jim Bankoski	7019e34c34	vp9 convolve lint issues Change-Id: I8b496191c6a60a60a52c929adca305db47058a84	2013-09-29 19:44:05 -07:00
Jim Bankoski	f6d7e3679c	resolved lint issues in default_coef_probs Change-Id: I97bf241c0d981721cc74a50be47c9db8a00f6be3	2013-09-29 19:41:31 -07:00
Jim Bankoski	c66bfc70d1	treecoder lint issues resolved Change-Id: I442609f689aa9381e1e208012305cf62a6b31eee	2013-09-29 19:37:11 -07:00
Jim Bankoski	a57912f893	systemdependent lint issue resolved Change-Id: I07fbb32d5cee0003d04b2369cfafcb03c371cd4f	2013-09-29 19:34:44 -07:00
Jim Bankoski	8f229caf87	lint issues in mvref_common.c Change-Id: If6a7a8c48fefc69349c792d8ed52a6e1d374e46e	2013-09-29 19:32:53 -07:00
Jim Bankoski	623e163f84	vp9_rtcd.c lint issues Change-Id: I58209ae96d21c56cbb8ef796940b6ca3b3ebfa72	2013-09-29 19:29:58 -07:00
Jim Bankoski	c288b94ab9	common_data.h lint issues resolved Change-Id: I1fd79093a5b9cb40c9e877b6b71c25a07a69b3ae	2013-09-29 19:28:32 -07:00
Jim Bankoski	03df17070b	vp9_loopfilter.c cpplint issues resolved. Change-Id: Idfa17d120ec4edf542e424fa0deb769951afbf4a	2013-09-29 19:04:21 -07:00
Jim Bankoski	6249a5b17e	cpplint issue with vp9_loopfilter_filters.c Change-Id: I13aa43df6bff340b5768d69125b473a52d1d59bd	2013-09-29 19:03:00 -07:00
Jim Bankoski	855d078f95	cpplint issue with entropymv.h Change-Id: I3556738d27def6a5bd71577728050a1e2bb1de63	2013-09-29 19:01:46 -07:00
Jim Bankoski	2b5bf7b8d8	cpplint issue in blockd.h Change-Id: Ia41e1966431652b839134a1c27feccb25c762539	2013-09-29 19:00:40 -07:00
Jim Bankoski	716d37f8bf	fixed cpplint issue with vp9_scale.h Change-Id: Ia7969baac7ffc6d7a0e8e8e83e9252d077a3c5b3	2013-09-29 18:58:58 -07:00
Jim Bankoski	2ecd0dae1e	vp9_entropymv.c cpplint issues resolved Change-Id: Ic5807152cc78127b3f84b5abb4c5f3ef6d06ce65	2013-09-29 18:57:35 -07:00
Jim Bankoski	7a59efe7f8	cpplint issues resolved in vp9_postproc.c Change-Id: If61380115163a02ecfe74b82e116001ac54e20e2	2013-09-29 18:52:29 -07:00
Jim Bankoski	152fd59964	fixed cpp lint issue in vp9_postproc_x86 Change-Id: I2b2af1dd9f5c29c05e28a4fd51fa58ccc4071477	2013-09-29 18:44:58 -07:00
Jim Bankoski	ec421b7810	nolintify intrinsic idct file Change-Id: Id2cc5c829399a2afdf7a8a82615a4e272c814986	2013-09-29 18:42:24 -07:00
Jim Bankoski	31ceb6b13c	cpplint issues in vp9_loopfilter.h Change-Id: Ib142f9c5130aa5f0e1fc76e1c4f51cd66c73dcc7	2013-09-29 18:36:42 -07:00
Jim Bankoski	11cf0c39c9	cpplint issues in vp9_filter.h Change-Id: Id58e21760c7948a2b020c9623c38cf007874d43e	2013-09-29 18:34:41 -07:00
Jim Bankoski	01d43aaa24	cpplint issue resolved in vp9_pred_common.h Change-Id: Ibacac91c2192fcfbd9e411ae141dd00445566efe	2013-09-29 18:17:06 -07:00
Jim Bankoski	ab03c00504	cpplint issues with onyxc_int.h Change-Id: I7efc74ef53139bbaa6ec4f01482d9d9b362be27b	2013-09-29 18:10:03 -07:00
Jim Bankoski	eb506a6590	cpplint fixes to debug modes Change-Id: I1c3943cd5db6cd8fc759116a3717dba3c030fa0d	2013-09-29 18:04:48 -07:00
Jim Bankoski	fb6e6cd24d	fix cpplint issue in reconintra Change-Id: I934f9cfb96ce4f5f266b025064237875dcd92b3a	2013-09-29 18:02:42 -07:00
Jim Bankoski	d052117319	fix lint issues in quant common Change-Id: I135ee6e8df91262f813c474b24f14381a4064e02	2013-09-29 17:59:43 -07:00
Jim Bankoski	efc8638890	cpplint issues in vp9_onyx.h Change-Id: I0b5af849833ac077bd4de71a24af8f8bd7ec06d6	2013-09-29 17:50:18 -07:00
Dmitry Kovalev	b927620231	Merge "Using is_inter_block and has_second_ref functions."	2013-09-29 12:14:41 -07:00
Dmitry Kovalev	b3d3578ee4	Merge "Renaming vp9_short_idct10_8x8_add to vp9_short_idct8x8_10_add."	2013-09-29 12:01:50 -07:00
Dmitry Kovalev	7343681675	Merge "Removing vp9_get_coef_neighbors_handle function."	2013-09-29 12:01:36 -07:00
Dmitry Kovalev	efbacc9f89	Merge "Removing vp9_subpelvar.h from common."	2013-09-29 12:00:46 -07:00
Dmitry Kovalev	b10e6b2943	Removing unnecessary function calls. Both vp9_init_mbmode_probs() and vp9_zero(cm->ref_frame_sign_bias) are called inside vp9_setup_past_independence() which called in any case for encoder/decoder after VP9_COMMON struct creation. Change-Id: I3724d1a4fb8060101ff0290dd6a158f0b5c57bb4	2013-09-27 17:42:05 -07:00
Dmitry Kovalev	3fab2125ff	Renaming vp9_short_idct10_8x8_add to vp9_short_idct8x8_10_add. Making name consistent with vp9_short_idct8x8 and vp9_short_idct8x8_1. Change-Id: I99e0be040ec893f9571dcf090e18f98dc58339f5	2013-09-27 15:26:27 -07:00
Christian Duvivier	b1b4ba1bdd	Properly save neon registers. Replace current code which corrupts the stack by duplicate of vp8 code to save and restore neon registers. Change-Id: Ibb0220b9aa985d10533befa0a455ebce57a2891a	2013-09-27 14:25:33 -07:00
Dmitry Kovalev	209c6cbf8f	Removing vp9_get_coef_neighbors_handle function. Change-Id: I6be72c8b048d1ccc7ef43764cf84c32360098970	2013-09-27 14:11:13 -07:00
Dmitry Kovalev	db60c02c9e	Merge "Renaming vp9_short_idct10_16x16 to vp9_short_idct16x16_10."	2013-09-27 13:08:52 -07:00
Scott LaVarnway	35830879db	Merge "d63 intra prediction ssse3 using bytes"	2013-09-27 07:21:08 -07:00
Dmitry Kovalev	15a36a0a0d	Renaming vp9_short_idct10_16x16 to vp9_short_idct16x16_10. Making function name consistent with vp9_short_idct16x16 and vp9_short_idct16x16_1. Change-Id: I70e54be9e6b9a1dddab0de470686591e96d05517	2013-09-26 14:01:25 -07:00
Christian Duvivier	5b1dc1515f	Fix a bunch of TODO from vp9_short_idct32x32_add_neon. - full ASM version, no more C gateway file. - integrate combine-add with last step of 2nd pass. - remove a few push/pop pairs. - some instruction reordering to hide latency. Change-Id: Ic9d9933c908b65d1bf7ba8fd47b524cda808c9c6	2013-09-25 21:15:19 -07:00
Dmitry Kovalev	eda4e24c0d	Using is_inter_block and has_second_ref functions. Change-Id: I60dee58a4fd24d3c4f3c101a49d30e217309f43a	2013-09-25 19:03:04 -07:00
Dmitry Kovalev	64eff7f360	Removing vp9_subpelvar.h from common. Moving all code from that file to vp9_variace_c.c in the encoder. Change-Id: Ic803d5b4c78d5191e4d25541b3df97337878fc3e	2013-09-25 16:10:43 -07:00
Dmitry Kovalev	49f5efa8d8	Removing unused SUBMVREF_COUNT constant. Change-Id: I302ab4603553352a84b57bc89bc9e3d037978d29	2013-09-25 15:33:05 -07:00
Scott LaVarnway	208658490c	d63 intra prediction ssse3 using bytes byte version of ronalds d63 ssse3 optimizations (commit: c5a1c8cf3541cf3665fee981b36d22c9fbd4191e) Change-Id: Ifd3e6d454a2246085f23eabb38518a930321e807	2013-09-25 16:16:44 -04:00
Dmitry Kovalev	d571e4e785	Replacing unsigned char* with uint8_t*. Change-Id: I99a1880aee015ae16311ba05a31aa307df89bef2	2013-09-24 14:57:42 -07:00
Yaowu Xu	71cfaaa689	Merge "Replace memcpy with vpx_memcpy"	2013-09-24 11:35:03 -07:00
Yaowu Xu	9be0bb19df	Replace memcpy with vpx_memcpy Also removed obselete comment Change-Id: Iae1664777d76383639c637ee786e0d50fc45819a	2013-09-24 10:56:06 -07:00
Yaowu Xu	6037f17942	Rename defined constants The change is to better reflect the nature of the constants. Change-Id: Icabac6e9bceefbdb3f03f8218f88ef75943c30fb	2013-09-24 10:53:01 -07:00
Dmitry Kovalev	f24b9b4f87	Merge "Adding best_mv[2] array instead of two variables."	2013-09-24 10:17:53 -07:00
Johann	a6a00fc6a3	Use lowercase instruction in assembly The iOS compiler does not recognize BLE: bad instruction `BLE idct32_transpose_pair_loop' Change-Id: I7426694c66bc31caf939a2d5000968da1222c15b	2013-09-20 16:11:05 -07:00
Dmitry Kovalev	bb5e2bf86a	Adding best_mv[2] array instead of two variables. Change-Id: I584fe50f73879f6a72fada45714ef80893b6d549	2013-09-20 17:08:53 +04:00
Dmitry Kovalev	24df77e951	Merge "Adding get_scan_and_band function."	2013-09-20 00:15:06 -07:00
Yaowu Xu	014acfa2af	fix integer overflow errors Change-Id: I76f440a917832c02d7a727697b225bac66b99f56	2013-09-19 08:14:26 -07:00
Dmitry Kovalev	a23c2a9e7b	Adding get_scan_and_band function. Extracting get_scan_and_band function from get_entropy_context to remove duplicated code. Change-Id: I5da1f5a60263017e887da68bc834317b5f084cb2	2013-09-19 16:53:48 +04:00
Yunqing Wang	a7b7f94ae8	Merge "Fix x86inc.asm to build PIC code correctly"	2013-09-18 14:51:31 -07:00
Yunqing Wang	9d901217c6	Fix x86inc.asm to build PIC code correctly Current x86inc.asm didn't handle 32bit PIC build properly. TEXTRELs were seen in the library built. The PIC macros from libvpx's x86_abi_support.asm was used to fix this problem. The assembly code was modified to use the macros. Notes: We need this fix in for decoder building. Functions in encoder will be fixed later. Change-Id: Ifa548d37b1d0bc7d0528db75009cc18cd5eb1838	2013-09-18 13:45:46 -07:00
Yaowu Xu	85fd8bdb01	Merge "Silence a bunch of MSVC warnings"	2013-09-17 17:10:58 -07:00
Yaowu Xu	a783da80e7	Silence a bunch of MSVC warnings Change-Id: I16633269582a640809dca27572bbe99efa6369fc	2013-09-17 12:08:51 -07:00
Jingning Han	2b3bfaa9ce	Remove redundant argument in get_sub_block_mv The sub8x8 check can be directly inferred from block_idx, hence removed from the arguments if get_sub_block_mv. Change-Id: Ib766d57e81248fb92df0f6d9b163e6c77b933ccd	2013-09-17 12:08:45 -07:00
hkuang	23e1a29fc7	Speed up iht8x8 by rearranging instructions. Speed improves from 282% to 302% faster based on assembly-perf. Change-Id: I08c5c1a542d43361611198f750b725e4303d19e2	2013-09-16 14:23:26 -07:00
James Zern	2d58761993	Revert "Improved 8t filters" This is incompatible with most toolchains other than gcc. Revert "Deleted #include <inttypes.h>" This reverts commit `4d018be950`. This reverts commit `d22a504d11`. Change-Id: I1751dc6831f4395ee064e6748281418e967e1dcf	2013-09-13 15:13:06 -07:00
Scott LaVarnway	8fc95a1b11	Merge "New mode_info_context storage -- undo revert"	2013-09-13 08:56:20 -07:00
Paul Wilkins	9c9a3b2775	Merge "Deleted #include <inttypes.h>"	2013-09-13 01:05:31 -07:00
hkuang	86fb12b600	Merge "Add neon optimize iht8x8 which is 282% faster than C."	2013-09-12 15:42:44 -07:00
hkuang	182366c736	Add neon optimize iht8x8 which is 282% faster than C. Change-Id: I963dd4a6e8671957403ccbb9a16ea7de703e3530	2013-09-12 11:49:05 -07:00
Paul Wilkins	4d018be950	Deleted #include <inttypes.h> This seems not to be needed and is not supported in the Windows build. Change-Id: Iaca3bbf8cca283aee6bc336cb31ba9dd4610322b	2013-09-12 13:43:07 +01:00
Christian Duvivier	6a501462f8	First draft of vp9_short_idct32x32_add_neon. Lots of TODO which will be taken care in upcoming changes. As is, about 6x faster than C version. Change-Id: Ie2557b72fd2d8edca376dbf400a4d173aa5e63e0	2013-09-11 15:19:38 -07:00
Scott LaVarnway	23845947c4	Merge "Improved 8t filters"	2013-09-11 14:34:54 -07:00
Scott LaVarnway	d22a504d11	Improved 8t filters Reformatted version of a patch submitted by Erik/Tamar from Intel. For the test clips used, the decoder performance improved by ~2%. Change-Id: Ifbc37ac6311bca9ff1cfefe3f2e9b7f13a4a511b	2013-09-11 13:56:32 -04:00
Scott LaVarnway	ac6093d179	New mode_info_context storage -- undo revert mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of pointers to MODE_INFO structs. The MODE_INFO structs are now stored as a stream (decoder only), eliminating unnecessary copies and is a little more cache friendly. Change-Id: I031d376284c6eb98a38ad5595b797f048a6cfc0d	2013-09-11 13:45:44 -04:00
Yunqing Wang	079183c1a8	code cleanup Removed unused function. Change-Id: Icb12a09e4d303968be6aec9fae1ef05935913a4f	2013-09-11 09:32:00 -07:00
hkuang	f4a6f936b5	Merge "Speed up idct16x16 by rearrange instructions."	2013-09-10 08:23:57 -07:00
hkuang	fc5ec206a7	Speed up idct16x16 by rearrange instructions. Speed improve from 376% to 400% faster base on assembly-perf. Change-Id: If0b2eccc39d5793dc101ce9feb7fcadf88396ea2	2013-09-09 18:00:13 -07:00
Ivan Maltz	20abe595ec	Merge "API extensions and sample app for spacial scalable encoder"	2013-09-09 16:57:01 -07:00
Ivan Maltz	01b35c3c16	API extensions and sample app for spacial scalable encoder Sample app: vp9_spatial_scalable_encoder vpx_codec_control extensions: VP9E_SET_SVC VP9E_SET_WIDTH, VP9E_SET_HEIGHT, VP9E_SET_LAYER VP9E_SET_MIN_Q, VP9E_SET_MAX_Q expanded buffer size for vp9_convolve modified setting of initial width in vp9_onyx_if.c so that layer size can be set prior to initial encode Default number of layers set to 3 (VPX_SS_DEFAULT_LAYERS) Number of layers set explicitly in vpx_codec_enc_cfg.ss_number_layers Change-Id: I2c7a6fe6d665113671337032f7ad032430ac4197	2013-09-09 15:57:56 -07:00
James Zern	c1913c9cf4	Merge "Revert "New mode_info_context storage""	2013-09-09 14:38:01 -07:00
James Zern	54a03e20dd	Revert "New mode_info_context storage" This reverts commit `dae17734ec` Encode crashes, leaks and increases integer overflow errors. Change-Id: I595aa2649bb8d0b6552ff91652837a74c103fda2	2013-09-09 13:37:01 -07:00
Yaowu Xu	b19126b291	Merge "Reduce the amount of extension in src frames"	2013-09-09 08:09:56 -07:00
Yaowu Xu	65c2444e15	Reduce the amount of extension in src frames The commit changes the border pixel extension from 160 pixel each side to what is necessary in arnr filter or motion estimation portion, i.e. 16 pixel on top and left side. For right or bottom side, the extension is changed to either round up image size to multiple of 64 or at least 16 pixels. Change-Id: Ic05e19b94368c1ab4df568723aae5734e6c3d2c5	2013-09-08 15:51:54 -07:00
Jim Bankoski	e378566060	Merge "New mode_info_context storage"	2013-09-08 07:16:25 -07:00
Deb Mukherjee	e378a89bd6	Support a constant quality mode in VP9 Adds a new end-usage option for constant quality encoding in vpx. This first version implemented for VP9, encodes all regular inter frames using the quality specified in the --cq-level= option, while encoding all key frames and golden/altref frames at a quality better than that. The current performance on derfraw300 is +0.910% up from bitrate control, but achieved without multiple recode loops per frame. The decision for qp for each altref/golden/key frame will be improved in subsequent patches based on better use of stats from the first pass. Further, the qp for regular inter frames may also be varied around the provided cq-level. Change-Id: I6c4a2a68563679d60e0616ebcb11698578615fb3	2013-09-06 10:30:53 -07:00
Scott LaVarnway	dae17734ec	New mode_info_context storage mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of a pointer to a MODE_INFO struct and a "in the image" flag. The MODE_INFO structs are now stored as a stream, eliminating unnecessary copies and is a little more cache friendly. For the test clips used, the decoder performance improved by ~4.3% (1080p) and ~9.7% (720p). Patch Set 2: Re-encoded clips with latest. Now ~1.7% (1080p) and 5.9% (720p). Change-Id: I846f29e88610fce2523ca697a9a9ef2a182e9256	2013-09-06 12:33:34 -04:00
Jim Bankoski	e4e864586c	Merge "fix loop filter setup_mask could reach out of bounds issue"	2013-09-06 06:21:28 -07:00
hkuang	3476404912	Merge "Speed up idct8x8 by rearrange instructions. Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf."	2013-09-05 17:37:13 -07:00
Jim Bankoski	736114f44b	fix loop filter setup_mask could reach out of bounds issue Change-Id: Ic8446c4f26b6782a6dc482c19ea73c77646df418	2013-09-05 15:53:31 -07:00
Jingning Han	1c263d6918	Merge "Use saturated addition in SSSE3 of 32x32 quant"	2013-09-05 14:09:40 -07:00
Jim Bankoski	2156ccaa4a	Merge "resolve clang warnings : uninitialized vars in vp9_entropy.h"	2013-09-05 12:55:32 -07:00
Jingning Han	458c2833c0	Use saturated addition in SSSE3 of 32x32 quant The 32x32 forward transform can potentially reach peak coefficient value close to 32700, while the rounding factor can go upto 610. This could cause overflow issue in the SSSE3 implementation of 32x32 quantization process. This commit resolves this issue by replacing the addition operations with saturated addition operations in 32x32 block quantization. Change-Id: Id6b98996458e16c5b6241338ca113c332bef6e70	2013-09-05 12:49:12 -07:00
Jim Bankoski	9fc3d32a50	Merge "faster accounting of inc_mv"	2013-09-05 12:38:56 -07:00
Jim Bankoski	2e4ca9d1a5	resolve clang warnings : uninitialized vars in vp9_entropy.h This helps clear out some of the warnings Change-Id: Ie7ccaca8fd92542386a7f1b257398e1bdf2f55dc	2013-09-04 18:38:41 -07:00
Jim Bankoski	e8feb2932f	Merge "wrap non420 loop filter code in macro"	2013-09-04 17:20:53 -07:00
hkuang	01c4e04424	Speed up idct8x8 by rearrange instructions. Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf. Change-Id: I3e2cc818ec14b432204ff43732f39b6438db685d	2013-09-04 15:57:22 -07:00
hkuang	3c05bda058	Merge "Add neon optimize vp9_short_iht4x4_add."	2013-09-04 13:35:09 -07:00
hkuang	3b8614a8f6	Add neon optimize vp9_short_iht4x4_add. Change-Id: I42c497b68ae1ee645b59c9968ad805db0a43e37e	2013-09-04 12:37:58 -07:00
Jim Bankoski	872c6d85c0	Merge "speed up inc_mv_component"	2013-09-04 10:35:51 -07:00
Jim Bankoski	bb2313db28	Merge "make vp9 postproc a config option"	2013-09-04 10:35:26 -07:00
Jim Bankoski	c3c21e3c14	wrap non420 loop filter code in macro Change-Id: I62bca0e7a4bffc1a78b750dbb9df9d2378e92423	2013-09-04 10:24:42 -07:00
Jim Bankoski	79401542f7	make vp9 postproc a config option Vp9 postproc is disabled for now as its not been shown to help and may be merged with vp8. Change-Id: I25620d6cd34c6e10331b18c7b5ef7482e39c6057	2013-09-04 10:02:08 -07:00
Jim Bankoski	532179e845	faster accounting of inc_mv Moves counting of mv branches to where we have a new mv, instead of after the whole frame is summed. Change-Id: I945d9f6d9199ba2443fe816c92d5849340d17bbd	2013-09-04 09:47:57 -07:00
Jim Bankoski	5dda1d2394	speed up inc_mv_component Convert mv_class if statements to look up. re order to avoid ifs... Change-Id: I76966a21bf517bb1f9a7957c08c476c7bb3e9a63	2013-09-04 07:11:30 -07:00
James Zern	1cf2272347	Merge "Fix intermediate height in convolve_c"	2013-09-03 15:50:33 -07:00
Jingning Han	010c0ad0eb	Merge "Fix 32x32 forward transform SSE2 version"	2013-09-03 08:58:03 -07:00
Scott LaVarnway	948aaab4ca	Merge "Improved mb_lpf_horizontal_edge_w_sse2_8"	2013-09-03 05:44:01 -07:00
Jingning Han	3cf46fa591	Fix 32x32 forward transform SSE2 version This commit fixed the potential overflow issue in the SSE2 implementation of 32x32 forward DCT. It resolved the corrupted coded frames in the border of scenes. Change-Id: If87eef2d46209269f74ef27e7295b6707fbf56f9	2013-08-31 18:47:08 -07:00
Tero Rintaluoma	e326cecf18	Fix intermediate height in convolve_c - Intermediate height was not correct i.e. when block size is 4 and y_step_q4 is 6. In this case intermediate height was (4*6) >> 4 = 1 and vertical interpolation needs two source pixels plus 7 extra pixels for taps. - Also if the current output block is 16x16 and we are using 4x upscaling we need only 12 rows after horizontal filtering instead of 16. Patch Set 2: Intermediate_height updated after CL 66723 "Fix bug in convolution functions (filter selection)" Change-Id: I5a1a1bc2ac9d5edb3a6e0818de618bf318fdd589	2013-08-30 10:31:21 +03:00
Jim Bankoski	1d44fc0c49	Merge "rework filter_block_plane"	2013-08-29 20:11:09 -07:00
Jim Bankoski	bc50961a74	rework filter_block_plane Change-Id: I55c3b60c4c0f4910d3dfb70e3edaae00cfa8dc4d	2013-08-29 17:00:05 -07:00
Jingning Han	c86c5443eb	Merge "Fix overflow issue in SSSE3 32x32 quantization"	2013-08-29 16:49:04 -07:00
James Zern	d765df2796	consistently name VP9_COMMON variables #3 stragglers Change-Id: Ib1e853f9a331b7b66639dc34d79568d84d1930f1	2013-08-29 13:27:41 -07:00
James Zern	aa05321262	consistently name VP9_COMMON variables #2 oci -> cm Change-Id: Ifd75c809d9cc99034d3c2fccc4653a78b3aec21f	2013-08-29 13:25:58 -07:00
James Zern	924d74516a	consistently name VP9_COMMON variables #1 pc -> cm Change-Id: If3e83404f574316fdd3b9aace2487b64efdb66f3	2013-08-29 13:25:57 -07:00
Jingning Han	abff678866	Fix overflow issue in SSSE3 32x32 quantization The 32x32 quantization process can potentially have the intermediate stacks over 16-bit range, thereby causing enc/dec mismatch. This commit fixes this overflow issue in the SSSE3 implementation, as well as the prototype, of 32x32 quantization. This fixes issue 607 from webm@googlecode. Change-Id: I85635e6ca236b90c3dcfc40d449215c7b9caa806	2013-08-29 11:00:54 -07:00
Scott LaVarnway	22dc946a7e	Improved mb_lpf_horizontal_edge_w_sse2_8 This patch is a reformatted version of optimizations done by engineers at Intel (Erik/Tamar) who have been providing performance feedback for VP9. For the test clips used (720p, 1080p), up to 1.2% performance improvement was seen. Change-Id: Ic1a7149098740079d5453b564da6fbfdd0b2f3d2	2013-08-29 08:30:17 -04:00
Dmitry Kovalev	851a2fd72c	Renaming txfm_size to tx_size. Change-Id: I752e374867d459960995b24d197301d65ad535e3	2013-08-27 19:47:53 -07:00
Dmitry Kovalev	1d3f94efe2	Merge "Adding get_entropy_context function."	2013-08-27 17:02:36 -07:00
Frank Galligan	7d058ef86c	Merge "Fix winodws warning."	2013-08-27 15:39:58 -07:00
Frank Galligan	f1560ce035	Fix winodws warning. Const is not needed on the function parameter. Change-Id: I38c2a7317cb6f42f70bbddfde9a2cd18d65ceb1c	2013-08-27 15:19:55 -07:00
Dmitry Kovalev	a93992e725	Adding get_entropy_context function. Moving common code from encoder and decoder to this function. Change-Id: I60fa643fb1ddf7ebbff5e83b6c4710137b0195ef	2013-08-27 14:17:53 -07:00
hkuang	3a679e56b2	Add neon optimize vp9_short_idct16x16_1_add. Change-Id: Ib9354c1d975d03e8081df20d50b6a77dfe2dc7e5	2013-08-27 14:00:27 -07:00
hkuang	ce04b1aa62	Merge "Add neon optimize vp9_short_idct8x8_1_add."	2013-08-27 12:10:07 -07:00
Dmitry Kovalev	7b95f9bf39	Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the encoder. Change-Id: I62bb07c377f947cb72fac68add7a6b199e42c6b9	2013-08-27 11:05:08 -07:00
Dmitry Kovalev	12e5931a9a	Merge "Using existing functions instead of raw expressions."	2013-08-27 10:33:34 -07:00
Dmitry Kovalev	bfebe7e927	Merge "Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the common/decoder."	2013-08-27 10:15:21 -07:00
Dmitry Kovalev	78e670fcf8	Merge "Renaming D27 to D207."	2013-08-27 10:03:57 -07:00
hkuang	36e9b82080	Add neon optimize vp9_short_idct8x8_1_add. Change-Id: I0b15d5e3b0eb97abb9ab5ec08e88b61f8723aaf4	2013-08-26 16:28:57 -07:00
hkuang	69384f4fad	Add neon optimize vp9_short_idct4x4_1_add. Change-Id: I6ecb5c4a1a472feb8e84e9f3352b536d5e28a4a5	2013-08-26 15:55:16 -07:00
Dmitry Kovalev	45870619f3	Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the common/decoder. Adding temporary "typedef BLOCK_SIZE BLOCK_SIZE_TYPE" which will go away after encoder's patch. Change-Id: I06ec6a6f079401439843ec981d1496234fd7775c	2013-08-26 11:33:16 -07:00
Jingning Han	4681197a58	Merge "Temporarily disable SSSE3 quant_32x32"	2013-08-26 11:19:53 -07:00
Jingning Han	166dc85bed	Temporarily disable SSSE3 quant_32x32 Make the current head working properly, while working on fixing an issue in the SSSE3 implementation of 32x32 quantization. Change-Id: Ic029da3fd7f1f5e58bc641341cbd226ec49a16bc	2013-08-26 10:45:59 -07:00
James Zern	c8ba8c513c	cosmetics: strip 'VP9_' from defines in vp9 only code Change-Id: I481d9bb2fa3ec72b6a83d5f04d545ad8013f295c	2013-08-23 19:16:49 -07:00
Dmitry Kovalev	50ee61db4c	Renaming D27 to D207. I've already renamed d27_predictor to d207_predictor but forgot about the corresponding constant. Change-Id: Id312aa80fc5b5a1ab8a709a33418a029552a6857	2013-08-23 17:33:48 -07:00
Dmitry Kovalev	480dd8ffbe	Using existing functions instead of raw expressions. Change-Id: Ifa50b04bac1a6ff2abef989073cbf1f37a89eb50	2013-08-23 17:26:53 -07:00
Dmitry Kovalev	e6c435b506	Merge "Cleanup in mvref_common.{h, c}."	2013-08-23 17:09:49 -07:00
Yaowu Xu	13930cf569	Limit mv range to be based on partition size Previous change `c4048dbd` limits the mv search range assuming max block size of 64x64, this commit change the search range using actual block size instead. Change-Id: Ibe07ab02b62bf64bd9f8675d2b997af20a2c7e11	2013-08-23 15:43:57 -07:00
Adrian Grange	78debf246b	Merge "Fix bug in convolution functions (filter selection)"	2013-08-23 13:41:47 -07:00
Dmitry Kovalev	21d8e8590b	Cleanup in mvref_common.{h, c}. Making code more compact, adding consts, removing redundant arguments, adding do/while(0) for macros. Change-Id: Ic9ec0bc58cee0910a5450b7fb8cfbf35fa9d0d16	2013-08-23 12:00:30 -07:00
Adrian Grange	3f10831308	Fix bug in convolution functions (filter selection) (In response to Issue 604: https://code.google.com/p/webm/issues/detail?id=604) There were bugs in the convolution code for two cases: 1. Where the filter table was assumed to be aligned to a 256 byte boundary. The offset of the pixel in the source buffer was computed incorrectly. 2. Where no such alignment assumption was made. An incorrect address for the filter table base was used. To fix both problems, I now assume that the filter table is 256-byte aligned and modify the pixel offset calculation to match. A later patch should remove the restriction that the filter table is aligned to a 256-byte boundary. There was also a bug in the ConvolveTest unit test (convolve_test.cc). (Bug & initial fix suggestion submitted by Tero Rintaluoma and Sami Pietilä). Change-Id: I71985551e62846e55e40de9e7e3959d4805baa82	2013-08-23 11:16:08 -07:00
Dmitry Kovalev	1c159c470a	Merge "Checking scale factors on access."	2013-08-23 11:05:17 -07:00
hkuang	b85367a608	Merge "Optimise idct4x4: rearrange the instructions a bit to improve instruction scheduling."	2013-08-23 10:08:43 -07:00
James Zern	d843ac5132	Merge "rename LOG2_* defines to *_LOG2"	2013-08-22 18:02:42 -07:00
Dmitry Kovalev	53f6f8ac93	Merge "check_bsize_coverage cleanup."	2013-08-22 16:18:24 -07:00
hkuang	4205d79273	Merge "Add neon optimize vp9_short_idct10_16x16_add."	2013-08-22 15:57:28 -07:00
hkuang	4082bf9d7c	Add neon optimize vp9_short_idct10_16x16_add. vp9_short_idct10_16x16_add is used to handle the block that only have valid data at top left 4x4 block. All the other datas are 0. So we could cut many unnecessary calculations in order to save instructions. Change-Id: I6e30a3fee1ece5af7f258532416d0bfddd1143f0	2013-08-22 15:53:22 -07:00
Dmitry Kovalev	335b1d360b	check_bsize_coverage cleanup. Change-Id: Ib7803857b35c00e317c9deb8630e777e25eb278f	2013-08-22 15:45:56 -07:00
Dmitry Kovalev	3c42657207	Checking scale factors on access. It is possible to have invalid scale factors and not access them during decoding. Error is reported if we really try to use invalid scale factors. Change-Id: Ie532d3ea7325ee0c7a6ada08269f804350c80fdf	2013-08-22 15:19:05 -07:00
James Zern	40ae02c247	rename LOG2_* defines to *_LOG2 gets rid of a mix of styles Change-Id: I3591d312157bc6f53a25438bf047765c671fd8a8	2013-08-22 14:45:24 -07:00
Dmitry Kovalev	640dea4d9d	Adding vp9_is_scaled function. Change-Id: Ieb7077ca3586b9491912027eed450a4f6fd38d30	2013-08-22 14:04:59 -07:00
hkuang	610642c130	Optimise idct4x4: rearrange the instructions a bit to improve instruction scheduling. Change-Id: I5ea881a6e419f9e8ed4b3b619406403b4de24134	2013-08-22 11:02:22 -07:00
Dmitry Kovalev	96a1a59d21	Merge "Using has_second_ref function to simplify the code."	2013-08-22 01:39:14 -07:00
Dmitry Kovalev	a33f178491	Merge "Cleaning up foreach_transformed_block_in_plane."	2013-08-22 01:37:21 -07:00
Dmitry Kovalev	359b571448	Merge "Cleaning up reset_skip_context function."	2013-08-22 01:36:25 -07:00
Dmitry Kovalev	596c51087b	Merge "Removing unused foreach_predicted_block function."	2013-08-22 01:35:41 -07:00
Dmitry Kovalev	4172d7c584	Cleaning up foreach_transformed_block_in_plane. Change-Id: I9f45af3894c57f35cb266c255e2b904295d39c34	2013-08-21 17:16:02 -07:00
Dmitry Kovalev	c43da352ab	Cleaning up reset_skip_context function. Change-Id: Ib3e72671eb8da6f2e9767a6de292ec7c7cde6bc7	2013-08-21 16:31:51 -07:00
Dmitry Kovalev	3286abd82e	Merge "Adding scale factor check."	2013-08-21 14:11:13 -07:00
James Zern	ac12f3926b	Merge "vp9 rtcd: remove non-existent sad functions"	2013-08-21 13:55:59 -07:00
Dmitry Kovalev	27a984fbd3	Removing a lot of duplicated code. Adding set_contexts contexts function and call it instead of set_contexts_on_border. Calling txfrm_block_to_raster_xy to get aoff and loff. Change-Id: I41897e344afd2cae1f923f4fdbe63daccf6fe80e	2013-08-21 11:55:12 -07:00
Dmitry Kovalev	a3ae4c87fd	Adding scale factor check. We support only [1/16, 2] scale factors, enforcing this now. Change-Id: I0822eb7cea51720df6814e42d3f35ff340963061	2013-08-21 11:24:47 -07:00
Adrian Grange	ce28d0ca89	Fix typos and minor stylistic cleanup Change-Id: I32e43474e8651ef2eb181d24860a8f118cfea7bf	2013-08-21 08:45:42 -07:00
Adrian Grange	5b63963573	Merge "Further correct bug in loopfilter initialization"	2013-08-21 07:17:43 -07:00
James Zern	ae455fabd8	vp9 rtcd: remove non-existent sad functions vp9_sad32x3, vp9_sad3x32 + remove unnecessary sad include from vp9_findnearmv.c Change-Id: Idef2a89cadc3fec64eff82ba9be60ffff50b3468	2013-08-20 18:07:53 -07:00
Dmitry Kovalev	90027be251	Removing unused foreach_predicted_block function. Moving foreach_predicted_block_in_plane function to vp9_reconinter.c because there is only one usage. Change-Id: I9852feae43fc3cf809b817fc541d043bc5496209	2013-08-20 17:20:47 -07:00
Dmitry Kovalev	7f814c6bf8	Merge "Passing plane_bsize to foreach_transformed_block_visitor."	2013-08-20 14:25:01 -07:00
Dmitry Kovalev	27de4fe922	Using has_second_ref function to simplify the code. Updating implementation of vp9_get_pred_context_single_ref_p2 using has_second_ref function to make code easier to read. Change-Id: I5ba642712f59861a48aab974e73aa01640d086fe	2013-08-20 14:09:56 -07:00
hkuang	62a2cd9ed2	Merge "Add neon optimize vp9_short_idct10_8x8_add."	2013-08-20 14:06:57 -07:00
Dmitry Kovalev	d19ac4b66d	vp9_filter.{h, c} cleanup + adding SUBPEL_TAPS constant. Change-Id: Ib394ea23f464591dad50b5c65c316701378d06d7	2013-08-20 12:29:57 -07:00
hkuang	37cda6dc4c	Add neon optimize vp9_short_idct10_8x8_add. vp9_short_idct10_8x8_add is used to handle the block that only have valid data at top left 4x4 block. All the other datas are 0. So we could cut several unnecessary calculations in order to save instructions. Change-Id: I34fda95e29082b789aded97c2df193991c2d9195	2013-08-20 11:51:07 -07:00
Dmitry Kovalev	5826407f2a	Merge "Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c."	2013-08-20 10:06:22 -07:00
Dmitry Kovalev	5baf510f74	Merge "Adding has_second_ref function."	2013-08-20 10:06:14 -07:00
Dmitry Kovalev	039b0c4c9e	Merge "Adding VP9_FILTER_BITS constant."	2013-08-20 10:05:09 -07:00
Jim Bankoski	f167433d9c	fix the mv_ref_idx issue The following issue was reported : https://code.google.com/p/webm/issues/detail?id=601&q=jimbankoski&sort=-id&colspec=ID%20Pri%20mstone%20ReleaseBlock%20Type%20Component%20Status%20Owner%20Summary This code makes the choice and code cleaner and removes any question about whether the border needs to be checked. Change-Id: Ia7aecfb3168e340618805bd318499176c2989597	2013-08-20 08:14:52 -07:00
Dmitry Kovalev	2612b99cc7	Adding VP9_FILTER_BITS constant. Removing VP9_FILTER_WEIGHT, VP9_FILTER_SHIFT, BLOCK_WIDTH_HEIGHT constants. Using ROUND_POWER_OF_TWO for rounding. Change-Id: I2e8d6858dcd600a87096138209731137d7decc24	2013-08-20 00:42:25 -07:00
Dmitry Kovalev	d8286dd56d	Adding has_second_ref function. Updating implementation of vp9_get_pred_context_single_ref_p1 using has_second_ref function to make code easier to read. Change-Id: Ie8f60403a7195117ceb2c6c43176ca9a9e70b909	2013-08-19 18:39:34 -07:00
Yaowu Xu	c4048dbdd3	Change to limit the mv search range As the pixel values beyond image border are duplicates of pixels on edge, the change limits the mv search range, any mv beyond the limits no longer produce new/different prediction values as entire block with pixels used for subpel interpolation are outside image border. Change-Id: I4c6fdf06e33c1cef1489f5470ce0fb4e5e01fb79	2013-08-19 17:19:36 -07:00
Dmitry Kovalev	569ca37d09	Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c. Change-Id: Ib8af21f2e7f603c2fb407e5d15a3bba64b545b49	2013-08-19 16:44:10 -07:00
Dmitry Kovalev	82d4d9a008	Passing plane_bsize to foreach_transformed_block_visitor. Updating all foreach_transformed_block_visitor functions to work with plane block size instead of general block. Removing a lot of duplicated code. Change-Id: I6a9069e27528c611f5a648e1da0c5a5fd17f1bb4	2013-08-19 15:47:24 -07:00
Dmitry Kovalev	2e3478a593	Using plane_bsize instead of bsize. This change set is intermediate. The next one will remove all repetitive plane_bsize calculations, because it will be passed as argument to foreach_transformed_block_visitor. Change-Id: Ifc12e0b330e017c6851a28746b3a5460b9bf7f0b	2013-08-19 13:20:21 -07:00
Adrian Grange	5a1a269f67	Further correct bug in loopfilter initialization The intent was to initialize the deltas for the segment to the computed value, irrespective of mode and reference frame if (mode_ref_delta_enabled == 0). (In response to bug posted by Manjit Hota to codec-devel and webm-discuss lists) Change-Id: I10435cb63d0f88359bb4c14f22181878a1988e72	2013-08-19 11:58:52 -07:00
Dmitry Kovalev	26e5b5e25d	Removing unused or redundant arguments from *_args structures. Redundant dst, pre[2] from build_inter_predictors_args, unused cm from encode_b_args. Change-Id: I2c476cd328c5c0cca4c78ba451ca6ba2a2c37e2d	2013-08-16 12:51:20 -07:00
Dmitry Kovalev	367cb10fcf	Merge "Moving from ss_txfrm_size to tx_size."	2013-08-16 12:46:45 -07:00
Dmitry Kovalev	1462433370	Merge "Renaming d27 predictor to d207."	2013-08-16 12:07:24 -07:00
Johann	d514b778c4	Merge "Reduce the instructions of idct8x8. Also add the saving and restoring of D registers."	2013-08-16 11:30:21 -07:00
Johann	65aa89af1a	Merge "Reduce instructions of idct4x4."	2013-08-16 11:28:35 -07:00
Frank Galligan	bdc785e976	Merge "vp9: neon: optimise vp9_wide_mbfilter_neon"	2013-08-16 11:16:48 -07:00
hkuang	df0715204c	Reduce instructions of idct4x4. Change-Id: Ia26a2526804e7e2f656b0051618a615fca8fc79d	2013-08-16 10:54:56 -07:00
hkuang	60ecd60c9a	Reduce the instructions of idct8x8. Also add the saving and restoring of D registers. Change-Id: Id3630c90fcb160ef939fef55411342608af5f990	2013-08-16 10:32:12 -07:00
Johann	bba68342ce	Merge "vp9: neon: use aligned stores in convolve functions"	2013-08-16 10:29:59 -07:00
Adrian Grange	3e340880a8	Merge "Added resizing & initialization of last frame segment map"	2013-08-16 09:07:36 -07:00
Mans Rullgard	4fa93bcef4	vp9: neon: use aligned stores in convolve functions The destination is block-aligned so it is safe to use aligned stores. Change-Id: I38261e4fa40bc60e6472edffece59e372908da7e	2013-08-16 14:25:08 +01:00
Dmitry Kovalev	afd9bd3e3c	Moving from ss_txfrm_size to tx_size. Updating foreach_transformed_block_visitor and corresponding functions to accept tx_size instead of ss_txfrm_size. List of functions per file: vp9_decodframe.c decode_block decode_block_intra vp9_detokenize.c decode_block vp9_encodemb.c optimize_block vp9_xform_quant vp9_encode_block_intra vp9_rdopt.c dist_block rate_block block_yrd_txfm vp9_tokenize.c set_entropy_context_b tokenize_b is_skippable Change-Id: I351bf563eb36cf34db71c3f06b9bbc9a61b55b73	2013-08-15 17:03:03 -07:00
Adrian Grange	d5bec522da	Added resizing & initialization of last frame segment map When the frame size changes the last frame segment map must be resized to match and initialized to 0. Change-Id: Idc10de109f55dbe9af3a6caae355a2974712243d	2013-08-15 15:35:21 -07:00
Dmitry Kovalev	9451e8d37e	Merge "Converting code from using ss_txfrm_size to tx_size."	2013-08-15 15:21:09 -07:00
Dmitry Kovalev	939b1e4a8c	Merge "Moving segmentation struct from MACROBLOCKD to VP9_COMMON."	2013-08-15 15:14:32 -07:00
Johann	a9aa7d07d0	Merge "vp9: neon: add vp9_convolve_avg_neon"	2013-08-15 14:55:15 -07:00
Johann	63e140eaa7	Merge "vp9: neon: add vp9_convolve_copy_neon"	2013-08-15 14:55:08 -07:00
Dmitry Kovalev	bb3b817c1e	Converting code from using ss_txfrm_size to tx_size. Updated function signatures: txfrm_block_to_raster_block txfrm_block_to_raster_xy extend_for_intra vp9_optimize_b Change-Id: I7213f4c4b1b9ec802f90621d5ba61d5e4dac5e0a	2013-08-15 11:44:57 -07:00
Dmitry Kovalev	81d7bd50f5	Renaming d27 predictor to d207. 27 degrees intra predictor is actually 207 degrees, so renaming it. Change-Id: Ife96a910437eb80ccdc0b7a5b7a62c77542ae5be	2013-08-15 11:09:49 -07:00
Mans Rullgard	67e53716e0	vp9: neon: optimise vp9_wide_mbfilter_neon Break up long dependency chains to improve instruction scheduling. Change-Id: I0e0cb66943df24af920767bb4167b25c38af9630	2013-08-15 19:07:22 +01:00
Dmitry Kovalev	b7616e387e	Moving segmentation struct from MACROBLOCKD to VP9_COMMON. VP9_COMMON is the right place to segmentatation struct because it has global segmentation parameters, not something specific to macroblock processing. Change-Id: Ib9ada0c06c253996eb3b5f6cccf6a323fbbba708	2013-08-15 10:47:48 -07:00
Jingning Han	ec01f52ffa	Unify luma and chroma rd-cost estimation This commit unifies the rate-distortion cost calculation process of luma and chroma components. It allows early termination to be enabled later in the rd search loop of chroma components, in consistent with luma pixels. Change-Id: I2e52a7c6496176bf2a5e3ef338d34ceb8aad9b3d	2013-08-15 09:41:33 -07:00
Paul Wilkins	1a3641d91b	Merge "Renaming in MB_MODE_INFO"	2013-08-15 02:12:48 -07:00
hkuang	39f42c8713	Merge "Add neon optimize vp9_short_idct16x16_add."	2013-08-14 14:16:20 -07:00
hkuang	cf6beea661	Add neon optimize vp9_short_idct16x16_add. Change-Id: I27134b9a5cace2bdad53534562c91d829b48838d	2013-08-14 13:52:16 -07:00
Dmitry Kovalev	bb072000e8	foreach_transformed_block_in_plane cleanup, explicit tx_size var. Making foreach_transformed_block_in_plane more clear (it's not finished yet). Using explicit tx_size variable consistently instead of (ss_txfrm_size / 2) or (ss_txfrm_size >> 1) expression. Change-Id: I1b9bba2c0a9f817fca72c88324bbe6004766fb7d	2013-08-14 11:39:31 -07:00
Dmitry Kovalev	f2c073efaa	Adding const to arguments of intra prediction functions. Adding const to above and left pointers. Cleanup. Change-Id: I51e195fa2e2923048043fe68b4e38a47ee82cda1	2013-08-14 10:35:56 -07:00
Mans Rullgard	0f1deccf86	vp9: neon: add vp9_convolve_avg_neon Change-Id: I33cff9ac4f2234558f6f87729f9b2e88a33fbf58	2013-08-14 16:27:55 +01:00
Mans Rullgard	635ba269be	vp9: neon: add vp9_convolve_copy_neon Change-Id: I15adbbda15d1842e9f15f21878a5ffbb75c3c0c9	2013-08-14 16:27:55 +01:00
Paul Wilkins	26fead7ecf	Renaming in MB_MODE_INFO The macro block mode info context originally contained an entry for each 16x16 macroblock. In VP9 each entry refers to an 8x8 region not a macro block, so the naming is misleading. This first stage clean up changes the names of 3 entries in the structure to remove the mb_ prefix. TODO clean up the nomenclature more widely in respect of mbmi and bmi. Change-Id: Ia7305c6d0cb805dfe8cdc98dad21338f502e49c6	2013-08-14 12:47:52 +01:00
Dmitry Kovalev	8105ce6dce	Merge "Using is_inter_block() instead of repetitive code."	2013-08-13 10:00:01 -07:00
Jingning Han	39fe235032	Merge "SSE2 high precision 32x32 forward DCT"	2013-08-12 23:03:47 -07:00
Johann	4417c04531	Merge "vp9: neon: optimise convolve8_vert functions"	2013-08-12 17:54:47 -07:00
Johann	4cabbca4ce	Merge "vp9: neon: optimise convolve8_horiz functions"	2013-08-12 17:54:42 -07:00
Dmitry Kovalev	32006aadd8	Using is_inter_block() instead of repetitive code. Change-Id: If0b04c476c34fb8c102c9f750d7fe5669a86a532	2013-08-12 17:42:14 -07:00
Jingning Han	78136edcdc	SSE2 high precision 32x32 forward DCT Enable SSE2 implementation of high precision 32x32 forward DCT. The intermediate stacks are of 32-bits. The run-time goes down from 32126 cycles to 13442 cycles. Change-Id: Ib5ccafe3176c65bd6f2dbdef790bd47bbc880e56	2013-08-12 16:52:53 -07:00
Dmitry Kovalev	b89eef8f82	Merge "Simplifying vp9_mvref_common.c."	2013-08-12 16:24:22 -07:00
Dmitry Kovalev	b214cd0dab	Merge "Removing foreach_predicted_block_uv function."	2013-08-12 15:54:01 -07:00
Dmitry Kovalev	1a5e6ffb02	Simplifying vp9_mvref_common.c. Change-Id: I272df2e33fa05310466acf06c179728514dd7494	2013-08-12 15:52:08 -07:00
Dmitry Kovalev	c66320b3e4	Merge "Entropy context related cleanups."	2013-08-12 15:18:24 -07:00
Dmitry Kovalev	bd1bc1d303	Merge "Making scaling code more clear."	2013-08-12 15:17:26 -07:00
Dmitry Kovalev	9a31d05e24	Removing unused convolve_avg_c function + cleanup. Change-Id: Id2b126c6456627c25e4041a82e304d0151d951ba	2013-08-12 14:28:00 -07:00
Dmitry Kovalev	76d166e413	Removing foreach_predicted_block_uv function. Adding function build_inter_predictors_for_planes to build inter predictors for specified planes. This function allows to remove condition "#if CONFIG_ALPHA" and use MAX_MB_PLANE for general case. Renaming 'which_mv' local var to 'ref', and 'weight' argument to 'ref'. Change-Id: I1a97160c9263006929d38953f266bc68e9c56c7d	2013-08-12 13:54:13 -07:00
Dmitry Kovalev	a72e269318	Making scaling code more clear. Reusing existing functions, using constants instead of magic numbers. Change-Id: Idc689ffba52c9a8b203fcf26bd67110ecb5635f9	2013-08-12 13:30:26 -07:00
Dmitry Kovalev	8b0e6035a2	Entropy context related cleanups. Adding set_skip_context() function used from both encoder and decoder. Change-Id: Ia22cfad3211a00a63eb294f64f857b78f4aa9b85	2013-08-12 11:24:24 -07:00
Mans Rullgard	ad7021dd6c	vp9: neon: optimise convolve8_vert functions Invert loops to operate vertically in the inner loop. This allows removing redundant loads. Also add preloading of data. Change-Id: I4fa85c0ab1735bcb1dd6ea58937efac949172bdc	2013-08-12 15:37:48 +01:00
Dmitry Kovalev	097046ae28	Merge "Removing redundant code and function arguments."	2013-08-11 12:20:58 -07:00
Mans Rullgard	b84dc949c8	vp9: neon: optimise convolve8_horiz functions Each iteration of the horizontal loop reuses 7 of the 11 source values. Loading only the 4 new values saves some time. Also add preload for source data. Overall 4% faster on Chromebook. Change-Id: I8f69e749f2b7f79e9734620dcee51dbfcd716b44	2013-08-11 16:21:55 +01:00
Dmitry Kovalev	3c43ec206c	Renaming BLOCK_SIZE_TYPES constant to BLOCK_SIZES. There will be another change set to rename BLOCK_SIZE_TYPE enum to BLOCK_SIZE. Change-Id: I8d1dfc873d6186fa5e554262f5169e929978085e	2013-08-09 17:47:32 -07:00
Dmitry Kovalev	67fe9d17cb	Removing redundant code and function arguments. Change-Id: Ia5cdda0f755befcd1e64397452c42cb7031ca574	2013-08-09 17:24:40 -07:00
Dmitry Kovalev	f1559bdeaf	Inlining 16 as a stride for BLOCK_OFFSET macro. Change-Id: I7f23d174eb089e5500f268a10db09648634c1b82	2013-08-09 16:40:05 -07:00

... 11 12 13 14 15 ...

2562 Commits