generic-library/vpx

Author	SHA1	Message	Date
Dmitry Kovalev	0a5e9ee054	Moving get_token_alloc function from common to the encoder. Also renaming mb_row -> mi_row, mb_col -> mi_col arguments and calculate mb_rows/mb_cols values from mi_rows/mi_cols. Change-Id: I6919a279f560648e23bc9a12f507d17c21ffd5d7	2013-10-01 11:54:10 -07:00
Scott LaVarnway	27b390e1a1	d153 intra prediction ssse3 using bytes byte version of ronalds d153 ssse3 optimizations for 4x4 and 8x8 (commit: fc91a2a112238a1aee568f3b840585de4e928fca) Change-Id: Iec4426032311483f615fd9e0dceba3ee85ddebd7	2013-10-01 09:05:20 -04:00
Dmitry Kovalev	c982a73b9f	Removing unused vp9_coeff_stats_model typedef. Change-Id: I6973e7121b6393379b5759f288632e8eab763d3e	2013-09-30 15:10:00 -07:00
Dmitry Kovalev	c64e23832f	Adding const to function arguments. Function list: tx_counts_to_branch_counts_32x32 tx_counts_to_branch_counts_8x8 tx_counts_to_branch_counts_8x8 update_ct update_ct2 update_mode_probs Change-Id: I120d8945a34378cf285d6bd415e23de1d522cf2f	2013-09-30 14:50:15 -07:00
Dmitry Kovalev	cd945c7bd9	Merge "Removing vp9_add_constant_residual_{8x8, 16x16, 32x32} functions."	2013-09-30 13:16:34 -07:00
Dmitry Kovalev	548671dd20	Removing vp9_add_constant_residual_{8x8, 16x16, 32x32} functions. We don't need these functions anymore. The only one which was actually used is vp9_add_constant_residual_32x32. Addition of vp9_short_idct32x32_1_add eliminates this single usage. SSE2 optimized version of vp9_short_idct32x32_1_add will be added in the next patch set, right now it is only C implementation. Now we have all idct functions implemented in a consistent manner. Change-Id: I63df79a13cf62aa2c9360a7a26933c100f9ebda3	2013-09-30 10:56:37 -07:00
Jim Bankoski	4906fe45e2	Merge "systemdependent lint issue resolved"	2013-09-30 10:55:07 -07:00
Jim Bankoski	fd09be0984	Merge changes I2b2af1dd,Id2cc5c82 * changes: fixed cpp lint issue in vp9_postproc_x86 nolintify intrinsic idct file	2013-09-30 10:53:30 -07:00
Jim Bankoski	e3c1f0880f	Merge "cpplint issues in vp9_loopfilter.h"	2013-09-30 10:53:13 -07:00
Jim Bankoski	509ba98938	Merge "treecoder lint issues resolved"	2013-09-30 10:43:22 -07:00
Jim Bankoski	7ddd9f7f27	Merge "cpplint issue with entropymv.h"	2013-09-30 10:43:16 -07:00
Jim Bankoski	c424c5e808	Merge "cpplint issue with vp9_loopfilter_filters.c"	2013-09-30 10:43:05 -07:00
Jim Bankoski	282704145d	Merge "cpplint issue in blockd.h"	2013-09-30 10:42:45 -07:00
Jim Bankoski	58a09c32c2	Merge "common_data.h lint issues resolved"	2013-09-30 10:42:35 -07:00
Jim Bankoski	9e056fa094	Merge "vp9_loopfilter.c cpplint issues resolved."	2013-09-30 10:42:27 -07:00
Jim Bankoski	d2a4ddf982	Merge "cpplint issue resolved in vp9_pred_common.h"	2013-09-30 10:42:19 -07:00
Jim Bankoski	cbdcc215b3	Merge "resolved lint issues in default_coef_probs"	2013-09-30 10:42:12 -07:00
Jim Bankoski	d35e9a0c53	Merge "lint issues in mvref_common.c"	2013-09-30 10:41:50 -07:00
Jim Bankoski	14916b0ca6	Merge "vp9 convolve lint issues"	2013-09-30 10:41:43 -07:00
Jim Bankoski	4e5d99ca72	Merge "vp9_rtcd.c lint issues"	2013-09-30 10:41:32 -07:00
Jim Bankoski	bc1b089372	Merge changes Id58e2176,I7efc74ef * changes: cpplint issues in vp9_filter.h cpplint issues with onyxc_int.h	2013-09-30 10:41:23 -07:00
Jim Bankoski	0f8805e086	Merge "vp9_entropy.c lint issues"	2013-09-30 10:34:11 -07:00
Jim Bankoski	7f13b33a78	Merge "cpplint issues resolved in vp9_postproc.c"	2013-09-30 08:26:00 -07:00
Jim Bankoski	1a2f4fd2f5	Merge "fix lint issues in quant common"	2013-09-30 08:26:00 -07:00
Jim Bankoski	88251c86dc	Merge "fix cpplint issue in reconintra"	2013-09-30 08:26:00 -07:00
Jim Bankoski	777460329b	vp9_entropy.c lint issues Change-Id: I4e163cc4ce9ec2f3a5a8b9da478049c71b08d71f	2013-09-29 20:29:43 -07:00
Jim Bankoski	7019e34c34	vp9 convolve lint issues Change-Id: I8b496191c6a60a60a52c929adca305db47058a84	2013-09-29 19:44:05 -07:00
Jim Bankoski	f6d7e3679c	resolved lint issues in default_coef_probs Change-Id: I97bf241c0d981721cc74a50be47c9db8a00f6be3	2013-09-29 19:41:31 -07:00
Jim Bankoski	c66bfc70d1	treecoder lint issues resolved Change-Id: I442609f689aa9381e1e208012305cf62a6b31eee	2013-09-29 19:37:11 -07:00
Jim Bankoski	a57912f893	systemdependent lint issue resolved Change-Id: I07fbb32d5cee0003d04b2369cfafcb03c371cd4f	2013-09-29 19:34:44 -07:00
Jim Bankoski	8f229caf87	lint issues in mvref_common.c Change-Id: If6a7a8c48fefc69349c792d8ed52a6e1d374e46e	2013-09-29 19:32:53 -07:00
Jim Bankoski	623e163f84	vp9_rtcd.c lint issues Change-Id: I58209ae96d21c56cbb8ef796940b6ca3b3ebfa72	2013-09-29 19:29:58 -07:00
Jim Bankoski	c288b94ab9	common_data.h lint issues resolved Change-Id: I1fd79093a5b9cb40c9e877b6b71c25a07a69b3ae	2013-09-29 19:28:32 -07:00
Jim Bankoski	03df17070b	vp9_loopfilter.c cpplint issues resolved. Change-Id: Idfa17d120ec4edf542e424fa0deb769951afbf4a	2013-09-29 19:04:21 -07:00
Jim Bankoski	6249a5b17e	cpplint issue with vp9_loopfilter_filters.c Change-Id: I13aa43df6bff340b5768d69125b473a52d1d59bd	2013-09-29 19:03:00 -07:00
Jim Bankoski	855d078f95	cpplint issue with entropymv.h Change-Id: I3556738d27def6a5bd71577728050a1e2bb1de63	2013-09-29 19:01:46 -07:00
Jim Bankoski	2b5bf7b8d8	cpplint issue in blockd.h Change-Id: Ia41e1966431652b839134a1c27feccb25c762539	2013-09-29 19:00:40 -07:00
Jim Bankoski	716d37f8bf	fixed cpplint issue with vp9_scale.h Change-Id: Ia7969baac7ffc6d7a0e8e8e83e9252d077a3c5b3	2013-09-29 18:58:58 -07:00
Jim Bankoski	2ecd0dae1e	vp9_entropymv.c cpplint issues resolved Change-Id: Ic5807152cc78127b3f84b5abb4c5f3ef6d06ce65	2013-09-29 18:57:35 -07:00
Jim Bankoski	7a59efe7f8	cpplint issues resolved in vp9_postproc.c Change-Id: If61380115163a02ecfe74b82e116001ac54e20e2	2013-09-29 18:52:29 -07:00
Jim Bankoski	152fd59964	fixed cpp lint issue in vp9_postproc_x86 Change-Id: I2b2af1dd9f5c29c05e28a4fd51fa58ccc4071477	2013-09-29 18:44:58 -07:00
Jim Bankoski	ec421b7810	nolintify intrinsic idct file Change-Id: Id2cc5c829399a2afdf7a8a82615a4e272c814986	2013-09-29 18:42:24 -07:00
Jim Bankoski	31ceb6b13c	cpplint issues in vp9_loopfilter.h Change-Id: Ib142f9c5130aa5f0e1fc76e1c4f51cd66c73dcc7	2013-09-29 18:36:42 -07:00
Jim Bankoski	11cf0c39c9	cpplint issues in vp9_filter.h Change-Id: Id58e21760c7948a2b020c9623c38cf007874d43e	2013-09-29 18:34:41 -07:00
Jim Bankoski	01d43aaa24	cpplint issue resolved in vp9_pred_common.h Change-Id: Ibacac91c2192fcfbd9e411ae141dd00445566efe	2013-09-29 18:17:06 -07:00
Jim Bankoski	ab03c00504	cpplint issues with onyxc_int.h Change-Id: I7efc74ef53139bbaa6ec4f01482d9d9b362be27b	2013-09-29 18:10:03 -07:00
Jim Bankoski	eb506a6590	cpplint fixes to debug modes Change-Id: I1c3943cd5db6cd8fc759116a3717dba3c030fa0d	2013-09-29 18:04:48 -07:00
Jim Bankoski	fb6e6cd24d	fix cpplint issue in reconintra Change-Id: I934f9cfb96ce4f5f266b025064237875dcd92b3a	2013-09-29 18:02:42 -07:00
Jim Bankoski	d052117319	fix lint issues in quant common Change-Id: I135ee6e8df91262f813c474b24f14381a4064e02	2013-09-29 17:59:43 -07:00
Jim Bankoski	efc8638890	cpplint issues in vp9_onyx.h Change-Id: I0b5af849833ac077bd4de71a24af8f8bd7ec06d6	2013-09-29 17:50:18 -07:00
Dmitry Kovalev	b927620231	Merge "Using is_inter_block and has_second_ref functions."	2013-09-29 12:14:41 -07:00
Dmitry Kovalev	b3d3578ee4	Merge "Renaming vp9_short_idct10_8x8_add to vp9_short_idct8x8_10_add."	2013-09-29 12:01:50 -07:00
Dmitry Kovalev	7343681675	Merge "Removing vp9_get_coef_neighbors_handle function."	2013-09-29 12:01:36 -07:00
Dmitry Kovalev	efbacc9f89	Merge "Removing vp9_subpelvar.h from common."	2013-09-29 12:00:46 -07:00
Dmitry Kovalev	b10e6b2943	Removing unnecessary function calls. Both vp9_init_mbmode_probs() and vp9_zero(cm->ref_frame_sign_bias) are called inside vp9_setup_past_independence() which called in any case for encoder/decoder after VP9_COMMON struct creation. Change-Id: I3724d1a4fb8060101ff0290dd6a158f0b5c57bb4	2013-09-27 17:42:05 -07:00
Dmitry Kovalev	3fab2125ff	Renaming vp9_short_idct10_8x8_add to vp9_short_idct8x8_10_add. Making name consistent with vp9_short_idct8x8 and vp9_short_idct8x8_1. Change-Id: I99e0be040ec893f9571dcf090e18f98dc58339f5	2013-09-27 15:26:27 -07:00
Christian Duvivier	b1b4ba1bdd	Properly save neon registers. Replace current code which corrupts the stack by duplicate of vp8 code to save and restore neon registers. Change-Id: Ibb0220b9aa985d10533befa0a455ebce57a2891a	2013-09-27 14:25:33 -07:00
Dmitry Kovalev	209c6cbf8f	Removing vp9_get_coef_neighbors_handle function. Change-Id: I6be72c8b048d1ccc7ef43764cf84c32360098970	2013-09-27 14:11:13 -07:00
Dmitry Kovalev	db60c02c9e	Merge "Renaming vp9_short_idct10_16x16 to vp9_short_idct16x16_10."	2013-09-27 13:08:52 -07:00
Scott LaVarnway	35830879db	Merge "d63 intra prediction ssse3 using bytes"	2013-09-27 07:21:08 -07:00
Dmitry Kovalev	15a36a0a0d	Renaming vp9_short_idct10_16x16 to vp9_short_idct16x16_10. Making function name consistent with vp9_short_idct16x16 and vp9_short_idct16x16_1. Change-Id: I70e54be9e6b9a1dddab0de470686591e96d05517	2013-09-26 14:01:25 -07:00
Christian Duvivier	5b1dc1515f	Fix a bunch of TODO from vp9_short_idct32x32_add_neon. - full ASM version, no more C gateway file. - integrate combine-add with last step of 2nd pass. - remove a few push/pop pairs. - some instruction reordering to hide latency. Change-Id: Ic9d9933c908b65d1bf7ba8fd47b524cda808c9c6	2013-09-25 21:15:19 -07:00
Dmitry Kovalev	eda4e24c0d	Using is_inter_block and has_second_ref functions. Change-Id: I60dee58a4fd24d3c4f3c101a49d30e217309f43a	2013-09-25 19:03:04 -07:00
Dmitry Kovalev	64eff7f360	Removing vp9_subpelvar.h from common. Moving all code from that file to vp9_variace_c.c in the encoder. Change-Id: Ic803d5b4c78d5191e4d25541b3df97337878fc3e	2013-09-25 16:10:43 -07:00
Dmitry Kovalev	49f5efa8d8	Removing unused SUBMVREF_COUNT constant. Change-Id: I302ab4603553352a84b57bc89bc9e3d037978d29	2013-09-25 15:33:05 -07:00
Scott LaVarnway	208658490c	d63 intra prediction ssse3 using bytes byte version of ronalds d63 ssse3 optimizations (commit: c5a1c8cf3541cf3665fee981b36d22c9fbd4191e) Change-Id: Ifd3e6d454a2246085f23eabb38518a930321e807	2013-09-25 16:16:44 -04:00
Dmitry Kovalev	d571e4e785	Replacing unsigned char* with uint8_t*. Change-Id: I99a1880aee015ae16311ba05a31aa307df89bef2	2013-09-24 14:57:42 -07:00
Yaowu Xu	71cfaaa689	Merge "Replace memcpy with vpx_memcpy"	2013-09-24 11:35:03 -07:00
Yaowu Xu	9be0bb19df	Replace memcpy with vpx_memcpy Also removed obselete comment Change-Id: Iae1664777d76383639c637ee786e0d50fc45819a	2013-09-24 10:56:06 -07:00
Yaowu Xu	6037f17942	Rename defined constants The change is to better reflect the nature of the constants. Change-Id: Icabac6e9bceefbdb3f03f8218f88ef75943c30fb	2013-09-24 10:53:01 -07:00
Dmitry Kovalev	f24b9b4f87	Merge "Adding best_mv[2] array instead of two variables."	2013-09-24 10:17:53 -07:00
Johann	a6a00fc6a3	Use lowercase instruction in assembly The iOS compiler does not recognize BLE: bad instruction `BLE idct32_transpose_pair_loop' Change-Id: I7426694c66bc31caf939a2d5000968da1222c15b	2013-09-20 16:11:05 -07:00
Dmitry Kovalev	bb5e2bf86a	Adding best_mv[2] array instead of two variables. Change-Id: I584fe50f73879f6a72fada45714ef80893b6d549	2013-09-20 17:08:53 +04:00
Dmitry Kovalev	24df77e951	Merge "Adding get_scan_and_band function."	2013-09-20 00:15:06 -07:00
Yaowu Xu	014acfa2af	fix integer overflow errors Change-Id: I76f440a917832c02d7a727697b225bac66b99f56	2013-09-19 08:14:26 -07:00
Dmitry Kovalev	a23c2a9e7b	Adding get_scan_and_band function. Extracting get_scan_and_band function from get_entropy_context to remove duplicated code. Change-Id: I5da1f5a60263017e887da68bc834317b5f084cb2	2013-09-19 16:53:48 +04:00
Yunqing Wang	a7b7f94ae8	Merge "Fix x86inc.asm to build PIC code correctly"	2013-09-18 14:51:31 -07:00
Yunqing Wang	9d901217c6	Fix x86inc.asm to build PIC code correctly Current x86inc.asm didn't handle 32bit PIC build properly. TEXTRELs were seen in the library built. The PIC macros from libvpx's x86_abi_support.asm was used to fix this problem. The assembly code was modified to use the macros. Notes: We need this fix in for decoder building. Functions in encoder will be fixed later. Change-Id: Ifa548d37b1d0bc7d0528db75009cc18cd5eb1838	2013-09-18 13:45:46 -07:00
Yaowu Xu	85fd8bdb01	Merge "Silence a bunch of MSVC warnings"	2013-09-17 17:10:58 -07:00
Yaowu Xu	a783da80e7	Silence a bunch of MSVC warnings Change-Id: I16633269582a640809dca27572bbe99efa6369fc	2013-09-17 12:08:51 -07:00
Jingning Han	2b3bfaa9ce	Remove redundant argument in get_sub_block_mv The sub8x8 check can be directly inferred from block_idx, hence removed from the arguments if get_sub_block_mv. Change-Id: Ib766d57e81248fb92df0f6d9b163e6c77b933ccd	2013-09-17 12:08:45 -07:00
hkuang	23e1a29fc7	Speed up iht8x8 by rearranging instructions. Speed improves from 282% to 302% faster based on assembly-perf. Change-Id: I08c5c1a542d43361611198f750b725e4303d19e2	2013-09-16 14:23:26 -07:00
James Zern	2d58761993	Revert "Improved 8t filters" This is incompatible with most toolchains other than gcc. Revert "Deleted #include <inttypes.h>" This reverts commit `4d018be950`. This reverts commit `d22a504d11`. Change-Id: I1751dc6831f4395ee064e6748281418e967e1dcf	2013-09-13 15:13:06 -07:00
Scott LaVarnway	8fc95a1b11	Merge "New mode_info_context storage -- undo revert"	2013-09-13 08:56:20 -07:00
Paul Wilkins	9c9a3b2775	Merge "Deleted #include <inttypes.h>"	2013-09-13 01:05:31 -07:00
hkuang	86fb12b600	Merge "Add neon optimize iht8x8 which is 282% faster than C."	2013-09-12 15:42:44 -07:00
hkuang	182366c736	Add neon optimize iht8x8 which is 282% faster than C. Change-Id: I963dd4a6e8671957403ccbb9a16ea7de703e3530	2013-09-12 11:49:05 -07:00
Paul Wilkins	4d018be950	Deleted #include <inttypes.h> This seems not to be needed and is not supported in the Windows build. Change-Id: Iaca3bbf8cca283aee6bc336cb31ba9dd4610322b	2013-09-12 13:43:07 +01:00
Christian Duvivier	6a501462f8	First draft of vp9_short_idct32x32_add_neon. Lots of TODO which will be taken care in upcoming changes. As is, about 6x faster than C version. Change-Id: Ie2557b72fd2d8edca376dbf400a4d173aa5e63e0	2013-09-11 15:19:38 -07:00
Scott LaVarnway	23845947c4	Merge "Improved 8t filters"	2013-09-11 14:34:54 -07:00
Scott LaVarnway	d22a504d11	Improved 8t filters Reformatted version of a patch submitted by Erik/Tamar from Intel. For the test clips used, the decoder performance improved by ~2%. Change-Id: Ifbc37ac6311bca9ff1cfefe3f2e9b7f13a4a511b	2013-09-11 13:56:32 -04:00
Scott LaVarnway	ac6093d179	New mode_info_context storage -- undo revert mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of pointers to MODE_INFO structs. The MODE_INFO structs are now stored as a stream (decoder only), eliminating unnecessary copies and is a little more cache friendly. Change-Id: I031d376284c6eb98a38ad5595b797f048a6cfc0d	2013-09-11 13:45:44 -04:00
Yunqing Wang	079183c1a8	code cleanup Removed unused function. Change-Id: Icb12a09e4d303968be6aec9fae1ef05935913a4f	2013-09-11 09:32:00 -07:00
hkuang	f4a6f936b5	Merge "Speed up idct16x16 by rearrange instructions."	2013-09-10 08:23:57 -07:00
hkuang	fc5ec206a7	Speed up idct16x16 by rearrange instructions. Speed improve from 376% to 400% faster base on assembly-perf. Change-Id: If0b2eccc39d5793dc101ce9feb7fcadf88396ea2	2013-09-09 18:00:13 -07:00
Ivan Maltz	20abe595ec	Merge "API extensions and sample app for spacial scalable encoder"	2013-09-09 16:57:01 -07:00
Ivan Maltz	01b35c3c16	API extensions and sample app for spacial scalable encoder Sample app: vp9_spatial_scalable_encoder vpx_codec_control extensions: VP9E_SET_SVC VP9E_SET_WIDTH, VP9E_SET_HEIGHT, VP9E_SET_LAYER VP9E_SET_MIN_Q, VP9E_SET_MAX_Q expanded buffer size for vp9_convolve modified setting of initial width in vp9_onyx_if.c so that layer size can be set prior to initial encode Default number of layers set to 3 (VPX_SS_DEFAULT_LAYERS) Number of layers set explicitly in vpx_codec_enc_cfg.ss_number_layers Change-Id: I2c7a6fe6d665113671337032f7ad032430ac4197	2013-09-09 15:57:56 -07:00
James Zern	c1913c9cf4	Merge "Revert "New mode_info_context storage""	2013-09-09 14:38:01 -07:00
James Zern	54a03e20dd	Revert "New mode_info_context storage" This reverts commit `dae17734ec` Encode crashes, leaks and increases integer overflow errors. Change-Id: I595aa2649bb8d0b6552ff91652837a74c103fda2	2013-09-09 13:37:01 -07:00
Yaowu Xu	b19126b291	Merge "Reduce the amount of extension in src frames"	2013-09-09 08:09:56 -07:00
Yaowu Xu	65c2444e15	Reduce the amount of extension in src frames The commit changes the border pixel extension from 160 pixel each side to what is necessary in arnr filter or motion estimation portion, i.e. 16 pixel on top and left side. For right or bottom side, the extension is changed to either round up image size to multiple of 64 or at least 16 pixels. Change-Id: Ic05e19b94368c1ab4df568723aae5734e6c3d2c5	2013-09-08 15:51:54 -07:00
Jim Bankoski	e378566060	Merge "New mode_info_context storage"	2013-09-08 07:16:25 -07:00
Deb Mukherjee	e378a89bd6	Support a constant quality mode in VP9 Adds a new end-usage option for constant quality encoding in vpx. This first version implemented for VP9, encodes all regular inter frames using the quality specified in the --cq-level= option, while encoding all key frames and golden/altref frames at a quality better than that. The current performance on derfraw300 is +0.910% up from bitrate control, but achieved without multiple recode loops per frame. The decision for qp for each altref/golden/key frame will be improved in subsequent patches based on better use of stats from the first pass. Further, the qp for regular inter frames may also be varied around the provided cq-level. Change-Id: I6c4a2a68563679d60e0616ebcb11698578615fb3	2013-09-06 10:30:53 -07:00
Scott LaVarnway	dae17734ec	New mode_info_context storage mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of a pointer to a MODE_INFO struct and a "in the image" flag. The MODE_INFO structs are now stored as a stream, eliminating unnecessary copies and is a little more cache friendly. For the test clips used, the decoder performance improved by ~4.3% (1080p) and ~9.7% (720p). Patch Set 2: Re-encoded clips with latest. Now ~1.7% (1080p) and 5.9% (720p). Change-Id: I846f29e88610fce2523ca697a9a9ef2a182e9256	2013-09-06 12:33:34 -04:00
Jim Bankoski	e4e864586c	Merge "fix loop filter setup_mask could reach out of bounds issue"	2013-09-06 06:21:28 -07:00
hkuang	3476404912	Merge "Speed up idct8x8 by rearrange instructions. Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf."	2013-09-05 17:37:13 -07:00
Jim Bankoski	736114f44b	fix loop filter setup_mask could reach out of bounds issue Change-Id: Ic8446c4f26b6782a6dc482c19ea73c77646df418	2013-09-05 15:53:31 -07:00
Jingning Han	1c263d6918	Merge "Use saturated addition in SSSE3 of 32x32 quant"	2013-09-05 14:09:40 -07:00
Jim Bankoski	2156ccaa4a	Merge "resolve clang warnings : uninitialized vars in vp9_entropy.h"	2013-09-05 12:55:32 -07:00
Jingning Han	458c2833c0	Use saturated addition in SSSE3 of 32x32 quant The 32x32 forward transform can potentially reach peak coefficient value close to 32700, while the rounding factor can go upto 610. This could cause overflow issue in the SSSE3 implementation of 32x32 quantization process. This commit resolves this issue by replacing the addition operations with saturated addition operations in 32x32 block quantization. Change-Id: Id6b98996458e16c5b6241338ca113c332bef6e70	2013-09-05 12:49:12 -07:00
Jim Bankoski	9fc3d32a50	Merge "faster accounting of inc_mv"	2013-09-05 12:38:56 -07:00
Jim Bankoski	2e4ca9d1a5	resolve clang warnings : uninitialized vars in vp9_entropy.h This helps clear out some of the warnings Change-Id: Ie7ccaca8fd92542386a7f1b257398e1bdf2f55dc	2013-09-04 18:38:41 -07:00
Jim Bankoski	e8feb2932f	Merge "wrap non420 loop filter code in macro"	2013-09-04 17:20:53 -07:00
hkuang	01c4e04424	Speed up idct8x8 by rearrange instructions. Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf. Change-Id: I3e2cc818ec14b432204ff43732f39b6438db685d	2013-09-04 15:57:22 -07:00
hkuang	3c05bda058	Merge "Add neon optimize vp9_short_iht4x4_add."	2013-09-04 13:35:09 -07:00
hkuang	3b8614a8f6	Add neon optimize vp9_short_iht4x4_add. Change-Id: I42c497b68ae1ee645b59c9968ad805db0a43e37e	2013-09-04 12:37:58 -07:00
Jim Bankoski	872c6d85c0	Merge "speed up inc_mv_component"	2013-09-04 10:35:51 -07:00
Jim Bankoski	bb2313db28	Merge "make vp9 postproc a config option"	2013-09-04 10:35:26 -07:00
Jim Bankoski	c3c21e3c14	wrap non420 loop filter code in macro Change-Id: I62bca0e7a4bffc1a78b750dbb9df9d2378e92423	2013-09-04 10:24:42 -07:00
Jim Bankoski	79401542f7	make vp9 postproc a config option Vp9 postproc is disabled for now as its not been shown to help and may be merged with vp8. Change-Id: I25620d6cd34c6e10331b18c7b5ef7482e39c6057	2013-09-04 10:02:08 -07:00
Jim Bankoski	532179e845	faster accounting of inc_mv Moves counting of mv branches to where we have a new mv, instead of after the whole frame is summed. Change-Id: I945d9f6d9199ba2443fe816c92d5849340d17bbd	2013-09-04 09:47:57 -07:00
Jim Bankoski	5dda1d2394	speed up inc_mv_component Convert mv_class if statements to look up. re order to avoid ifs... Change-Id: I76966a21bf517bb1f9a7957c08c476c7bb3e9a63	2013-09-04 07:11:30 -07:00
James Zern	1cf2272347	Merge "Fix intermediate height in convolve_c"	2013-09-03 15:50:33 -07:00
Jingning Han	010c0ad0eb	Merge "Fix 32x32 forward transform SSE2 version"	2013-09-03 08:58:03 -07:00
Scott LaVarnway	948aaab4ca	Merge "Improved mb_lpf_horizontal_edge_w_sse2_8"	2013-09-03 05:44:01 -07:00
Jingning Han	3cf46fa591	Fix 32x32 forward transform SSE2 version This commit fixed the potential overflow issue in the SSE2 implementation of 32x32 forward DCT. It resolved the corrupted coded frames in the border of scenes. Change-Id: If87eef2d46209269f74ef27e7295b6707fbf56f9	2013-08-31 18:47:08 -07:00
Tero Rintaluoma	e326cecf18	Fix intermediate height in convolve_c - Intermediate height was not correct i.e. when block size is 4 and y_step_q4 is 6. In this case intermediate height was (4*6) >> 4 = 1 and vertical interpolation needs two source pixels plus 7 extra pixels for taps. - Also if the current output block is 16x16 and we are using 4x upscaling we need only 12 rows after horizontal filtering instead of 16. Patch Set 2: Intermediate_height updated after CL 66723 "Fix bug in convolution functions (filter selection)" Change-Id: I5a1a1bc2ac9d5edb3a6e0818de618bf318fdd589	2013-08-30 10:31:21 +03:00
Jim Bankoski	1d44fc0c49	Merge "rework filter_block_plane"	2013-08-29 20:11:09 -07:00
Jim Bankoski	bc50961a74	rework filter_block_plane Change-Id: I55c3b60c4c0f4910d3dfb70e3edaae00cfa8dc4d	2013-08-29 17:00:05 -07:00
Jingning Han	c86c5443eb	Merge "Fix overflow issue in SSSE3 32x32 quantization"	2013-08-29 16:49:04 -07:00
James Zern	d765df2796	consistently name VP9_COMMON variables #3 stragglers Change-Id: Ib1e853f9a331b7b66639dc34d79568d84d1930f1	2013-08-29 13:27:41 -07:00
James Zern	aa05321262	consistently name VP9_COMMON variables #2 oci -> cm Change-Id: Ifd75c809d9cc99034d3c2fccc4653a78b3aec21f	2013-08-29 13:25:58 -07:00
James Zern	924d74516a	consistently name VP9_COMMON variables #1 pc -> cm Change-Id: If3e83404f574316fdd3b9aace2487b64efdb66f3	2013-08-29 13:25:57 -07:00
Jingning Han	abff678866	Fix overflow issue in SSSE3 32x32 quantization The 32x32 quantization process can potentially have the intermediate stacks over 16-bit range, thereby causing enc/dec mismatch. This commit fixes this overflow issue in the SSSE3 implementation, as well as the prototype, of 32x32 quantization. This fixes issue 607 from webm@googlecode. Change-Id: I85635e6ca236b90c3dcfc40d449215c7b9caa806	2013-08-29 11:00:54 -07:00
Scott LaVarnway	22dc946a7e	Improved mb_lpf_horizontal_edge_w_sse2_8 This patch is a reformatted version of optimizations done by engineers at Intel (Erik/Tamar) who have been providing performance feedback for VP9. For the test clips used (720p, 1080p), up to 1.2% performance improvement was seen. Change-Id: Ic1a7149098740079d5453b564da6fbfdd0b2f3d2	2013-08-29 08:30:17 -04:00
Dmitry Kovalev	851a2fd72c	Renaming txfm_size to tx_size. Change-Id: I752e374867d459960995b24d197301d65ad535e3	2013-08-27 19:47:53 -07:00
Dmitry Kovalev	1d3f94efe2	Merge "Adding get_entropy_context function."	2013-08-27 17:02:36 -07:00
Frank Galligan	7d058ef86c	Merge "Fix winodws warning."	2013-08-27 15:39:58 -07:00
Frank Galligan	f1560ce035	Fix winodws warning. Const is not needed on the function parameter. Change-Id: I38c2a7317cb6f42f70bbddfde9a2cd18d65ceb1c	2013-08-27 15:19:55 -07:00
Dmitry Kovalev	a93992e725	Adding get_entropy_context function. Moving common code from encoder and decoder to this function. Change-Id: I60fa643fb1ddf7ebbff5e83b6c4710137b0195ef	2013-08-27 14:17:53 -07:00
hkuang	3a679e56b2	Add neon optimize vp9_short_idct16x16_1_add. Change-Id: Ib9354c1d975d03e8081df20d50b6a77dfe2dc7e5	2013-08-27 14:00:27 -07:00
hkuang	ce04b1aa62	Merge "Add neon optimize vp9_short_idct8x8_1_add."	2013-08-27 12:10:07 -07:00
Dmitry Kovalev	7b95f9bf39	Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the encoder. Change-Id: I62bb07c377f947cb72fac68add7a6b199e42c6b9	2013-08-27 11:05:08 -07:00
Dmitry Kovalev	12e5931a9a	Merge "Using existing functions instead of raw expressions."	2013-08-27 10:33:34 -07:00
Dmitry Kovalev	bfebe7e927	Merge "Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the common/decoder."	2013-08-27 10:15:21 -07:00
Dmitry Kovalev	78e670fcf8	Merge "Renaming D27 to D207."	2013-08-27 10:03:57 -07:00
hkuang	36e9b82080	Add neon optimize vp9_short_idct8x8_1_add. Change-Id: I0b15d5e3b0eb97abb9ab5ec08e88b61f8723aaf4	2013-08-26 16:28:57 -07:00
hkuang	69384f4fad	Add neon optimize vp9_short_idct4x4_1_add. Change-Id: I6ecb5c4a1a472feb8e84e9f3352b536d5e28a4a5	2013-08-26 15:55:16 -07:00
Dmitry Kovalev	45870619f3	Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the common/decoder. Adding temporary "typedef BLOCK_SIZE BLOCK_SIZE_TYPE" which will go away after encoder's patch. Change-Id: I06ec6a6f079401439843ec981d1496234fd7775c	2013-08-26 11:33:16 -07:00
Jingning Han	4681197a58	Merge "Temporarily disable SSSE3 quant_32x32"	2013-08-26 11:19:53 -07:00
Jingning Han	166dc85bed	Temporarily disable SSSE3 quant_32x32 Make the current head working properly, while working on fixing an issue in the SSSE3 implementation of 32x32 quantization. Change-Id: Ic029da3fd7f1f5e58bc641341cbd226ec49a16bc	2013-08-26 10:45:59 -07:00
James Zern	c8ba8c513c	cosmetics: strip 'VP9_' from defines in vp9 only code Change-Id: I481d9bb2fa3ec72b6a83d5f04d545ad8013f295c	2013-08-23 19:16:49 -07:00
Dmitry Kovalev	50ee61db4c	Renaming D27 to D207. I've already renamed d27_predictor to d207_predictor but forgot about the corresponding constant. Change-Id: Id312aa80fc5b5a1ab8a709a33418a029552a6857	2013-08-23 17:33:48 -07:00
Dmitry Kovalev	480dd8ffbe	Using existing functions instead of raw expressions. Change-Id: Ifa50b04bac1a6ff2abef989073cbf1f37a89eb50	2013-08-23 17:26:53 -07:00
Dmitry Kovalev	e6c435b506	Merge "Cleanup in mvref_common.{h, c}."	2013-08-23 17:09:49 -07:00
Yaowu Xu	13930cf569	Limit mv range to be based on partition size Previous change `c4048dbd` limits the mv search range assuming max block size of 64x64, this commit change the search range using actual block size instead. Change-Id: Ibe07ab02b62bf64bd9f8675d2b997af20a2c7e11	2013-08-23 15:43:57 -07:00
Adrian Grange	78debf246b	Merge "Fix bug in convolution functions (filter selection)"	2013-08-23 13:41:47 -07:00
Dmitry Kovalev	21d8e8590b	Cleanup in mvref_common.{h, c}. Making code more compact, adding consts, removing redundant arguments, adding do/while(0) for macros. Change-Id: Ic9ec0bc58cee0910a5450b7fb8cfbf35fa9d0d16	2013-08-23 12:00:30 -07:00
Adrian Grange	3f10831308	Fix bug in convolution functions (filter selection) (In response to Issue 604: https://code.google.com/p/webm/issues/detail?id=604) There were bugs in the convolution code for two cases: 1. Where the filter table was assumed to be aligned to a 256 byte boundary. The offset of the pixel in the source buffer was computed incorrectly. 2. Where no such alignment assumption was made. An incorrect address for the filter table base was used. To fix both problems, I now assume that the filter table is 256-byte aligned and modify the pixel offset calculation to match. A later patch should remove the restriction that the filter table is aligned to a 256-byte boundary. There was also a bug in the ConvolveTest unit test (convolve_test.cc). (Bug & initial fix suggestion submitted by Tero Rintaluoma and Sami Pietilä). Change-Id: I71985551e62846e55e40de9e7e3959d4805baa82	2013-08-23 11:16:08 -07:00
Dmitry Kovalev	1c159c470a	Merge "Checking scale factors on access."	2013-08-23 11:05:17 -07:00
hkuang	b85367a608	Merge "Optimise idct4x4: rearrange the instructions a bit to improve instruction scheduling."	2013-08-23 10:08:43 -07:00
James Zern	d843ac5132	Merge "rename LOG2_* defines to *_LOG2"	2013-08-22 18:02:42 -07:00
Dmitry Kovalev	53f6f8ac93	Merge "check_bsize_coverage cleanup."	2013-08-22 16:18:24 -07:00
hkuang	4205d79273	Merge "Add neon optimize vp9_short_idct10_16x16_add."	2013-08-22 15:57:28 -07:00
hkuang	4082bf9d7c	Add neon optimize vp9_short_idct10_16x16_add. vp9_short_idct10_16x16_add is used to handle the block that only have valid data at top left 4x4 block. All the other datas are 0. So we could cut many unnecessary calculations in order to save instructions. Change-Id: I6e30a3fee1ece5af7f258532416d0bfddd1143f0	2013-08-22 15:53:22 -07:00
Dmitry Kovalev	335b1d360b	check_bsize_coverage cleanup. Change-Id: Ib7803857b35c00e317c9deb8630e777e25eb278f	2013-08-22 15:45:56 -07:00
Dmitry Kovalev	3c42657207	Checking scale factors on access. It is possible to have invalid scale factors and not access them during decoding. Error is reported if we really try to use invalid scale factors. Change-Id: Ie532d3ea7325ee0c7a6ada08269f804350c80fdf	2013-08-22 15:19:05 -07:00
James Zern	40ae02c247	rename LOG2_* defines to *_LOG2 gets rid of a mix of styles Change-Id: I3591d312157bc6f53a25438bf047765c671fd8a8	2013-08-22 14:45:24 -07:00
Dmitry Kovalev	640dea4d9d	Adding vp9_is_scaled function. Change-Id: Ieb7077ca3586b9491912027eed450a4f6fd38d30	2013-08-22 14:04:59 -07:00
hkuang	610642c130	Optimise idct4x4: rearrange the instructions a bit to improve instruction scheduling. Change-Id: I5ea881a6e419f9e8ed4b3b619406403b4de24134	2013-08-22 11:02:22 -07:00
Dmitry Kovalev	96a1a59d21	Merge "Using has_second_ref function to simplify the code."	2013-08-22 01:39:14 -07:00
Dmitry Kovalev	a33f178491	Merge "Cleaning up foreach_transformed_block_in_plane."	2013-08-22 01:37:21 -07:00
Dmitry Kovalev	359b571448	Merge "Cleaning up reset_skip_context function."	2013-08-22 01:36:25 -07:00
Dmitry Kovalev	596c51087b	Merge "Removing unused foreach_predicted_block function."	2013-08-22 01:35:41 -07:00
Dmitry Kovalev	4172d7c584	Cleaning up foreach_transformed_block_in_plane. Change-Id: I9f45af3894c57f35cb266c255e2b904295d39c34	2013-08-21 17:16:02 -07:00
Dmitry Kovalev	c43da352ab	Cleaning up reset_skip_context function. Change-Id: Ib3e72671eb8da6f2e9767a6de292ec7c7cde6bc7	2013-08-21 16:31:51 -07:00
Dmitry Kovalev	3286abd82e	Merge "Adding scale factor check."	2013-08-21 14:11:13 -07:00
James Zern	ac12f3926b	Merge "vp9 rtcd: remove non-existent sad functions"	2013-08-21 13:55:59 -07:00
Dmitry Kovalev	27a984fbd3	Removing a lot of duplicated code. Adding set_contexts contexts function and call it instead of set_contexts_on_border. Calling txfrm_block_to_raster_xy to get aoff and loff. Change-Id: I41897e344afd2cae1f923f4fdbe63daccf6fe80e	2013-08-21 11:55:12 -07:00
Dmitry Kovalev	a3ae4c87fd	Adding scale factor check. We support only [1/16, 2] scale factors, enforcing this now. Change-Id: I0822eb7cea51720df6814e42d3f35ff340963061	2013-08-21 11:24:47 -07:00
Adrian Grange	ce28d0ca89	Fix typos and minor stylistic cleanup Change-Id: I32e43474e8651ef2eb181d24860a8f118cfea7bf	2013-08-21 08:45:42 -07:00
Adrian Grange	5b63963573	Merge "Further correct bug in loopfilter initialization"	2013-08-21 07:17:43 -07:00
James Zern	ae455fabd8	vp9 rtcd: remove non-existent sad functions vp9_sad32x3, vp9_sad3x32 + remove unnecessary sad include from vp9_findnearmv.c Change-Id: Idef2a89cadc3fec64eff82ba9be60ffff50b3468	2013-08-20 18:07:53 -07:00
Dmitry Kovalev	90027be251	Removing unused foreach_predicted_block function. Moving foreach_predicted_block_in_plane function to vp9_reconinter.c because there is only one usage. Change-Id: I9852feae43fc3cf809b817fc541d043bc5496209	2013-08-20 17:20:47 -07:00
Dmitry Kovalev	7f814c6bf8	Merge "Passing plane_bsize to foreach_transformed_block_visitor."	2013-08-20 14:25:01 -07:00
Dmitry Kovalev	27de4fe922	Using has_second_ref function to simplify the code. Updating implementation of vp9_get_pred_context_single_ref_p2 using has_second_ref function to make code easier to read. Change-Id: I5ba642712f59861a48aab974e73aa01640d086fe	2013-08-20 14:09:56 -07:00
hkuang	62a2cd9ed2	Merge "Add neon optimize vp9_short_idct10_8x8_add."	2013-08-20 14:06:57 -07:00
Dmitry Kovalev	d19ac4b66d	vp9_filter.{h, c} cleanup + adding SUBPEL_TAPS constant. Change-Id: Ib394ea23f464591dad50b5c65c316701378d06d7	2013-08-20 12:29:57 -07:00
hkuang	37cda6dc4c	Add neon optimize vp9_short_idct10_8x8_add. vp9_short_idct10_8x8_add is used to handle the block that only have valid data at top left 4x4 block. All the other datas are 0. So we could cut several unnecessary calculations in order to save instructions. Change-Id: I34fda95e29082b789aded97c2df193991c2d9195	2013-08-20 11:51:07 -07:00
Dmitry Kovalev	5826407f2a	Merge "Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c."	2013-08-20 10:06:22 -07:00
Dmitry Kovalev	5baf510f74	Merge "Adding has_second_ref function."	2013-08-20 10:06:14 -07:00
Dmitry Kovalev	039b0c4c9e	Merge "Adding VP9_FILTER_BITS constant."	2013-08-20 10:05:09 -07:00
Jim Bankoski	f167433d9c	fix the mv_ref_idx issue The following issue was reported : https://code.google.com/p/webm/issues/detail?id=601&q=jimbankoski&sort=-id&colspec=ID%20Pri%20mstone%20ReleaseBlock%20Type%20Component%20Status%20Owner%20Summary This code makes the choice and code cleaner and removes any question about whether the border needs to be checked. Change-Id: Ia7aecfb3168e340618805bd318499176c2989597	2013-08-20 08:14:52 -07:00
Dmitry Kovalev	2612b99cc7	Adding VP9_FILTER_BITS constant. Removing VP9_FILTER_WEIGHT, VP9_FILTER_SHIFT, BLOCK_WIDTH_HEIGHT constants. Using ROUND_POWER_OF_TWO for rounding. Change-Id: I2e8d6858dcd600a87096138209731137d7decc24	2013-08-20 00:42:25 -07:00
Dmitry Kovalev	d8286dd56d	Adding has_second_ref function. Updating implementation of vp9_get_pred_context_single_ref_p1 using has_second_ref function to make code easier to read. Change-Id: Ie8f60403a7195117ceb2c6c43176ca9a9e70b909	2013-08-19 18:39:34 -07:00
Yaowu Xu	c4048dbdd3	Change to limit the mv search range As the pixel values beyond image border are duplicates of pixels on edge, the change limits the mv search range, any mv beyond the limits no longer produce new/different prediction values as entire block with pixels used for subpel interpolation are outside image border. Change-Id: I4c6fdf06e33c1cef1489f5470ce0fb4e5e01fb79	2013-08-19 17:19:36 -07:00
Dmitry Kovalev	569ca37d09	Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c. Change-Id: Ib8af21f2e7f603c2fb407e5d15a3bba64b545b49	2013-08-19 16:44:10 -07:00
Dmitry Kovalev	82d4d9a008	Passing plane_bsize to foreach_transformed_block_visitor. Updating all foreach_transformed_block_visitor functions to work with plane block size instead of general block. Removing a lot of duplicated code. Change-Id: I6a9069e27528c611f5a648e1da0c5a5fd17f1bb4	2013-08-19 15:47:24 -07:00
Dmitry Kovalev	2e3478a593	Using plane_bsize instead of bsize. This change set is intermediate. The next one will remove all repetitive plane_bsize calculations, because it will be passed as argument to foreach_transformed_block_visitor. Change-Id: Ifc12e0b330e017c6851a28746b3a5460b9bf7f0b	2013-08-19 13:20:21 -07:00
Adrian Grange	5a1a269f67	Further correct bug in loopfilter initialization The intent was to initialize the deltas for the segment to the computed value, irrespective of mode and reference frame if (mode_ref_delta_enabled == 0). (In response to bug posted by Manjit Hota to codec-devel and webm-discuss lists) Change-Id: I10435cb63d0f88359bb4c14f22181878a1988e72	2013-08-19 11:58:52 -07:00
Dmitry Kovalev	26e5b5e25d	Removing unused or redundant arguments from *_args structures. Redundant dst, pre[2] from build_inter_predictors_args, unused cm from encode_b_args. Change-Id: I2c476cd328c5c0cca4c78ba451ca6ba2a2c37e2d	2013-08-16 12:51:20 -07:00
Dmitry Kovalev	367cb10fcf	Merge "Moving from ss_txfrm_size to tx_size."	2013-08-16 12:46:45 -07:00
Dmitry Kovalev	1462433370	Merge "Renaming d27 predictor to d207."	2013-08-16 12:07:24 -07:00
Johann	d514b778c4	Merge "Reduce the instructions of idct8x8. Also add the saving and restoring of D registers."	2013-08-16 11:30:21 -07:00
Johann	65aa89af1a	Merge "Reduce instructions of idct4x4."	2013-08-16 11:28:35 -07:00
Frank Galligan	bdc785e976	Merge "vp9: neon: optimise vp9_wide_mbfilter_neon"	2013-08-16 11:16:48 -07:00
hkuang	df0715204c	Reduce instructions of idct4x4. Change-Id: Ia26a2526804e7e2f656b0051618a615fca8fc79d	2013-08-16 10:54:56 -07:00
hkuang	60ecd60c9a	Reduce the instructions of idct8x8. Also add the saving and restoring of D registers. Change-Id: Id3630c90fcb160ef939fef55411342608af5f990	2013-08-16 10:32:12 -07:00
Johann	bba68342ce	Merge "vp9: neon: use aligned stores in convolve functions"	2013-08-16 10:29:59 -07:00
Adrian Grange	3e340880a8	Merge "Added resizing & initialization of last frame segment map"	2013-08-16 09:07:36 -07:00
Mans Rullgard	4fa93bcef4	vp9: neon: use aligned stores in convolve functions The destination is block-aligned so it is safe to use aligned stores. Change-Id: I38261e4fa40bc60e6472edffece59e372908da7e	2013-08-16 14:25:08 +01:00
Dmitry Kovalev	afd9bd3e3c	Moving from ss_txfrm_size to tx_size. Updating foreach_transformed_block_visitor and corresponding functions to accept tx_size instead of ss_txfrm_size. List of functions per file: vp9_decodframe.c decode_block decode_block_intra vp9_detokenize.c decode_block vp9_encodemb.c optimize_block vp9_xform_quant vp9_encode_block_intra vp9_rdopt.c dist_block rate_block block_yrd_txfm vp9_tokenize.c set_entropy_context_b tokenize_b is_skippable Change-Id: I351bf563eb36cf34db71c3f06b9bbc9a61b55b73	2013-08-15 17:03:03 -07:00
Adrian Grange	d5bec522da	Added resizing & initialization of last frame segment map When the frame size changes the last frame segment map must be resized to match and initialized to 0. Change-Id: Idc10de109f55dbe9af3a6caae355a2974712243d	2013-08-15 15:35:21 -07:00
Dmitry Kovalev	9451e8d37e	Merge "Converting code from using ss_txfrm_size to tx_size."	2013-08-15 15:21:09 -07:00
Dmitry Kovalev	939b1e4a8c	Merge "Moving segmentation struct from MACROBLOCKD to VP9_COMMON."	2013-08-15 15:14:32 -07:00
Johann	a9aa7d07d0	Merge "vp9: neon: add vp9_convolve_avg_neon"	2013-08-15 14:55:15 -07:00
Johann	63e140eaa7	Merge "vp9: neon: add vp9_convolve_copy_neon"	2013-08-15 14:55:08 -07:00
Dmitry Kovalev	bb3b817c1e	Converting code from using ss_txfrm_size to tx_size. Updated function signatures: txfrm_block_to_raster_block txfrm_block_to_raster_xy extend_for_intra vp9_optimize_b Change-Id: I7213f4c4b1b9ec802f90621d5ba61d5e4dac5e0a	2013-08-15 11:44:57 -07:00
Dmitry Kovalev	81d7bd50f5	Renaming d27 predictor to d207. 27 degrees intra predictor is actually 207 degrees, so renaming it. Change-Id: Ife96a910437eb80ccdc0b7a5b7a62c77542ae5be	2013-08-15 11:09:49 -07:00
Mans Rullgard	67e53716e0	vp9: neon: optimise vp9_wide_mbfilter_neon Break up long dependency chains to improve instruction scheduling. Change-Id: I0e0cb66943df24af920767bb4167b25c38af9630	2013-08-15 19:07:22 +01:00
Dmitry Kovalev	b7616e387e	Moving segmentation struct from MACROBLOCKD to VP9_COMMON. VP9_COMMON is the right place to segmentatation struct because it has global segmentation parameters, not something specific to macroblock processing. Change-Id: Ib9ada0c06c253996eb3b5f6cccf6a323fbbba708	2013-08-15 10:47:48 -07:00
Jingning Han	ec01f52ffa	Unify luma and chroma rd-cost estimation This commit unifies the rate-distortion cost calculation process of luma and chroma components. It allows early termination to be enabled later in the rd search loop of chroma components, in consistent with luma pixels. Change-Id: I2e52a7c6496176bf2a5e3ef338d34ceb8aad9b3d	2013-08-15 09:41:33 -07:00
Paul Wilkins	1a3641d91b	Merge "Renaming in MB_MODE_INFO"	2013-08-15 02:12:48 -07:00
hkuang	39f42c8713	Merge "Add neon optimize vp9_short_idct16x16_add."	2013-08-14 14:16:20 -07:00
hkuang	cf6beea661	Add neon optimize vp9_short_idct16x16_add. Change-Id: I27134b9a5cace2bdad53534562c91d829b48838d	2013-08-14 13:52:16 -07:00
Dmitry Kovalev	bb072000e8	foreach_transformed_block_in_plane cleanup, explicit tx_size var. Making foreach_transformed_block_in_plane more clear (it's not finished yet). Using explicit tx_size variable consistently instead of (ss_txfrm_size / 2) or (ss_txfrm_size >> 1) expression. Change-Id: I1b9bba2c0a9f817fca72c88324bbe6004766fb7d	2013-08-14 11:39:31 -07:00
Dmitry Kovalev	f2c073efaa	Adding const to arguments of intra prediction functions. Adding const to above and left pointers. Cleanup. Change-Id: I51e195fa2e2923048043fe68b4e38a47ee82cda1	2013-08-14 10:35:56 -07:00
Mans Rullgard	0f1deccf86	vp9: neon: add vp9_convolve_avg_neon Change-Id: I33cff9ac4f2234558f6f87729f9b2e88a33fbf58	2013-08-14 16:27:55 +01:00
Mans Rullgard	635ba269be	vp9: neon: add vp9_convolve_copy_neon Change-Id: I15adbbda15d1842e9f15f21878a5ffbb75c3c0c9	2013-08-14 16:27:55 +01:00
Paul Wilkins	26fead7ecf	Renaming in MB_MODE_INFO The macro block mode info context originally contained an entry for each 16x16 macroblock. In VP9 each entry refers to an 8x8 region not a macro block, so the naming is misleading. This first stage clean up changes the names of 3 entries in the structure to remove the mb_ prefix. TODO clean up the nomenclature more widely in respect of mbmi and bmi. Change-Id: Ia7305c6d0cb805dfe8cdc98dad21338f502e49c6	2013-08-14 12:47:52 +01:00
Dmitry Kovalev	8105ce6dce	Merge "Using is_inter_block() instead of repetitive code."	2013-08-13 10:00:01 -07:00
Jingning Han	39fe235032	Merge "SSE2 high precision 32x32 forward DCT"	2013-08-12 23:03:47 -07:00
Johann	4417c04531	Merge "vp9: neon: optimise convolve8_vert functions"	2013-08-12 17:54:47 -07:00
Johann	4cabbca4ce	Merge "vp9: neon: optimise convolve8_horiz functions"	2013-08-12 17:54:42 -07:00
Dmitry Kovalev	32006aadd8	Using is_inter_block() instead of repetitive code. Change-Id: If0b04c476c34fb8c102c9f750d7fe5669a86a532	2013-08-12 17:42:14 -07:00
Jingning Han	78136edcdc	SSE2 high precision 32x32 forward DCT Enable SSE2 implementation of high precision 32x32 forward DCT. The intermediate stacks are of 32-bits. The run-time goes down from 32126 cycles to 13442 cycles. Change-Id: Ib5ccafe3176c65bd6f2dbdef790bd47bbc880e56	2013-08-12 16:52:53 -07:00
Dmitry Kovalev	b89eef8f82	Merge "Simplifying vp9_mvref_common.c."	2013-08-12 16:24:22 -07:00
Dmitry Kovalev	b214cd0dab	Merge "Removing foreach_predicted_block_uv function."	2013-08-12 15:54:01 -07:00
Dmitry Kovalev	1a5e6ffb02	Simplifying vp9_mvref_common.c. Change-Id: I272df2e33fa05310466acf06c179728514dd7494	2013-08-12 15:52:08 -07:00
Dmitry Kovalev	c66320b3e4	Merge "Entropy context related cleanups."	2013-08-12 15:18:24 -07:00
Dmitry Kovalev	bd1bc1d303	Merge "Making scaling code more clear."	2013-08-12 15:17:26 -07:00
Dmitry Kovalev	9a31d05e24	Removing unused convolve_avg_c function + cleanup. Change-Id: Id2b126c6456627c25e4041a82e304d0151d951ba	2013-08-12 14:28:00 -07:00
Dmitry Kovalev	76d166e413	Removing foreach_predicted_block_uv function. Adding function build_inter_predictors_for_planes to build inter predictors for specified planes. This function allows to remove condition "#if CONFIG_ALPHA" and use MAX_MB_PLANE for general case. Renaming 'which_mv' local var to 'ref', and 'weight' argument to 'ref'. Change-Id: I1a97160c9263006929d38953f266bc68e9c56c7d	2013-08-12 13:54:13 -07:00
Dmitry Kovalev	a72e269318	Making scaling code more clear. Reusing existing functions, using constants instead of magic numbers. Change-Id: Idc689ffba52c9a8b203fcf26bd67110ecb5635f9	2013-08-12 13:30:26 -07:00
Dmitry Kovalev	8b0e6035a2	Entropy context related cleanups. Adding set_skip_context() function used from both encoder and decoder. Change-Id: Ia22cfad3211a00a63eb294f64f857b78f4aa9b85	2013-08-12 11:24:24 -07:00
Mans Rullgard	ad7021dd6c	vp9: neon: optimise convolve8_vert functions Invert loops to operate vertically in the inner loop. This allows removing redundant loads. Also add preloading of data. Change-Id: I4fa85c0ab1735bcb1dd6ea58937efac949172bdc	2013-08-12 15:37:48 +01:00
Dmitry Kovalev	097046ae28	Merge "Removing redundant code and function arguments."	2013-08-11 12:20:58 -07:00
Mans Rullgard	b84dc949c8	vp9: neon: optimise convolve8_horiz functions Each iteration of the horizontal loop reuses 7 of the 11 source values. Loading only the 4 new values saves some time. Also add preload for source data. Overall 4% faster on Chromebook. Change-Id: I8f69e749f2b7f79e9734620dcee51dbfcd716b44	2013-08-11 16:21:55 +01:00
Dmitry Kovalev	3c43ec206c	Renaming BLOCK_SIZE_TYPES constant to BLOCK_SIZES. There will be another change set to rename BLOCK_SIZE_TYPE enum to BLOCK_SIZE. Change-Id: I8d1dfc873d6186fa5e554262f5169e929978085e	2013-08-09 17:47:32 -07:00
Dmitry Kovalev	67fe9d17cb	Removing redundant code and function arguments. Change-Id: Ia5cdda0f755befcd1e64397452c42cb7031ca574	2013-08-09 17:24:40 -07:00
Dmitry Kovalev	f1559bdeaf	Inlining 16 as a stride for BLOCK_OFFSET macro. Change-Id: I7f23d174eb089e5500f268a10db09648634c1b82	2013-08-09 16:40:05 -07:00
Dmitry Kovalev	125146034e	Merge "Using MV struct instead of int[2] array."	2013-08-09 15:33:08 -07:00
Dmitry Kovalev	cd0629fe68	Merge "Removing plane_block_{width, height}_log2by4 functions."	2013-08-09 15:26:51 -07:00
Dmitry Kovalev	ff7df102d9	Merge "Moving loopfilter struct to VP9_COMMON."	2013-08-09 15:23:00 -07:00
Dmitry Kovalev	816d6c989c	Moving loopfilter struct to VP9_COMMON. Loop filter configuration doesn't belong to macroblock, so moving it from MACROBLOCKD to VP9_COMMON. Also moving the declaration of loopfilter struct from vp9_blockd.h to vp9_loopfilter.h. Change-Id: I4b3e34be9623b47cda35f9b1f9951f8c5b1d5d28	2013-08-09 14:41:51 -07:00
Dmitry Kovalev	8ffe85ad00	Moving scale_factors and related code to separate files. Change-Id: I531829e5aee2a4a7a112d528ecccbddf052d0e74	2013-08-09 14:07:09 -07:00
Dmitry Kovalev	fa0cd61087	Merge "Using buf_2d struct instead of separate buffer and stride vars."	2013-08-09 11:50:58 -07:00
Adrian Grange	0eef1acbef	Merge "Correct bug in loopfilter initialization"	2013-08-09 09:51:58 -07:00
Adrian Grange	12eb2d0267	Correct bug in loopfilter initialization The memset sets 16 bytes rather than the correct size of the final array dimension (MAX_MODE_LF_DELTAS). (In response to bug posted by Manjit Hota to codec-devel and webm-discuss lists) Change-Id: I8980f5aa71ddc9d7ef57c5b4700bc28ddf8651c7	2013-08-09 09:21:15 -07:00
Yaowu Xu	6ec2b85bad	Added lpf level picking using partial frame Change-Id: I599ab1bd22b5f3f10d5962c609952abdef8ff67a	2013-08-09 07:37:08 -07:00
Dmitry Kovalev	6fd2407035	Using buf_2d struct instead of separate buffer and stride vars. Change-Id: Id5cc3566cc16d1e3030ddb4d1c58459320321dca	2013-08-08 21:25:48 -07:00
Dmitry Kovalev	6a8ec3eac2	General code cleanup. Removing redundant parenthesis and curly braces. Combining declarations with initializations. Adding useful intermediate variables instead of recalculating expressions every time. Change-Id: I00106f404afd60bfc189905b0fded881684f941a	2013-08-08 21:12:34 -07:00
Dmitry Kovalev	ee40e1a637	Merge "Cleanup inside vp9_reconinter.c."	2013-08-08 14:59:38 -07:00
Dmitry Kovalev	47fad4c2d7	Using MV struct instead of int[2] array. Change-Id: Iab951c555037e36b154f319f351c5e67f9abb931	2013-08-08 12:01:56 -07:00
Dmitry Kovalev	ac008f0030	Removing unneeded intermediate entropy_nodes_adapt var. Change-Id: I541a178d997b4541e0e2d4d5b854e2ed6b113c3a	2013-08-08 11:52:02 -07:00
Dmitry Kovalev	61c33d0ad5	Removing plane_block_{width, height}_log2by4 functions. Change-Id: I040b82b8e32aee272d10cbb021c7ba1c76343d7a	2013-08-07 17:06:33 -07:00
Dmitry Kovalev	a766d8918e	Cleanup inside vp9_reconinter.c. Using block width and block height instead of their logarithms. Using SUBPEL_BITS and SUBPEL_SHIFTS constants instead of magic numbers. Change-Id: I4e10e93c907c8a5e1cb27dfe74d1fcdcc4995448	2013-08-07 17:02:28 -07:00
Dmitry Kovalev	82d7c6fb3c	Merge "Using only one scale function in scale_factors struct."	2013-08-07 16:32:09 -07:00
Dmitry Kovalev	8db2675b97	Adding ss_size_lookup table. Removing the old one bsize_from_dim_lookup. Now we have a way to determine block size for plane using its subsampling values (ss_size_lookup). And then we can find the number of pixels in the block (num_pels_log2_lookup). Change-Id: I6fc981da2ae093de81741d3d78eaefed11015db9	2013-08-07 15:33:17 -07:00
Christian Duvivier	78182538d6	Neon version of vp9_short_idct4x4_add. Change-Id: Idec4cae0cb9b3a29835fd2750d354c1393d47aa4	2013-08-06 18:41:27 -07:00
Dmitry Kovalev	63ec0587c1	Merge "Motion vector code cleanup."	2013-08-06 16:00:01 -07:00
Dmitry Kovalev	1c552e79bd	Using only one scale function in scale_factors struct. Functions scale_mv_q4 and scale_mv_q3_to_q4 were almost identical except q3->q4 conversion in scale_mv_q3_to_q4. Now q3->q4 conversion happens directly in vp9_build_inter_predictor. Also adding useful constants: SUBPEL_BITS and SUBPEL_MASK. Change-Id: Ia0a6ad2ac07c45fdf95a5139ece6286c035e9639	2013-08-06 15:43:56 -07:00
Jim Bankoski	5b307886fb	variance x86inc guards also fixed bug in sad calcs Change-Id: I6571fcbe37556c16ae32be66dc0fd879852aac1d	2013-08-06 14:17:13 -07:00
Jim Bankoski	6eb1254b88	sse3 intrapred x86inc protected Change-Id: I4a3c83119cdf8a205920034c8019d855d5504605	2013-08-06 14:17:13 -07:00
Jim Bankoski	c9126e0b30	sad + miscellaneous updates Enable use_x86inc as a commandline option. Fix Bug with sse2 when x86inc is disabled. Adds Sad asm protection to x86inc protection Change-Id: Iee0f9dd235ea10e8ace512eb362ba9bebe8c9df6	2013-08-06 12:16:04 -07:00
Dmitry Kovalev	8725ca2ed2	Merge "Inlining vp9_get_pred_probs_switchable_interp function."	2013-08-06 11:57:45 -07:00
Dmitry Kovalev	0c80065694	Inlining vp9_get_pred_probs_switchable_interp function. There was no benefit having this function. For example, inside read_switchable_filter_type switchable filter context was calculated twice. Change-Id: I79cd5bf95cbc0f6d8bf91a2e32289e01b18dcff1	2013-08-06 11:04:31 -07:00
Jim Bankoski	efc94102f0	Merge "intrapred x86inc guards"	2013-08-06 10:39:19 -07:00
Dmitry Kovalev	a39abe2627	Motion vector code cleanup. Converting arguments of two functions (clamp_mv_ref, lower_mv_precision) from int_mv* to MV*. Rewriting is_inside function to make it much shorter. Change-Id: Ie4c4cf3eccd46707c7df099ec21fb1b61c72fc7a	2013-08-06 10:31:11 -07:00
Dmitry Kovalev	3e51acafec	Merge "Finally removing all old block size constants."	2013-08-06 10:30:37 -07:00
Dmitry Kovalev	4a692e4168	Merge "Changing the order switchable filter enum constants."	2013-08-06 10:30:26 -07:00
Jim Bankoski	25ec1375c9	intrapred x86inc guards Change-Id: If0399d8e11f4ebe75a5c91abb8d6a52a7709065b	2013-08-06 09:39:30 -07:00
Jim Bankoski	62c6aa884d	block error / x86inc mods Change-Id: Icb607745634e10b9bac5019d06661ece09fcdb40	2013-08-06 06:23:38 -07:00
Jim Bankoski	a93b115cd6	reworked config for use_x86_inc Support enabling it or disabling it. Moved read out to configure.sh so that its done once instead of in make and in config. Change-Id: I73a9190cf31de9f03e8a577f478fa522f8c01c8b	2013-08-05 17:35:25 -07:00
James Zern	d115cd8b12	Merge changes I082959ab,Ib6932640 * changes: vp9/decoder: threaded row-based loop filter vp9/decoder: add thread worker	2013-08-05 16:07:09 -07:00
Dmitry Kovalev	b9c7d04e95	Finally removing all old block size constants. Change-Id: I3aae21e88b876d53ecc955260479980ffe04ad8d	2013-08-05 15:23:49 -07:00
Jim Bankoski	f4837579d1	fixed script problem with config_force_x86_inc Change-Id: I226e5094d216b09dc47fa5511a66e2d314608000	2013-08-05 14:48:20 -07:00
Jim Bankoski	a5a7322459	Merge "Begin to restrict x86inc.asm usage"	2013-08-05 14:17:49 -07:00
Jim Bankoski	9f988a2edf	Merge "cleanups after bw bh code"	2013-08-05 14:02:02 -07:00
James Zern	a0ffa2794b	vp9/decoder: threaded row-based loop filter Currently the only threaded option for vp9 decode. Enabled when the decoder config thread count is > 1. Change-Id: I082959abac9e31aa4a38ed9fd68b94680e57f4df	2013-08-05 13:22:04 -07:00
Dmitry Kovalev	3f611555d7	Changing the order switchable filter enum constants. This changeset allows to remove vp9_switchable_interp and vp9_switchable_interp_map arrays and make code much clear. Actually we still have to use these mapping but only inside read_interp_filter_type and write_interp_filter_type functions. Change-Id: I4026c6f8c4acefba6c81421b7bacbaa52cc45f50	2013-08-05 12:26:15 -07:00
Jim Bankoski	5d2cb7ead0	cleanups after bw bh code Cons bw/bh parms that should have been const. Additional formatting. Change-Id: Icd36a5c9dc17dadd7284315ac0d6fef1a565ca16	2013-08-05 12:15:52 -07:00
Jim Bankoski	c3809f3de5	Begin to restrict x86inc.asm usage Chromium does not support 32bit builds for Mac which use x86inc.asm. Make the files which include it work if 64bit or not PIC enabled starting with vp9_copy_sse2.asm Consolidate these targets in vp9_rtcd_defs.sh Change-Id: If18f0b957a611efd085a3ee7d245cf1eb91e8248	2013-08-05 12:07:30 -07:00
Dmitry Kovalev	d007446b3f	Replacing long block size enum values with shorter ones (2). Change-Id: I428c4d42212b757112e3acfe5b81314cfbb5fd6b	2013-08-05 10:51:02 -07:00
Dmitry Kovalev	319867d71c	Merge "Cleaning up vp9_build_inter_predictor function."	2013-08-05 01:52:11 -07:00
Jim Bankoski	f703f98757	reworked find_mv_ref This is an attempt at rewriting vp9_find_mv_refs_idx. I believe that it gains about 1-2% decode speed Change-Id: Ia5359c94ce9bb43b32652890e605e9a385485c1b	2013-08-03 20:25:55 -07:00
Dmitry Kovalev	7b50333e8f	Merge "Adding is_inter_block function."	2013-08-02 16:54:32 -07:00
Dmitry Kovalev	5f0a52faaf	Cleaning up vp9_build_inter_predictor function. Change-Id: I94f6b4272b95ac101de6d10f048116ba065788b0	2013-08-02 16:53:18 -07:00
Dmitry Kovalev	603931e291	Merge "Changing function arg type from int_mv* to MV*."	2013-08-02 16:30:06 -07:00
Dmitry Kovalev	a6adc82e78	Merge "Cleanups around allow_high_precision_mv flag."	2013-08-02 16:27:05 -07:00

... 4 5 6 7 8 ...

1813 Commits