generic-library/vpx

Author	SHA1	Message	Date
Aℓex Converse	08d5cf226e	Merge "Remove branch in inner loop of foreach_transformed_block_in_plane()"	2015-07-28 21:59:33 +00:00
Jingning Han	d19033fa4e	Move DC only forward 2D-DCT functions to vpx_dsp This completes the forward transform functions layout refactoring. Change-Id: I996fb0fb795f41e2040f7b21db985774098aedbd	2015-07-28 14:52:30 -07:00
Hui Su	fe7cabe8b6	Merge "Move intra prediction functions from vp9/common/ to vpx_dsp/"	2015-07-28 20:41:01 +00:00
Jingning Han	a73f0f4170	Merge "Factor 32x32 fwd DCT to vpx_dsp folder"	2015-07-28 20:36:59 +00:00
Jingning Han	a6a4659bea	Factor 32x32 fwd DCT to vpx_dsp folder Move the 32x32 2D-DCT implementations from vp9/ to vpx_dsp/. Change-Id: Id3980696f8b69906ff7a59ff9fb2b9013d60047d	2015-07-28 11:13:41 -07:00
Frank Galligan	b1fb6e0365	Fix dspr2 build. Change-Id: I18895c29d6db872d033b3874de9dcd9501d0c10e	2015-07-28 09:05:41 -07:00
James Zern	ea990af7f5	add vp9_block_error_fp_neon ~60-70% faster depending on the block size Change-Id: Icdbaa9977a91a63cbcc6ead0cf19d5a2af7f27e1	2015-07-27 19:59:50 -07:00
hui su	4013645353	Replace prefix vp9_ with vpx_ for intra prediction functions Change-Id: I8ae6fb586f8d5d018ace228df11714f82b085076	2015-07-27 13:42:06 -07:00
hui su	7971846a5e	Move intra prediction functions from vp9/common/ to vpx_dsp/ Change-Id: I64edc26cf4aab050c83f2d393df6250628ad43b8	2015-07-27 13:38:16 -07:00
Jingning Han	5f214d6bca	Use common coefficient definition in neon idct implementations Replace the duplicate coefficient definition in neon implementations of inverse transform with those from vpx_dsp/txfm_common.h Change-Id: I4cd9bd9569ab1793dfdbb6f16d80bcb581599f0d	2015-07-27 12:12:31 -07:00
Jingning Han	a9a1d4e8e5	Replace vp9_idct.h for precise dependency This commit replaces vp9_idct.h with txfm_common.h in many SIMD implementation files for precise file dependency. Change-Id: If73dd726bb16537e7494f28538b0a169810f9756	2015-07-27 11:55:31 -07:00
Jingning Han	5ebc8febdc	Refactor vp9_idct.h file Separate the common coefficient constant into vpx_dsp/txfm_common.h. Move the SSE2 macro definitions to vpx_dsp/x86/txfm_common_sse2.h. This clears the use case of vp9_idct.h in vpx_dsp folder. Change-Id: I319735a2abf42888e5080ac14cfbcde34be7b121	2015-07-26 08:26:32 -07:00
Alex Converse	742021f026	Remove branch in inner loop of foreach_transformed_block_in_plane() Change-Id: Ib14d09376a9ce4fa5f541264e5c335aceb71380a	2015-07-24 11:14:33 -07:00
Jingning Han	d341f843e2	Refactor forward/inverse transform msa implementations This commit factors out common macro definitions from the forward and inverse transform implementations into vpx_dsp. It removes the duplicate macro definitions from encoder and decoder folders. Change-Id: I92301acbd3317075e9c5f03328a25abb123bca78	2015-07-23 11:20:30 -07:00
Jingning Han	b67821f37b	Factor forward 2D-DCT transforms into vpx_dsp This commit factors the 4x4, 8x8, and 16x16 2D-DCT forward transform operations into vpx_dsp folder. Change-Id: I084b117b79c0925edcbcabb93f62b9f4bf8dbe7d	2015-07-22 15:48:17 -07:00
Yaowu Xu	70ad668056	vpx_dsp/prob.h: vp9_ -> vpx_ change prefix vp9_ to vpx_ for non codec specific functions and data structures. Change-Id: I97c7e6422eceea99212b93f4942bc2187763a07c	2015-07-20 18:13:04 -07:00
Yaowu Xu	bf82514b54	vpx_dsp/bitreader.h: vp9_->vpx_ Replace vp9_ in names to vpx_ as they are not codec specific. Change-Id: I2e583aa63dee769353ada4b42417aa15c4074ebb	2015-07-20 18:06:31 -07:00
Jingning Han	e253eaa036	Unify the high bit-depth forward hybrid transforms The SSE2 version high bit-depth forward hybrid transforms are essentially using the C functions via cross referencing to 1-D functions in vp9_dct.c. This commit unifies the two versions and removes the unnecessary dependency. Change-Id: Ib4d0702a138f8daf7d0bd97c141ee7088f293765	2015-07-20 11:17:49 -07:00
Yaowu Xu	345ff1a2f2	Merge "Removed vp9_ prefix from vpx_dsp/bitreader file names"	2015-07-20 17:12:08 +00:00
Yunqing Wang	f65473c036	Merge "Migrate quantization functions from vp9/ to vpx_dsp/"	2015-07-20 16:20:07 +00:00
Yaowu Xu	87d2c3c063	Removed vp9_ prefix from vpx_dsp/bitreader file names Change-Id: I0426126d0a65f13f9250983e44cc366b1b1a9c4a	2015-07-20 08:57:35 -07:00
Yaowu Xu	b0e6811ace	Merge "Move bit reader files to vpx_dsp"	2015-07-20 14:52:50 +00:00
Yunqing Wang	38f1fbbb75	Migrate quantization functions from vp9/ to vpx_dsp/ The following quantization functions were moved: vp9_quantize_b vp9_quantize_b_32x32 vp9_highbd_quantize_b vp9_highbd_quantize_b_32x32 vp9_quantize_dc vp9_quantize_dc_32x32 vp9_highbd_quantize_dc vp9_highbd_quantize_dc_32x32 The purpose of doing that was to allow these functions to be shared by multiple codecs. Change-Id: Id8ab939f283353cdd07bd930d47db3d932a5d87f	2015-07-17 16:38:14 -07:00
Jingning Han	2992739b5d	Rename loop filter function from vp9_ to vpx_ Change-Id: I6f424bb8daec26bf8482b5d75dd9b0e45c11a665	2015-07-17 15:55:02 -07:00
Yaowu Xu	97279ed2e2	Move bit reader files to vpx_dsp Change-Id: Ib1cb1fbe92a39ff5312cee069559be6d3ea458d0	2015-07-17 15:38:40 -07:00
Jingning Han	4735edd00f	Migrate mips dspr2 loop filter implementation from vp9 to vpx This commit moves the loop filter dspr2 implementation from vp9 to vpx_dsp directory. It also fixes header file format issues. Change-Id: I09203ed4bd267d7fd76bb79a6ee84a37646206b2	2015-07-17 11:51:05 -07:00
Jingning Han	d0750d287f	Resolve dspr2 loop filter dependency complexity Narrow the scope of dependency required by the dspr2 implementation of loop filters. Change-Id: Ib8d99dc7d9c231f69dd31d02e0a89e5bd0545a28	2015-07-17 10:38:35 -07:00
Jingning Han	55e80a3cc6	Replace vp9_common_dspr2.h with common_dspr2.h Narrow the scope of dependency in dspr2 loop filter implementation. Change-Id: I30426d7e4d41575a82286f1d3c5881aeb99a3250	2015-07-17 10:31:38 -07:00
Jingning Han	b8ff84b7f8	Create common dspr2 header file in vpx_dsp Move the common prefetch_load/store in dspr2 to header file in vpx_dsp/mips. Change-Id: I8acc22970f2a0ef97d73061e39a3ae65c6955eac	2015-07-17 09:54:02 -07:00
Jingning Han	3590a4b437	Merge "Simplify dependencies in dspr2 related codes"	2015-07-17 16:12:52 +00:00
Jingning Han	845aad42b8	Merge "Migrate loop filter functions from vp9/ to vpx_dsp/"	2015-07-17 16:12:01 +00:00
Jingning Han	d190ad228f	Simplify dependencies in dspr2 related codes The common_dspr2.h should be independent of codec-specific data structures. Change-Id: I34ee1f9552c2d2d205fd7f1813cdf312c7ff5d2b	2015-07-16 18:22:48 -07:00
Jingning Han	50adfdf5ba	Migrate loop filter functions from vp9/ to vpx_dsp/ The various tap loop filter operations are common functions across codec. This commit moves them along with SIMD optimizations to vpx_dsp folder. Change-Id: Ia5fa0b2e5289cdb98467502a549c380b9c60e92c	2015-07-16 16:40:47 -07:00
Frank Galligan	8be1dcb4cb	Merge "Add vp9_int_pro_col_neon."	2015-07-16 05:45:17 +00:00
Jingning Han	db8e731b8d	Add vpx_dsp_common.h file Move the clamp functions to vpx_dsp_common.h file. Clear out the dependency of vp9_loopfilter_filters.c on vp9_common.h file. Change-Id: I9c4b928bcd7f597106b5aa96354356d3775a3431	2015-07-15 13:03:23 -07:00
Jingning Han	3fe83cdf81	Remove redundant header files in vp9_loopfilter_filers.c This cleans out the unnecessary dependency on vp9 codec-specific data structures. Change-Id: Iadbe431174a0f9bf9423f39ab854fc18be554bea	2015-07-15 12:44:47 -07:00
Frank Galligan	1c39998e39	Add vp9_int_pro_col_neon. BUG=https://code.google.com/p/webm/issues/detail?id=1023 Change-Id: I212a1d67b23ce3b5ce08800de369b25b9e375e7d	2015-07-15 09:04:28 -07:00
Alex Converse	fa94dbda81	Merge "Add an SSE2 version of vp9_iwht4x4_16_add"	2015-07-14 22:11:47 +00:00
Alex Converse	d8426d6f12	Add an SSE2 version of vp9_iwht4x4_16_add Roughly half as many cycles as plain C. Change-Id: I8c16c29940b76d54ee7e4fb874c328ce90bff5d4	2015-07-14 14:23:32 -07:00
Jingning Han	81452cf0b7	Refactor intra block prediction function This commit simplifies the intra block boundary condition logic. It removes the block index from the argument set. Change-Id: If00142512eb88992613d6609356dfd73ba390138	2015-07-13 15:20:47 -07:00
Yaowu Xu	0cdc85d8cf	Merge "Revert "Add an SSE2 version of vp9_iwht4x4_16_add.""	2015-07-13 16:27:10 +00:00
Yaowu Xu	ae5394b9e2	Revert "Add an SSE2 version of vp9_iwht4x4_16_add." This reverts commit `f8d3501640`. Change-Id: If8c7af403c091b7fb447a6f0c73fecdbccbc51b3	2015-07-13 16:26:27 +00:00
Yaowu Xu	f70c80289c	Merge "Clean out more MSVC warnings"	2015-07-09 17:49:08 +00:00
Scott LaVarnway	e8103f3676	Merge "Eliminate num_8x8 and num_4x4 width/height lookups"	2015-07-09 17:16:22 +00:00
Scott LaVarnway	13a4f14710	Eliminate num_8x8 and num_4x4 width/height lookups Also some log2 lookups. Pass in 8x8 block width/height and log2 num4x4s instead. Change-Id: I8ea9a1ec1e0bbab23f8ba556954a1b5433f4d613	2015-07-09 05:30:46 -07:00
Yaowu Xu	c369daf3ea	Clean out more MSVC warnings Change-Id: I1bab0c104df2ec4825d050cd516e26ab635a7b3e	2015-07-08 15:09:20 -07:00
Alex Converse	f8d3501640	Add an SSE2 version of vp9_iwht4x4_16_add. 80% fewer cycles than C Change-Id: I841bde1e268ddd33ae2ee75eee94737a400e2cde	2015-07-08 15:00:51 -07:00
Alex Converse	8bf791e7ef	Merge "Don't allocate dqcoeff in MACROBLOCKD."	2015-07-08 20:42:36 +00:00
Alex Converse	89090d8046	Don't allocate dqcoeff in MACROBLOCKD. The encoder gets its dqcoeff from the context tree. In the decoder move it to directly after MACROBLOCKD. Change-Id: I46c9b76f26956a360d17de0b26ecb994dae34ecb	2015-07-08 12:37:55 -07:00
Frank Galligan	b770def572	Merge "VP9_LPF_VERTICAL_16_DUAL_SSE2 optimization"	2015-07-08 18:15:39 +00:00
James Zern	892128f6ca	Merge "vp9_entropymv: remove vp9_get_mv_mag()"	2015-07-08 01:27:13 +00:00
Frank Galligan	5327fcf857	Merge "Add vp9_int_pro_row_neon."	2015-07-08 00:16:03 +00:00
Johann	6a82f0d7fb	Move sub pixel variance to vpx_dsp Change-Id: I66bf6720c396c89aa2d1fd26d5d52bf5d5e3dff1	2015-07-07 15:51:04 -07:00
Jingning Han	a652048efd	Add vp9_ prefix to init_macroblockd Change-Id: I202d4924e627eec94838741df004ed9259d38b88	2015-07-07 12:00:01 -07:00
Jingning Han	cccad1c5de	Reduce dqcoeff array size in decoder The decoding process handles detokenization and reconstruction per transform block sequentially. There is no need to offset the dqcoeff buffer according to the transform block index. This allows to reduce the memory spill and improve cache performance. Change-Id: Ibb8bfe532a7a08fcabaf6d42cbec1e986901d32d	2015-07-07 11:36:05 -07:00
James Zern	c6d90f0535	vp9_entropymv: remove vp9_get_mv_mag() inline the code directly in read_mv_component(), the only place where it was being used; this removes a function call in a hot function Change-Id: I66f99c0c9ce3bc310101dbca4a470f023cc6fb55	2015-07-06 22:30:21 -07:00
James Zern	1696114587	Merge "mips msa vp9 subpel variance optimization"	2015-07-06 22:43:01 +00:00
Jingning Han	fcb5a8692a	Merge "Move subtract functions from vp9 to vpx_dsp"	2015-07-06 22:39:26 +00:00
Parag Salasakar	fbe67d307a	mips msa vp9 subpel variance optimization Change-Id: If88401bf8c5d8ee58200278734d7a5058d1585d0	2015-07-06 14:59:01 -07:00
James Zern	91c412b6db	Merge "remove vp9_get_interp_kernel()"	2015-07-06 21:36:37 +00:00
James Zern	017253b7a3	remove vp9_get_interp_kernel() expose filter_kernels[] and do the table lookup directly Change-Id: I0b10bff0327c3e01a723736141a9ffd377cd3d20	2015-07-06 13:04:05 -07:00
Jingning Han	432cd4bfb7	Move subtract functions from vp9 to vpx_dsp Factor out the subtraction operator as common function. Change-Id: I526e703477c6a290e0e3e3c8898f8bb1ca82779b	2015-07-06 12:22:47 -07:00
Jingning Han	39f03bf9c6	Merge "Rename vpx_thread to vpx_util"	2015-07-06 17:01:30 +00:00
James Zern	3d4526322b	Merge "Revert "mips msa vp9 subpel variance optimization""	2015-07-02 21:07:32 +00:00
James Zern	4c5ac477cb	Merge "Revert "mips msa vp9 avg subpel variance optimization""	2015-07-02 21:07:24 +00:00
James Zern	97946622c0	Revert "mips msa vp9 subpel variance optimization" This reverts commit `a42df86c03`. this change causes MSA/VP9SubpelVarianceTest.Ref and MSA/VP9SubpelVarianceTest.ExtremeRef failures under mips32r5el-msa-linux-gnu and mips64r6el-msa-linux-gnu Change-Id: I40b71a0b774eaeb31f66f795733f95cf360909f7	2015-07-02 12:06:51 -07:00
James Zern	ced982640b	Revert "mips msa vp9 avg subpel variance optimization" This reverts commit `61774ad1c4`. this change causes MSA/VP9SubpelAvgVarianceTest.Ref failures under mips32r5el-msa-linux-gnu and mips64r6el-msa-linux-gnu Change-Id: I7fb520c12b2a3b212d5e84b7619a380a48e49bb0	2015-07-02 12:06:29 -07:00
levytamar82	3c5256d572	VP9_LPF_VERTICAL_16_DUAL_SSE2 optimization The vp9_lpf_vertical_16_dual function optimized for x86 32bit target. The hot code in that function was caused by the call to the transpose8x16. The gcc generated assembly created uneeded fills and spills to the stack. By interleaving 2 loads and unpack instructions, in addition to hoisting the consumer instruction closer to the producer instructions, we eliminated most of the fills and spills and improve the function-level performance by 17%. credit for writing the function as well as finding the root cause goes to Erik Niemeyer (erik.a.niemeyer@intel.com) Change-Id: I6173cf53956d52918a047d1c53d9a673f952ec46	2015-07-02 11:56:11 -07:00
Jingning Han	d1b30ceaa3	Rename vpx_thread to vpx_util Change the dir name to include more util tools. Change-Id: Id5b16062803ce5eed872fe2edb36d7e56b32eed8	2015-07-02 10:02:37 -07:00
Jingning Han	8565a1c99a	Merge "Use vpx prefix for codec independent threading functions"	2015-07-02 04:24:54 +00:00
Jingning Han	66cf8098e6	Merge "Move multi-threading module functions into vpx_thread folder"	2015-07-02 04:24:37 +00:00
James Zern	e757808429	Merge "vp9_pred_common: inline vp9_get_tx_size_context"	2015-07-02 01:52:40 +00:00
James Zern	0ea304620c	Merge "vp9_pred_common: inline vp9_get_segment_id"	2015-07-02 01:52:21 +00:00
Jingning Han	04d2e57425	Use vpx prefix for codec independent threading functions Replace vp9_ prefix with vpx_ for common multi-threading functions. Change-Id: I941a5ead9bfe8213fdad345511d2061b07797b55	2015-07-02 00:47:54 +00:00
Jingning Han	3a3b0be09a	Move multi-threading module functions into vpx_thread folder This commit moves the primitive multi-threading files from vp9 folder to vpx_thread, which will be accessible by all vpx codec. Change-Id: Ib51e66e9c69801c10631fab56d35a0c0aaed5883	2015-07-01 17:45:49 -07:00
Johann	79fcc56781	Merge "Fix --disable-use-x86inc when used with --enable-vp9-highbitdepth"	2015-07-01 21:14:41 +00:00
Johann	8d5389171f	Merge "Fix --disable-use-x86inc"	2015-07-01 21:14:17 +00:00
Johann	1c967f17bd	Fix --disable-use-x86inc when used with --enable-vp9-highbitdepth Change-Id: I0ed6de72dc0bb99fc9c5b1f6500399b16754ffb3	2015-07-01 13:17:01 -07:00
Johann	ff8505a54d	Fix --disable-use-x86inc Change-Id: I374fcd8fb45a6893dcdeac6896671be142a99f06	2015-07-01 13:15:51 -07:00
James Zern	4f7e7c4d49	Merge "mips msa vp9 avg subpel variance optimization"	2015-07-01 20:05:50 +00:00
Scott LaVarnway	dc6d954bd2	Merge "Move inter_predictor to vp9_reconinter.h"	2015-07-01 20:01:53 +00:00
Scott LaVarnway	d157742788	Merge "VP9: Move ref_mvs[][] and mode_context[] from MB_MODE_INFO"	2015-07-01 12:52:21 +00:00
Parag Salasakar	61774ad1c4	mips msa vp9 avg subpel variance optimization average improvement ~3x-5x Change-Id: Iefbcafc05daab77b38a4e63b551e427867a501a4	2015-07-01 13:46:41 +05:30
Parag Salasakar	a42df86c03	mips msa vp9 subpel variance optimization average improvement ~3x-5x Change-Id: I4cbba2711467b0e205904769ebbb4a1fcbb1a311	2015-07-01 07:51:34 +05:30
James Zern	fc5f3b8f4f	Merge "vp9_common_data: right-size tables"	2015-06-30 21:12:54 +00:00
Parag Salasakar	fc3c456053	Merge "mips msa vp9 common macro comments updated"	2015-06-30 06:25:31 +00:00
Scott LaVarnway	c06d56cc7d	VP9: Move ref_mvs[][] and mode_context[] from MB_MODE_INFO to MB_MODE_INFO_EXT. This saves 36 bytes per 8x8 area for both the decoder and encoder. (encoder has two MODE_INFO buffers) Change-Id: If006abb2224acaf326df3c2be09e77e967662107	2015-06-29 12:46:47 -07:00
Scott LaVarnway	437d033dbb	Merge "Remove tile param"	2015-06-29 18:04:56 +00:00
Parag Salasakar	3c353e58c0	mips msa vp9 common macro comments updated Cosmetic/Grammatical corrections in vp9 macro comments Change-Id: I774b983aff854feb69c7e4442e8731ce4c995645	2015-06-29 11:52:28 +05:30
Parag Salasakar	b92cc27b76	mips msa vp9 temporal filter optimization average improvement ~4x-5x Change-Id: Iad9c0a296dbc2ea96d000bd009077999ed58a3c5	2015-06-26 12:00:24 +05:30
Parag Salasakar	c040f96e4b	mips msa vp9 subtract block optimization average improvement ~3x-4x Change-Id: Idbe4d13a00d05ff8be6559b116f416e42c3b4097	2015-06-26 09:23:56 +05:30
Parag Salasakar	d017f5ba38	Merge "mips msa vp9 block error optimization"	2015-06-26 03:42:31 +00:00
Parag Salasakar	1543f2b60e	mips msa vp9 block error optimization average improvement ~3x-4x Change-Id: If0fdcc34b17437a7e3e7fb4caaf1067bc175f291	2015-06-26 09:04:00 +05:30
James Zern	28a8226350	vp9_common_data: right-size tables Change-Id: I2206ee148a46b234df58f2b623e9f32f26033e04	2015-06-25 20:20:40 -07:00
James Zern	d219f2b9d2	Merge "vp9_reconintra_neon: add d45 16x16"	2015-06-24 21:23:15 +00:00
Frank Galligan	944ad6cac9	Add vp9_int_pro_row_neon. BUG=https://code.google.com/p/webm/issues/detail?id=1022 Change-Id: I510c3b0a70158fa2e4da554f7c5d7558021a6ddf	2015-06-23 11:53:49 -07:00
James Zern	9db1f24c47	vp9_reconintra_neon: add d45 16x16 ~90% faster over 20M pixels Change-Id: I92d80f66e91e0a870a672cfb5dd29bf1a17cb11a	2015-06-22 21:00:07 -07:00
Parag Salasakar	7555e2b822	mips msa vp9 avg optimization average improvement ~2x-3x Change-Id: I76f7fc00c0ffdf2b4ba41bf3819f3b6044bcdeff	2015-06-23 07:32:25 +05:30
Parag Salasakar	7b71cdb0b4	Merge "mips msa vp9 fdct 4x4 optimization"	2015-06-23 01:46:54 +00:00
James Zern	c8b9658ecc	Merge "vp9_reconintra_neon: add d45 8x8"	2015-06-22 22:27:57 +00:00
Scott LaVarnway	86f4a3d8af	Remove tile param and added to MACROBLOCKD. Change-Id: I0e60aaa9f84bcc9f2376d71bd934f251baee38db	2015-06-22 06:09:38 -07:00
Parag Salasakar	bc94999148	mips msa vp9 fdct 4x4 optimization average improvement ~2x-3x Change-Id: Idf8be780b8b4228fc91f110a94e4ee1fd9af0163	2015-06-22 14:30:24 +05:30
Parag Salasakar	b6131a733d	Merge "mips msa vp9 fdct 8x8 optimization"	2015-06-20 02:58:10 +00:00
James Zern	12c6688e31	vp9_reconintra_neon: add d45 8x8 based on ssse3 implementation ~91% faster over 20M pixels Change-Id: I6d743a53352c2d6de0efe7899d7996e8b0f7fa29	2015-06-19 19:19:22 -07:00
Parag Salasakar	7ca84888c2	mips msa vp9 fdct 8x8 optimization average improvement ~4x-5x Change-Id: I37582efc2622bc20b2bf99617a76110ab24e9f6a	2015-06-20 07:48:35 +05:30
James Zern	714a46a63c	Merge "vp9_filter: make all filter tables static"	2015-06-19 03:32:24 +00:00
James Zern	a2c69af50e	Merge "vp9_reconintra_neon: add d45 4x4"	2015-06-19 03:27:23 +00:00
James Zern	5d1d72df16	Merge changes from topic 'vp9-intra-pred' * changes: vp9_reconintra_neon: add d135 4x4 vp9_reconintra: correct d135 4x4 signature	2015-06-19 03:24:58 +00:00
James Zern	ce88d74d34	vp9_reconintra_neon: add d45 4x4 based on webp's LD4() ~59% faster over 20M pixels Change-Id: I371eaed9ce8f470451046997e130b0ba1a2f7a9c	2015-06-18 15:25:07 -07:00
James Zern	337b221e00	vp9_reconintra_neon: add d135 4x4 based on webp's RD4() ~50% faster over 20M pixels Change-Id: Ifcb7bf7f7fc8eabf79d9e3b219ce1be67abc524a	2015-06-18 15:25:06 -07:00
James Zern	e8e3583fc7	vp9_reconintra: correct d135 4x4 signature add missing '_c' suffix Change-Id: I928d6cf8f90db0b8ca0b1f3bbf10b3d792062cec	2015-06-18 15:25:06 -07:00
James Zern	41d8545ab6	Merge "vp9_reconintra_neon: add DC 4x4 predictors"	2015-06-18 22:24:55 +00:00
James Zern	6e44bf20f7	vp9_reconintra_neon: add DC 4x4 predictors ~85-89% faster over 20M pixels Change-Id: I3812e8adfffe5255034da88dfe6546e12f4d10ee	2015-06-18 15:22:43 -07:00
James Zern	e77f859d72	Merge "vp9_reconintra_neon: add DC 32x32 predictors"	2015-06-18 22:17:51 +00:00
Parag Salasakar	d9fedf7832	mips msa vp9 fdct 32x32 optimization average improvement ~4x-6x Change-Id: Ibcac3ef8ed5e207cf8c121e696570e6b63d3c0f4	2015-06-17 07:58:34 +05:30
Parag Salasakar	fa53008fb7	Merge "mips msa vp9 fdct 16x16 optimization"	2015-06-17 01:21:59 +00:00
Scott LaVarnway	5fe0e55ca4	Merge "Eliminated frame_type check in get_partition_probs()"	2015-06-16 13:40:23 +00:00
Scott LaVarnway	b2658ec321	Eliminated frame_type check in get_partition_probs() Moved the frame_type check to the tile level and stored the prob ptr in MACROBLOCKD. Change-Id: I10b5a4abd58213dc7610e3ade1a1583c01526842	2015-06-16 05:37:54 -07:00
Scott LaVarnway	a41fe749a8	Merge "Update use_prev_frame_mvs flag in decoder."	2015-06-16 12:28:46 +00:00
Parag Salasakar	89b4b315aa	mips msa vp9 fdct 16x16 optimization average improvement ~4x-6x Change-Id: Id3b2243e5b3c7844c90c4231a5e75fa69911362c	2015-06-16 12:49:34 +05:30
James Zern	79fb3a013e	vp9_reconintra_neon: add DC 32x32 predictors ~84-85% faster over 20M pixels Change-Id: Ia67a7f4a342bf7b0a9280e05c25d81a774d90469	2015-06-15 20:57:28 -07:00
James Zern	3edd293dae	vp9_pred_common: inline vp9_get_tx_size_context + drop 'vp9_' prefix Change-Id: If3f3ec32d03026af78b8fcd82749e587a3f43059	2015-06-15 18:41:22 -07:00
James Zern	e6add6499f	vp9_pred_common: inline vp9_get_segment_id + drop 'vp9_' prefix Change-Id: Id5a3c8d416dbdf93d9f4f1bde662f7b2c2290168	2015-06-15 18:41:14 -07:00
James Zern	17c9678a3c	Merge "vp9_entropy: delete vp9_coefmodel_tree[]"	2015-06-15 23:02:42 +00:00
James Zern	e8d3491ec2	Merge "vp9_entropymode: make vp9_init_mode_probs private"	2015-06-15 23:02:36 +00:00
James Zern	98f0178611	enable vp9_d153_predictor_32x32_ssse3 unused since its initial commit ~91% faster over 20M pixels Change-Id: Ic8b5b3246bc97c8406be8bc4496601370403b70a	2015-06-12 19:48:22 -07:00
James Zern	ef75416ab7	vp9_entropy: delete vp9_coefmodel_tree[] it's been unused since: `4ac6a25` Moving vp9_tree_probs_from_distribution() to encoder. Change-Id: Ieae65864277fc3dbe993c5c08d75c6c5fcaa3a2d	2015-06-12 18:43:37 -07:00
James Zern	53b7f33f2d	vp9_entropymode: make vp9_init_mode_probs private rename to init_mode_probs Change-Id: Id451d7763b784ed37e43f2c35073a778078d3d0f	2015-06-12 18:25:23 -07:00
Parag Salasakar	ecbbef6b67	Merge "mips msa vp9 filter by weight optimization"	2015-06-12 18:30:11 +00:00
Parag Salasakar	fbac961b47	mips msa vp9 filter by weight optimization filter by weight - average improvement ~2x-3x Change-Id: I4832033335d339cdafdce697f07ce3e643920057	2015-06-12 12:06:42 +05:30
James Zern	e2b52f6f01	vp9_filter: make all filter tables static these are returned via vp9_get_interp_kernel() Change-Id: I45ed75e5b1515c4f5be9212759dcb50a456b5548	2015-06-11 15:15:52 -07:00
James Zern	33b3953c54	vp9_filter: restore vp9_bilinear_filters alignment the declaration containing the alignment in vp9_filter.h was removed in: `eb88b17` Make vp9 subpixel match vp8 fixes a crash in 32-bit builds Change-Id: I9a97e6b4e8e94698e43ff79d0d8bb85043b73c61	2015-06-11 15:15:25 -07:00
Scott LaVarnway	cca866f578	inline vp9_get_segdata() and change name. Change-Id: I706645cf9d9dc04f1b3b6ac80df80edb7f101854	2015-06-11 09:52:00 -07:00
Scott LaVarnway	a49c701529	Merge "inline vp9_segfeature_active()"	2015-06-11 12:29:45 +00:00
Scott LaVarnway	42c0b1b1f1	inline vp9_segfeature_active() and changed name. Change-Id: Ie023ca66cc2c823032f58d4faeb53fd1863c94f3	2015-06-11 04:20:55 -07:00
Parag Salasakar	c7489f4815	Merge "mips msa vp9 intra-pred optimization"	2015-06-11 03:31:49 +00:00
James Zern	44afbbb72d	Merge "vp9_reconintra/d45_predictor: remove temp storage"	2015-06-10 19:23:57 +00:00
Scott LaVarnway	97880c3324	Merge "Reducing size of MODE_INFO struct"	2015-06-10 13:15:19 +00:00
Scott LaVarnway	c9976b32b4	Update use_prev_frame_mvs flag in decoder. Added check to see if last frame was all intra. This will eliminate two checks in find_mv_refs_idx(). Also, do not update the frame mvs if the current frame is all intra. This improved performance on material with frequent intra-only frames. Change-Id: I44a4042c3670ab0d38439d565062a0e2a1ba9d1e	2015-06-08 03:38:13 -07:00
Parag Salasakar	a2288d274c	mips msa vp9 intra-pred optimization intra pred - average improvement ~2x-3x Change-Id: Ie3f7d6eded5ecb7ed7ee506ba8e4d98f93803b09	2015-06-06 22:29:32 +05:30
James Zern	9c6eea35b6	Merge "vp9_reconintra: simplify d63_predictor"	2015-06-05 21:49:13 +00:00
Frank Galligan	bfb6d48812	Add control to skip loop filter in VP9 decoder. This control allows the application to skip the loop filter in the decoder. This is an advanced control that should only be used in extreme circumstances as it may introduce and accumulate decode artifacts. Change-Id: I278c65c60826f84c9141ebe06c6eeed3c2335fa8	2015-06-05 10:07:09 -07:00
Parag Salasakar	d43fd99822	mips msa vp9 loopfilter 4, 8 optimization average improvement ~3x-4x Change-Id: I59279293ce4b2a1e99bd10579ac97740e943643f	2015-06-05 09:56:08 +05:30
James Zern	60d0b3364c	vp9_reconintra/d45_predictor: remove temp storage dst row 0 can be reused in the same way Change-Id: Id977da62545dcc4a89cebbcbad90ba84f8ff5d6b	2015-06-04 20:11:53 -07:00
James Zern	7012ba6395	vp9_reconintra: simplify d63_predictor calculate the averages needed for even and odd rows once; this removes a conditional from the inner loop the final average calculated currently relies on above[] being extended, it could be reduced to use above[block_size - 2] + 3 * above[block_size - 1] Change-Id: I70f5eac8d8a2a959c7114844a95826f445c3dd4d	2015-06-04 19:21:05 -07:00
Parag Salasakar	dc07cc6fed	Merge "mips msa vp9 loopfilter 16 optimization"	2015-06-05 02:15:26 +00:00
James Zern	c2cf347fe2	Merge "vp9_reconintra: use AVG[23] consistently"	2015-06-05 02:15:22 +00:00
James Zern	2b6d62140e	Merge "vp9_reconintra_neon_asm/tm4x4: simplify left load"	2015-06-05 01:46:39 +00:00
James Zern	6c3b691c49	Merge "vp9_reconintra: fix d45/d63 discrepancies"	2015-06-04 22:56:43 +00:00
James Zern	faea038f4f	vp9_reconintra: fix d45/d63 discrepancies the final index in rows 2, 3 differ from vp8 Change-Id: I0fcea907b4ab44e266c0f1fd77b290d2236b280a	2015-06-04 14:49:56 -07:00
Scott LaVarnway	baaaa57533	Reducing size of MODE_INFO struct Reduced size from 124 bytes to 104 bytes. For decode only builds, it is reduced to 68 bytes. Change-Id: If9e6b92285459425fa086ab5a743d0a598a69de3	2015-06-04 07:32:16 -07:00
Scott LaVarnway	8bb37dd069	Remove cm parameter from vp9_decode_block_tokens() part 2 Change-Id: Iee24b6bb095f748333223e6036fc5c9d9e7e5f1c	2015-06-04 07:13:19 -07:00
Scott LaVarnway	877fac122b	Merge "Remove counts param"	2015-06-04 13:46:42 +00:00
Parag Salasakar	914f8f9ee0	mips msa vp9 loopfilter 16 optimization average improvement ~3x-4x Change-Id: I8ef263da6ebcf8f20aabaefeccf25a84640ba048	2015-06-04 11:50:41 +05:30
Johann Koenig	c005792951	Merge "Make vp9 subpixel match vp8"	2015-06-04 06:16:13 +00:00
Parag Salasakar	fd891a9655	Merge "mips msa vp9 convolve8 avg hv optimization"	2015-06-04 05:44:24 +00:00
Johann	eb88b172fe	Make vp9 subpixel match vp8 The only difference between the two was that the vp9 function allowed for every step in the bilinear filter (16 steps) while vp8 only allowed for half of those. Since all the call sites in vp9 (<< 1) the input, it only ever used the same steps as vp8. This will allow moving the subpel variance to vpx_dsp with the rest of the variance functions. Change-Id: I6fa2509350a2dc610c46b3e15bde98a15a084b75	2015-06-03 22:10:51 -07:00
hkuang	ce5e17072d	Merge "Optimize the idct assembly code."	2015-06-04 04:32:11 +00:00
James Zern	4fcabf5169	vp9_reconintra: use AVG[23] consistently Change-Id: Iab7215f82be0c0c831cd81b6f8091afc3710dd54	2015-06-03 19:52:46 -07:00
Parag Salasakar	bdfbc3e876	mips msa vp9 convolve8 avg hv optimization average improvement ~4x-6x Change-Id: I7c8b4f2334491be8a859592606e568bc95d019aa	2015-06-04 08:11:01 +05:30
James Zern	2da8d24e8f	Merge "vp9_reconintra: simplify d45_predictor"	2015-06-04 01:59:10 +00:00
James Zern	a9f55e8324	Merge changes from topic 'vp9-intra-pred' * changes: vp9_reconintra: specialize d135 4x4 vp9_reconintra: specialize d117 4x4 vp9_reconintra: specialize d207 4x4 vp9_reconintra: specialize d153 4x4 vp9_reconintra: specialize d63 4x4 vp9_reconintra: specialize d45 4x4	2015-06-04 01:58:28 +00:00
James Zern	65d9599807	vp9_reconintra_neon_asm/tm4x4: simplify left load use vld1.8 {d0[]}, [r0] rather than ldrb+vdup; mildly faster Change-Id: Ia5ffc736bcb0f5497b7d9e55a93bf5a5f5f6928c	2015-06-03 18:51:13 -07:00
hkuang	98e88e6ad8	Optimize the idct assembly code. Change-Id: Ia0ff859ff1c813dbe100e2f27b1ef78167483f4e	2015-06-03 17:20:35 -07:00
Parag Salasakar	b8c1cdcd12	mips msa vp9 convolve8 avg horiz optimization average improvement ~5x-8x Change-Id: I179a69ec620fbd69979bd128f05d18113618aab4	2015-06-03 11:33:42 +05:30
Parag Salasakar	c543d38ac7	mips msa vp9 convolve8 avg vert optimization average improvement ~4x-6x Change-Id: Ia2e6f770da46416ebec31fdcea5cc7878879a9d9	2015-06-03 09:55:25 +05:30
Scott LaVarnway	f779dba405	Remove counts param Moved to MACROBLOCKD. Change-Id: Icce765b334f2755f4fe2a4c39fb2ae2d7660d004	2015-06-02 09:06:00 -07:00
Parag Salasakar	54a6f73958	mips msa vp9 idct4x4 and iwht4x4 optimization average improvement ~3x-4x moved assert to respective files Change-Id: I6c915059d456a00bdd76fab0dd2eede8b6c6ea58	2015-06-02 12:16:28 +05:30
Parag Salasakar	ebf7466cd8	mips msa vp9 updated convolve horiz, vert, hv, copy, avg module Updated sources according to improved version of common MSA macros. Enabled respective convolve MSA hooks and tests. Overall, this is just upgrading the code with styling changes. Change-Id: If5ad6ef8ea7ca47feed6d2fc9f34f0f0e8b6694d	2015-06-02 12:03:51 +05:30
Parag Salasakar	cf1c0ebc3a	Merge "mips msa vp9 updated idct 8x8, 16x16 and 32x32 module"	2015-06-02 04:48:02 +00:00
James Zern	71d923232c	Merge changes from topic 'vp9-intra-pred' * changes: vp9_reconintra_neon/tm: improve above_left load vp9_reconintra_neon: cosmetics: normalize fn params	2015-06-01 20:03:47 +00:00
James Zern	b601202905	Merge "vp9_reconintra_neon_asm/tm: simplify above_left load"	2015-06-01 20:01:38 +00:00
Parag Salasakar	6af9d7f2e2	mips msa vp9 updated idct 8x8, 16x16 and 32x32 module Updated sources according to improved version of common MSA macros. Enabled idct MSA hooks and tests. Overall, this is just upgrading the code with styling changes. Change-Id: I1f488ab2c741f6c622b7a855388a202168082209	2015-06-01 09:24:23 +05:30
James Zern	acc481eaae	vp9_reconintra: simplify d45_predictor only the immediate above right pixel is needed; this removes a conditional from the inner loop the final average calculated currently relies on above[] being extended, it could be reduced to use above[block_size - 2] + 3 * above_right Change-Id: Ica4f2b8d25eec3ca1d6fa52ef0d4adc228eeea3f	2015-05-30 13:30:59 -07:00
James Zern	6e068e51b5	vp9_reconintra: specialize d135 4x4 based on webp's RD4() Change-Id: I64c8f0a1325a8f201eaad39b396fae7a2d06efff	2015-05-30 13:29:40 -07:00
James Zern	b6782686f4	vp9_reconintra: specialize d117 4x4 based on webp's VR4() Change-Id: Ic8c0b8ed65a63772ca0a4321592880a5e8947db5	2015-05-30 13:29:02 -07:00
James Zern	c022dbc4d3	vp9_reconintra: specialize d207 4x4 based on webp's HU4() Change-Id: I2401ef307cd94e70cc7904f55954af04290c8af9	2015-05-30 13:28:22 -07:00
James Zern	2276eb16f3	vp9_reconintra: specialize d153 4x4 based on webp's HD4() Change-Id: Icba1e21ec4b8f5026dc92e49741a68b059c8b9b1	2015-05-30 13:27:50 -07:00
James Zern	102123821d	vp9_reconintra: specialize d63 4x4 based on webp's VL4() Change-Id: Ibab962053843eae8752b4e74b6481a53bb034ae9	2015-05-30 13:27:03 -07:00
James Zern	6051bcc3dc	vp9_reconintra: specialize d45 4x4 based on webp's LD4() Change-Id: I74855d23ce73e1c6988fe08bf7c959b7a69b4abf	2015-05-30 13:26:21 -07:00
Parag Salasakar	71e88f903d	Merge "mips msa vp9 updated macros and disable all MSA functions"	2015-05-30 02:52:27 +00:00
James Zern	7621b48a1c	vp9_reconintra_neon/tm: improve above_left load use vld1?_dup_u8 over vdup?_n_u8, reduces general register use; mildly faster Change-Id: Ie0e4e550849a207b34b378541196b553c9f12011	2015-05-29 19:18:43 -07:00
James Zern	f2d621e383	vp9_reconintra_neon: cosmetics: normalize fn params s/y_stride/stride/ Change-Id: Ie98c3fe241dc240b653849eda356a8862bdd52f4	2015-05-29 19:01:39 -07:00
James Zern	b337c54cc4	vp9_reconintra_neon_asm/tm: simplify above_left load use vld1.8 {d0[]}, [r0] rather than ldrb+vdup; mildly faster Change-Id: I5c24d49a90c2855c94395184774b289da8e9d5a7	2015-05-29 18:56:16 -07:00
James Zern	a2a13cbe5f	vp9_reconintra_neon: add DC 16x16 predictors 85-89% faster over 20M pixels Change-Id: I9b320ed6b9e67f27df738b84c8b43b65a93c50c2	2015-05-29 15:41:44 -07:00
James Zern	e97b849219	vp9_reconintra_neon: add DC 8x8 predictors ~90% faster over 20M pixels Change-Id: Iab791510cc57c8332c2f9a5da0ed50702e5f5763	2015-05-29 15:39:08 -07:00
Parag Salasakar	f9f078ebb6	mips msa vp9 updated macros and disable all MSA functions Done little restructuring/styling changes to the sources like generic macro definitions, their use to reduce code lines, better code alignments etc. Disabled all MSA hooks and tests Change-Id: Ic6f2dce0b501f46b80c06c46c0fe2043d557b190	2015-05-29 13:34:33 +05:30
Scott LaVarnway	bbea7c95d8	Merge "Re-worked header files"	2015-05-28 19:56:39 +00:00
Johann	3f2a06674a	Merge "Don't #define snprintf in VS 2015 or higher."	2015-05-28 19:38:57 +00:00
hkuang	5317185eb0	Merge "Add error handling when running out of free frame buffers."	2015-05-28 17:41:01 +00:00
Johann	cad0eca25c	Don't #define snprintf in VS 2015 or higher. In VS 2015 and higher snprintf is supplied and therefore vsnprintf doesn't need to be defined. This also avoids problems caused by _snprintf being different from snprintf. This fixes a build break with VS 2015 and improves security. Originally submitted via chromium by brucedawson@chromium.org https://codereview.chromium.org/1055603003 Additionally break this MSVC-specific tweak to a new file, which will become the home of all such MSVC-specific things. This requires adding a dependency on msvc.h to every example which uses args.c and tools_common.h Change-Id: I35b5f8e7ea00f6627403aabc9ea79b0412557a99	2015-05-27 18:28:25 -07:00
hkuang	131cab7c27	Add error handling when running out of free frame buffers. Change-Id: If28b59b9521204a6e3aecedcf75932d76a752567	2015-05-27 14:20:58 -07:00
Minghai Shang	cbdfdb947c	Merge "[decoder] Optimize context buffer re-allocation"	2015-05-27 20:24:30 +00:00
Johann	dee70d355f	Merge "Move variance functions to vpx_dsp"	2015-05-26 23:02:11 +00:00
Johann	c3bdffb0a5	Move variance functions to vpx_dsp subpel functions will be moved in another patch. Change-Id: Idb2e049bad0b9b32ac42cc7731cd6903de2826ce	2015-05-26 12:01:52 -07:00
Scott LaVarnway	89ca85dacd	Move inter_predictor to vp9_reconinter.h This function was originally static. Change-Id: I1922fa86711ace884d9f394210b6bb9ea2a0bfe3	2015-05-26 04:22:11 -07:00
James Zern	02fda6582c	Merge changes Ie15e301e,Ib070c79b * changes: vp9_reconintra_neon: cosmetics: reindent vp9_reconintra_neon: cosmetics: drop unneeded returns	2015-05-23 17:47:52 +00:00
James Zern	4e11f3ca6e	vp9_reconintra_neon: cosmetics: reindent Change-Id: Ie15e301e8f55cf928f42a03e53a8bb8b66d0e5d5	2015-05-22 21:04:30 -07:00
James Zern	ff683ab1da	vp9_reconintra_neon: cosmetics: drop unneeded returns Change-Id: Ib070c79bdbb9c1f4e25af693d7056ec9f964c789	2015-05-22 20:59:36 -07:00
James Zern	8c15ced172	vp9: move ssse3 convolve fns to intrinsics file + synchronize filter function signatures this makes any intrinsics filters available for inlining and has the side-effect of making those filters static, quieting missing-prototype warnings. Change-Id: I1908875caffa585bd4fc65aaf10d17a5e20cfb46	2015-05-22 20:14:16 -07:00
James Zern	2161e44025	vp9: move avx2 convolve fns to intrinsics file + synchronize filter function signatures this makes any intrinsics filters available for inlining and has the side-effect of making those filters static, quieting missing-prototype warnings. Change-Id: I1cd55c9d52547793ad65aa90c7620f0e426edaa2	2015-05-22 20:13:06 -07:00
James Zern	ef2b3cce50	add vp9/common/x86/convolve.h collect the vp9_convolve function definition macros there; this will allow some relocation of functions from vp9_asm_stubs.c Change-Id: Idadd117fa256dd48748379856973fd985b8204e8	2015-05-22 20:12:16 -07:00
James Zern	48d8291df4	vp9_subpixel_8t_intrin_ssse3: quiet vs9 warning reorder includes to avoid: warning C4985: 'ceil': attributes not present on previous declaration. this is the same workaround used in vp9/common/vp9_systemdependent.h Change-Id: Ia10dd63de24f96fa1507a6179220e9d6ec774db6	2015-05-22 12:05:02 -07:00
Scott LaVarnway	b962646fc5	Re-worked header files Various header/test files had to be re-worked in order to build "Remove cm parameter from vp9_decode_block_tokens()". This patch reverts the "Remove cm" part and only contains the re-worked header files. Change-Id: I520958a88d1991fee988a3c784d0eac40e117a32	2015-05-22 11:19:51 -07:00
James Zern	a492bcef87	vp9_mvref_common.c: fix compile warning string literal to int within an assert Change-Id: Ifd7acc717e01ee1bb3955ef830ec0d1645942459	2015-05-20 16:45:16 -07:00
Minghai Shang	48bfee8797	[decoder] Optimize context buffer re-allocation 1. Check existing buffer sizes when re-allocate context buffers. 2. Don't need to set mi buffers to 0 during setup_mi. Change-Id: I6b48b0e077a4d804312b605ad0dc34aec5795a6d	2015-05-20 11:05:22 -07:00
James Zern	97db651ce0	vp9: add some missing includes mostly: <file>.c should include <file>.h silences missing prototype warnings Change-Id: Ic05ec32c6f7b2224b78825904d96d73aacad6000	2015-05-15 10:43:47 -07:00
James Zern	330fba41e2	vp9 intrinsics: add vp9_rtcd include silences a missing declaration warning Change-Id: I59a34e1a1377cf3529b678d7ec0122bd43ab1bf1	2015-05-15 10:43:47 -07:00
James Zern	18b60af27c	vp9: correct some function signatures silences missing prototype warnings Change-Id: Idaf68d83d2cb03847f3ee002c4d00c2ac79da604	2015-05-15 10:43:47 -07:00
Frank Galligan	d610ead258	Merge "Move mc_buf to cut down size of MACROBLOCKD."	2015-05-15 15:20:39 +00:00
Frank Galligan	0a80164c94	Move mc_buf to cut down size of MACROBLOCKD. Change-Id: Icea64b9e5632b41aaa7cd7018c501d6add9b7a7f	2015-05-14 19:10:02 -07:00
Johann	cafae5b544	Merge "Relocate memory operations for common code"	2015-05-13 19:47:24 +00:00
Johann	1d7ccd5325	Relocate memory operations for common code With the sad functions, and hopefully the variance functions soon, moving to the vpx_dsp location, place the defines used in the reference C code in a common location. Change-Id: I4c8ce7778eb38a0a3ee674d2f1c488eda01cfeca	2015-05-13 11:41:15 -07:00
Parag Salasakar	686616a989	Merge "mips msa vp9 idct 8x8 optimization"	2015-05-13 04:36:34 +00:00
James Zern	a5e4ca8390	build_intra_predictors*: reduce above_data size currently this needs to be 2x (NEED_ABOVERIGHT) the size of the largest block (32) + 1 (for above_left). reduce the buffer size from 128 + 16 (alignment) to 64 + 16. Change-Id: Idaca1806c7e1214e9437de24e15edc2ebf18f95d	2015-05-08 20:17:20 -07:00
James Zern	6d22713722	Merge "build_intra_predictors*: reduce left_col size"	2015-05-09 00:53:55 +00:00
hkuang	d53fb0fda5	Fix clang ioc warning due to NULL mi pointer. The warning only happens in VP9 encoder's first pass due to src_mi is not set up yet. But it will not fail the encoder as left_mi and above_mi are not used in the first_pass and they will be set up again in the second pass. Change-Id: I0713b4660d71e229e196654cb0970ba6b1574f28	2015-05-08 15:42:50 -07:00
hkuang	f5574fb44c	Merge "Add more sse2 code for intra prediction."	2015-05-08 17:26:30 +00:00
Parag Salasakar	7c5f00f868	mips msa vp9 idct 8x8 optimization average improvement ~4x-6x Change-Id: I5edf713721b9e24c7e0ce2e69d8fc3ecab625d91	2015-05-08 12:23:27 +05:30
Parag Salasakar	a8a9c2bb45	Merge "mips msa vp9 idct 32x32 optimization"	2015-05-08 04:27:44 +00:00
James Zern	7e55ff1593	build_intra_predictors*: reduce left_col size this should only need to be the size of the largest block, i.e., 32, not 64. Change-Id: Ib8cb2424771fdd2a64c55379597248b2722a5ceb	2015-05-07 16:16:42 -07:00
James Zern	fd3658b0e4	replace DECLARE_ALIGNED_ARRAY w/DECLARE_ALIGNED this macro was used inconsistently and only differs in behavior from DECLARE_ALIGNED when an alignment attribute is unavailable. this macro is used with calls to assembly, while generic c-code doesn't rely on it, so in a c-only build without an alignment attribute the code will function as expected. Change-Id: Ie9d06d4028c0de17c63b3a27e6c1b0491cc4ea79	2015-05-07 11:55:08 -07:00
Johann	76a08210b6	Merge "Move shared SAD code to vpx_dsp"	2015-05-07 18:33:06 +00:00
hkuang	086934136b	Merge "Remove an unnecessary check."	2015-05-07 15:51:11 +00:00
Parag Salasakar	1601c1385a	mips msa vp9 idct 32x32 optimization average improvement ~4x-6x Change-Id: Idaba7e49fbd7f388caee0d73773ccf6e4807ef17	2015-05-07 12:42:23 +05:30
hkuang	7153b822ed	Add more sse2 code for intra prediction. vp9_dc_left_predictor_16x16 vp9_dc_top_predictor_32x32 vp9_dc_left_predictor_32x32 vp9_dc_128_predictor_32x32 Change-Id: Ib9861deefd01c3527235b92ff6b3d571ef6b4bc6	2015-05-06 17:17:00 -07:00
Johann	d5d9289800	Move shared SAD code to vpx_dsp Create a new component, vpx_dsp, for code that can be shared between codecs. Move the SAD code into the component. This reduces the size of vpxenc/dec by 36k on x86_64 builds. Change-Id: I73f837ddaecac6b350bf757af0cfe19c4ab9327a	2015-05-06 16:58:20 -07:00
hkuang	240767b29d	Remove an unnecessary check. Change-Id: Id0f224ac4667dd173363b0f05711678448291d4e	2015-05-06 14:15:00 -07:00
hkuang	623e6eed5e	Merge "Optimize the read_partition."	2015-05-06 17:29:52 +00:00
Parag Salasakar	d1cdda88bd	Merge "mips msa vp9 idct 16x16 optimization"	2015-05-06 06:40:56 +00:00
hkuang	4c1a8be29d	Optimize the read_partition. Change-Id: I5a796425ce5706824a2fc17c6f24f983c5b9e43b	2015-05-05 15:51:04 -07:00
James Zern	ccae5d99d2	fix and enable vp9_dc_128_predictor_16x16 widen the loads and stores to 128-bit. this was added, but not enabled in: `493a857` Add some sse2 code for intra prediction. Change-Id: I277d7db608a7db7d75cc0bde86f48fa66ad487e4	2015-05-05 11:40:13 -07:00
hkuang	e47811ef8f	Merge "Add some sse2 code for intra prediction."	2015-05-05 17:11:07 +00:00
Parag Salasakar	60052b618f	mips msa vp9 idct 16x16 optimization average improvement ~4x-6x Change-Id: I55e95b7f2ba403dff11813958dc7c73a900dd022	2015-05-05 12:37:06 +05:30
James Zern	670b2c09ce	vp9_idct_intrin_sse2: cosmetics: reindent + fix some whitespace Change-Id: Id61b739282014288a7e5d3c17a9d6448d9d4cda2	2015-05-01 16:07:54 -07:00
James Zern	c77b1f5acd	vp9: RECON_AND_STORE4X4: remove dest offset offsetting by a variable stride prevents instruction reordering, resulting in poor assembly Change-Id: Id62d6b3299cdd23f8c44f97b630abf4fea241446	2015-04-30 19:14:17 -07:00
James Zern	778845da05	vp9_idct_intrin_*: RECON_AND_STORE: remove dest offset offsetting by a variable stride prevents instruction reordering, resulting in poor assembly. additionally reroll 16x16/32x32 loops to reduce register spill with this new format Change-Id: I0635b8ba21ecdb88116e927dbdab53acdf256e11	2015-04-30 19:14:17 -07:00
Yaowu Xu	2061359fcf	Merge "Remove vp9_idct16x16_10_add_ssse3()"	2015-04-30 23:13:33 +00:00
hkuang	493a8579f1	Add some sse2 code for intra prediction. Change-Id: I16c0a62e52dab62837c547345df31e7518620ed4	2015-04-30 15:42:57 -07:00
Yaowu Xu	47767609fe	Remove vp9_idct16x16_10_add_ssse3() The rotation computation using 2X of cos(pi/16) has a potential to overflow 32 bit, this commit disable the function to allow further investigation and optimization. Change-Id: I4a9803bc71303d459cb1ec5bbd7c4aaf8968e5cf	2015-04-30 09:07:30 -07:00
Parag Salasakar	95cb130f32	Merge "mips msa vp9 copy and avg convolve optimization"	2015-04-30 04:39:13 +00:00
Yaowu Xu	d45870be8d	Merge "Disable ssse3 version idct16x16_256_add()"	2015-04-30 03:09:23 +00:00
Yaowu Xu	486a73a9ce	Disable ssse3 version idct16x16_256_add() The version is currently producing different result from c version for some input. Disable the use of it for now to allow time for investigation the source of mismatch. Change-Id: Id039455494ee531db4886a9f1fa4761174ef6df3	2015-04-29 16:58:59 -07:00
Parag Salasakar	2301d10f73	mips msa vp9 copy and avg convolve optimization average improvement ~3x-5x Change-Id: I422e4c33ea7e6d6783ba40029438ccf21b0e76bb	2015-04-29 12:28:17 +05:30
James Zern	f58011ada5	vpx_mem: remove vpx_memset vestigial. replace instances with memset() which they already were being defined to. Change-Id: Ie030cfaaa3e890dd92cf1a995fcb1927ba175201	2015-04-28 20:00:59 -07:00
James Zern	f274c2199b	vpx_mem: remove vpx_memcpy vestigial. replace instances with memcpy() which they already were being defined to. Change-Id: Icfd1b0bc5d95b70efab91b9ae777ace1e81d2d7c	2015-04-28 19:59:41 -07:00
Frank Galligan	2be50a1c9c	Merge "WIP: Use LUT for y_dequant/uv_dequant"	2015-04-28 16:12:10 +00:00
Scott LaVarnway	afcb62b414	WIP: Use LUT for y_dequant/uv_dequant instead of calculating every block. Change-Id: Ib19ff2546be8441f8755ae971ba2910f29412029	2015-04-28 07:52:06 -07:00
Yunqing Wang	297b2b99de	Fix debugmodes file to print modes and MVs correctly This patch fixed the issues in debugmodes file because of the recent changes in MODE_INFO struct. Change-Id: I4df83379ecc887c1f009d4a8329c9809c5b299d6	2015-04-27 17:09:38 -07:00
Parag Salasakar	1c9af9833d	Merge "mips msa vp9 convolve8 horiz optimization"	2015-04-21 22:08:25 -07:00
Johann	931c0a954f	Merge "Rename neon convolve avg file"	2015-04-21 15:45:29 -07:00
Johann	66b9933b8d	Rename neon convolve avg file Some build systems use just the basename for object files. Change-Id: I333e1107ee866f3906cc46476ef8d04c6200a8a0	2015-04-21 14:18:17 -07:00
Scott LaVarnway	8b17f7f4eb	Revert "Remove mi_grid_* structures." (see I3a05cf1610679fed26e0b2eadd315a9ae91afdd6) For the test clip used, the decoder performance improved by ~2%. This is also an intermediate step towards adding back the mode_info streams. Change-Id: Idddc4a3f46e4180fbebddc156c4bbf177d5c2e0d	2015-04-21 11:16:45 -07:00
Parag Salasakar	ca90d4fd96	mips msa vp9 convolve8 horiz optimization average improvement ~6x-8x Change-Id: I7c91eec41aada3b0a5231dda7869b3b968f3ad18	2015-04-21 12:31:26 +05:30
Parag Salasakar	ef51c1ab5b	mips msa vp9 convolve8 hv optimization average improvement ~5x-8x Change-Id: I3214734cb3716e742907ce0d2d7a042d953df82b	2015-04-21 09:17:49 +05:30
Parag Salasakar	2e36149ccd	Merge "mips msa vp9 convolve8 vert optimization"	2015-04-18 23:39:25 -07:00
Parag Salasakar	27d083c1b9	mips msa vp9 convolve8 vert optimization average improvement ~6x-10x Change-Id: Ie3f3ab3a9005be84935919701e56b404e420affa	2015-04-18 08:13:04 +05:30
Marco Paniconi	f76ccce5bc	Revert "Revert "Force_split on 16x16 blocks in variance partition."" This reverts commit `004b9d83e3` Change-Id: I2f2d0bdb9368c2c07f1d29a69cd461267a3a8743	2015-04-16 17:52:13 -07:00
Johann	14ef4aeafb	Reorganize *_rtcd() calling conventions Change-Id: Ib1e17d8aae9b713b87f560ab5e49952ee2bfdcc2	2015-04-15 11:12:05 -04:00
Yunqing Wang	004b9d83e3	Revert "Force_split on 16x16 blocks in variance partition." This reverts commit `eb8c667570`. The patch caused mismatch while using multi-threads. Change-Id: Icd646340af25b5d91e32f03ed3ea212e00e3e0be	2015-04-14 15:19:31 -07:00
Marco	eb8c667570	Force_split on 16x16 blocks in variance partition. Force split on 16x16 block (to 8x8) based on the minmax over the 8x8 sub-blocks. Also increase variance threshold for 32x32, and add exit condiiton in choose_partition (with very safe threshold) based on sad used to select reference frame. Some visual improvement near moving boundaries. Average gain in psnr/ssim: ~0.6%, some clips go up ~1 or 2%. Encoding time increase (due to more 8x8 blocks) from ~1-4%, depending on clip. Change-Id: I4759bb181251ac41517cd45e326ce2997dadb577	2015-04-13 12:05:07 -07:00
Parag Salasakar	2f693be8f8	Merge "mips msa vp9 common headers added"	2015-04-09 21:50:15 -07:00
Jingning Han	93d9c50419	Merge "SSSE3 assembly implementation of 8x8 Hadamard transform"	2015-04-09 11:16:11 -07:00
Parag Salasakar	481fb7640c	mips msa vp9 common headers added Change-Id: Ia31ada59172eb1818e1eb91009f83cbb1f581223	2015-04-09 15:35:12 +05:30
Jingning Han	7f629dfca4	SSSE3 assembly implementation of 8x8 Hadamard transform It uses about 10% less CPU cycles than the SSE2 intrinsic implementation. Change-Id: I91017c0c068679a214b98cdd4cff3a6facfb7499	2015-04-04 09:59:37 -07:00
James Zern	44e3640923	Merge "vp9: enable sse4 sad functions"	2015-04-03 14:57:52 -07:00
James Zern	b644384bb5	Merge "vp9: fix high-bitdepth NEON build"	2015-04-01 23:36:17 -07:00
Yaowu Xu	54210f706c	Merge "use MAX_MB_PLANE consistently"	2015-04-01 18:24:39 -07:00
Yaowu Xu	f26b8c84f8	use MAX_MB_PLANE consistently Change-Id: Ic416a7f145001a88f5a7f70dde9b1edbc1b69381	2015-04-01 15:21:20 -07:00
Jingning Han	1470529f62	Refactor block_yrd function for RTC coding mode This commit separates Hadamard transform/quantization operations from rate and distortion computation in block_yrd. This allows one to skip SATD computation when all transform blocks are quantized to zero. It also uses a new block error function that skips repeated computation of sum of squared residuals. It reduces the CPU cycles spent on block error calculation in block_yrd by 40%. Change-Id: I726acb2454b44af1c3bd95385abecac209959b10	2015-04-01 12:00:43 -07:00
James Zern	14e24a1297	vp9: enable sse4 sad functions sse4 isn't set by configure or used in rtcd, correct the sad entries to use sse4_1 without changing the signatures for now. this was done in vp8 post-vp9 branch. Change-Id: Ia9f1fff9f2476fdfa53ed022778dd2f708caa271	2015-03-31 21:00:55 -07:00
James Zern	8845334097	vp9: fix high-bitdepth NEON build remove incorrect specializations in rtcd and update a configuration check in partial_idct_test.cc Change-Id: I20f551f38ce502092b476fb16d3ca0969dba56f0	2015-03-31 17:45:25 -07:00
hui su	d4f2f1dd5b	Merge "Move vp9_coef_con_tree to common/"	2015-03-31 10:51:10 -07:00
Jingning Han	db5ec37edc	Merge "Enable 16x16 Hadamard transform in SATD based mode decision"	2015-03-31 09:55:41 -07:00
hui su	302e24cb3e	Move vp9_coef_con_tree to common/ This tree should be defined in common/, as it is needed for both encoder and decoder. Change-Id: I4f5cbc80025cf2ced14182c98f7c82dc7d0f87db	2015-03-31 09:20:46 -07:00
Jingning Han	26d3d3af6a	Enable 16x16 Hadamard transform in SATD based mode decision This commit replaces the 16x16 2D-DCT transform with Hadamard transform for RTC coding mode. It reduces the CPU cycles cost on 16x16 transform by 5X. Overall it makes the speed -6 encoding speed 1.5% faster without compromise on compression performance. Change-Id: If6c993831dc4c678d841edc804ff395ed37f2a1b	2015-03-30 15:43:31 -07:00
Jingning Han	f0ac5aaa08	Merge "Hadamard transform based coding mode decision process"	2015-03-30 15:43:15 -07:00
Jingning Han	8c411f74e0	Hadamard transform based coding mode decision process This commit uses Hadamard transform based rate-distortion cost estimate for rtc coding mode decision. It improves the compression performance of speed -6 for many hard clips at lower bit-rates. For example, 5.5% for jimredvga, 6.7% for mmmoving, 6.1% for niklas720p. This will introduce extra encoding cycle costs at this point. Change-Id: Iaf70634fa2417a705ee29f2456175b981db3d375	2015-03-30 14:46:05 -07:00
jackychen	68610ae568	vp9_postproc.c: eliminate -Wshadow build warnings. Change-Id: I6df525a9ad1ae3cfbba8710d21db8fee76e64dbb	2015-03-27 20:27:30 -07:00
Alex Converse	a1e20ec58f	Refactor fast loop filter code to handle 444. Change-Id: I921b1ebabdf617049f8fa26fbe462c3ff115c1ce	2015-03-24 11:17:50 -07:00
hkuang	9f4f98fdbd	Merge "Optimize the intra frame decode to skip some unnecessary copy."	2015-03-23 16:50:37 -07:00
hkuang	85107641a4	Optimize the intra frame decode to skip some unnecessary copy. This speeds up a normal YT style 1080P clip decode by ~1% on nexus 7. Change-Id: Ied7fa0d8bc941b2adb4db9382f549ee4d5654f3a	2015-03-23 10:11:49 -07:00
hkuang	b88dac8938	Safely free all the frame buffers after all the workers finish the work. Issue: 978 Change-Id: Ia7aa809095008f6819a44d7ecb0329def79b1117	2015-03-19 12:21:00 -07:00
Yaowu Xu	73508be364	Fix a typo introduced in #94401aff This fixes all test vector failures Change-Id: Ie1a9fe0f023f7a0c7e89eb55df1b40ff65302adc	2015-03-12 08:01:08 -07:00
hkuang	4a691aa209	Merge "Refactor the block decode code to make it simpler."	2015-03-11 16:19:14 -07:00
hkuang	94401aff5c	Refactor the block decode code to make it simpler. Change-Id: I0f983cb821ad7ec6fbefe7895cb8124a8fa39df6	2015-03-11 11:37:16 -07:00
Yunqing Wang	f0cf9719d0	Accumulate tx_totals counters in multi-threaded encoder Tx_totals counters weren't handled correctly in multi-thread case, which caused the mismatch while encoding using threads > 1. This patch fixed that. Change-Id: Ice9b0386f57175fb92a0bdcd5042686a3106246a	2015-03-10 10:02:49 -07:00
Hangyu Kuang	a1ef75bb63	Merge "Only wait for previous frame's motion vector if needed."	2015-03-06 10:27:26 -08:00
Hangyu Kuang	d5fa786b4f	Only wait for previous frame's motion vector if needed. Change-Id: Iecce685a33b64844446c0009f21bc85566d7469f	2015-03-05 16:09:44 -08:00
Johann	42eb97eb91	Declare function used by 'once' with 'void' parameters Visual Studio is exceptionally picky about this: vp9_reconintra.c(900): warning C4113: 'void (__cdecl )()' differs in parameter lists from 'void (__cdecl )(void)' [.build-x86_64-win64-vs10\vpx.vcxproj] Change-Id: I564c7415f4608fd962be8c699d6133a996b545f7	2015-03-04 15:34:55 -08:00
Adrian Grange	3807dd82ab	Make encoder buffer allocation dynamic Frame buffers are now allocated dynamically on-demand. Entries in the reference frame map, cm->ref_frame_map, may now be set to -1 (INVALID_IDX) to indicate that there is not a valid reference buffer in that "slot". All slots in the reference frame map are now initialized to the empty state (-1) and each buffer is initialized to have a reference count of 0. Change-Id: Id1afe98de98db4ae8b2dfefed7889c3b28c68582	2015-03-04 07:58:32 -08:00
Yunqing Wang	55639c383b	fix a race condition caused by intra function pointer initialization This patch fixed webm issue 962. (https://code.google.com/p/webm/issues/detail?id=962) The data races occurred when an encoder and a decoder were created at the same time, and the function pointers were initialized twice. Change-Id: I8851b753c4b4ad4767d6eea781b61f0ac9abb44b	2015-03-03 09:58:37 -08:00
Jingning Han	1790d45252	Use variance metric for integral projection vector match This commit replaces the SAD with variance as metric for the integral projection vector match. It improves the search accuracy in the presence of slight light change. The average speed -6 compression performance for rtc set is improved by 1.7%. No speed changes are observed for the test clips. Change-Id: I71c1d27e42de2aa429fb3564e6549bba1c7d6d4d	2015-03-01 10:42:56 -08:00
Jingning Han	c4cb8059ff	Merge "Fix high bit-depth loop-filter sse2 compiling issue - part 4"	2015-02-27 09:49:10 -08:00
Jingning Han	43bb97f7d0	Merge "Fix high bit-depth loop-filter sse2 compiling issue - part 3"	2015-02-27 09:49:00 -08:00
Jingning Han	4800b0e80d	Merge "Fix high bit-depth loop-filter sse2 compiling issue - part 2"	2015-02-27 09:48:51 -08:00
Jingning Han	8ec22296b3	Fix high bit-depth loop-filter sse2 compiling issue - part 3 Change-Id: Idb14b9a285f8098126f967c5e2750221d6a58f69	2015-02-26 15:21:22 -08:00
Jingning Han	14ff1cb74a	Fix high bit-depth loop-filter sse2 compiling issue - part 2 Change-Id: I6728b69bb3dff1daa64ff7142f691e80a089f1c4	2015-02-26 12:41:19 -08:00
Jingning Han	2080e4b206	Fix high bit-depth loop-filter sse2 compiling issue - part 1 The intrinsic statement _mm_subs_epi16() should take immediate. Feeding variable as its input argument will cause compile failure in older version gcc. Change-Id: I6a71efcc8d3b16b84715e0a9bcfa818494eea3f4	2015-02-25 09:59:50 -08:00
James Zern	044bfa3949	Merge "vp9_loopfilter: quiet integer constant size warnings"	2015-02-24 19:09:32 -08:00

... 4 5 6 7 8 ...

3252 Commits