generic-library/vpx

Author	SHA1	Message	Date
Marco Paniconi	55c6a74bd4	Merge "Dynamic resize for real-time: reference scaling."	2015-07-24 22:23:10 +00:00
Jingning Han	48de07d882	Remove redundant function definitions in vp9_dct_sse2.h Change-Id: I283d364a4e65ca9bf6ff581da1d0b498433c5402	2015-07-24 21:12:06 +00:00
Jingning Han	c376fbc62e	Merge "Move msa implementations of 2D-DCT to vpx_dsp"	2015-07-24 21:11:33 +00:00
Jingning Han	9aaf523ace	Move msa implementations of 2D-DCT to vpx_dsp Refactor and clean up the msa transform related code layout. Change-Id: Ic5048bd3d62a6046589817da745370ea89448e44	2015-07-24 13:24:25 -07:00
Alex Converse	742021f026	Remove branch in inner loop of foreach_transformed_block_in_plane() Change-Id: Ib14d09376a9ce4fa5f541264e5c335aceb71380a	2015-07-24 11:14:33 -07:00
Alex Converse	d3b6062a13	Simplify is_skippable to point straight to eobs. Change-Id: If196d9e5c7a15ee7d988ee2ecbf155a54d59b480	2015-07-24 11:14:33 -07:00
Alex Converse	964058129f	Don't initialize extra context tree buffers for 4x8 and 8x4. Change-Id: Ib669d572654f24fd43410a9399a8b609e87f846a	2015-07-24 11:14:33 -07:00
Hui Su	a15edeb76d	Merge "Code cleanup in vp9_encode_block_intra"	2015-07-24 17:40:37 +00:00
Aℓex Converse	a60e0c15bc	Merge "Allocate four \|zcoeff_blk\| for sub8x8 contexts."	2015-07-24 17:38:45 +00:00
Aℓex Converse	b4297bb122	Merge "Allocate eobs array per txblock and not per pixel."	2015-07-24 17:38:32 +00:00
Marco	f01c769dc6	Dynamic resize for real-time: reference scaling. Avoid scaling the references if they have already been scaled. Change only affects 1 pass non-svc mode for now. Change-Id: I204f4079c026cba7adce7a7f855d072f6139ccec	2015-07-23 16:08:40 -07:00
Alex Converse	e905da6f9c	Allocate four \|zcoeff_blk\| for sub8x8 contexts. The RD and load save/code grabs it as groups of four. In practice there is no change to physical allocations becaquse this is backed by a 16-byte memalign. Change-Id: I01e89769872300e23227e03dd24a6e229f482025	2015-07-23 15:43:48 -07:00
Alex Converse	fa84acb441	Allocate eobs array per txblock and not per pixel. Change-Id: I5368f5fc7283420c38d5bd85e3077b761d94ace6	2015-07-23 15:19:43 -07:00
Jingning Han	e8c6c00d80	Merge "Fix vp9_psnrhvs.c build error"	2015-07-23 21:19:40 +00:00
Jingning Han	598b083342	Fix vp9_psnrhvs.c build error Add vpx_dsp_rtcd.h to the header file list. The od_bin_fdct8x8() here depends on forward 8x8 2D-DCT. Change-Id: I1d71edc71f07069808823d2445c1cafd285e1b94	2015-07-23 13:00:15 -07:00
Jingning Han	d341f843e2	Refactor forward/inverse transform msa implementations This commit factors out common macro definitions from the forward and inverse transform implementations into vpx_dsp. It removes the duplicate macro definitions from encoder and decoder folders. Change-Id: I92301acbd3317075e9c5f03328a25abb123bca78	2015-07-23 11:20:30 -07:00
James Zern	33a9d53c10	Merge "mips/dspr2: fix vp9-highbitdepth build"	2015-07-23 02:08:50 +00:00
Jingning Han	97ec51233d	Take out VP9_ prefix from mips/msa macros The msa macros are locally used and should not be named with VP9 prefix. Change-Id: I2c9c746c4027383c16b9ab12b77b4e70e7e7d206	2015-07-22 16:47:42 -07:00
Jingning Han	b67821f37b	Factor forward 2D-DCT transforms into vpx_dsp This commit factors the 4x4, 8x8, and 16x16 2D-DCT forward transform operations into vpx_dsp folder. Change-Id: I084b117b79c0925edcbcabb93f62b9f4bf8dbe7d	2015-07-22 15:48:17 -07:00
James Zern	9a0a2193e4	mips/dspr2: fix vp9-highbitdepth build vp9_itrans*_dspr2.c aren't necessary for high bitdepth builds and notably vp9_itrans8_dspr2.c fails in various configurations using a codesourcery toolchain: vp9_itrans8_dspr2.c:31:5: can't find a register in class 'GR_REGS' while reloading 'asm' Change-Id: I2ac76203e65cc643cb835ab50e95701896d92a1a	2015-07-22 11:54:39 -07:00
hui su	e298d650cb	Code cleanup in vp9_encode_block_intra Change-Id: Ie4d958b26e586db218f8ee95d5df4bf11f2345a1	2015-07-22 10:53:12 -07:00
Jingning Han	2726023fc1	Merge "Clean up vp9_dct32x32_sse2_impl.h header files"	2015-07-21 16:31:50 +00:00
Jingning Han	aeee70f9dd	Merge "Arrange 1D forward transform order in vp9_dct.c"	2015-07-21 04:59:14 +00:00
Jingning Han	fe39f6cc9f	Merge "Remove redundant function definitions from vp9_dct.h"	2015-07-21 04:57:58 +00:00
Yaowu Xu	4110a27d66	Merge "vpx_dsp/bitwriter_buffer.h: vp9_ -> vpx_"	2015-07-21 04:10:23 +00:00
Yaowu Xu	987451d864	Merge "vpx_dsp/bitwriter.h: vp9_->vpx_"	2015-07-21 04:10:09 +00:00
Yaowu Xu	41c13ddbc9	Merge "vpx_dsp/prob.h: vp9_ -> vpx_"	2015-07-21 04:09:53 +00:00
Yaowu Xu	0fc4d4e1ef	Merge "vpx_dsp/bitreader_buffer.h: vp9_->vpx_"	2015-07-21 04:09:38 +00:00
Yaowu Xu	ac1e1b698f	Merge "vpx_dsp/bitreader.h: vp9_->vpx_"	2015-07-21 04:09:08 +00:00
Yaowu Xu	d41781560e	Merge "Fix bug in setting sf->use_square_partition_only."	2015-07-21 01:24:53 +00:00
Yaowu Xu	5f5091636e	vpx_dsp/bitwriter_buffer.h: vp9_ -> vpx_ Change-Id: I0ac7beaa160a6c2a60a019f6b8ce85e6537bed7d	2015-07-20 18:13:06 -07:00
Yaowu Xu	817be1d214	vpx_dsp/bitwriter.h: vp9_->vpx_ changes prefix vp9_ to vpx_ for non codec specific functions and data structures. Change-Id: I91a21548e39bd24d2c7caaaa223ae47240bb78c8	2015-07-20 18:13:05 -07:00
Yaowu Xu	70ad668056	vpx_dsp/prob.h: vp9_ -> vpx_ change prefix vp9_ to vpx_ for non codec specific functions and data structures. Change-Id: I97c7e6422eceea99212b93f4942bc2187763a07c	2015-07-20 18:13:04 -07:00
Yaowu Xu	cbce003712	vpx_dsp/bitreader_buffer.h: vp9_->vpx_ Replace vp9_ in names to vpx_ for non codec specific functions. Change-Id: Ib9e3b86cb0728d10b239f3493ceda18cc2c34e0f	2015-07-20 18:13:03 -07:00
Yaowu Xu	bf82514b54	vpx_dsp/bitreader.h: vp9_->vpx_ Replace vp9_ in names to vpx_ as they are not codec specific. Change-Id: I2e583aa63dee769353ada4b42417aa15c4074ebb	2015-07-20 18:06:31 -07:00
Jingning Han	07d5d538c2	Clean up vp9_dct32x32_sse2_impl.h header files Remove redundant file dependency. Change-Id: I4708218157617dabe00e2e33e237be2838c16603	2015-07-20 17:22:12 -07:00
Jingning Han	bcbd3c8fa2	Arrange 1D forward transform order in vp9_dct.c Remove the redundant function declarations therein. Change-Id: I27731fb70bb1abce63da761a5812f518c62f590f	2015-07-20 16:29:40 -07:00
Jingning Han	1279d3bac7	Remove redundant function definitions from vp9_dct.h Change-Id: I963f08f1023481712c6f9ed624ddf05e5bac6321	2015-07-20 16:26:36 -07:00
Jingning Han	b8c47a98b8	Merge "Make local functions in vp9_dct.c static"	2015-07-20 23:08:14 +00:00
Yaowu Xu	149822e399	Merge "Correctly report "Unsupported bitstream profile""	2015-07-20 22:49:54 +00:00
Jingning Han	f62805fae0	Make local functions in vp9_dct.c static This commit limits the scope of 1-D DCT and ADST functions within vp9_dct.c and makes them static. This largely clears out the cross referencing issue between vp9_dct.c and the SIMD optimizations. Change-Id: If7cac478b11bb32328ccf70a9f60b709dad43d7f	2015-07-20 15:15:27 -07:00
Yaowu Xu	add779e425	Merge "Remove vp9_ prefix from bit writer files"	2015-07-20 21:21:53 +00:00
Yaowu Xu	7a63e6446b	Merge "Move bit writer files to vpx_dsp/"	2015-07-20 21:21:41 +00:00
Jingning Han	f987e64476	Merge "Unify the high bit-depth forward hybrid transforms"	2015-07-20 20:19:03 +00:00
Jingning Han	9e23c6d534	Merge "Refactor highbd forward transform use case"	2015-07-20 20:18:22 +00:00
Yaowu Xu	1fcef81cb0	Remove vp9_ prefix from bit writer files Change-Id: I07647c7482b9ec498fbad3a9c9901f72b2336500	2015-07-20 11:20:03 -07:00
Yaowu Xu	c5ad31e518	Move bit writer files to vpx_dsp/ Change-Id: Id27e0007a0feac821ca66bcecbf3a723305da82d	2015-07-20 11:20:02 -07:00
Jingning Han	e253eaa036	Unify the high bit-depth forward hybrid transforms The SSE2 version high bit-depth forward hybrid transforms are essentially using the C functions via cross referencing to 1-D functions in vp9_dct.c. This commit unifies the two versions and removes the unnecessary dependency. Change-Id: Ib4d0702a138f8daf7d0bd97c141ee7088f293765	2015-07-20 11:17:49 -07:00
hui su	f744613be9	Fix uninitialized value warning Change-Id: Ib919a8ec2ec66d460d2f8a26d72aabc09dcbbd72	2015-07-20 11:13:00 -07:00
Jingning Han	389ed6da10	Refactor highbd forward transform use case Separate the hybrid transform case from 2D-DCT case. This will allow us to clear up cross dependency between c and SIMD implementations later. Change-Id: Iaa499e8b096850a1c5a0c50a3b6e63e15d0184bf	2015-07-20 10:31:17 -07:00
Yaowu Xu	345ff1a2f2	Merge "Removed vp9_ prefix from vpx_dsp/bitreader file names"	2015-07-20 17:12:08 +00:00
Yunqing Wang	f65473c036	Merge "Migrate quantization functions from vp9/ to vpx_dsp/"	2015-07-20 16:20:07 +00:00
Yaowu Xu	87d2c3c063	Removed vp9_ prefix from vpx_dsp/bitreader file names Change-Id: I0426126d0a65f13f9250983e44cc366b1b1a9c4a	2015-07-20 08:57:35 -07:00
Yaowu Xu	b0e6811ace	Merge "Move bit reader files to vpx_dsp"	2015-07-20 14:52:50 +00:00
Jingning Han	a2b623d467	Merge "Remove dspr2 loop filter files from vp9_common.mk"	2015-07-18 01:33:35 +00:00
Jingning Han	44925b4c17	Merge "Rename loop filter function from vp9_ to vpx_"	2015-07-18 01:33:15 +00:00
Jingning Han	fd15cd5ad9	Remove dspr2 loop filter files from vp9_common.mk These files have been moved to vpx_dsp directory. Clean the vp9_common make file accordingly. Change-Id: I9b1e820376421c801f705157e60cc7a55487f469	2015-07-17 16:38:53 -07:00
Yunqing Wang	38f1fbbb75	Migrate quantization functions from vp9/ to vpx_dsp/ The following quantization functions were moved: vp9_quantize_b vp9_quantize_b_32x32 vp9_highbd_quantize_b vp9_highbd_quantize_b_32x32 vp9_quantize_dc vp9_quantize_dc_32x32 vp9_highbd_quantize_dc vp9_highbd_quantize_dc_32x32 The purpose of doing that was to allow these functions to be shared by multiple codecs. Change-Id: Id8ab939f283353cdd07bd930d47db3d932a5d87f	2015-07-17 16:38:14 -07:00
Jingning Han	2992739b5d	Rename loop filter function from vp9_ to vpx_ Change-Id: I6f424bb8daec26bf8482b5d75dd9b0e45c11a665	2015-07-17 15:55:02 -07:00
Yaowu Xu	97279ed2e2	Move bit reader files to vpx_dsp Change-Id: Ib1cb1fbe92a39ff5312cee069559be6d3ea458d0	2015-07-17 15:38:40 -07:00
Marco	479c669a61	Merge "Dynamic resize 1 pass mode: fix buffer underflow threshold."	2015-07-17 21:31:56 +00:00
Jingning Han	4735edd00f	Migrate mips dspr2 loop filter implementation from vp9 to vpx This commit moves the loop filter dspr2 implementation from vp9 to vpx_dsp directory. It also fixes header file format issues. Change-Id: I09203ed4bd267d7fd76bb79a6ee84a37646206b2	2015-07-17 11:51:05 -07:00
Marco	7501de267c	Dynamic resize 1 pass mode: fix buffer underflow threshold. Remove the use of drop_frames_water_mark, as this is used for frame dropping control. Use fixed threshold for now on buffer underflow. Change-Id: If0ddda9f7f6fa96067cdcb0eccb42e17bda37c32	2015-07-17 11:25:15 -07:00
Yaowu Xu	7c0c62df1d	Correctly report "Unsupported bitstream profile" For vp9 decoder build without profile 2 and profile 3 support, this commit changes to report error "Unsupported bitstream profile" for input streams in profile 2 or 3, rather than other misleading error information. In addition, one of the invalid files in unit tests is actually coded profile 2, this commit makes it tested only when the decoder is built with vp9-highbitdepth. This fixes issue #1028. Change-Id: I8b6c1210787c8f89c703a546687dcf973ac20fc0	2015-07-17 10:51:02 -07:00
Jingning Han	d0750d287f	Resolve dspr2 loop filter dependency complexity Narrow the scope of dependency required by the dspr2 implementation of loop filters. Change-Id: Ib8d99dc7d9c231f69dd31d02e0a89e5bd0545a28	2015-07-17 10:38:35 -07:00
Jingning Han	55e80a3cc6	Replace vp9_common_dspr2.h with common_dspr2.h Narrow the scope of dependency in dspr2 loop filter implementation. Change-Id: I30426d7e4d41575a82286f1d3c5881aeb99a3250	2015-07-17 10:31:38 -07:00
Jingning Han	b8ff84b7f8	Create common dspr2 header file in vpx_dsp Move the common prefetch_load/store in dspr2 to header file in vpx_dsp/mips. Change-Id: I8acc22970f2a0ef97d73061e39a3ae65c6955eac	2015-07-17 09:54:02 -07:00
Jingning Han	3590a4b437	Merge "Simplify dependencies in dspr2 related codes"	2015-07-17 16:12:52 +00:00
Jingning Han	845aad42b8	Merge "Migrate loop filter functions from vp9/ to vpx_dsp/"	2015-07-17 16:12:01 +00:00
Jingning Han	d190ad228f	Simplify dependencies in dspr2 related codes The common_dspr2.h should be independent of codec-specific data structures. Change-Id: I34ee1f9552c2d2d205fd7f1813cdf312c7ff5d2b	2015-07-16 18:22:48 -07:00
Jingning Han	50adfdf5ba	Migrate loop filter functions from vp9/ to vpx_dsp/ The various tap loop filter operations are common functions across codec. This commit moves them along with SIMD optimizations to vpx_dsp folder. Change-Id: Ia5fa0b2e5289cdb98467502a549c380b9c60e92c	2015-07-16 16:40:47 -07:00
Marco	f83f9dbb3a	Merge "Dynamic resize for 1 pass: update of golden frame."	2015-07-16 19:38:27 +00:00
Marco	7ae1aa6b37	Dynamic resize for 1 pass: update of golden frame. In aq-mode=3 under a resizing action (i.e., resize_pending != 0), force an update of the golden reference frame. Change-Id: I14806f6db71b5f8c827678cc5e1fc913c138a9a4	2015-07-16 09:27:20 -07:00
paulwilkins	7d15444d07	Fix bug in setting sf->use_square_partition_only. Fix bug in setting this flag for animated content. The bug did cause quality to increase because far more frames are not boosted than boosted. However, the speed trade off to gain is a lot less favorable and the behavior was not as intended. Change-Id: I89fb70419c88b26f40b3534de0481730a1b3fcfa	2015-07-16 16:20:39 +01:00
Frank Galligan	8be1dcb4cb	Merge "Add vp9_int_pro_col_neon."	2015-07-16 05:45:17 +00:00
Jingning Han	b946e5ce0f	Merge "Add vpx_dsp_common.h file"	2015-07-15 22:41:54 +00:00
Jingning Han	de740b258b	Merge "Remove redundant header files in vp9_loopfilter_filers.c"	2015-07-15 22:41:11 +00:00
Marco	eaf1ffd837	Merge "Fix to resize logic for 1 pass mode."	2015-07-15 21:43:07 +00:00
Jingning Han	db8e731b8d	Add vpx_dsp_common.h file Move the clamp functions to vpx_dsp_common.h file. Clear out the dependency of vp9_loopfilter_filters.c on vp9_common.h file. Change-Id: I9c4b928bcd7f597106b5aa96354356d3775a3431	2015-07-15 13:03:23 -07:00
Jingning Han	3fe83cdf81	Remove redundant header files in vp9_loopfilter_filers.c This cleans out the unnecessary dependency on vp9 codec-specific data structures. Change-Id: Iadbe431174a0f9bf9423f39ab854fc18be554bea	2015-07-15 12:44:47 -07:00
Marco	2f66fdd375	Adjust some logic for dynamic_resize 1 pass mode. Use drop_frames_water_mark for threshold on buffer underflow, and change threshold for resize down. Change-Id: I2de19adce50abe9bcdc0b107528cec8cc1857fcc	2015-07-15 11:54:04 -07:00
Frank Galligan	1c39998e39	Add vp9_int_pro_col_neon. BUG=https://code.google.com/p/webm/issues/detail?id=1023 Change-Id: I212a1d67b23ce3b5ce08800de369b25b9e375e7d	2015-07-15 09:04:28 -07:00
Marco	7b756183aa	Fix to source scaling for dynamic_resize. The fast scaling for 1 pass mode was being used only on the first frame after resizing event (because resize_scale_num/den is set to 1 and only changed for first frame following resize event). Change-Id: I723b63e21823eb858f25f5662d2bbe4f1842e61f	2015-07-15 08:28:59 -07:00
Marco	dc7da005d7	Fix to resize logic for 1 pass mode. Proper use/update of resize_state and resize_pending to constrain the total amount of downsizing to be at most one scale down, for now. Change-Id: Id18fc32499f2fbdbec16728dcdc9e4eac09098f0	2015-07-14 16:23:57 -07:00
Alex Converse	fa94dbda81	Merge "Add an SSE2 version of vp9_iwht4x4_16_add"	2015-07-14 22:11:47 +00:00
Alex Converse	d8426d6f12	Add an SSE2 version of vp9_iwht4x4_16_add Roughly half as many cycles as plain C. Change-Id: I8c16c29940b76d54ee7e4fb874c328ce90bff5d4	2015-07-14 14:23:32 -07:00
paulwilkins	e11878c8e3	Merge "Add extra resize trigger for frames above maximum allowed size."	2015-07-14 18:24:13 +00:00
Debargha Mukherjee	3c5244886a	Fixes part of merge regression from adding arf parameters. From Change Ibf0c30b72074b3f71918ab278ccccc02a95a70a0 There is still an issue relating to one animated test clip with repeat patterns where this change effectively increase the default maximum arf interval by +1. This can be examined seperately. Change-Id: Idd01d5480fc45202d8a059a0c3afc0997cc5bdd1	2015-07-14 18:32:38 +01:00
Jingning Han	cda17e12ed	Merge "Refactor intra block prediction and reconstruction process"	2015-07-14 16:22:42 +00:00
Jingning Han	d5975b733b	Merge "Refactor intra block prediction function"	2015-07-14 16:22:21 +00:00
Jingning Han	cb1e817c77	Refactor intra block prediction and reconstruction process Flaten the intra block decoding process. It removes the legacy foreach_transformed_block use in the decoder. This saves cycles spent on retrieving the transform block position. Change-Id: I21969afa50bb0a8ca292ef72f3569f33f663ef00	2015-07-13 22:24:17 +00:00
Jingning Han	81452cf0b7	Refactor intra block prediction function This commit simplifies the intra block boundary condition logic. It removes the block index from the argument set. Change-Id: If00142512eb88992613d6609356dfd73ba390138	2015-07-13 15:20:47 -07:00
Marco	e03b8b78b2	Merge "Dynamic resize for real-time: source scaling"	2015-07-13 19:06:26 +00:00
Yaowu Xu	0cdc85d8cf	Merge "Revert "Add an SSE2 version of vp9_iwht4x4_16_add.""	2015-07-13 16:27:10 +00:00
Yaowu Xu	ae5394b9e2	Revert "Add an SSE2 version of vp9_iwht4x4_16_add." This reverts commit `f8d3501640`. Change-Id: If8c7af403c091b7fb447a6f0c73fecdbccbc51b3	2015-07-13 16:26:27 +00:00
Jim Bankoski	c243835303	Merge "Revert "Fill buffer speed up""	2015-07-13 14:39:01 +00:00
Jim Bankoski	da9db83270	Revert "Fill buffer speed up" This reverts commit `9b4f9f45ee`. Change-Id: I23545ac8c7464127f7466fc6a58de517874fe0cf	2015-07-13 13:47:46 +00:00
Marco	4bbd95512a	Dynamic resize for real-time: source scaling Use faster scaling on source. Change-Id: I968df97239a86834c96126b86832d3d6d0875a53	2015-07-10 11:04:18 -07:00
Jim Bankoski	db50037ece	Merge "Fill buffer speed up"	2015-07-09 20:26:23 +00:00
Jim Bankoski	9b4f9f45ee	Fill buffer speed up Eliminates the byte by byte read from bool decoder, by reading in a size_t and then shifting it into place. Change-Id: Id89241977103fc3b973e4ed172a5cbf246998e5d	2015-07-09 11:41:30 -07:00
paulwilkins	4b44e46de0	Merge "Changes to use of rectangular partitions."	2015-07-09 18:34:41 +00:00
Yaowu Xu	49fa5276fe	Merge "Remove clamp operations."	2015-07-09 17:49:18 +00:00
Yaowu Xu	f70c80289c	Merge "Clean out more MSVC warnings"	2015-07-09 17:49:08 +00:00
Scott LaVarnway	e8103f3676	Merge "Eliminate num_8x8 and num_4x4 width/height lookups"	2015-07-09 17:16:22 +00:00
Alex Converse	74f869b962	Merge "Add an SSE2 version of vp9_iwht4x4_16_add."	2015-07-09 16:57:03 +00:00
paulwilkins	2d637ca36d	Merge "Change speed and rd features for formatting bars."	2015-07-09 16:38:38 +00:00
Scott LaVarnway	13a4f14710	Eliminate num_8x8 and num_4x4 width/height lookups Also some log2 lookups. Pass in 8x8 block width/height and log2 num4x4s instead. Change-Id: I8ea9a1ec1e0bbab23f8ba556954a1b5433f4d613	2015-07-09 05:30:46 -07:00
Yaowu Xu	b58c99eb71	Remove clamp operations. The clamp calls with INT32_MIN and INT32_MAX have no effect at all on int values passed in, therefore this commit removes those effectless clamps and also adds more const intermediate results to make the code more readable. Change-Id: I66d8811f58bb74ec31cbec9a6c441983a662352e	2015-07-08 17:44:19 -07:00
Jingning Han	535cc6d87f	Format fixes in vp9_encodeframe.c and vp9_encodemb.c Change-Id: Ib1303dac9043ab1b1f8fce54611cf4ea8a208038	2015-07-09 00:04:28 +00:00
Jingning Han	8783a8a97c	Refactor transform block loop for inter mode decoding Rework the inter mode transform block decoding loop. Replace the block index with the row and col index as the input argument. It saves function call to compute the row and col index according to the block index and overall block size, and many if statements associated with the transform block position relative to the coding block. For the test bit-stream pedestrian_area 1080p at 5 Mbps, the decoding speed goes up from 81.13 fps to 81.92 fps. Note that the intra coded block decoding needs more refactoring work than the inter ones. So keep it using foreach_transforme_block as for now. Change-Id: I5622bdae7be28ed5af96693274057f55ba9b4fb4	2015-07-08 22:55:16 +00:00
Yaowu Xu	c369daf3ea	Clean out more MSVC warnings Change-Id: I1bab0c104df2ec4825d050cd516e26ab635a7b3e	2015-07-08 15:09:20 -07:00
Alex Converse	f8d3501640	Add an SSE2 version of vp9_iwht4x4_16_add. 80% fewer cycles than C Change-Id: I841bde1e268ddd33ae2ee75eee94737a400e2cde	2015-07-08 15:00:51 -07:00
Alex Converse	8bf791e7ef	Merge "Don't allocate dqcoeff in MACROBLOCKD."	2015-07-08 20:42:36 +00:00
Alex Converse	89090d8046	Don't allocate dqcoeff in MACROBLOCKD. The encoder gets its dqcoeff from the context tree. In the decoder move it to directly after MACROBLOCKD. Change-Id: I46c9b76f26956a360d17de0b26ecb994dae34ecb	2015-07-08 12:37:55 -07:00
Jingning Han	66da771040	Merge "Refactor inverse_transform_block argument list"	2015-07-08 19:28:25 +00:00
Jingning Han	0497d3a827	Merge "Reset dqcoeff[0] only if eob is 1"	2015-07-08 19:27:22 +00:00
Frank Galligan	b770def572	Merge "VP9_LPF_VERTICAL_16_DUAL_SSE2 optimization"	2015-07-08 18:15:39 +00:00
paulwilkins	a6f2a9619b	Add extra resize trigger for frames above maximum allowed size. Even if the recode loop is not enabled for the current frame type trap the case where the projected size of a a frame is above the maximum allowed in recode_loop_test() Change-Id: I453004694b8f8699e3c2a83252e9f83adccdda4e	2015-07-08 18:15:10 +01:00
paulwilkins	8dd466edc8	Changes to use of rectangular partitions. Changes to allow more use of rectangular partitions at speeds 1 and 2 for content classed by the first pass as animation and for blocks near the active image edge. This has quite a big impact in quality for the animated test sequence but also hurts encode speed for speed 2. For other content types the impact on both speed and quality is small. Added some plumbing for detection of internal vertical image edges. Change-Id: I3fc48de2349f8cb87946caaf0b06dbb0ea261a9a	2015-07-08 18:14:12 +01:00
paulwilkins	a126b6ce7d	Change speed and rd features for formatting bars. Change speed features / behavior for split mode when there is an internal active edge (e.g. formatting bars). Remove some threshold constraints in rd code near the active edge of the image. Add some plumbing for left and right active edge detection. Patch set 5. Limit rd pass through for sub 8x8 to internal active edges. This takes away any speed penalty for most clips but keeps the enhanced edge coding for the more critical case of internal image edges Change-Id: If644e4762874de4fe9cbb0a66211953fa74c13a5	2015-07-08 17:51:42 +01:00
Jingning Han	7e0d0de211	Refactor inverse_transform_block argument list Replace block index with transform type in the argument list. This allows to save an extra fetch to the prediction mode. For pedestrian area 1080p coded at 5 Mbps with single tile, the average decoding speed goes up from 80.55 fps (before the refactoring series) to 81.13 fps. Change-Id: Icbebf84ce63c19c0c92f3690ed201f6c3eab7881	2015-07-08 09:26:02 -07:00
James Zern	892128f6ca	Merge "vp9_entropymv: remove vp9_get_mv_mag()"	2015-07-08 01:27:13 +00:00
Frank Galligan	5327fcf857	Merge "Add vp9_int_pro_row_neon."	2015-07-08 00:16:03 +00:00
Johann	ac7f403cbe	Merge "Move sub pixel variance to vpx_dsp"	2015-07-07 23:57:18 +00:00
Jingning Han	55c2646666	Merge "Rework scan order fetch logic for decoder"	2015-07-07 23:09:39 +00:00
Johann	6a82f0d7fb	Move sub pixel variance to vpx_dsp Change-Id: I66bf6720c396c89aa2d1fd26d5d52bf5d5e3dff1	2015-07-07 15:51:04 -07:00
Marco	155b9416b3	Merge "Update to speed 5 non-rd mode partition search."	2015-07-07 22:47:47 +00:00
Jingning Han	c2d0f9ddeb	Merge "Add vp9_ prefix to init_macroblockd"	2015-07-07 22:35:45 +00:00
Jingning Han	6e6c57da9a	Merge "Reduce dqcoeff array size in decoder"	2015-07-07 22:35:31 +00:00
Jingning Han	76ccba9ec8	Reset dqcoeff[0] only if eob is 1 If only the first dequantized coefficient is non-zero, reset dqcoeff[0] to zero directly. Change-Id: I0197ba72028a8ec436f0b1b9abcc1c0ae5d70abe	2015-07-07 15:20:34 -07:00
Jingning Han	97d1f1aaae	Rework scan order fetch logic for decoder Save redundant call for getting prediction mode to obtain scan order for detokenization. Change-Id: I0683ef119f1579d1261ed5d59052a1745b68ef6f	2015-07-07 15:03:21 -07:00
Jingning Han	a652048efd	Add vp9_ prefix to init_macroblockd Change-Id: I202d4924e627eec94838741df004ed9259d38b88	2015-07-07 12:00:01 -07:00
Marco	478fbc8f23	Update to speed 5 non-rd mode partition search. If the pre-selected partition size (from variance partition) is 32x32, also apply nonrd partition search for 32x32 and 16x16 size. Overall small positive gain in metrics, average ~1%. Some visual improvement, for lower resolutions. Change-Id: I69cb425bda94f7d13d34c451ab30e9276335a30e	2015-07-07 11:52:01 -07:00
Jingning Han	cccad1c5de	Reduce dqcoeff array size in decoder The decoding process handles detokenization and reconstruction per transform block sequentially. There is no need to offset the dqcoeff buffer according to the transform block index. This allows to reduce the memory spill and improve cache performance. Change-Id: Ibb8bfe532a7a08fcabaf6d42cbec1e986901d32d	2015-07-07 11:36:05 -07:00
Yaowu Xu	a8f8b83cef	Allows using optimzed version vp9_fdct8x8 Change-Id: I59cecb7178a93cdee7ad535fa996ef0caa6e988c	2015-07-07 10:28:42 -07:00
paulwilkins	02b3b05278	Merge "Alter partition search at image edge."	2015-07-07 12:44:28 +00:00
paulwilkins	8051b6d256	Merge "Error score recalibration for inactive regions."	2015-07-07 08:44:35 +00:00
paulwilkins	00c0cbb445	Merge "ARF Boost correction for inactive regions."	2015-07-07 08:44:17 +00:00
James Zern	c6d90f0535	vp9_entropymv: remove vp9_get_mv_mag() inline the code directly in read_mv_component(), the only place where it was being used; this removes a function call in a hot function Change-Id: I66f99c0c9ce3bc310101dbca4a470f023cc6fb55	2015-07-06 22:30:21 -07:00
James Zern	8c6d5a874d	Merge "inline vp9_reader_has_error()"	2015-07-07 00:48:58 +00:00
James Zern	4ec8f9c5ae	Merge "vp9_variance*.c: make static tables const"	2015-07-06 22:52:39 +00:00
James Zern	1696114587	Merge "mips msa vp9 subpel variance optimization"	2015-07-06 22:43:01 +00:00
Jingning Han	fcb5a8692a	Merge "Move subtract functions from vp9 to vpx_dsp"	2015-07-06 22:39:26 +00:00
James Zern	cb4310fc58	vp9_variance*.c: make static tables const Change-Id: Ia5044d13c09685c401191fe87fbf90d36203aadd	2015-07-06 15:04:37 -07:00
Parag Salasakar	fbe67d307a	mips msa vp9 subpel variance optimization Change-Id: If88401bf8c5d8ee58200278734d7a5058d1585d0	2015-07-06 14:59:01 -07:00
Debargha Mukherjee	5256a4034b	Merge "Expose params min-gf-interval/max-gf-interval"	2015-07-06 21:36:40 +00:00
James Zern	91c412b6db	Merge "remove vp9_get_interp_kernel()"	2015-07-06 21:36:37 +00:00
James Zern	017253b7a3	remove vp9_get_interp_kernel() expose filter_kernels[] and do the table lookup directly Change-Id: I0b10bff0327c3e01a723736141a9ffd377cd3d20	2015-07-06 13:04:05 -07:00
Debargha Mukherjee	9852643373	Expose params min-gf-interval/max-gf-interval Adds two new vp9 parameters --min-gf-interval and --max-gf-interval to enable testing based on frequency of alt-ref frames. Also adds a unit-test to test enforcement of min-gf-interval. For both these parameters the default value is 0, which indicates they are picked by the encoder, based on resolution and framerate considerations. If they are greater than zero, the specified parameter is honored. (Additional note by paulwilkins) Note that there is a slight oddity in that key frames are also GFs and considered part of GF only group. However they are treated as not being part of an arf group because for arf groups the previous GF is assumed to be the terminal or overlay frame for the previous group. (end note) Change-Id: Ibf0c30b72074b3f71918ab278ccccc02a95a70a0	2015-07-06 12:24:59 -07:00
Jingning Han	432cd4bfb7	Move subtract functions from vp9 to vpx_dsp Factor out the subtraction operator as common function. Change-Id: I526e703477c6a290e0e3e3c8898f8bb1ca82779b	2015-07-06 12:22:47 -07:00
Jingning Han	39f03bf9c6	Merge "Rename vpx_thread to vpx_util"	2015-07-06 17:01:30 +00:00
James Zern	823a126d4c	Merge "Revert "Correct the inter prediction coordinate...""	2015-07-03 18:44:02 +00:00
hkuang	52e358f13e	Revert "Correct the inter prediction coordinate..." Change in `92b199061a` leads to frame parallel decode failure in extreme case. addresses issue #1010 Change-Id: I4fa488dac8e8c584f5eef4cae1640a579130d387	2015-07-03 11:05:28 -07:00
James Zern	3d4526322b	Merge "Revert "mips msa vp9 subpel variance optimization""	2015-07-02 21:07:32 +00:00
James Zern	4c5ac477cb	Merge "Revert "mips msa vp9 avg subpel variance optimization""	2015-07-02 21:07:24 +00:00
James Zern	97946622c0	Revert "mips msa vp9 subpel variance optimization" This reverts commit `a42df86c03`. this change causes MSA/VP9SubpelVarianceTest.Ref and MSA/VP9SubpelVarianceTest.ExtremeRef failures under mips32r5el-msa-linux-gnu and mips64r6el-msa-linux-gnu Change-Id: I40b71a0b774eaeb31f66f795733f95cf360909f7	2015-07-02 12:06:51 -07:00
James Zern	ced982640b	Revert "mips msa vp9 avg subpel variance optimization" This reverts commit `61774ad1c4`. this change causes MSA/VP9SubpelAvgVarianceTest.Ref failures under mips32r5el-msa-linux-gnu and mips64r6el-msa-linux-gnu Change-Id: I7fb520c12b2a3b212d5e84b7619a380a48e49bb0	2015-07-02 12:06:29 -07:00
levytamar82	3c5256d572	VP9_LPF_VERTICAL_16_DUAL_SSE2 optimization The vp9_lpf_vertical_16_dual function optimized for x86 32bit target. The hot code in that function was caused by the call to the transpose8x16. The gcc generated assembly created uneeded fills and spills to the stack. By interleaving 2 loads and unpack instructions, in addition to hoisting the consumer instruction closer to the producer instructions, we eliminated most of the fills and spills and improve the function-level performance by 17%. credit for writing the function as well as finding the root cause goes to Erik Niemeyer (erik.a.niemeyer@intel.com) Change-Id: I6173cf53956d52918a047d1c53d9a673f952ec46	2015-07-02 11:56:11 -07:00
Jingning Han	d1b30ceaa3	Rename vpx_thread to vpx_util Change the dir name to include more util tools. Change-Id: Id5b16062803ce5eed872fe2edb36d7e56b32eed8	2015-07-02 10:02:37 -07:00
paulwilkins	99f8bd72cb	Alter partition search at image edge. Added code to reduce the minimum partition size searched for super blocks at or straddling the edge of the image. If the first pass has detected formatting bars the "active" edge may not be the real edge. Change-Id: I9c4bdd1477e60f162a75fac95ba6be7c3521e05c	2015-07-02 16:25:25 +01:00
paulwilkins	dc19f352af	Error score recalibration for inactive regions. Apply a correction to the frame error scores for frames with inactive regions. Change-Id: I217840f2efe7eafed3f5b8ddc7c468f1ca3d923c	2015-07-02 15:13:01 +01:00
paulwilkins	e4702deeec	ARF Boost correction for inactive regions. Correct the ARF boost calculations to partly discount inactive or very low energy regions of the image. Examples (formatting bars and 0 energy areas of animated clips). Change-Id: I241af058d10aba8c67a4deca36deb913047d4561	2015-07-02 14:15:46 +01:00
Jingning Han	8565a1c99a	Merge "Use vpx prefix for codec independent threading functions"	2015-07-02 04:24:54 +00:00
Jingning Han	66cf8098e6	Merge "Move multi-threading module functions into vpx_thread folder"	2015-07-02 04:24:37 +00:00
James Zern	1e0aa9497f	inline vp9_reader_has_error() this is tested for each block Change-Id: I229c6f0e9513fb206bdbce8be9699a4bf4008ca4	2015-07-01 19:10:43 -07:00
James Zern	e757808429	Merge "vp9_pred_common: inline vp9_get_tx_size_context"	2015-07-02 01:52:40 +00:00
James Zern	0ea304620c	Merge "vp9_pred_common: inline vp9_get_segment_id"	2015-07-02 01:52:21 +00:00
James Zern	95dc082168	Merge "vp9_dsubexp: replace some divides with shifts"	2015-07-02 01:51:25 +00:00
James Zern	b49de21d74	Merge "vp9/inv_remap_prob: simplify inv_map_table[]"	2015-07-02 01:51:06 +00:00
James Zern	f0b3b08fb4	Merge "vp9_dsubexp: remove clamp in inv_remap_prob()"	2015-07-02 01:50:46 +00:00
Jingning Han	04d2e57425	Use vpx prefix for codec independent threading functions Replace vp9_ prefix with vpx_ for common multi-threading functions. Change-Id: I941a5ead9bfe8213fdad345511d2061b07797b55	2015-07-02 00:47:54 +00:00
Jingning Han	3a3b0be09a	Move multi-threading module functions into vpx_thread folder This commit moves the primitive multi-threading files from vp9 folder to vpx_thread, which will be accessible by all vpx codec. Change-Id: Ib51e66e9c69801c10631fab56d35a0c0aaed5883	2015-07-01 17:45:49 -07:00
Johann	79fcc56781	Merge "Fix --disable-use-x86inc when used with --enable-vp9-highbitdepth"	2015-07-01 21:14:41 +00:00
Johann	8d5389171f	Merge "Fix --disable-use-x86inc"	2015-07-01 21:14:17 +00:00
Johann	1c967f17bd	Fix --disable-use-x86inc when used with --enable-vp9-highbitdepth Change-Id: I0ed6de72dc0bb99fc9c5b1f6500399b16754ffb3	2015-07-01 13:17:01 -07:00
Johann	ff8505a54d	Fix --disable-use-x86inc Change-Id: I374fcd8fb45a6893dcdeac6896671be142a99f06	2015-07-01 13:15:51 -07:00
James Zern	4f7e7c4d49	Merge "mips msa vp9 avg subpel variance optimization"	2015-07-01 20:05:50 +00:00
Scott LaVarnway	dc6d954bd2	Merge "Move inter_predictor to vp9_reconinter.h"	2015-07-01 20:01:53 +00:00
Scott LaVarnway	d157742788	Merge "VP9: Move ref_mvs[][] and mode_context[] from MB_MODE_INFO"	2015-07-01 12:52:21 +00:00
Parag Salasakar	61774ad1c4	mips msa vp9 avg subpel variance optimization average improvement ~3x-5x Change-Id: Iefbcafc05daab77b38a4e63b551e427867a501a4	2015-07-01 13:46:41 +05:30
James Zern	bd7162269f	vp9_dsubexp: replace some divides with shifts Change-Id: I24e10c37ea8f06600cd04b43512efa6170e23e5c	2015-06-30 20:09:00 -07:00
James Zern	5609858785	vp9/inv_remap_prob: simplify inv_map_table[] add one to each entry to remove the universal 'value + 1'. Change-Id: I8919b1d7fde8155d1728196c4d577db3064e2c1e	2015-06-30 19:58:08 -07:00
Parag Salasakar	a42df86c03	mips msa vp9 subpel variance optimization average improvement ~3x-5x Change-Id: I4cbba2711467b0e205904769ebbb4a1fcbb1a311	2015-07-01 07:51:34 +05:30
James Zern	8aaf5ec4c7	vp9_dsubexp: remove clamp in inv_remap_prob() the max value of the lookup in expanded form is: (((1 << 7) - 1) << 1) - 65 + 1 + 64 = 254 remove the clamp [0, 253] and add one table entry Change-Id: I0b5d0c66702fdb0b8f1cc9ab9b0dac66326e85a6	2015-06-30 15:49:29 -07:00
James Zern	fc5f3b8f4f	Merge "vp9_common_data: right-size tables"	2015-06-30 21:12:54 +00:00
Yaowu Xu	e943db045a	Merge "Fixed a variance calculation"	2015-06-30 19:48:33 +00:00
Parag Salasakar	fc3c456053	Merge "mips msa vp9 common macro comments updated"	2015-06-30 06:25:31 +00:00
Scott LaVarnway	c06d56cc7d	VP9: Move ref_mvs[][] and mode_context[] from MB_MODE_INFO to MB_MODE_INFO_EXT. This saves 36 bytes per 8x8 area for both the decoder and encoder. (encoder has two MODE_INFO buffers) Change-Id: If006abb2224acaf326df3c2be09e77e967662107	2015-06-29 12:46:47 -07:00
Scott LaVarnway	437d033dbb	Merge "Remove tile param"	2015-06-29 18:04:56 +00:00
Parag Salasakar	3c353e58c0	mips msa vp9 common macro comments updated Cosmetic/Grammatical corrections in vp9 macro comments Change-Id: I774b983aff854feb69c7e4442e8731ce4c995645	2015-06-29 11:52:28 +05:30
Yaowu Xu	9f14bbfd80	Fixed a variance calculation This commit fixed a mistake in variance calculation. Thanks to Xintong for spotting the error. Change-Id: Ia285fc0128c00f0234a73b0a7eba6adc88b8a7de	2015-06-26 15:54:43 -07:00
Parag Salasakar	b92cc27b76	mips msa vp9 temporal filter optimization average improvement ~4x-5x Change-Id: Iad9c0a296dbc2ea96d000bd009077999ed58a3c5	2015-06-26 12:00:24 +05:30
Parag Salasakar	c040f96e4b	mips msa vp9 subtract block optimization average improvement ~3x-4x Change-Id: Idbe4d13a00d05ff8be6559b116f416e42c3b4097	2015-06-26 09:23:56 +05:30
Parag Salasakar	d017f5ba38	Merge "mips msa vp9 block error optimization"	2015-06-26 03:42:31 +00:00
Parag Salasakar	1543f2b60e	mips msa vp9 block error optimization average improvement ~3x-4x Change-Id: If0fdcc34b17437a7e3e7fb4caaf1067bc175f291	2015-06-26 09:04:00 +05:30
James Zern	28a8226350	vp9_common_data: right-size tables Change-Id: I2206ee148a46b234df58f2b623e9f32f26033e04	2015-06-25 20:20:40 -07:00
Marco	1c7b1f9aec	Update to dynamic resize logic for 1pass CBR. Only do the check for resizing if the feature is selected (i.e., resize_mode = RESIZE_DYNAMIC). And modify condition for checking to be resize_count >= window, (since framerate can change). Change-Id: Idceb4e50956bb965a1492b4993b0dcb393c9be4d	2015-06-25 12:28:43 -07:00
Marco	3dd9cde2a5	Fix to unstable build from commit 517a66. Change-Id: I123db2d20ae65a10e2dec95eec61150e2f69546d	2015-06-24 17:28:57 -07:00
James Zern	d219f2b9d2	Merge "vp9_reconintra_neon: add d45 16x16"	2015-06-24 21:23:15 +00:00
Marco	517a662005	aq-mode=3: Reduce boost for segment#2 at low bitrates/low res. Reduce boost for segment#2 for low bitrates and low-res. This change is to reduce the rate overshoot at low bitrates. No change in behavior, except at the very low bitrates. Change-Id: I0dbd9d3b6356da5804de94adf10fca6a7a8f8948	2015-06-23 16:50:43 -07:00

... 2 3 4 5 6 ...

8287 Commits