generic-library/vpx

Author	SHA1	Message	Date
Dmitry Kovalev	1ad7c1f250	Renaming y1dc_delta_q, uvdc_delta_q, uvac_delta_q fields from VP9Common. New names are y_dc_delta_q, uv_dc_delta_q, uv_ac_delta_q. Change-Id: I4acae1fc23a4697ce2c5a5becb8dc28ef0a4b552	2013-04-16 15:05:52 -07:00
Ronald S. Bultje	94996b9d26	Slightly hackish workaround to support rectangles in directional intra predictors. Change-Id: I8a4da6925f2d58a426c4d122df8b97bb69452e49	2013-04-16 14:33:03 -07:00
John Koleszar	e3cfe4e89e	Remove the mb_no_coeff_skip flag This flag was added to VP8 to allow a mode where MB-level skipping was not allowed, saving a bit per mb. It was never used in practice, and hasn't been tested in VP9, so remove it. Change-Id: Id450ec6904c6d06c1919508e7efc52d05cde5631	2013-04-16 12:36:16 -07:00
Dmitry Kovalev	5953a98631	Merge "Code cleanup inside vp9_reconintra4x4.c file." into experimental	2013-04-16 10:24:32 -07:00
Dmitry Kovalev	b30182c733	Merge "Adding mv_joint_vertical and mv_joint_horizontal functions." into experimental	2013-04-16 10:24:01 -07:00
Yunqing Wang	e87c7f0930	Merge "Optimize the scaling calculation" into experimental	2013-04-16 09:14:22 -07:00
Scott LaVarnway	466f395148	Merge "Removing extra params from x_add_residual() functions" into experimental	2013-04-16 08:58:28 -07:00
Yunqing Wang	148eb803bb	Optimize the scaling calculation In decoder, the scaling calculation, such as (mv * x_num / x_den), is fairly time-consuming. In this patch, we check if the scaling happens or not at frame level, and then decide which function to call to skip scaling calculation when no scaling is needed. Tests showed a 3% decoder performance gain. Change-Id: I270901dd0331048e50368cfd51ce273dd82b8733	2013-04-16 08:52:40 -07:00
Scott LaVarnway	6f95d53e37	Removing extra params from x_add_residual() functions Now that the predictor is the dest, we do not need the extra parameters. Change-Id: I31e2c3d2015f4a1cd12e7f04536d8db478582a0a	2013-04-16 09:59:01 -04:00
John Koleszar	4054ff5da5	Merge "Removing TRUE and FALSE macro definitions." into experimental	2013-04-16 06:55:13 -07:00
John Koleszar	7f7d1357a2	Merge branch 'experimental' into master VP9 preview bitstream 2, commit '868ecb55a1528ca3f19286e7d1551572bf89b642' Conflicts: vp9/vp9_common.mk Change-Id: I3f0f6e692c987ff24f98ceafbb86cb9cf64ad8d3	2013-04-16 06:49:46 -07:00
Scott LaVarnway	5393379c84	Merge "Removing extra params in dequant functions" into experimental	2013-04-16 06:37:00 -07:00
Dmitry Kovalev	a0d9309eab	Removing TRUE and FALSE macro definitions. Using regular 0 and 1 constants now. Change-Id: Ie763503cbb727847cc8f1d6506cd6f2ee607f056	2013-04-15 15:24:39 -07:00
Ronald S. Bultje	f7d43d21bd	Merge "Add rectangular block size variance/sad functions." into experimental	2013-04-15 14:20:25 -07:00
Jingning Han	aaf33d7df5	Add rectangular block size variance/sad functions. With this, the RD loop properly supports rectangular blocks. Change-Id: Iece79048fb4e84741ee1ada982da129a7bf00470	2013-04-15 13:39:07 -07:00
Dmitry Kovalev	fd61b7ea10	Adding mv_joint_vertical and mv_joint_horizontal functions. Change-Id: Ieaec2c48f3752b8558ba051caaf4ba2ab0e9e84d	2013-04-15 12:07:26 -07:00
Dmitry Kovalev	64de375e1f	Code cleanup inside vp9_reconintra4x4.c file. Using ROUND_POWER_OF_TWO macro, using array initialization syntax for less code. Change-Id: I661453a6b29a9046fcff0a3f18fccb452b5eb39d	2013-04-15 11:15:56 -07:00
Scott LaVarnway	74610b1ae4	Removing extra params in dequant functions Now that the predictor is the dest, we do not need the extra parameters. Change-Id: I78db73d39b5aff62f15303f3d51ad2797eae74b6	2013-04-15 13:43:11 -04:00
Yaowu Xu	757e138a3b	Merge "Reorder enum i4X4 predcition modes" into experimental	2013-04-15 10:37:37 -07:00
Adrian Grange	4ee671a15c	Merge "Initial addition of multiple ARF frames" into experimental	2013-04-15 09:46:16 -07:00
Adrian Grange	c2876cf0fd	Initial addition of multiple ARF frames This is work-in-progress, it implements multiple ARF encoding behind an experimental flag. It adds the ability to insert multiple ARF frames into a single ARF group. This patch implements the reordering of the coded frames, and implements a fixed-length coding pattern. It applies a fixed quantizer strategy based on where the frame is in the coding sequence. Further work to modify the rate control strategy is ongoing and will be submitted via a set of future patches. In this first step, each ARF group is recursively bisected and an ARF frame added at that position in the sequence. The recursion continues until ARF frames are within MIN_GF_INTERVAL frames. The code sits behind the "multiple-arf" experimental flag ("CONFIG_MULTIPLE_ARF"). The experimental flag "oneshotq" ("CONFIG_ONESHOTQ") also needs to be enabled for this patch to work correctly. Change-Id: Ie473b05ebb43ac473c0cfb659b2b8042823085e2	2013-04-15 09:11:39 -07:00
Dmitry Kovalev	8ae091823d	Merge "Encoder code cleanup." into experimental	2013-04-14 10:58:44 -07:00
Dmitry Kovalev	ee9ce0e7d7	Merge "Intra code cleanup." into experimental	2013-04-14 04:34:16 -07:00
Dmitry Kovalev	399a6cbcde	Merge "Renaming vp9_token_struct to vp9_token and removing previous typedef." into experimental	2013-04-14 04:31:39 -07:00
Dmitry Kovalev	78ddf964cd	Intra code cleanup. Removing redundant code. Change-Id: I71bfc40a1fb06d8e3149ed5400aa4dfd87a51aac	2013-04-12 16:53:04 -07:00
Jingning Han	3ba9dd4165	Enable inter predictor for rectangular block size Combine superblock inter predictors into a unified function that allows configurable block width and height. The inter predictions of block sizes smaller than 16x16 are handled differently. To be continued on merging them later. Change-Id: I14075959dd5e221f00c205c99ca35c1c31ef728e	2013-04-12 11:51:58 -07:00
Yaowu Xu	c2ad69bcf4	Reorder enum i4X4 predcition modes To match the order of directional intra prediction modes for larger blocks, also renamed the i4x4 prediction modes to mirror the larger variants. Change-Id: I77cea4d0add6c7758460bf9c7a2fe59aca601f0b	2013-04-12 10:13:23 -07:00
Yaowu Xu	7de5edd14a	Rename B_PRED to I4X4_PRED So it is consistent with I8x8_PRED. Change-Id: Iefa65124b2419690d83e526c611129c0ede29d11	2013-04-12 09:23:58 -07:00
Jingning Han	815e95fbeb	Make intra predictor support rectangular blocks The intra predictor supports configurable block sizes. It can handle intra prediction down to 4x4 sizes, when enabled in BLOCK_SIZE_TYPE. Change-Id: I7399ec2512393aa98aadda9813ca0c83e19af854	2013-04-11 16:45:57 -07:00
John Koleszar	2f19cd03aa	Merge "Remove unused vp9_recon_mb{y,uv}_s" into experimental	2013-04-11 15:51:20 -07:00
Scott LaVarnway	cff266bbef	Merge "WIP: removing predictor buffer usage from decoder" into experimental	2013-04-11 15:24:33 -07:00
Ronald S. Bultje	56d01ee0a6	Merge "Remove unused macroblock versions of reconstruction functions." into experimental	2013-04-11 15:19:08 -07:00
Deb Mukherjee	7a97959f13	Merge "Turning model-based updates on with modelcoefprob" into experimental	2013-04-11 14:54:53 -07:00
Deb Mukherjee	66f413af4f	Turning model-based updates on with modelcoefprob This patch changes the default with the modecoefprob expt to use mode-based forward updates with one-node pegged modeling. The maximum difference with fully trained tables is now less that 0.1%. Change-Id: I06b44322e10c6703f93f3c1d48d973b1136a0618	2013-04-11 14:45:26 -07:00
John Koleszar	4ba74ae81a	Merge "Remove unused vp9 ppc files" into experimental	2013-04-11 14:39:18 -07:00
John Koleszar	c382ed09f8	Remove unused vp9_recon_mb{y,uv}_s These functions now are handled through the common superblock code. Change-Id: Ib6688971bae297896dcec42fae1d3c79af7a611c	2013-04-11 14:05:59 -07:00
Scott LaVarnway	6189f2bcb1	WIP: removing predictor buffer usage from decoder This patch will use the dest buffer instead of the predictor buffer. This will allow us in future commits to remove the extra mem copy that occurs in the dequant functions when eob == 0. We should also be able to remove extra params that are passed into the dequant functions. Change-Id: I7241bc1ab797a430418b1f3a95b5476db7455f6a	2013-04-11 13:55:18 -07:00
John Koleszar	8bf6de725c	Merge changes I6721e42f,Iaffb1ae8 into experimental * changes: tokenize: convert skippable functions Add foreach_transformed_block	2013-04-11 13:36:25 -07:00
John Koleszar	633d9e7b4f	Remove unused vp9 ppc files Change-Id: I3fe8c529ddec658cfa2376cfc05d9c8a5366e978	2013-04-11 13:29:37 -07:00
Dmitry Kovalev	24f18e1c34	Renaming vp9_token_struct to vp9_token and removing previous typedef. Change-Id: If69c3d795f87af5cc7bfdfe70ef733c41b4d55c8	2013-04-11 13:01:52 -07:00
John Koleszar	c2bd46bf45	tokenize: convert skippable functions Use the common block walker to calculate skippability. Change-Id: I6721e42f065df237426c91c1d871ec226ba7cdcb	2013-04-11 12:27:37 -07:00
Ronald S. Bultje	13e41ba440	Remove unused macroblock versions of reconstruction functions. More specifically, remove vp9_quantize_mb, vp9_optimize_mb, vp9_inverse_transform_mb* and vp9_transform_mb. Instead, use the generic _sb functions that take a size argument, and call them with BLOCK_SIZE_MB16X16. Change-Id: I33024afea95d3a23ffbc1df7da426e4645110f29	2013-04-11 12:27:15 -07:00
John Koleszar	42471f6b72	Add foreach_transformed_block Adds a framework for doing arbitrary functions on each transform- sized block in the mb/sb. Change-Id: Iaffb1ae8db5ff2abfa8720c608c78376b42f2096	2013-04-11 11:42:19 -07:00
John Koleszar	c18b2617a4	Remove vp9_reset_mb_tokens_context Use sb-common version instead. Change-Id: If2552b5a39fd2e5272f66a41c5667dda85fd3939	2013-04-11 11:39:19 -07:00
Dmitry Kovalev	ec299e2092	Encoder code cleanup. Removing duplicated code from vp9_encodemv.c and reusing ROUND_POWER_OF_TWO macro definitions. Change-Id: I9caf0c17f761ada7905cb99a3e2a31f871fef0f9	2013-04-11 11:08:00 -07:00
Ronald S. Bultje	8fb5be48a6	Make usage of sb_type independent of literal values. Change-Id: I0d12f9ef9d960df0172a1377f8e5236eb6d90492	2013-04-10 17:38:57 -07:00
Ronald S. Bultje	b4f6098ef7	Make RD superblock mode search size-agnostic. Merge various super_block_yrd and super_block_uvrd versions into one common function that works for all sizes. Make transform size selection size-agnostic also. This fixes a slight bug in the intra UV superblock code where it used the wrong transform size for txsz > 8x8, and stores the txsz selection for superblocks properly (instead of forgetting it). Lastly, it removes the trellis search that was done for 16x16 intra predictors, since trellis is relatively expensive and should thus only be done after RD mode selection. Gives basically identical results on derf (+0.009%). Change-Id: If4485c6f0a0fe4038b3172f7a238477c35a6f8d3	2013-04-10 16:50:30 -07:00
Yaowu Xu	8e9819230d	Merge "Remove obselete code" into experimental	2013-04-10 14:56:28 -07:00
Yaowu Xu	2da90fddc2	Remove obselete code The strategy to run fast loop filter picking for encoder speed-up should be revisited at a later stage. Change-Id: I3b75e06d767cff41be952a42e63b3292f4eab996	2013-04-10 13:45:22 -07:00
Dmitry Kovalev	0cef7234e1	Merge "Fixing upper case names." into experimental	2013-04-10 13:29:38 -07:00
Dmitry Kovalev	20645ec4fb	Merge "Cleanup of set_offsets function." into experimental	2013-04-10 10:15:13 -07:00
Ronald S. Bultje	1932828d19	Merge "Make SB coding size-independent." into experimental	2013-04-10 08:51:58 -07:00
Ronald S. Bultje	9b46e30494	Merge "Don't use BLOCKD in vp9_invtrans.c." into experimental	2013-04-09 21:36:09 -07:00
Ronald S. Bultje	a3874850dd	Make SB coding size-independent. Merge sb32x32 and sb64x64 functions; allow for rectangular sizes. Code gives identical encoder results before and after. There are a few macros for rectangular block sizes under the sbsegment experiment; this experiment is not yet functional and should not yet be used. Change-Id: I71f93b5d2a1596e99a6f01f29c3f0a456694d728	2013-04-09 21:28:27 -07:00
John Koleszar	a3ec4cbd33	Merge "detokenize: use consistent structure for all block sizes" into experimental	2013-04-09 14:18:59 -07:00
Dmitry Kovalev	c34f6fcb54	Fixing upper case names. Renaming Y1dequant to y_dequant, UVdequant to uv_dequant, QIndex to qindex. Change-Id: I1c356e5f886deb3f8807dc212de9799b55b09d58	2013-04-09 10:46:57 -07:00
Dmitry Kovalev	df76a617b4	Cleanup of set_offsets function. Adding ALLOWED_REFS_PER_FRAME constant instead of hard coded number 3. Change-Id: I46146aa837896936f920c748c7d4aa4c27f026e4	2013-04-09 10:17:22 -07:00
Jingning Han	b3935e8348	Merge "Clamp inferred motion vectors only" into experimental	2013-04-09 09:24:08 -07:00
John Koleszar	e6deea4e60	detokenize: use consistent structure for all block sizes Restructure the code to avoid the majority of per-block-size switches, code duplication, etc. All block types (mb/sb32/sb64) can be handled by the same code. Change-Id: I4022718d66e31a15a7074e43f3b98cd0a5124ea7	2013-04-08 13:11:40 -07:00
Ronald S. Bultje	f42bee7edf	Don't use BLOCKD in vp9_invtrans.c. Change-Id: I40524170334109e2864b06e3c73c8b34e5aa8b0f	2013-04-08 11:37:29 -07:00
Jingning Han	12bf0796e6	Clamp inferred motion vectors only Clamp only the motion vectors inferred from neighboring reference macroblocks. The motion vectors obtained through motion search in NEWMV mode are constrained during the search process, which allows a relatively larger referencing region than the inferred mvs. Hence further clamping the best mv provided by the motion search may affect the efficacy of NEWMV mode. Synchronized the decoding process. The decoded mvs in NEWMV modes should be guaranteed to fit in the effective range. Put a mv range clamping function there for security purpose. This improves the coding performance of high motion sequences, e.g., derf set: foreman 0.233% husky 0.175% icd 0.135% mother_daughter 0.337% pamphlet 0.561% stdhd set: blue_sky 0.408% city 0.455% also saw sunflower goes down by -0.469%. Change-Id: I3fcbba669e56dab779857a8126a91b926e899cb5	2013-04-08 11:37:03 -07:00
Ronald S. Bultje	aeefa6e194	Fix typo which breaks 4x4 splitmv compound prediction RD code. 0.15% quality increase on derf, particularly noticeable on hard clips at the higher bitrate end. Change-Id: I02415a96eb9bbc361cba923069625fae71844bc9	2013-04-08 09:17:52 -07:00
John Koleszar	0e7b7e47c2	Merge "Small cleanup inside setup_loopfilter function." into experimental	2013-04-05 16:13:46 -07:00
John Koleszar	8bbabbea70	Merge "Segmentation code cleanup." into experimental	2013-04-05 16:03:25 -07:00
John Koleszar	fa135d7b9e	Merge changes Ibbfa68d6,Idb76a0e2 into experimental * changes: Move EOB to per-plane data Move qcoeff, dqcoeff from BLOCKD to per-plane data	2013-04-05 15:56:50 -07:00
Ronald S. Bultje	36c3a67c20	Remove full-pixel-related code. This is a VP8-only feature (part of profile 3) that is unsupported in VP9. Change-Id: I78016eede8d9c834d44d4c517f3e8b8fc2a378b1	2013-04-05 12:50:19 -07:00
Dmitry Kovalev	421baef49e	Small cleanup inside setup_loopfilter function. Change-Id: If7fa8aea02f26c2c2bb5daf4e65c3e661d7031ca	2013-04-05 12:48:48 -07:00
Ronald S. Bultje	61834f7325	Remove some unused macros. Change-Id: Ic219e7878428128e4bb1b3995e8151f92b6bd9c3	2013-04-05 12:40:56 -07:00
Ronald S. Bultje	0732a61c37	Remove struct POS. It is never used. Change-Id: If7462357c0498ed05af2645f0c272124381d3aab	2013-04-05 12:38:40 -07:00
Ronald S. Bultje	1cb34c32ed	Remove unused vpx_log() function prototype. Change-Id: Icd6b4322841fefcc86f06645e6aaf1ea42fdfabd	2013-04-05 12:37:45 -07:00
Ronald S. Bultje	5cd235c6cd	Remove "tx_type" member from union b_mode_info. It is never used. Change-Id: Ibae898c52c766aabf65868611060f9c38fb85b35	2013-04-05 12:36:15 -07:00
Dmitry Kovalev	2c42499513	Segmentation code cleanup. Cleaning up the code, removing unused vp9_check_segref_inter function and useless comments. Change-Id: Ia0e1a3878dc0f9789cba84aeb507a83d9dccd26b	2013-04-05 11:55:52 -07:00
John Koleszar	05a79f2fbf	Move EOB to per-plane data Continue migrating data from BLOCKD/MACROBLOCKD to the per-plane structures. Change-Id: Ibbfa68d6da438d32dcbe8df68245ee28b0a2fa2c	2013-04-04 21:30:23 -07:00
John Koleszar	4c05a051ab	Move qcoeff, dqcoeff from BLOCKD to per-plane data Start grouping data per-plane, as part of refactoring to support additional planes, and chroma planes with other-than 4:2:0 subsampling. Change-Id: Idb76a0e23ab239180c818025bae1f36f1608bb23	2013-04-04 16:30:57 -07:00
John Koleszar	4d9dbb2ae8	Merge "Reimplementation of setup_frame_size." into experimental	2013-04-03 21:04:29 -07:00
Dmitry Kovalev	d5a017300c	General code cleanup. Making code more readable in different places. Change-Id: Iea92c9a35e64d257ee358879fc04fc926843d52e	2013-04-03 18:40:17 -07:00
Yunqing Wang	dcd3a5c055	Merge "Modify vp9_setup_interp_filters function" into experimental	2013-04-03 14:09:01 -07:00
Yunqing Wang	4ca882f32f	Modify vp9_setup_interp_filters function Took vp9_setup_scale_factors_for_frame() out from vp9_setup_interp_filters(), so that it is only called once per frame instead of per macroblock. Decoder tests showed a 1.5% performance gain. Change-Id: I770cb09eb2140ab85132f82aed388ac0bdd3a0aa	2013-04-03 13:49:55 -07:00
Dmitry Kovalev	da0232fd59	Reimplementation of setup_frame_size. General code cleanup in loopfilter code. Modification of setup_frame_size, so now VP9_COMMON is modified in one place after all width/height checks passed. Change-Id: Iedf32df43a912d7aae788ed276ac6c429973f6fe	2013-04-03 12:21:47 -07:00
John Koleszar	30d83c4159	Merge "Fix overlapping writes by copy_and_extend_plane" into experimental	2013-04-03 11:54:29 -07:00
John Koleszar	8b71b8a6de	Merge "Renaming sb32_coded and sb64_coded fields." into experimental	2013-04-02 21:49:03 -07:00
John Koleszar	dc12e6c0dc	Merge "Lower case names for struct members." into experimental	2013-04-02 21:27:32 -07:00
Dmitry Kovalev	dca8ad178c	Renaming sb32_coded and sb64_coded fields. Renaming sb32_coded to prob_sb32_coded and sb64_coded to prob_sb64_coded. Change-Id: I6de5cad00a57c3e066d53467f8c38cb6073dce11	2013-04-02 18:21:55 -07:00
John Koleszar	01247f67a7	Fix overlapping writes by copy_and_extend_plane Broken by refactoring commit `180cd5faa5` Change-Id: I307f6e54d93219a31e7336f1633103ecb25e4832	2013-04-02 14:58:10 -07:00
John Koleszar	42db454c7f	Merge branch 'master' into experimental Conflicts: vp9/vp9_common.mk Change-Id: I2cd5ab47dc31c4210cefc23a282102123d5e2221	2013-04-02 14:54:44 -07:00
Dmitry Kovalev	626635c271	Lower case names for struct members. Lower case member names inside VP9D_CONFIG and VP9D_COMP structs. Change-Id: I75af9ad2d929a35c357207a3fd9ebedddabf79c3	2013-04-02 13:34:20 -07:00
Johann	3db60c8c6c	Demux vp9_loopfilter_x86.c Allow more careful targeting of compiler flags. Change-Id: I963ab4a6479dedb165419310dfca52a58a9877b8	2013-04-02 12:49:04 -07:00
Johann	6c147b9d93	vp9_sadmxn_x86 only contains SSE2 functions Rename the file and clean up includes. In the future we would like to pattern match the files which need additional compiler flags. Change-Id: I2c76256467f392a78dd4ccc71e6e0a580e158e56	2013-04-02 11:20:55 -07:00
John Koleszar	49bc402a94	Merge "Code cleanup." into experimental	2013-04-01 21:12:56 -07:00
John Koleszar	a417a6e32c	Merge "Removing redundant function arguments." into experimental	2013-04-01 21:09:48 -07:00
Dmitry Kovalev	e71248addc	Code cleanup in block reconstruction code. Adding recon, recond_sby and recon_sbuv functions. Change-Id: I6050db233e792e73a3699d18b056eaef9c901d6d	2013-04-01 18:26:58 -07:00
Dmitry Kovalev	50e54c112d	Code cleanup. Adding multiple16 function, removing redundant code, better formatting. Change-Id: I50195b78ac8ab803e3d05c8fb05a7ca134fab386	2013-04-01 18:23:04 -07:00
Deb Mukherjee	e3955007df	Merge "Framework changes in nzc to allow more flexibility" into experimental	2013-03-29 15:57:27 -07:00
John Koleszar	edb1222acb	Merge "Extracting common motion vector prediction code." into experimental	2013-03-29 10:43:38 -07:00
John Koleszar	2e181c2d0b	Merge "General code cleanup." into experimental	2013-03-29 10:40:34 -07:00
Yaowu Xu	4b3e59ef0e	Merge "define a specific neighborhood for SB64 mv search" into experimental	2013-03-29 09:26:14 -07:00
Yaowu Xu	cbc7ec55a5	Merge "remove code not in use" into experimental	2013-03-29 08:40:29 -07:00
Deb Mukherjee	c5840a8d8e	Merge "Reoptimizing the interpolation filters" into experimental	2013-03-29 07:15:05 -07:00
Ronald S. Bultje	6cb2fcf601	Merge "Fix mix-up in pt token indexing." into experimental	2013-03-28 12:53:00 -07:00
Deb Mukherjee	fe9b5143ba	Framework changes in nzc to allow more flexibility The patch adds the flexibility to use standard EOB based coding on smaller block sizes and nzc based coding on larger blocksizes. The tx-sizes that use nzc based coding and those that use EOB based coding are controlled by a function get_nzc_used(). By default, this function uses nzc based coding for 16x16 and 32x32 transform blocks, which seem to bridge the performance gap substantially. All sets are now lower by 0.5% to 0.7%, as opposed to ~1.8% before. Change-Id: I06abed3df57b52d241ea1f51b0d571c71e38fd0b	2013-03-28 09:33:50 -07:00
Ronald S. Bultje	9eea9fa206	Fix mix-up in pt token indexing. This fixes uninitialized reads in the trellis, and probably makes the trellis do something again. Change-Id: Ifac8dae9aa77574bde0954a71d4571c5c556df3c	2013-03-28 09:24:29 -07:00
Yaowu Xu	48104f0dfa	define a specific neighborhood for SB64 mv search Change-Id: Ifda91d697c5970c65ce3ec1feac5562124f91782	2013-03-27 16:34:45 -07:00
Dmitry Kovalev	17cddb4e26	Removing redundant function arguments. Almost all arguments for vp9_build_inter32x32_predictors_sb and vp9_build_inter64x64_predictors_sb can be deduced from the first macroblock argument. Change-Id: I5d477a607586d05698d5b3b9b9bc03891dd3fe83	2013-03-27 16:19:27 -07:00
Dmitry Kovalev	52ccff4719	Extracting common motion vector prediction code. Adding b_mv_pred_row and b_mv_pred_col functions, updating mi_mv_pred_row and mi_mv_pred_row functions. Change-Id: I9af068442d4474478375943cc6fce1605d6fc0a5	2013-03-27 14:35:36 -07:00
Dmitry Kovalev	180cd5faa5	General code cleanup. Removing redundant code, lower case variable names, better indentation, better parameter names, adding const to readonly parameters. Change-Id: Ibfdee00f60316fdc5b3f024028c7aaa76a627483	2013-03-27 14:22:30 -07:00
John Koleszar	9ba8aed179	Merge "Extract setup_frame_size and update_frame_context functions." into experimental	2013-03-27 14:21:57 -07:00
Dmitry Kovalev	8c69c193b5	Extract setup_frame_size and update_frame_context functions. Extracting setup_frame_size and update_frame_context functions. Introducing vp9_read_prob function as shortcut for (vp9_prob)vp9_read_literal(r, 8). Change-Id: Ia5c68fd725b2d1b9c5eb20f69cacb62361b5a3dd	2013-03-27 14:04:35 -07:00
Yunqing Wang	c6c0657c60	Modify idct code to use macro Small modification of idct code. Change-Id: I5c4e3223944c68e4ccf762f6cf07c990250e4290	2013-03-27 12:36:08 -07:00
Yunqing Wang	0e91bec4b5	Merge "Optimize 32x32 idct function" into experimental	2013-03-27 11:30:48 -07:00
Yunqing Wang	21a718d9a7	Optimize 32x32 idct function Wrote sse2 version of vp9_short_idct_32x32 function. Compared to c version, the sse2 version is 5X faster. Change-Id: I071ab7378358346ab4d9c6e2980f713c3c209864	2013-03-27 11:05:42 -07:00
Ronald S. Bultje	513157e093	Scatter-based scantables. This gains about 0.2% on derf, 0.1% on hd and 0.4% on stdhd. I can put this under an experimental flag if wanted, just trying to get my patch queue in shape. Change-Id: Ibe1a30fe0e0b07bec4802e0f3ff0ba22e505f576	2013-03-27 09:44:45 -07:00
Ronald S. Bultje	7c70145914	Merge "Add col/row-based coefficient scanning patterns for 1D 8x8/16x16 ADSTs." into experimental	2013-03-26 19:17:08 -07:00
Ronald S. Bultje	3c77ab4c0f	Merge "Redo banding for all transforms." into experimental	2013-03-26 19:16:44 -07:00
Ronald S. Bultje	c6efbbcfe4	Merge "Use above/left (instead of previous in scan-order) as token context." into experimental	2013-03-26 19:16:24 -07:00
Deb Mukherjee	23144d2345	Implicit weighted prediction experiment Adds an experiment to use a weighted prediction of two INTER predictors, where the weight is one of (1/4, 3/4), (3/8, 5/8), (1/2, 1/2), (5/8, 3/8) or (3/4, 1/4), and is chosen implicitly based on consistency of the predictors to the already reconstructed pixels to the top and left of the current macroblock or superblock. Currently the weighting is not applied to SPLITMV modes, which default to the usual (1/2, 1/2) weighting. However the code is in place controlled by a macro. The same weighting is used for Y and UV components, where the weight is derived from analyzing the Y component only. Results (over compound inter-intra experiment) derf: +0.18% yt: +0.34% hd: +0.49% stdhd: +0.23% The experiment suggests bigger benefit for explicitly signaled weights. Change-Id: I5438539ff4485c5752874cd1eb078ff14bf5235a	2013-03-26 16:58:56 -07:00
Ronald S. Bultje	d9094d8fd3	Add col/row-based coefficient scanning patterns for 1D 8x8/16x16 ADSTs. These are mostly just for experimental purposes. I saw small gains (in the 0.1% range) when playing with this on derf. Change-Id: Ib21eed477bbb46bddcd73b21c5c708a5b46abedc	2013-03-26 16:46:13 -07:00
Ronald S. Bultje	3120dbddb1	Redo banding for all transforms. Now that the first AC coefficient in both directions use the same DC as their context, there no longer is a purpose in letting both have their own band. Merging these two bands allows us to split bands for some of the very high-frequency AC bands. In addition, I'm redoing the banding for the 1D-ADST col/row scans. I don't think the old banding made any sense at all (it merged the last coefficient of the first row/col in the same band as the first two of the second row/col), which was clearly an oversight from the band being applied in scan-order (rather than in their actual position). Now, coefficients at the same position will be in the same band, regardless what scan order is used. I think this makes most sense for the purpose of banding, which is basically "predict energy for this coefficient depending on the energy of context coefficients" (i.e. pt). After full re-training, together with previous patch, derf gains about 1.2-1.3%, and hd/stdhd gain about 0.9-1.0%. Change-Id: I7a0cc12ba724e88b278034113cb4adaaebf87e0c	2013-03-26 16:46:13 -07:00
Ronald S. Bultje	790fb13215	Use above/left (instead of previous in scan-order) as token context. Pearson correlation for above or left is significantly higher than for previous-in-scan-order (absolute values depend on position in scan, but in general, we gain about 0.1-0.2 by using either above or left; using both basically just makes this even better). For eob branch skipping, we continue to use the previous token in scan order. This helps about 0.9% on derf after re-training on a limited data set. Full re-training and results on larger-resolution clips are pending. Note that this commit breaks trellis, so we can probably get further gains out of it by fixing trellis at some later point. Change-Id: Iead68e296fc3a105cca746b5e3da9555d6010cfe	2013-03-26 16:46:09 -07:00
Deb Mukherjee	57c97e2a5b	Reoptimizing the interpolation filters Reoptimizes the 8-tap smooth filter. Results: derf: +0.101% yt: +0.157% hd: +0.791% stdhd: +0.264% The next step will be to reoptimize the other two filters. Change-Id: I3d256a510ad9c7c30c33fae4a70fb43dfc708ed0	2013-03-26 16:34:35 -07:00
Yaowu Xu	43df87e841	remove code not in use Change-Id: I4fa46f10e82aca36c563f7ea829e5a3177a0c740	2013-03-26 15:27:35 -07:00
Dmitry Kovalev	d7209b3a0a	Cleaning up loopfilter code. Lower case variable names, removing redundant variables, declaration and initialization on the same line. Change-Id: Ie0c6c95b14103990eb6a9d7784f8259c662e1251	2013-03-26 11:09:58 -07:00
John Koleszar	8e1c368486	Merge "Add an in-loop deringing experiment" into experimental	2013-03-26 08:36:55 -07:00
John Koleszar	7d9a7fb297	Merge "Code cleanup." into experimental	2013-03-26 08:34:06 -07:00
John Koleszar	f0923f3b01	Merge "Code cleanup." into experimental	2013-03-26 08:30:46 -07:00
John Koleszar	441e2eab1b	Add an in-loop deringing experiment Adds a per-frame, strength adjustable, in loop deringing filter. Uses the existing vp9_post_proc_down_and_across 5 tap thresholded blur code, with a brute force search for the threshold. Results almost strictly positive on the YT HD set, either having no effect or helping PSNR in the range of 1-3% (overall average 0.8%). Results more mixed for the CIF set, (-0.5 min, 1.4 max, 0.1 avg). This has an almost strictly negative impact to SSIM, so examining a different filter or a more balanced search heuristic is in order. Other test set results pending. Change-Id: I5ca6ee8fe292dfa3f2eab7f65332423fa1710b58	2013-03-26 08:23:24 -07:00
Deb Mukherjee	49dcc71493	Merge "Modeling default coef probs with distribution" into experimental	2013-03-26 07:13:13 -07:00
Deb Mukherjee	fd18d5dffe	Modeling default coef probs with distribution Replaces the default tables for single coefficient magnitudes with those obtained from an appropriate distribution. The EOB node is left unchanged. The model is represeted as a 256-size codebook where the index corresponds to the probability of the Zero or the One node. Two variations are implemented corresponding to whether the Zero node or the One-node is used as the peg. The main advantage is that the default prob tables will become considerably smaller and manageable. Besides there is substantially less risk of over-fitting for a training set. Various distributions are tried and the one that gives the best results is the family of Generalized Gaussian distributions with shape parameter 0.75. The results are within about 0.2% of fully trained tables for the Zero peg variant, and within 0.1% of the One peg variant. The forward updates are optionally (controlled by a macro) model-based, i.e. restricted to only convey probabilities from the codebook. Backward updates can also be optionally (controlled by another macro) model-based, but is turned off by default. Currently model-based forward updates work about the same as unconstrained updates, but there is a drop in performance with backward-updates being model based. The model based approach also allows the probabilities for the key frames to be adjusted from the defaults based on the base_qindex of the frame. Currently the adjustment function is a placeholder that adjusts the prob of EOB and Zero node from the nominal one at higher quality (lower qindex) or lower quality (higher qindex) ends of the range. The rest of the probabilities are then derived based on the model from the adjusted prob of zero. Change-Id: Iae050f3cbcc6d8b3f204e8dc395ae47b3b2192c9	2013-03-25 23:43:38 -07:00
Dmitry Kovalev	3644a5b632	Code cleanup. Fixing function arguments alignment, reusing MIN/MAX and clamp functions. Change-Id: I87dd5a40ffb65b521b8abbf0fccf2f50552c5309	2013-03-25 15:16:14 -07:00
Dmitry Kovalev	7cc14e598e	Code cleanup. Lower case variable names, code simplification by using already defined clamp and read_le16 functions. Change-Id: I8fd544365bd8d1daed86d7b2ae0843e4ef80df08	2013-03-25 14:24:26 -07:00
Yunqing Wang	f68350ca98	Merge "Optimize 16x16 idct10 function" into experimental	2013-03-22 11:17:32 -07:00
Paul Wilkins	52abaeca85	Merge "Remove TX size segment feature" into experimental	2013-03-22 10:39:22 -07:00
Yunqing Wang	869d6c0534	Optimize 16x16 idct10 function Wrote sse2 version of vp9_short_idct10_16x16 function. Compared to c version, the sse2 version is 2.3X faster. Change-Id: I314c4f09369648721798321eeed6f58e38857f26	2013-03-21 16:36:01 -07:00
Yunqing Wang	8a3233b54d	Merge "Optimize 16x16 idct function" into experimental	2013-03-21 11:54:20 -07:00
Yunqing Wang	ec3100661c	Optimize 16x16 idct function Wrote sse2 version of vp9_short_idct16x16 function. Compared to c version, the sse2 version is over 2.5X faster. Change-Id: I38536e2b846427a2cc5c5423aaf305fd0e605d61	2013-03-21 11:44:05 -07:00
Dmitry Kovalev	56f3a2c663	Code cleanup: lower case variable names. Renaming Width to width, Height to height and Version to version in several structs and function signatures. Change-Id: I084c3f7e747cb2ce3345aff27a3dff9b13a87543	2013-03-20 16:41:30 -07:00
Paul Wilkins	1c75e77b6d	Remove TX size segment feature Change-Id: I0d226e4cb240caced37230f46905bf69b46e0cce	2013-03-19 17:31:08 +00:00
Yunqing Wang	6344c84c82	Optimize 8x8 idct function Wrote sse2 functions of vp9_short_idct8x8 and vp9_short_idct10_8x8. Compared to c version, the sse2 version is 2X faster. The decoder test didn't show noticeable gain since 8x8 idct doesn't take much of decoding time (less than 1% in my test). Change-Id: I56313e18cd481700b3b52c4eda5ca204ca6365f3	2013-03-18 15:34:14 -07:00
John Koleszar	8a3f55f2d4	Replace scaling byte with explicit display size If the intended display size is different than the size the frame is coded at, then send that size explicitly in the bitstream. Adds a new bit to the frame header to indicate whether the extra size fields are present. Change-Id: I525c66f22d207efaf1e5f903c6a2a91b80245854	2013-03-18 12:02:20 -07:00
John Koleszar	c5b317057b	Merge "Fix pulsing issue with scaling" into experimental	2013-03-18 11:57:36 -07:00
John Koleszar	e5d7542447	Merge "Add VP9_GET_REFERENCE control" into experimental	2013-03-18 11:57:31 -07:00
Yaowu Xu	d29f5435df	Merge "put refmvselection under experiment" into experimental	2013-03-18 08:51:33 -07:00
Yaowu Xu	12ade55719	Merge "removed reference to "LLM" and "x8"" into experimental	2013-03-18 08:51:19 -07:00
Deb Mukherjee	bf7387f6b7	Merge "Context-pred fix to not use top/left on edges" into experimental	2013-03-16 19:09:25 -07:00
Deb Mukherjee	b1921b2f08	Context-pred fix to not use top/left on edges This fix resolves some of the mismatch issues being seen recently. While this is the right thing to do when tiling is used for this experiment, it is not the underlying cause of the the mismatches. Something else is causing writing outside of the allowable frame area in the encoder leading to this mismatch. Change-Id: If52c6f67555aa18ab8762865384e323b47237277	2013-03-16 09:26:52 -07:00
Christian Duvivier	4418b790a7	Faster vp9_short_fdct16x16. Scalar path is about 1.5x faster (3.1% overall encoder speedup). SSE2 path is about 7.2x faster (7.8% overall encoder speedup). Change-Id: I06da5ad0cdae2488431eabf002b0d898d66d8289	2013-03-15 15:55:31 -07:00
Yaowu Xu	5d9ba7938e	Merge "Remove leftover reference to 2nd order dc/ac quant" into experimental	2013-03-14 19:05:11 -07:00
Yaowu Xu	f4d2ad6915	Remove leftover reference to 2nd order dc/ac quant Change-Id: Ib8dacf1d2797743569771b8f699e40e1aeb085cb	2013-03-14 10:46:15 -07:00
John Koleszar	9b7be88883	Fix pulsing issue with scaling Updates the YV12_BUFFER_CONFIG structure to be crop-aware. The exiting width/height parameters are left unchanged, storing the width and height algined to a 16 byte boundary. The cropped dimensions are added as new fields. This fixes a nasty visual pulse when switching between scaled and unscaled frame dimensions due to a mismatch between the scaling ratio and the 16-byte aligned sizes. Change-Id: Id4a3f6aea6b9b9ae38bdfa1b87b7eb2cfcdd57b6	2013-03-13 19:10:10 -07:00
John Koleszar	b3c350a1a9	Add VP9_GET_REFERENCE control This is like VP8_COPY_REFERENCE, but returns a pointer to the reference frame rather than a copy of it. This is useful when the application doesn't know what the size of the reference is, as is the case when scaling is in effect. Change-Id: I63667109f65510364d0e397ebe56217140772085	2013-03-13 19:08:06 -07:00
Jingning Han	76c12ab9c9	Support +/-2048 motion vector coding Enable entropy coding of motion vectors up to +/-2048. Also extend the motion search range accordingly. Change-Id: Iac2bb015e8934521cef83a19edbe967d9f097436	2013-03-13 14:08:27 -07:00
Yaowu Xu	88862c0454	put refmvselection under experiment and turn the experiment off by default. Change-Id: If9e684aa6cc49eacd39f36645a110a447e38d2de	2013-03-13 10:40:31 -07:00
Yaowu Xu	005552639b	removed reference to "LLM" and "x8" The commit changed the name of files and function to remove obselete reference to LLM and x8. Change-Id: I973b20fc1a55149ed68b5408b3874768e6f88516	2013-03-13 08:35:46 -07:00
Ronald S. Bultje	8fc3ab7c62	Merge "Fix typo in comment for number of extra bits for cat6 tokens." into experimental	2013-03-12 10:45:12 -07:00
Ronald S. Bultje	516f7ac04e	Fix typo in comment for number of extra bits for cat6 tokens. Change-Id: I07ddf3be8bc5d6c2eb561d4241879777c315b183	2013-03-12 10:25:43 -07:00
John Koleszar	045c53f51e	fix an assumption about uv_stride Use the uv_stride from the framebuffer rather than deriving it from the y_stride. Change-Id: I94581cb741539d094ff062b3d008235556903b8c	2013-03-12 09:22:44 -07:00
Dmitry Kovalev	2891d70b23	Code cleanup. Removing redundant code, introducing new functions for better decomposition, adding 'clamp' function to vp9_common.h. Change-Id: Ic3b8ca13bbc38f60f0c9c43910b5802005e31aaf	2013-03-11 17:02:27 -07:00
John Koleszar	9b4095c537	Fix vp9_tree_probs_from_distribution with CONFIG_CODE_NONZEROCOUNT The automatic merge result was incomplete. Change-Id: I8976318bfc346d867660a013a302c80edb25fc29	2013-03-11 11:03:36 -07:00
John Koleszar	52fc4f8a78	Merge "Simplify vp9_adapt_nmv_probs" into experimental	2013-03-11 09:57:53 -07:00
John Koleszar	ee4649ded2	Simplify vp9_adapt_nmv_probs Remove the temporary branch count arrays and build the adapted probabilities while walking the tree. Gives an additional 1.5% or so on CIF. Change-Id: I875d61e5e0ec778e5d2f7f9d0837b989a91cf3a3	2013-03-11 09:44:22 -07:00
Deb Mukherjee	fad43d4249	Merge "Minor optimization in mv entropy adaptation" into experimental	2013-03-11 09:43:54 -07:00
John Koleszar	e6257342b1	Merge "Optimize vp9_tree_probs_from_distribution" into experimental	2013-03-11 09:32:11 -07:00
Deb Mukherjee	f74c55eb03	Minor optimization in mv entropy adaptation Adds a check to exit from the increment_nmv_count function when the increment is 0. Change-Id: I99c1e342d351f7800e23590f9c2419881bf1d708	2013-03-11 08:49:14 -07:00
John Koleszar	bd84685f78	Optimize vp9_tree_probs_from_distribution The previous implementation visited each node in the tree multiple times because it used each symbol's encoding to revisit the branches taken and increment its count. Instead, we can traverse the tree depth first and calculate the probabilities and branch counts as we walk back up. The complexity goes from somewhere between O(nlogn) and O(n^2) (depending on how balanced the tree is) to O(n). Only tested one clip (256kbps, CIF), saw 13% decoding perf improvement. Note that this optimization should port trivially to VP8 as well. In VP8, the decoder doesn't use this function, but it does routinely show up on the profile for realtime encoding. Change-Id: I4f2848e4f41dc9a7694f73f3e75034bce08d1b12	2013-03-10 13:39:30 -07:00
Deb Mukherjee	a28139c849	Continued experiment with nonzero count Adds probability updates for extra bits for the nzcs, code for getting nzc stats, plus some minor cleanups and fixes. Change-Id: If2814e7f04fb52f5025ad9f400f3e6c50a00b543	2013-03-08 16:37:08 -08:00
Yunqing Wang	cb7acbc0e1	Merge "Add vp9_idct4_1d_sse2" into experimental	2013-03-08 15:14:02 -08:00
Yunqing Wang	11ca81f8b6	Add vp9_idct4_1d_sse2 Added SSE2 idct4_1d which is called by vp9_short_iht4x4. Also, modified the parameter type passed to vp9_short_iht functions to make it work with rtcd prototype. Change-Id: I81ba7cb4db6738f1923383b52a06deb760923ffe	2013-03-08 15:04:22 -08:00
Dmitry Kovalev	3edbc77ae3	Merge "Consistent usage of ROUND_POWER_OF_TWO macro." into experimental	2013-03-08 11:35:22 -08:00
Yunqing Wang	2e0553227e	Merge "Optimize add_constant_residual function" into experimental	2013-03-08 10:18:52 -08:00
Jingning Han	2a5278bdbd	Extend diff MV limit from +/-256 to +/-1024 Increase the motion search range by 4x. Change MV_CLASS tree of the entropy coding to allow two additional mv classes to cover the extended motion vector limit. The codec determines the effective motion search range conditioned on the actual frame dimension. It provides coding gains: stdhd 0.39% yt 0.56% hd 0.47% Major coding performance gains are packed in several sequences with intense motion activities, e.g., ped_1080p gains 7% at high bit-rates, and on average 3%. TODO: Need to further tune the rate control and motion search units. Change-Id: Ib842540a6796fbee5a797809433ef6a477c6d78d	2013-03-08 10:04:36 -08:00
Yunqing Wang	f240782650	Optimize add_constant_residual function Optimized adding constant diff to predictor, which gave about 2% decoder performance gain. Change-Id: I47db20c31428e8c4a8f16214a85cbe386a6e9303	2013-03-07 15:49:07 -08:00
Dmitry Kovalev	3603dfb62c	Consistent usage of ROUND_POWER_OF_TWO macro. Change-Id: I44660975e9985310d8c654c158ee7a61291b5a08	2013-03-07 12:24:35 -08:00
Ronald S. Bultje	89e4ce20d0	Update ADST selection if tx_size < block_size. Change-Id: Ic9b336486774c95ffbb92adcb110cc0fc2a83cc5	2013-03-07 11:19:15 -08:00
Ronald S. Bultje	d3724abe9f	Re-add support for ADST in superblocks. This also changes the RD search to take account of the correct block index when searching (this is required for ADST positioning to work correctly in combination with tx_select). Change-Id: Ie50d05b3a024a64ecd0b376887aa38ac5f7b6af6	2013-03-07 11:19:10 -08:00
Deb Mukherjee	eb6ef2417f	Coding con-zero count rather than EOB for coeffs This patch revamps the entropy coding of coefficients to code first a non-zero count per coded block and correspondingly remove the EOB token from the token set. STATUS: Main encode/decode code achieving encode/decode sync - done. Forward and backward probability updates to the nzcs - done. Rd costing updates for nzcs - done. Note: The dynamic progrmaming apporach used in trellis quantization is not exactly compatible with nzcs. A suboptimal approach has been used instead where branch costs are updated to account for changes in the nzcs. TODO: Training the default probs/counts for nzcs Change-Id: I951bc1e22f47885077a7453a09b0493daa77883d	2013-03-07 07:20:30 -08:00
Dmitry Kovalev	a9961fa819	Merge "Code cleanup." into experimental	2013-03-06 16:57:34 -08:00
Yunqing Wang	f4e383f3d1	Merge "Optimize add_residual function" into experimental	2013-03-05 16:47:58 -08:00
Yunqing Wang	943c6d7172	Optimize add_residual function Optimized adding diff to predictor, which gave 0.8% decoder performance gain. Change-Id: Ic920f0baa8cbd13a73fa77b7f9da83b58749f0f8	2013-03-05 16:27:45 -08:00
Dmitry Kovalev	7f99c3c59a	Code cleanup. Removing redundant 'extern' keywords, fixing formatting and #include order, code simplification. Change-Id: I0e5fdc8009010f3f885f13b5d76859b9da511758	2013-03-05 14:12:16 -08:00
Ronald S. Bultje	4209bba462	Merge changes Ifacbf5a0,Ibad7c3dd into experimental * changes: vpxenc: actually report mismatch on stderr. Make superblocks independent of macroblock code and data.	2013-03-05 11:17:14 -08:00
Dmitry Kovalev	764be4f66f	Merge "Code cleanup and simplification of build_4x4uvmvs function." into experimental	2013-03-04 16:57:30 -08:00
Ronald S. Bultje	111ca42133	Make superblocks independent of macroblock code and data. Split macroblock and superblock tokenization and detokenization functions and coefficient-related data structs so that the bitstream layout and related code of superblock coefficients looks less like it's a hack to fit macroblocks in superblocks. In addition, unify chroma transform size selection from luma transform size (i.e. always use the same size, as long as it fits the predictor); in practice, this means 32x32 and 64x64 superblocks using the 16x16 luma transform will now use the 16x16 (instead of the 8x8) chroma transform, and 64x64 superblocks using the 32x32 luma transform will now use the 32x32 (instead of the 16x16) chroma transform. Lastly, add a trellis optimize function for 32x32 transform blocks. HD gains about 0.3%, STDHD about 0.15% and derf about 0.1%. There's a few negative points here and there that I might want to analyze a little closer. Change-Id: Ibad7c3ddfe1acfc52771dfc27c03e9783e054430	2013-03-04 16:34:36 -08:00
Yunqing Wang	37932d9168	Merge "Optimize vp9_short_idct4x4llm function" into experimental	2013-03-04 14:13:31 -08:00
Yunqing Wang	e8bc9f4220	Optimize vp9_short_idct4x4llm function Wrote a SSE2 vp9_short_idct4x4llm to improve the decoder performance. Change-Id: I90b9d48c4bf37aaf47995bffe7e584e6d4a2c000	2013-03-04 12:01:27 -08:00
Jingning Han	5957b2b514	Support 16K sequence coding Fixed a couple of variable/function definitions, as well as header handling to support 16K sequence coding at high bit-rates. The width and height are each specified by two bytes in the header. Use an extra byte to explicitly indicate the scaling factors in both directions, each ranging from 0 to 15. Tested coding up to 16400x16400 dimension. Change-Id: Ibc2225c6036620270f2c0cf5172d1760aaec10ec	2013-03-04 11:08:41 -08:00
John Koleszar	1cfc86ebe0	Add unit test for x4 multi-SAD functions Update the function prototypes to match between VP9 and VP8. Change-Id: If58965073989e87df3b62b67a030ec6ce23ca04f	2013-03-01 18:14:02 -08:00
Dmitry Kovalev	b5a9795d25	Code cleanup and simplification of build_4x4uvmvs function. Change-Id: Iab0176f058045181821ded95ff1cf423af1625f9	2013-03-01 17:50:55 -08:00
John Koleszar	69c67c9531	Merge master branch into experimental Picks up some build system changes, compiler warning fixes, etc. Change-Id: I2712f99e653502818a101a72696ad54018152d4e	2013-03-01 11:06:05 -08:00
Yunqing Wang	67dbc8fe55	Merge "Add eob<=10 case in idct32x32" into experimental	2013-03-01 08:58:19 -08:00
Yunqing Wang	c550bb3b09	Add eob<=10 case in idct32x32 Simplified idct32x32 calculation when there are only 10 or less non-zero coefficients in 32x32 block. This helps the decoder performance. Change-Id: If7f8893d27b64a9892b4b2621a37fdf4ac0c2a6d	2013-02-28 16:40:29 -08:00
John Koleszar	17c221687f	Merge "Fix use of uninitialized memory in CONFIG_ABOVESPREFMV" into experimental	2013-02-28 15:18:50 -08:00
Yunqing Wang	72b146690a	Merge "Refactor vp9_dequant_idct_add function" into experimental	2013-02-28 14:34:27 -08:00
Yunqing Wang	6193bc3ba8	Refactor vp9_dequant_idct_add function Provided a wrapper and removed duplicate code. Change-Id: Iaef842226ec348422e459202793b001d0983ea30	2013-02-28 14:18:46 -08:00
Scott LaVarnway	aa8fb070b8	Removed vp9_dequantize_b Change-Id: Ie89bd00d58e30bf4094cb748a282f1dfa81a31d8	2013-02-28 14:08:12 -08:00
John Koleszar	2eab4372fc	Fix use of uninitialized memory in CONFIG_ABOVESPREFMV The ABOVESPREFMV experiment uses four pixels to the left of the current block, which don't exist for the left-most column. Change-Id: I4cf0b42ae8f54c0b3e7b1ed8755704b74fafc39c	2013-02-28 13:48:58 -08:00
Jim Bankoski	714aa9f3c0	this commit converts all sad ptrs to uint32 sse4_1 code used uint16_t for returning sad, but that won't work for 32x32 or 64x64. This code fixes the assembly for those and also reenables sse4_1 on linux Change-Id: I5ce7288d581db870a148e5f7c5092826f59edd81	2013-02-28 08:46:35 -08:00
Christian Duvivier	c129203f7e	Faster vp9_short_fdct8x8. Scalar path is about 1.4x faster (4% overall encoder speedup). SSE2 path is about 7x faster (13% overall encoder speedup). Change-Id: I7e85d8225a914a74c61ea370210414696560094d	2013-02-27 17:23:08 -08:00
Dmitry Kovalev	347f3a0aa8	Code cleanup. Fixing code style, using array lookup instead of switch statements for forward hybrid transforms (in the same way as for their inverses). Consistent usage of ROUND_POWER_OF_TWO macro in appropriate places. Change-Id: I0d3822ae11f928905fdbfbe4158f91d97c71015f	2013-02-27 13:51:04 -08:00
John Koleszar	5ac141187a	Merge "Remove unused vp9_copy32xn" into experimental	2013-02-27 12:23:45 -08:00
Yunqing Wang	d6ff6fe2ed	Merge "Remove unused file" into experimental	2013-02-27 11:58:29 -08:00
Ronald S. Bultje	90932399b4	Merge "Move eob from BLOCKD to MACROBLOCKD." into experimental	2013-02-27 11:39:16 -08:00

... 2 3 4 5 6 ...

630 Commits