generic-library/vpx

Author	SHA1	Message	Date
Yaowu Xu	9c44ce9f4b	Merge "Loopfilter: use the current block only for skip" into experimental	2013-06-09 21:17:54 -07:00
Yaowu Xu	2e1fd0a497	Merge "Modified loop filter edge skipping" into experimental	2013-06-09 21:17:47 -07:00
John Koleszar	140ac34e57	Loopfilter: Always filter intra edges Change-Id: Ifb1ce2bd52147981ca1aec9ec6cfea8738a23e45	2013-06-09 09:02:47 -07:00
Ronald S. Bultje	c3f9b070ca	Merge "New comp_inter defaults." into experimental	2013-06-09 06:40:02 -07:00
Ronald S. Bultje	3993d30922	Merge "Fix firstpass if framesize is not a multiple of 16." into experimental	2013-06-08 17:40:17 -07:00
Ronald S. Bultje	d30968c32a	Merge "New default tables" into experimental	2013-06-08 17:39:50 -07:00
Ronald S. Bultje	20760254f6	Merge "Align frame size to 8 instead of 16." into experimental	2013-06-08 17:39:41 -07:00
Ronald S. Bultje	99e10253b0	New comp_inter defaults. It seems like I inverted the meaning of the contexts by accident? Change-Id: Iafb2346d9933930949578342b84519b719dd5dd3	2013-06-08 15:13:57 -07:00
Ronald S. Bultje	073c7d5eec	Fix firstpass if framesize is not a multiple of 16. Change-Id: Iec41736c2b6140715f90f40de5ae6cf52497a9b8	2013-06-08 13:32:05 -07:00
Ronald S. Bultje	b64be43998	New default tables Change-Id: Ice8c73a2a843113877b8f8ed78737a1442c25ced	2013-06-08 13:29:14 -07:00
Deb Mukherjee	17da2cab78	TX_SIZE contexts simplification. Reduces TX_SIZE contexts to 2 for each kind. The code is cleaner and there is hardly any performance difference with more than two contexts. Results: almost neutral Change-Id: I17656bd6db76224ae2856adf882504560e7dbaa4	2013-06-08 12:32:26 -07:00
Deb Mukherjee	67cb1f093c	Minor fix in TX_SIZE contexts Change-Id: I9e81f84877e18ba7e55d66389ed60e64a5b7abcc	2013-06-08 07:14:58 -07:00
Yaowu Xu	b7da6d0c5a	Merge "Handle partition type coding of boundary blocks" into experimental	2013-06-07 18:16:16 -07:00
John Koleszar	f7e4b72df8	Loopfilter: use the current block only for skip Use the current block's skip flag to determine edge skipping. Change-Id: I4ba81f899286afbc3f6bb83eba2ef146a01b6fa4	2013-06-07 17:48:57 -07:00
Ronald S. Bultje	71701f3d40	Align frame size to 8 instead of 16. Change-Id: Ic606ef1b31e49963a779455a1e010a9ebb0f3f1f	2013-06-07 17:20:50 -07:00
Adrian Grange	07a5777bde	Frame header changes to support intra_only frames Made changes to the frame header to write the sync code in the frame header for a non-displayable, intra-only frame. Extended reset_frame_context to 2-bits. (Submitting on behalf of Dmitri) Change-Id: Ie836ae0df9ed572fb4f08aabe9351a555c4f3b96	2013-06-07 16:19:34 -07:00
Deb Mukherjee	21401942b0	Coding tx-size selection by use of spatial context Adds coding of transform size within a frame by use of context of transform sizes selected in left and above blocks. Also incorporates code for generating stats. TODO: generate and incorporate new default stats Change-Id: I6a7af099f6ad61d448521d9a51167aedaf638ed6	2013-06-07 16:07:58 -07:00
Deb Mukherjee	869a39ba60	Cleans up mbskip encoding Refactors mbskip coding to be compatible with coding of the rest of the symbols. Adds forward/backward adaptation and removes a lot of the legacy code. Results: fast50: +1.6% derfraw300: +0.317% Change-Id: I395a2976d15af044d3b8ded5acfa45f6f065f980	2013-06-07 16:00:26 -07:00
Jingning Han	78b8190cc7	Handle partition type coding of boundary blocks The partition types of blocks sitting on the frame boundary are constrained by the block size and the position of each sub-block relative to the frame. Hence we use truncated probability models to handle the coding of such information. 100 frames run: yt 0.138% Change-Id: I85d9b45665c15280069c0234ea6f778af586d87d	2013-06-07 14:19:40 -07:00
Ronald S. Bultje	28164eb962	Fix segment feature data size. Change-Id: I4331cfd99a717938f4f970cad81c468cbf287b00	2013-06-07 13:57:28 -07:00
Ronald S. Bultje	fb1f6f1db4	Fix segment feature data type. It has a range of -255,255, so should be int16_t, not int8_t. Change-Id: I5ef4b6aefb6212b0f35f4754f3c4d73fddbc52a0	2013-06-07 13:57:27 -07:00
Ronald S. Bultje	363dc6ceda	Don't crash if motion vector ref points to out-of-bounds area. This can only happen if partition is partly out-of-frame, in which case the referenced mv is either out-of-frame also (and thus has the same value as an already-read one), or it is actually uninitialized, in which case we don't want to use it. Change-Id: Icf39fa4d987c7abcbebb9bbdcdd6311e8fb9d3c9	2013-06-07 13:57:27 -07:00
Paul Wilkins	340c7a48e6	Change to segment ref frame feature. Simplify feature to only support a single reference frame instead of a mask. Change-Id: I5dd3a98c7a224aafb35708850ab82e2f220e68fb	2013-06-07 21:42:22 +01:00
Yaowu Xu	0bb6da3668	Merge "Remove two un-used entries in mode_lf_delta[]" into experimental	2013-06-07 10:10:45 -07:00
Yaowu Xu	254f46bc5b	Merge "Specify mv neighborhood for block larger than 8x8" into experimental	2013-06-07 10:09:35 -07:00
Yaowu Xu	b097a3ba82	Remove two un-used entries in mode_lf_delta[] With the removal of i4X4 and SPLIT_MV modes, the two entries for the modes are no longer used. This patch remove the coding of the deltas. Change-Id: Iea4eb500404ebe9706159380a03b8eca542fb4c3	2013-06-07 09:24:09 -07:00
Deb Mukherjee	78fbaf4d84	Merge "Coding updates for tx-size selection" into experimental	2013-06-07 09:19:36 -07:00
Ronald S. Bultje	def6bc765c	Merge "Revert "Align frame size to 8 instead of 16."" into experimental	2013-06-07 09:01:33 -07:00
Yaowu Xu	8b3ad75266	Specify mv neighborhood for block larger than 8x8 The new neighorbhood adapts to the shape and size of the block type cif +.16% stdhd +.13% Change-Id: I978db58278e9ae3fbd6726ef831bdfc5f5f37d02	2013-06-07 08:59:48 -07:00
Ronald S. Bultje	e7d306aae6	Revert "Align frame size to 8 instead of 16." This reverts commit `c2574414d4` Change-Id: Ie9013cb0bb43e639e01b4588f630b1da59295d38	2013-06-07 08:59:27 -07:00
Deb Mukherjee	3ee1a21a42	Coding updates for tx-size selection Changes to the coding of transform sizes, along with forward and backward probability updates. Results: derf300: +0.241% Context based coding of transform sizes will be in a separate patch. Change-Id: I97241d60a926f014fee2de21fa4446ca56495756	2013-06-07 08:54:00 -07:00
Janne Salonen	5c5223860a	Modified loop filter edge skipping Added condition to not to skip filtering of transform block edges when the edge is also a decoding block edge. Change-Id: Iaccb6206c4202b78e5dca3b89379556e0f4aba0c	2013-06-07 06:36:22 -07:00
Paul Wilkins	576c2bb021	Fix bug in segment skip. Wrong max data size (skip has no data) and use of vp9_get_segdata() when it should be vp9_segfeature_active(). Change-Id: I1eb97d33df6e2a42cc589049f704266fe3639902	2013-06-07 13:27:08 +01:00
Yaowu Xu	4df9e7883c	Merge "Removed rectangular intra prediction code" into experimental	2013-06-06 22:58:07 -07:00
Yaowu Xu	472669befb	Fix a merge conflict ref_frame in MB_Mode_Info was changed in the ref frame coding patch to be an array to handle first and second reference frame, this patch fix the loop filter code that use the pointer directly as reference frame. Change-Id: I71afa5a49deb50c1bc38029fd07470b984c6dfe9	2013-06-06 22:10:07 -07:00
Yaowu Xu	9470c1a2a1	Removed rectangular intra prediction code As all intra predictions happen on squared transform block now. Change-Id: I7ec91e3f0ad01383a03d2bd3099bbf32e87e3466	2013-06-06 21:35:10 -07:00
Jim Bankoski	fa9db8da15	Merge "Fix FIXME." into experimental	2013-06-06 20:50:51 -07:00
Jim Bankoski	686f437264	Merge "Align frame size to 8 instead of 16." into experimental	2013-06-06 20:49:59 -07:00
John Koleszar	736c7b804a	Merge "Reimplementation of loop filter" into experimental	2013-06-06 17:34:26 -07:00
Ronald S. Bultje	c2574414d4	Align frame size to 8 instead of 16. Change-Id: Ic22f416a33de558519d5c30a929f6a954546ade9	2013-06-06 17:28:11 -07:00
Ronald S. Bultje	bc41af00cf	Fix FIXME. Change-Id: I47a9857d35da1bff6153f8090c6b98b689b31a61	2013-06-06 17:28:11 -07:00
Ronald S. Bultje	6ef805eb9d	Change ref frame coding. Code intra/inter, then comp/single, then the ref frame selection. Use contextualization for all steps. Don't code two past frames in comp pred mode. Change-Id: I4639a78cd5cccb283023265dbcc07898c3e7cf95	2013-06-06 17:28:09 -07:00
Ronald S. Bultje	ad34368786	New intra mode and partitioning probabilities. Split partition probabilities between keyframes and non-keyframes, since they are fairly different. Also have per-blocksize interframe y intramode probabilities, since these vary heavily between different blocksizes. Lastly, replace default probabilities for partitioning and intra modes with new ones generated from current codec. Replace counts with actual probabilities also. Change-Id: I77ca996e25e4a28e03bdbc542f27a3e64ca1234f	2013-06-06 10:45:30 -07:00
John Koleszar	043d348aae	Reimplementation of loop filter This version of the loop filter supports non-4:2:0 subsampling and a fourth plane, as well as changing the filtering order to be more friendly to hardware implementations. The filters are applied first to all vertical edges within the 64x64 SB, followed by the top horizontal edge and any internal horizontal edges. Since filtering is applied on each 4x4 edge serially, a dependency is created from filtering one block edge to the next. It would be possible to remove this depencnecy by building all filtering decisions from the unfiltered reconstruction data. Change-Id: I08f3e9683eb7bded8a76651cbc50fc0dfdd05fa7	2013-06-06 08:45:45 -07:00
Jim Bankoski	5a88271b09	don't tokenize & encode tokens for blocks in UMV This avoids encoding tokens for blocks that are entirely in the UMV border. This changes the bitstream. Change-Id: I32b4df46ac8a990d0c37cee92fd34f8ddd4fb6c9	2013-06-06 06:10:25 -07:00
Dmitry Kovalev	28d31aed7f	Merge "Moving bits from compressed header to uncompressed one." into experimental	2013-06-06 01:15:44 -07:00
Jingning Han	61e6586230	Merge "Fix UV intra coding rd loop" into experimental	2013-06-05 21:47:00 -07:00
Jingning Han	f04b15486a	Fix UV intra coding rd loop This commit makes the coding/reconstruction operations of intra coding rate-distortion loop for UV components consistent with those of the encoding process. key frame coding gains: derf: 0.11% stdhd: 0.42% Change-Id: I8d49f83924a320e3689ef2d60096c49d7f0c7a40	2013-06-05 21:18:02 -07:00
Dmitry Kovalev	12345cb391	Moving bits from compressed header to uncompressed one. Bits moved: refresh_frame_flags, active_ref_idx[], ref_frame_sign_bias[], allow_high_precision_mv, mcomp_filter_type, ref_pred_probs[]. Derf results: +0.040% Change-Id: I011f43c7eac0371d533b255fd99aee5ed75b85a5	2013-06-05 20:56:37 -07:00
Deb Mukherjee	30226a658f	Cosmetic renaming VP9_MVREFS to VP9_INTER_MODES NO bitstream change Change-Id: I79f6146dac5fdd157051b6f8dc611c0b7b5e5f7f	2013-06-05 11:24:01 -07:00
Deb Mukherjee	83885235a7	Clean-ups on switchable interpolation and mv_ref Adds backward adaptation and differential forward updates of switchable interpolation filter probabilities. Also adds some cosmetic cleanups and minor fixes on mv_ref probabilities. derfraw300: +0.353% (with most coming from switchable interp changes) Change-Id: Ie2718be73528c945fd0d80cfd63ca2d9cb3032de	2013-06-05 10:11:52 -07:00
Yaowu Xu	0449ee0fec	Fix a off-by-one bug in the calculation of maximum number of tiles in log2 scale. Change-Id: Id283d6e51a8b926015fd3fc631cdbfb4b8268d4a	2013-06-03 14:25:28 -07:00
Paul Wilkins	6dd3a6320e	Merge "Replace scatter scan 32x32 with HW friendly scan." into experimental	2013-06-03 02:42:37 -07:00
Paul Wilkins	3f380d5252	Merge "vp9_find_mv_refs_idx change for last frame." into experimental	2013-06-03 02:34:46 -07:00
Dmitry Kovalev	317d832d38	Merge "Adding plane_block_width and plane_block_height functions." into experimental	2013-05-31 15:28:45 -07:00
Dmitry Kovalev	d771bba27e	Renaming 'motion_vector' to 'mv' for consistency. Change-Id: Ie869ea4992e26867caec46cb878fc86a646aeb9f	2013-05-31 12:32:53 -07:00
Dmitry Kovalev	120a878199	Adding plane_block_width and plane_block_height functions. Change-Id: I02c17fb733c0f3c22dc3167c3d3182797415f1ae	2013-05-31 12:31:49 -07:00
Ronald S. Bultje	a288cb3b10	Merge "Merge all various transform size data trackers into single variables." into experimental	2013-05-31 09:59:24 -07:00
Scott LaVarnway	1e025dbfd1	Merge "Moved use_prev_in_find_mv_refs check to frame level" into experimental	2013-05-31 09:35:51 -07:00
Ronald S. Bultje	e9d68a5e36	Merge all various transform size data trackers into single variables. Change-Id: I2dfc569106b29fbe4da20585a0e85e5e9ea6a4db	2013-05-31 09:18:59 -07:00
Paul Wilkins	cf61fae8ee	vp9_find_mv_refs_idx change for last frame. Restrict get_matching_candidate() to considering mvs at 8x8 and larger sizes for last frame case. This is to reduce the HW load of using vectors down to the 4x4 level from the previous frame. Change-Id: I6505e610fd63a4e22d67f136aec7905a01b893ba	2013-05-31 15:37:27 +01:00
Sami Pietila	0835a35347	Fix inter mode context adaptation. Change-Id: Ibaa47be878c1cd84d88d7518418d2d8d38224e70	2013-05-31 12:58:31 +03:00
Paul Wilkins	aaf61dfbca	Merge "Patch to remove implicit segmentation." into experimental	2013-05-31 02:56:20 -07:00
Yaowu Xu	7ca651a383	Merge "Changed to use a new variant of WHT" into experimental	2013-05-30 21:53:12 -07:00
Ronald S. Bultje	a4e7c6bd4d	Merge "Remove unused define." into experimental	2013-05-30 20:58:22 -07:00
Ronald S. Bultje	310bc1030a	Merge "Merge VP9_YMODES, VP9_UV_MODES, INTRA_MODE_COUNT and cousins." into experimental	2013-05-30 20:58:19 -07:00
Ronald S. Bultje	7d549870f7	Merge "Remove TX_SIZE_MAX_MB." into experimental	2013-05-30 20:58:16 -07:00
Ronald S. Bultje	6ea6f4d253	Merge "Remove one (unused) entry from mvref tables." into experimental	2013-05-30 20:58:13 -07:00
Jim Bankoski	21595f8e38	Merge "Creates a new speed 1:" into experimental	2013-05-30 20:36:05 -07:00
Jim Bankoski	ced21bd6a6	Creates a new speed 1: This speed 1 - uses variance threshold stolen from static-thresh to determine split. Any superblock with greater than the variance set by static thresh * quantizer index squared is split. In addition transform size is set to largest size less than or equal to partition size, sub pixel filter is set to normal, and only 12 modes are used at all. Change-Id: If7a2858ee70f96d1eb989c04fd87a332b147abef	2013-05-30 19:53:00 -07:00
Ronald S. Bultje	16482bddf7	Merge "Remove splitmv." into experimental	2013-05-30 19:07:12 -07:00
Ronald S. Bultje	d2205f92c3	Merge changes I98c18fe5,I80c37cff into experimental * changes: Remove i4x4_pred. Remove unused table.	2013-05-30 19:06:44 -07:00
Ronald S. Bultje	117282a690	Remove unused define. Change-Id: Ic6555128206d61f47a46c550cb3dcaf3b4ec6374	2013-05-30 17:21:06 -07:00
Ronald S. Bultje	a433abbcad	Merge VP9_YMODES, VP9_UV_MODES, INTRA_MODE_COUNT and cousins. These are now merged in a new define called VP9_INTRA_MODES. Change-Id: I0890f895756a7395d84c92f98f43e43f4cf9050d	2013-05-30 17:21:06 -07:00
Ronald S. Bultje	4d3d00b195	Remove TX_SIZE_MAX_MB. Change-Id: I715870513d1fef8471bfd0f5218a79360a1ef126	2013-05-30 17:21:06 -07:00
Ronald S. Bultje	580d29bdbb	Remove one (unused) entry from mvref tables. Change-Id: Ieb4669ae564bec9f3051485ecdf186cb4e00decb	2013-05-30 17:21:06 -07:00
Ronald S. Bultje	e6485581fe	Remove splitmv. We leave it in rdopt.c as a local define for now - this can be removed later. In all other places, we remove it, thereby slightly decreasing the size of some arrays in the bitstream. Change-Id: Ic2a9beb97a4eda0b086f62c039d994b192f99ca5	2013-05-30 17:21:01 -07:00
Ronald S. Bultje	1efa79d32f	Remove i4x4_pred. It remains as a local define in rdopt.c so we can distinguish between split and non-split modes in the RD loop, but disappears outside that scope in the codec. Change-Id: I98c18fe5ab7e4fbd1d6620ec5695e2ea20513ce9	2013-05-30 16:44:58 -07:00
Ronald S. Bultje	9175082c4e	Remove unused table. Change-Id: I80c37cffa176bac942ab3051abdfd585ed5555e1	2013-05-30 16:44:56 -07:00
Yaowu Xu	042e70e45e	Changed to use a new variant of WHT The commit changed to use a new variant of Walsh-Hadamard Transform by Tim Terriberry. This new variant has the best compression among a number of variants that developed by Tim. Change-Id: Icb3a88515463cfc644b17ca046fcd139db2557e9	2013-05-30 15:37:52 -07:00
Ronald S. Bultje	f5827699bf	Merge "Merge all intra mode coding trees into a single one." into experimental	2013-05-30 11:27:51 -07:00
Adrian Grange	6f361f5841	Merge "Add intra_only and reset_frame_context flags" into experimental	2013-05-30 10:56:25 -07:00
Ronald S. Bultje	98c192ae83	Merge all intra mode coding trees into a single one. Also merge all counters. This removes a few unused probability updates from the bitstream. Change-Id: I20f58853e9dac84d8c0d9703ae012c55917516eb	2013-05-30 09:58:53 -07:00
Deb Mukherjee	c98bfcfbbb	Merge "Balancing coef-tree to reduce bool decodes" into experimental	2013-05-30 08:10:47 -07:00
Sami Pietila	5700b4ea42	Replace scatter scan 32x32 with HW friendly scan. The first 240 coeff positions (15 top-left blocks) are scanned in the same order as in scatter scan, after that the coeffs are scanned in "block bands", each band at a time, all coeffs in one band before moving on to the next band. This brings down the amount of 4x4 coeff blocks that need to be buffered while scanning, from 15 blocks to 8 blocks. Change-Id: I478a991d63c48bd5e64d36e59fed7a00c9a651ba	2013-05-30 15:32:46 +03:00
Paul Wilkins	1b103f250f	Patch to remove implicit segmentation. This patch removes the implicit segmentation experiment from the code base as the benefits were still unproven as of the bitstream deadline. Change-Id: I273b99d8d621d1853eac4182f97982cb5957247e	2013-05-30 11:06:29 +01:00
Ronald S. Bultje	17544d1478	Merge "Remove some unused code related to macroblock/splitmv coding." into experimental	2013-05-29 17:35:05 -07:00
Ronald S. Bultje	7873de1481	Merge "Remove unused and outdated debug code." into experimental	2013-05-29 17:33:32 -07:00
Adrian Grange	9e5bb9598c	Add intra_only and reset_frame_context flags Added two flags to the frame header: intra_only: Signals that the frame is encoded using only INTRA coding modes. reset_frame_context: Indicates that the coding context specified in the frame header should be reset to default values before the frame is encoded/decoded. Change-Id: I182d46f1f84fb67a13c46ad767f246a38d7861a2	2013-05-29 17:16:00 -07:00
Deb Mukherjee	b8b3f1a46d	Balancing coef-tree to reduce bool decodes This patch changes the coefficient tree to move the EOB to below the ZERO node in order to save number of bool decodes. The advantages of moving EOB one step down as opposed to two steps down in the other parallel patch are: 1. The coef modeling based on the One-node becomes independent of the tree structure above it, and 2. Fewer conext/counter increases are needed. The drawback is that the potential savings in bool decodes will be less, but assuming that 0s are much more predominant than 1's the potential savings is still likely to be substantial. Results on derf300: -0.237% Change-Id: Ie784be13dc98291306b338e8228703a4c2ea2242	2013-05-29 16:25:52 -07:00
Dmitry Kovalev	38cb616fbf	Merge "Compressed/uncompressed frame header changes." into experimental	2013-05-29 15:29:44 -07:00
Scott LaVarnway	353642bc53	Moved use_prev_in_find_mv_refs check to frame level This patch checks at the frame level to see if the previous mode info context can be used. This patch eliminates the flag check that was done for every mode and removes another check that was done prior to every vp9_find_mv_refs(). Change-Id: I9da5e18b7e7e28f8b1f90d527cad087073df2d73	2013-05-29 16:42:23 -04:00
Jingning Han	6c97bba403	Merge "further clean-ups on intra4x4 coding" into experimental	2013-05-29 10:55:14 -07:00
Sami Pietila	88a4d4c510	Residual coding to cache energy class of tokens. Proposal for tuning the residual coding by changing how the context from previous tokens is calculated. Storing the energy class of previous tokens instead of the token itself eases the critical path of HW implementations. Change-Id: I6d71d856b84518f6c88de771ddd818436f794bab	2013-05-29 15:21:01 +01:00
Ronald S. Bultje	4487f5a690	Remove some unused code related to macroblock/splitmv coding. Change-Id: Ic40d56fb162f4e201547dfae33e62ccd9e865889	2013-05-29 06:29:56 -07:00
Ronald S. Bultje	2afc3422c6	Remove unused and outdated debug code. Change-Id: I0e789bdeaed60f920f7a470e56a8d4ea374233fc	2013-05-28 19:15:57 -07:00
Dmitry Kovalev	18c83b3714	Compressed/uncompressed frame header changes. Adding API to read/write uncompressed frame header bits (it is not final yet). Separate functions to read/write uncompressed header. Moving clr_type, error_resilient_mode, refresh_frame_context, frame_parallel_decoding_mode, frame_context_idx from compressed partition to uncompressed frame header. Change-Id: Id3ed8a387980c652ae147549412f4ec24a0a5bd0	2013-05-28 18:07:54 -07:00
Deb Mukherjee	3d4e032e16	Merge "Clean up related to coefficient modeling" into experimental	2013-05-28 16:55:02 -07:00
Deb Mukherjee	d8c0989d56	Clean up related to coefficient modeling Uses reduced arrays for probabilities and branch counts in the encoder. No change in bitstream. Change-Id: Iec605446f44db4cd325eb45fa12a3003a6ee29db	2013-05-28 16:32:03 -07:00
Jingning Han	4729a6f389	further clean-ups on intra4x4 coding Removed one 4x4 prediction step that was unnessary in the rd loop. Removed a unused modecosts estimate from encoder side. Change-Id: I65221a52719d6876492996955ef04142d2752d86	2013-05-28 11:19:05 -07:00
Paul Wilkins	245a11553a	Merge "Remove loop dering experiment." into experimental	2013-05-28 05:34:14 -07:00
Dmitry Kovalev	1a24011469	Revert "Adding API to read/write uncompressed frame header bits." because of bitstream mismatches. This reverts commit `df037b615f` Change-Id: I1a529f2590df7bc912f5035d22311268933e3dd6	2013-05-28 02:24:52 -07:00
Yaowu Xu	2b96ffe025	a few clean-ups 1. remove prediction mode conversion 2. unified bmode, same for key and non-key frame 3. set I4X4_PRED count for pdf to 0, as I4X4_PRED is no longer coded ever. It is determined by ref_frame and block partition Change-Id: If5b282957c24339b241acdb9f2afef85658fe47d	2013-05-27 13:53:56 -07:00
Timothy B. Terriberry	95339d6825	Reduce WHT complexity. Saves 1 add, 3 shifts (and a shift bias) per 1-D transform. Change-Id: I1104bb1679fe342b2f9677df8a9cdc0cb9699e7d	2013-05-27 13:23:52 -07:00
Jingning Han	de735929cf	Reduce bmi buffer length from 16 to 4 This commit removes the use of bmi_ in the first-pass encoding by forcing encode_intra4x4block_ to use DC_PRED, followed by DCT_DCT only, as John suggested. This makes the need for bmi buffer only up to 4 entries, instead of 16. Change-Id: I3410007dfae789ee46a09ae20c39d3ce3c7954aa	2013-05-27 08:59:15 -07:00
Ronald S. Bultje	5cac66078e	Remove splitmv. Also do per-partition motion vector referencing in <sb8x8 partitions, and adjust mvref finding for sub8x8 partitions. Change-Id: Id3ed1ed4d2a8910d11d327db6cc63b8eb79f941f	2013-05-26 14:40:49 -07:00
Paul Wilkins	845bc13ba9	Remove loop dering experiment. Change-Id: I1a979bf74c286b157c31bab6bdcba0494acb4918	2013-05-25 10:09:23 +01:00
Dmitry Kovalev	0b2b81249b	Merge "Adding API to read/write uncompressed frame header bits." into experimental	2013-05-24 13:43:19 -07:00
Paul Wilkins	e41fd6e3e2	Fix bug in 4x4 band definition. Also some unused data structures/references removed. Change-Id: I295809e887173543e794250cb60ddaf1475ffd24	2013-05-23 17:51:02 -07:00
Yaowu Xu	22694ca1ad	Change txfm_type decision The changing in intra coding to base on transform block, i.e. pred-> txfm->quant->dequant-itxfm->recon, made all blocks within a prediction unit behave consistently, there is no longer a need to handle blocks differently based on the position within a predicitn block. So this commit simplifies the decision of transform type to be based on prediction mode only. Change-Id: If96cb72386f2e9186126ace88afa35ef085b6c96	2013-05-23 17:46:28 -07:00
Paul Wilkins	33ecd6ad54	Merge Scatter Scan experiment. Removal from under configure flag. A bit renaming Change-Id: I2213229dfe852001dfec16b149f47c52ce88f3aa	2013-05-23 13:09:27 +01:00
Jingning Han	7ac5ac52f9	Merge 4x4 block level partition into codebase Move 4x4/4x8/8x4 partition coding out of experimental list. This commit fixed the unit test failure issues. It also resolved the merge conflicts between 4x4 block level partition and iterative motion search for comp_inter_inter. Change-Id: I898671f0631f5ddc4f5cc68d4c62ead7de9c5a58	2013-05-23 11:58:50 +01:00
Yunqing Wang	0812c121e7	Merge "Optimize variance functions" into experimental	2013-05-22 13:41:04 -07:00
Deb Mukherjee	ddb2309568	Merge "Using 128 entry look up table for coef models" into experimental	2013-05-22 10:38:35 -07:00
Yunqing Wang	f4fcfe3075	Optimize variance functions Added SSE2 version of variance functions for super blocks. Change-Id: Ibeaae8771ca21c99d41dd74067574a51e97b412d	2013-05-22 10:29:38 -07:00
Jingning Han	d2cacdc530	Merge "Make the intra rd search support 8x4/4x8" into experimental	2013-05-22 10:00:15 -07:00
Deb Mukherjee	de4d682ca4	Using 128 entry look up table for coef models Reverts to using 128 bit LUT for the coef models rather than 48 to ease hardware implementation. Also incorporates some cleanups including removing various hooks to support different lookup tables based on block_type and ref_type. Change-Id: I54100c120cca07a2ebd3a7776bc4630fa6a153f6	2013-05-22 08:44:31 -07:00
Paul Wilkins	4e08fa96f3	Merge "changes intra coding to be based on txfm block" into experimental	2013-05-22 06:53:12 -07:00
Paul Wilkins	22d7f0703a	Merge "Generalized intra 4x4 encoding for all sizes" into experimental	2013-05-22 06:52:32 -07:00
Scott LaVarnway	d679fdf7b0	Merge "Removed unused idct functions" into experimental	2013-05-22 05:36:36 -07:00
Yaowu Xu	8ba92a0bed	changes intra coding to be based on txfm block This commit changed the encoding and decoding of intra blocks to be based on transform block. In each prediction block, the intra coding iterates thorough each transform block based on raster scan order. This commit also fixed a bug in D135 prediction code. TODO next: The RD mode/txfm_size selection should take this into account when computing RD values. Change-Id: I6d1be2faa4c4948a52e830b6a9a84a6b2b6850f6	2013-05-22 11:53:19 +01:00
Yaowu Xu	232d90d8fd	Generalized intra 4x4 encoding for all sizes Change-Id: I1b86744fa247233c8df031b3f4b87b212c8dd094	2013-05-22 11:44:12 +01:00
Jingning Han	f153a5d063	Make the intra rd search support 8x4/4x8 This commit allows the rate-distortion optimization of intra coding capable of supporting 8x4 and 4x8 partition settings. It enables the entropy coding of intra modes in key frame using a unified contextual probability model conditioned on its above/left prediction modes. Coding performance: derf 0.464% Change-Id: Ieed055084e11fcb64d5d5faeb0e706d30268ba18	2013-05-21 21:03:00 -07:00
John Koleszar	ddf13be8ef	Merge "Initial version of alpha channel support" into experimental	2013-05-21 17:29:51 -07:00
Dmitry Kovalev	df037b615f	Adding API to read/write uncompressed frame header bits. The API is not final yet and can be changed. Actual layout of uncompressed frame part will be finalized later. Right now moving clr_type, error_resilient_mode, refresh_frame_context, frame_parallel_decoding_mode from first compressed partition to uncompressed frame part. Change-Id: I3afc5d4ea92c5a114f4c3d88f96858cccc15b76e	2013-05-21 15:31:32 -07:00
Scott LaVarnway	a143152600	Removed unused idct functions No longer used. Change-Id: Id28c9247cebba183c6fa786dff96824ae100132c	2013-05-21 17:59:54 -04:00
Deb Mukherjee	7a645e4e12	Merging the model coef prob experiment Merges the experiment. Change-Id: I4eb19af6de6df6aa3a96a2e82f231d47ed9b3ae9	2013-05-21 14:44:38 -07:00
Deb Mukherjee	90a7723f8c	Merge "Refinements on modelcoef expt to reduce storage" into experimental	2013-05-21 13:41:54 -07:00
Scott LaVarnway	3d0110fd8e	Removed diff from macroblockd_plane No longer used. Change-Id: I171c5fa33a7600ad45b9466af23a46ccbdfe0480	2013-05-21 14:22:09 -04:00
Scott LaVarnway	0c3f3bf1d5	Removed vp9_recon functions No longer used. Change-Id: Ica5166f7117f4693dffdf7633dcfc1b263103d0d	2013-05-21 13:57:50 -04:00
Scott LaVarnway	1db6373267	Merge "WIP: 4x4 idct/recon merge" into experimental	2013-05-21 10:45:53 -07:00
Dmitry Kovalev	060d93d704	Merge "Removing clamp_type from the bitstream." into experimental	2013-05-21 10:35:27 -07:00
Deb Mukherjee	07443f1589	Refinements on modelcoef expt to reduce storage Uses more aggrerssive interpolation to reduce storage for the model tables by almost more than half. Only 48 lists of probs are stored (as opposed to 128 before), corresponding to ONE_NODE probabilities of: 1, 3, 7, 11, ..., 115, 119, 127, 135, ..., 247, 255. Besides, only 1 table is used as opposed to 2 before. So the overall memory needed for the tables is just 48 * 8 = 384 bytes. The table currently used is based on a new Pareto distribution with heavier tail than a generalized Gaussian - which improves results on derf by about 0.1% over a single table Generaized Gaussian. Results overall on derfraw300 is -0.14%. Change-Id: I19bd03559cbf5894a9f8594b8023dcc3e546f6bd	2013-05-21 10:06:56 -07:00
Deb Mukherjee	39a90bc8e8	Updating the model coef experiment Cleans up the experiment. Actually uses reduced counts for backward updates, and reduced number of probabilities in the context. No change in bitstream when the experiment is on. Between expt on and off: derfraw300 is down only -0.062% (which is better than when expts were run previously). Change-Id: I55285a049a0c22810bdb42914212ab5a4f8521b5	2013-05-20 12:46:36 -07:00
Scott LaVarnway	ba48a11130	WIP: 4x4 idct/recon merge This patch eliminates the intermediate diff buffer usage by combining the short idct and the add residual into one function. The encoder can use the same code as well. Change-Id: I296604bf73579c45105de0dd1adbcc91bcc53c22	2013-05-20 13:03:17 -04:00
Jingning Han	810b612c23	Enable bit-stream support to 8x4 and 4x8 partition The recursive partition type search is enabled down to 4x4, 4x8 and 8x4, followed by the corresponding rate-distortion optimization for the per-partition encoding mode decisions. The bit-stream writing/reading synchronized in supporting the rectangular partition of 8x8 block. This provides above 1% coding performance gains on derf. To do next: 1. re-design the rate-distortion loop for inter prediction below 8x8. 2. re-design the rate-distortion loop for intra prediction below 4x4. 3. make the loop-filter aware of rectangular partition of 8x8 block. 4. clean the unused probability models. 5. update default probability values. Change-Id: Idd41a315b16879db08f045a322241f46f1d53f20	2013-05-19 14:59:04 -07:00
Dmitry Kovalev	498b6460a1	Removing clamp_type from the bitstream. Change-Id: Ica75bdd4905c4a04b7f92795d0b8ce6836a99ef4	2013-05-17 15:50:26 -07:00
Jim Bankoski	d9705b200f	Merge "holds utility debugging functions" into experimental	2013-05-17 09:23:36 -07:00
Paul Wilkins	6f0c8e82c0	Merge "Replace default counts with default probs." into experimental	2013-05-17 08:58:53 -07:00
Paul Wilkins	61e2eac61d	Merge "New inter mode context." into experimental	2013-05-17 06:59:09 -07:00
Paul Wilkins	99c4b1eea1	Replace default counts with default probs. Replace vp9_kf_default_bmode_counts structure with direct default probabilities. The probability structure is smaller and it removes the need to specify in the bitstream how to convert the counts to probabilities. Note that I have concerns still about the size and value of the large intra mode context. This may cause problems for HW but it also means we rely heavily on reverse update as forwards update of a structure this size is problematic. I intend to review this more generally in the next few days to see if we can come up with a competitive solution that does not rely on such a large context. Change-Id: I0a36071079d5d26a57ab0e9fbf91af4199aa7984	2013-05-17 14:54:52 +01:00
Jim Bankoski	b67e46b33c	holds utility debugging functions This one prints out a visual version of the partitioning for human eyes to follow... Change-Id: Iba434589a2f55eb069484686d99a382db93b9548	2013-05-17 06:29:28 -07:00
Paul Wilkins	51bc4bf4a0	Remove MODE_STATS flag and code Change-Id: I6c70a8a8a4633399842ac74792003ae5f7859ffa	2013-05-17 12:34:10 +01:00
John Koleszar	679e4abdd5	Initial version of alpha channel support This is a mostly-working implementation of an extra channel in the bitstream. Configure with --enable-alpha to test. Notable TODOs: - Add extra channel to all mismatch tests, PSNR, SSIM, etc - Configurable subsampling - Variable number of planes (currently always uses all 4) - Loop filtering - Per-plane lossless quantizer - ARNR support This implementation just uses the same contents as the Y channel for the A channel, due to lack of content and general pain in playing back 4 channel content. A later patch will use the actual alpha channel passed in from outside the codec. Change-Id: Ibf81f023b1c570bd84b3064e9b4b8ae52e087592	2013-05-16 22:21:09 -07:00
John Koleszar	f07602e403	Merge "Remove vp9_extend_mb_row()" into experimental	2013-05-16 15:22:38 -07:00
Scott LaVarnway	9aa37a51b2	Merge "WIP: 8x8 idct/recon merge" into experimental	2013-05-16 14:28:30 -07:00
Yaowu Xu	e3869e9cfc	Removed Q threshold in the usage of ADST Test on cif set showed small but consistent compression gain for almost all encodings with overall impact of .08%. The gains average aournd .12% combined with D63 adst change. Test encoding on std-hd set is ongoing.. Change-Id: If4d94799cf0486fb9c770b193e5c386d13d99d59	2013-05-16 13:33:07 -07:00
John Koleszar	16ac5a5cde	Remove vp9_extend_mb_row() This code is no longer needed for correct intra prediction. Change-Id: I822d1a8b0ad0a00e7c4c6e7b2931790c39d1267d	2013-05-16 11:56:00 -07:00
Scott LaVarnway	794a7bedbd	WIP: 8x8 idct/recon merge This patch eliminates the intermediate diff buffer usage by combining the short idct and the add residual into one function. The encoder can use the same code as well. Change-Id: Iacfd57324fbe2b7beca5d7f3dcae25c976e67f45	2013-05-16 13:52:15 -04:00
Jingning Han	8e3d0e4d7d	Add building blocks for 4x8/8x4 rd search These building blocks enable rate-distortion optimization search over block sizes of 8x4 and 4x8. Need to convert them into mmx/sse forms. Change-Id: I570ea2d22d14ceec3fe3575128d7dfa172a577de	2013-05-16 10:41:29 -07:00
Jingning Han	c0f70cca40	Merge "Fix the transform type selection in 4x4 partition" into experimental	2013-05-16 10:14:49 -07:00
Paul Wilkins	6ff3eb1647	New inter mode context. This patch creates a new inter mode contest that avoids a dependence on the reconstructed motion vectors from neighboring blocks. This was a change requested by a hardware vendor to improve decode performance. As part of this change I have also made some modifications to stats output code (under a flag) to allow accumulation of inter mode context flags over multiple clips Some further changes will be required to accommodate the deprecation of the split mv mode over the next few days. Performance as stands is around -0.25% on derf and std-hd but up on the YT and YT-HD sets. With further tuning or some adjustment to the context criteria it should be possible to make this change broadly neutral. Change-Id: Ia15cb4470969b9e87332a59c546ae0bd40676f6c	2013-05-16 12:09:19 +01:00
Paul Wilkins	18e07420a2	Merge "Further Implicit Segmentation Changes" into experimental	2013-05-16 03:03:44 -07:00
John Koleszar	7addafb5b1	Merge "Fix vp9_build_intra_predictors_sbuv_s for non-4:2:0" into experimental	2013-05-15 20:58:15 -07:00
John Koleszar	501ae3484c	Fix vp9_build_intra_predictors_sbuv_s for non-4:2:0 Remove an assumption about chroma size, and the number of planes. Change-Id: I286a7fac296ec334c6a8ad847f663f3adbb9f43e	2013-05-15 17:57:08 -07:00
Dmitry Kovalev	5c5582242d	Moving the same code to new function vp9_setup_scale_factors. Change-Id: I2408ad22717784a40e23701ccb9d978265440e4f	2013-05-15 16:33:36 -07:00
Jingning Han	8468a5c1a0	Fix the transform type selection in 4x4 partition This commit allows proper transform type (DCT/ADST) selection in the settings of partition 4x4 level. Change-Id: Iec6f922a46480d777e7ca9142a99e8c131f0077b	2013-05-15 16:09:58 -07:00
Dmitry Kovalev	cd16fe9160	Merge "Preparing vp9_deblock and vp9_denoise to alpha support." into experimental	2013-05-15 15:40:52 -07:00
Dmitry Kovalev	80c0715375	Merge "Moving several static functions from vp9_reconinter.h to vp9_reconinter.c." into experimental	2013-05-15 15:39:43 -07:00
Scott LaVarnway	a272ff25cd	WIP: 16x16 idct/recon merge This patch eliminates the intermediate diff buffer usage by combining the short idct and the add residual into one function. The encoder can use the same code as well. Change-Id: Iea7976b22b1927d24b8004d2a3fddae7ecca3ba1	2013-05-15 13:16:02 -04:00
Paul Wilkins	7d7e5b5131	Further Implicit Segmentation Changes Trial use of a combination of reference frame, prediction block size and mv to define segmentation. Change-Id: Ie8946a0446dbad777fdcf7626f89e5af0994db50	2013-05-15 16:00:06 +01:00
Dmitry Kovalev	6706e674b1	Moving several static functions from vp9_reconinter.h to vp9_reconinter.c. Change-Id: I5da9c16bab26f6ff0c9d3a2a29ef6c84f5093161	2013-05-14 17:49:41 -07:00
Scott LaVarnway	2cf0d4be12	WIP: 32x32 idct/recon merge This patch eliminates the intermediate diff buffer usage by combining the short idct and the add residual into one function. The encoder can use the same code as well. Change-Id: I4ea09df0e162591e420d869b7431c2e7f89a8c1a	2013-05-14 15:54:17 -07:00
Jingning Han	1f26840fbf	Enable recursive partition down to 4x4 This commit allows the rate-distortion optimization recursion at encoder to go down to 4x4 block size. It deprecates the use of I4X4_PRED and SPLITMV syntax elements from bit-stream writing/reading. Will remove the unused probability models in the next patch. The partition type search and bit-stream are now capable of supporting the rectangular partition of 8x8 block, i.e., 8x4 and 4x8. Need to revise the rate-distortion parts to get these two partition tested in the rd loop. Change-Id: I0dfe3b90a1507ad6138db10cc58e6e237a06a9d6	2013-05-14 12:39:56 -07:00
Dmitry Kovalev	7bbf716f04	Preparing vp9_deblock and vp9_denoise to alpha support. Change-Id: I299feefa64b93bd62263aea1ff1e41e85faeb6ca	2013-05-14 11:01:57 -07:00
Yaowu Xu	8be35347a8	Merge "changed to use adst for D63_PRED" into experimental	2013-05-14 09:30:17 -07:00
John Koleszar	d31a632dcd	Merge "Revert "Preparing vp9_deblock and vp9_denoise to alpha support."" into experimental	2013-05-14 06:47:44 -07:00
John Koleszar	56efb73be3	Revert "Preparing vp9_deblock and vp9_denoise to alpha support." This reverts commit `a933311131` Change-Id: I2321f88011178381adbcffeda1bcc6a430ab8f1d	2013-05-14 06:46:11 -07:00
Yaowu Xu	da3aec6c8a	changed to use adst for D63_PRED To be consistent with other prediciton modes Change-Id: If9e1464e5c807f0b36047a046c4ac59d91b1b868	2013-05-13 22:09:38 -07:00
Dmitry Kovalev	571aa44606	Merge "Preparing vp9_deblock and vp9_denoise to alpha support." into experimental	2013-05-13 21:48:37 -07:00
Dmitry Kovalev	ce70ad5967	Merge "Code cleanup inside vp9_firstpass.c." into experimental	2013-05-13 21:46:14 -07:00
Dmitry Kovalev	2149852935	Merge "Removing simple loopfilter and code duplication from loopfilter code." into experimental	2013-05-13 18:25:15 -07:00
Dmitry Kovalev	1c43e643b7	Removing simple loopfilter and code duplication from loopfilter code. Change-Id: Ib19352e391408507f2237985501406900a355964	2013-05-13 18:09:11 -07:00
Dmitry Kovalev	a933311131	Preparing vp9_deblock and vp9_denoise to alpha support. Change-Id: Id1cc1c2663b9c2219cb830ffb4b0c6ab3468dc04	2013-05-13 14:03:29 -07:00
Jingning Han	089ed30d07	Merge "Use consistent partition context setup in enc/dec" into experimental	2013-05-13 10:51:51 -07:00
Jingning Han	fc31ae479d	Merge "Move get_sb_index to vp9_blockd.h" into experimental	2013-05-13 10:51:29 -07:00
Paul Wilkins	e5f715201a	Change to band calculation. Change band calculation back to simpler model based on the order in which coefficients are coded in scan order not the absolute coefficient positions. With the scatter scan experiment enabled the results were appear broadly neutral on derf (-0.028) but up a little on std-hd +0.134). Without the scatterscan experiment on the results were up derf as well. Change-Id: Ie9ef03ce42a6b24b849a4bebe950d4a5dffa6791	2013-05-13 17:21:49 +01:00
Jingning Han	6910f178f1	Use consistent partition context setup in enc/dec Move set_partition_seg_context_ to common file. Use consistent context setup conditions for partition probability model update at encoder and decoder. Change-Id: I24b7ed3b1c48e3d2568191a46b70136b99b67b1a	2013-05-11 15:22:13 -07:00
Jingning Han	2117d4ee96	Move get_sb_index to vp9_blockd.h Use common function to fetch/assign sb_index in rd loop, bit-stream writing and reading. Change-Id: I1d8a214a57ed9cbcd026040436ef33e5e39d65b7	2013-05-11 13:26:59 -07:00
John Koleszar	5f221e7ac4	Merge "Fix token allocation for non-4:2:0" into experimental	2013-05-10 16:57:09 -07:00
John Koleszar	998b540fe2	Merge "Fix non-4:2:0 chroma MV calculation for SPLITMV" into experimental	2013-05-10 16:57:03 -07:00
John Koleszar	64667d5af7	Merge "Subsampling aware allocs and bitstream" into experimental	2013-05-10 16:55:00 -07:00
Dmitry Kovalev	4a559d3448	Merge "Removing unused simple loopfilter code." into experimental	2013-05-10 12:14:34 -07:00
Dmitry Kovalev	effaa3263d	Removing unused simple loopfilter code. Change-Id: Ic11dc052fb641687c015e1bbc37181b9babcd43e	2013-05-10 11:04:43 -07:00
Yunqing Wang	9f5811c2da	Add joint motion search in comp_inter_inter mode(experiment) In current code, motion vectors got from single prediction mode are used in compound prediction mode directly. These motion vectors may not give accurate prediction since they are searched independently. In this patch, we took Pascal's suggestion, and did joint motion search in compound prediction mode to find better motion vectors in this situation. Test results: Overall PSNR: 0.570%(derf), 0.918%(stdhd); SSIM: 0.572%(derf), 1.009%(stdhd); The encoder is a little slower. This can be improved since some c code is used in motion search. Change-Id: Ib30c9240f6c56c9b070867b4ca89412a76d9f3c6	2013-05-10 10:15:43 -07:00
John Koleszar	da5054c5af	Fix token allocation for non-4:2:0 Increase the allocated size of the token array to support 4:4:4. Change-Id: I7766a7bedc74b819dcc1f3622d634f340fd3186d	2013-05-09 20:22:59 -07:00
John Koleszar	a37ee9d2e8	Fix non-4:2:0 chroma MV calculation for SPLITMV The previous code was somewhat vestigial for 16x16 MI units, but was incorrect when called with chroma blocks larger than 4x4 because the block index caused a reference to a non-existent BMI. This patch uses the same MV for all chroma subblocks in SPLITMV mode, which is suboptimal for non-4:2:0 subsamplings, but as SPLITMV may be removed in the near future, will use this as a stop gap. Change-Id: I3211cee5ccf1cfb426e5eef5353b0ce5bb92b4cd	2013-05-09 20:14:39 -07:00
John Koleszar	da58436f43	Subsampling aware allocs and bitstream Make framebuffer allocations according to the chroma subsamping factors in use. A bit is placed in the raw part of the frame header for each of the two subsampling factors. This will be moved in a future commit to make them part of the TBD feature set bits, probably only set on keyframes, etc. Change-Id: I59ed38d3a3c0d4af3c7c277617de28d04a001853	2013-05-09 17:50:12 -07:00
John Koleszar	eab6a421ea	Merge "Use common get_uv_tx_size()" into experimental	2013-05-09 12:18:10 -07:00
Dmitry Kovalev	eb93893bee	Updating comments for prediction modes. Change-Id: If4063184f7b37dc011ec6a7a3e75260f4251e984	2013-05-09 11:37:51 -07:00
John Koleszar	1fec23bef6	Use common get_uv_tx_size() Use a single method for calculating the transform size of non-luma planes. Change-Id: I16ebd10e7944d7b9075ab79d15e6a5b5f9bab775	2013-05-08 20:48:32 -07:00
Dmitry Kovalev	dc5418050a	Code cleanup inside vp9_firstpass.c. Change-Id: Ia2814402e3c2ec97c24c536c05f0f526fe1a431c	2013-05-08 18:13:46 -07:00
Dmitry Kovalev	f66320abff	Removing LOOPFILTER_TYPE and corresponding bit in bitstream. We don't have two loopfilter types anymore. Change-Id: I53c0137361342c7d00887ad03be3490f0dfa3532	2013-05-08 16:44:08 -07:00
Dmitry Kovalev	267e9331e2	Merge "Using 4-iteration loop for extra_mb_col inside loopfilter function." into experimental	2013-05-08 16:37:34 -07:00
Dmitry Kovalev	7b602cba65	Merge "Eliminating several YV12_BUFFER_CONFIG usages." into experimental	2013-05-08 16:36:24 -07:00
Jingning Han	944ad130b6	Merge "Extend left/above partition context to per mi(8x8)" into experimental	2013-05-08 16:33:25 -07:00
Dmitry Kovalev	81f33bc091	Eliminating several YV12_BUFFER_CONFIG usages. Change-Id: Ia85b987c935d545920dcae5a6f44136b1a08a008	2013-05-08 14:11:47 -07:00
Dmitry Kovalev	673cc21dfc	Using loop to iterate through YV12_BUFFER_CONFIG planes. Change-Id: I22f1066eb0022c8d75f65a78435ee4ffecdfe0c9	2013-05-08 13:39:16 -07:00
Dmitry Kovalev	a0b6b8a7d4	Merge "Removing unused code + little cleanup." into experimental	2013-05-08 11:23:14 -07:00
Jingning Han	4a88ad89fd	Extend left/above partition context to per mi(8x8) Update and buffer left/above partition information context per 8x8 block. This allows to further enable recursive partition down to 4x4 block size, and hence deprecating I4X4_PRED and SPLITMV. This commit also fixes a context buffer swap/restore issue in 32x32 partition type search. This gives 0.1% performance gain for derf/yt. Will refactor the superblock partition type search into recursion form. Change-Id: Ib61975aca5f12b78d8018481d7fa1393d085689b	2013-05-08 10:20:34 -07:00
John Koleszar	7465f52f81	Merge "Make setup_pred_block subsampling-aware." into experimental	2013-05-07 21:53:31 -07:00
Dmitry Kovalev	8e39295934	Using 4-iteration loop for extra_mb_col inside loopfilter function. Change-Id: I3a4f456035628a9397bdc57c19cdb03439ab1ed3	2013-05-07 17:18:57 -07:00
John Koleszar	d6c490cb15	Merge "Deprecate code_zerogroup experiment." into experimental	2013-05-07 17:09:38 -07:00
Dmitry Kovalev	9cd5406c32	Merge "Removing vp9_swap_yv12_buffer function and corresponding files." into experimental	2013-05-07 17:02:38 -07:00
Dmitry Kovalev	cba0a5db2b	Removing unused code + little cleanup. Change-Id: I81c19a8f19cfb5c7183609656ade833d72feb500	2013-05-07 16:56:22 -07:00
Paul Wilkins	a14ae84749	Deprecate code_zerogroup experiment. Delete code under the CONFIG_CODE_ZEROGROUP flag. Change-Id: I5fe6c7b42a5da9b73118e33594301da4129f320a	2013-05-07 16:52:55 -07:00
Dmitry Kovalev	b05247df95	Removing vp9_swap_yv12_buffer function and corresponding files. Adding static swap_yv12 function to vp9_firstpass.c. Change-Id: I7da9caab9720498db4a74c627901bf37816ed06c	2013-05-07 16:49:22 -07:00
Paul Wilkins	1ed57a6a62	Deprecate comp_interintra_pred experiment. Delete code under the CONFIG_COMP_INTERINTRA_PRED flag. Change-Id: I3d1079cf46305c08f7e11d738596ea112e7b547f	2013-05-07 16:24:08 -07:00
Paul Wilkins	0bfcd30768	Remove enable_6tap filter experiment. Clean out code under CONFIG_ENABLE_6TAP flag. Change-Id: Ic45b624081181027d6ba24d55dd644c3197f9830	2013-05-07 16:13:02 -07:00
Paul Wilkins	8c1b516d10	Deprecate the newbintramode experiment. Clean out code relating to newbintramode. Change-Id: Ie91f4f156cdf60ce0da8ca407c1c9cb00c7d0705	2013-05-07 16:00:59 -07:00
Paul Wilkins	9afb6700c2	Adjust q range Skip Q values between the q.0 mode and a real q of 2.0 as these are not valuable from an RD perspective. Change-Id: I110c4858c57f97315953f4d88a2596d4764360df	2013-05-07 15:34:17 -07:00
Jingning Han	b0cd64f189	Merge "Add building blocks for partition down to 4x4" into experimental	2013-05-07 15:33:20 -07:00
Dmitry Kovalev	847e184011	Merge "General code cleanup inside treewriter-related files." into experimental	2013-05-07 15:04:28 -07:00
Jingning Han	cf8b5a09ed	Add building blocks for partition down to 4x4 Macro ab4x4 contains experiments for recursive partition down to 4x4 block size. Change-Id: Ic727842fa98a4df9fd51e0025a545dc76a5c76c1	2013-05-07 12:11:51 -07:00
John Koleszar	e559e14fa6	Make setup_pred_block subsampling-aware. Code previously set up the pointers by scaling by MI_UV_SIZE, which is 4:2:0 only. Change-Id: Ic13a92895cff018ec1345736746ed84cb31e6e31	2013-05-07 11:47:45 -07:00
Jingning Han	c0504a9b24	Merge "Merge SB8X8 into the codebase" into experimental	2013-05-07 09:23:47 -07:00
Jingning Han	776c1482a3	Merge SB8X8 into the codebase Pull sb8x8 out of experimental list. verified via borg run tests. Fixed unit test failures. Change-Id: I12a4bbd17395930580c048ab68becad1ffe46e76	2013-05-07 09:08:25 -07:00
Scott LaVarnway	cb7955d83e	Removed vp9_setup_intra_recon() This setup is now handled by vp9_build_intra_predictors() when left_available and/or up_available is zero. Change-Id: I59cec0ab95f8be69ce885fd20727510e4deef8a0	2013-05-06 16:13:06 -04:00
Ronald S. Bultje	f7fa367094	Fix first-pass intra4x4 for sb8x8 experiment. Change-Id: I1df17f45721c690d157800daa6a0b377e3d32bc2	2013-05-04 15:49:41 -07:00
John Koleszar	acc9c125dd	Remove old_block_idx_4x4 Removes several instances where the old block numbering was still in use. Change-Id: Id35130591455a4abe6844613e45c0b70c1220c08	2013-05-03 17:19:13 -07:00
John Koleszar	6c622e2783	Merge "Separate transform and quant from vp9_encode_sb" into experimental	2013-05-03 17:19:01 -07:00
John Koleszar	4529c68b3b	Separate transform and quant from vp9_encode_sb This allows removing a large number of transform size specific functions, as well as supporting 444/alpha by routing all code through the subsampling-aware path. Change-Id: Ieb085cebe9f37f24fc24de179898b22abfda08a4	2013-05-03 12:14:50 -07:00
Adrian Grange	7aae782c37	Merge "Extend number of reference buffers to 8." into experimental	2013-05-03 09:59:54 -07:00
Adrian Grange	d7eea782f2	Extend number of reference buffers to 8. The number of reference buffers is extended to 8 and a reference sign-bias added for the LAST_FRAME. Whilst the number of reference buffers used by an individual frame remains unchanged at 3, these may now be selected from 8 possible buffers. Change-Id: I2d247b9c1c2b3a339d6c9fac125e81ba373f75a7	2013-05-03 09:17:18 -07:00
Scott LaVarnway	3041cf8c8b	Merge "Reduced y_dequant, uv_dequant size" into experimental	2013-05-03 07:30:31 -07:00
Dmitry Kovalev	519d9f3e16	Merge "Using treed_read/treed_write functions for segment ids." into experimental	2013-05-02 10:40:58 -07:00
Ronald S. Bultje	704fb4866e	Fix right-edge availability for intra prediction in sb8x8. Fixes valgrind uninitialized value use warnings. Change-Id: Ie9314d684e2ad194f8aca5bde1729fb9b7c0221d	2013-05-02 10:16:48 -07:00
Ronald S. Bultje	ec6cf519d1	Merge "Fix some more offset errors in sb8x8." into experimental	2013-05-02 09:08:33 -07:00
Jingning Han	73a4824c34	Merge "Fix bug in sb8x8 partition context" into experimental	2013-05-02 09:04:28 -07:00
Ronald S. Bultje	3e345cd4d8	Fix some more offset errors in sb8x8. Change-Id: I83677227f7610fdf2db9f15f87fecd4d8e072427	2013-05-02 07:54:18 -07:00
Ronald S. Bultje	dd1e6b8e6f	Merge "Fix block reconstruction with sb8x8 enabled." into experimental	2013-05-02 07:11:36 -07:00
Jingning Han	ba24a28f69	Fix bug in sb8x8 partition context Fix the issue that causes array bound excess in getting partition context. Change-Id: I66166f047f0bcaefebb0bcf441c5b1f777d8da44	2013-05-01 22:34:27 -07:00
Ronald S. Bultje	ff37688a91	Fix block reconstruction with sb8x8 enabled. The encoder reconstruction is now correct. Decoder to follow shortly. Change-Id: Iedf98cdaebb4ca1256c7714cad7024a75853ad6a	2013-05-01 19:28:17 -07:00
Jingning Han	b8decb0313	Fix bugs in sb8x8 experiment/context prob update Fix bugs occur in contextual partition probability update, when sb8x8 is enabled. Change-Id: I19e2cec8a54c2dafd2be2803bbfde7337a2ae45f	2013-05-01 15:16:50 -07:00
Ronald S. Bultje	b6c2d872f0	Fix some crashes in sb8x8 experiment. Change-Id: I390bb1cedc835f439fd5dd6cda6572b29cbb139c	2013-05-01 14:45:27 -07:00
Jingning Han	650e632400	Merge "Enable bit-stream support to SB8X8" into experimental	2013-05-01 13:48:14 -07:00
Scott LaVarnway	94ed11d89d	Reduced y_dequant, uv_dequant size Currently, only two values are used. Removed the unused values. Change-Id: Idc5b8be354d84ffc68df39ea3e45f9f50d977b35	2013-05-01 16:25:10 -04:00
Jingning Han	2bf1dc2e23	Enable bit-stream support to SB8X8 This commit enables bit-stream writing and reading for recursive partition down to block 8x8. Change-Id: I163cd48d191cc94ead49cbb7fc91374f6bf204e2	2013-05-01 13:03:54 -07:00
Dmitry Kovalev	6e4ed2f0fe	Merge "Adding vp9_get_qindex function." into experimental	2013-05-01 12:04:21 -07:00
John Koleszar	d139655b14	Merge "Make vp9_optimize_sb* common" into experimental	2013-04-30 21:43:26 -07:00
John Koleszar	1f80a568d2	Make vp9_optimize_sb* common Unify the various vp9_optimize_sb functions into one that handles all transform sizes. Change-Id: I48b642fbfb3e72cc2e0bcf1d0317a80a80547882	2013-04-30 21:34:58 -07:00
Dmitry Kovalev	79590f186c	Merge "Cleaning up encoder segmentation code." into experimental	2013-04-30 17:49:55 -07:00
Ronald S. Bultje	ff2d69573e	Use more restrictive block radius for 8x8 block MV references. Change-Id: If02e006aa8a89da9de23da92362bd2e7718ea07c	2013-04-30 17:34:02 -07:00
Dmitry Kovalev	aea29cd278	General code cleanup inside treewriter-related files. Change-Id: Ifaa40612a9c054d96112ba350c6f4adb46b1bd5b	2013-04-30 16:39:07 -07:00
Ronald S. Bultje	d068d869b9	sb8x8 integration in rd loop. Work-in-progress, not yet ready for review. TODO items: - bitstream writing (encoder) and reading (decoder) - decoder reconstruction Change-Id: I5afb7284e7e0480847b47cd0097cb469433c9081	2013-04-30 16:13:20 -07:00
Dmitry Kovalev	b5364d4f3b	Using treed_read/treed_write functions for segment ids. Changing the order of probabilities inside mb_segment_tree_probs in order to use treed_read/treed_write function instead of custom code. Change-Id: I843487d5057913b9358db73da270893eefecc6c8	2013-04-30 14:06:49 -07:00
Dmitry Kovalev	3f6c6ffc86	Adding vp9_get_qindex function. Moving common code from encoder and decoder to vp9_get_qindex function. Also moving quant-related constants from vp9_onyxc_int.h to vp9_quant_common.h. Change-Id: I70c5bfbaa1c8bf00fde0bfc459d077f88b6d46c8	2013-04-30 13:39:50 -07:00
Yaowu Xu	ad6890316c	Merge "Removed code no longer being used." into experimental	2013-04-30 12:26:09 -07:00
Yaowu Xu	df6f82c3e8	Removed code no longer being used. Change-Id: Iab9a88f250614a790b6ad96bf3150a74210910df	2013-04-30 12:09:27 -07:00
Dmitry Kovalev	15b5e465f2	Adding vp9_update_frame_size function. Moving common code from encoder and decoder to vp9_update_frame_size. Change-Id: I6ca758b7d05ffd52821bd3f7ad68089da11e4165	2013-04-30 11:14:27 -07:00
Dmitry Kovalev	70d12c3a75	Merge "Renaming refresh_entropy_probs to refresh_frame_context." into experimental	2013-04-30 10:21:24 -07:00
Dmitry Kovalev	51a73fbba2	Merge "Consistent names for quant-related functions and variables." into experimental	2013-04-30 10:19:48 -07:00
Jingning Han	7492edac93	Merge "Separate I4X4_PRED coding from macroblock modules" into experimental	2013-04-29 21:51:59 -07:00
Jingning Han	94191b5c82	Separate I4X4_PRED coding from macroblock modules Separate the functionality of I4X4_PRED from decode_mb. Use decode_atom_intra instead, to enable recursive partition of superblock down to 8x8. Change-Id: Ifc89a3be82225398954169d0a839abdbbfd8ca3b	2013-04-29 18:59:36 -07:00
Yaowu Xu	0b48548eeb	Merge "fixed new intra code for rectanglar blocks" into experimental	2013-04-29 16:16:27 -07:00
Dmitry Kovalev	ee97da2c03	Cleaning up encoder segmentation code. Moving code from vp9_pack_bitstream to new function encode_segmentation. Change-Id: I1f1e59a1f038618ad95162b7db4b6f8164850ea8	2013-04-29 16:07:17 -07:00
Yaowu Xu	caea860a0f	Merge "Enabled i4x4 to use right above pixels" into experimental	2013-04-29 16:05:19 -07:00
Yaowu Xu	7ea12f2c5f	Merge "Use same intra prediction for all block size" into experimental	2013-04-29 16:02:35 -07:00
Yaowu Xu	4747c6ed90	fixed new intra code for rectanglar blocks Also fixed two minor subtle boundary conditions in intra prediction code, and replaced memcpy/memset with vpx_ prefixed version. Change-Id: I9cddff3be831228b628f1f2f065a61feacbcbee6	2013-04-29 16:02:00 -07:00
Yaowu Xu	e388251d5d	Enabled i4x4 to use right above pixels Change-Id: I7442b4600b6812bed13e655ccf68f9ea56cc83a2	2013-04-29 15:16:59 -07:00
Yaowu Xu	3d655805f2	Use same intra prediction for all block size The commmit changed to use same intra prediction function for all block sizes. Some details on the changes: 1. All directional modes except DC/TM/V/H now have built-in filtering for all pixels with filter taps either (1, 2, 1)/4 or (1, 1)/2. 2. Above edge get automatic extended to double width (bw*2), which makes a lot of the prediciton mode computation simpler. 3. Same intra prediction function is called with different size for i4x4_pred and all other larger size. Overall, the change helped keyframe only coding for both cif size and std-hd size test sets by .5% consistently on all encodings. For normal coding with single/auto key frame, the change now also is consistently net positive for all encodings. The overall gains is about .15% on std-hd set. Change-Id: I01ceb31fbc73d49776262e6bdc06853b03bbd1d1	2013-04-29 15:15:30 -07:00
Ronald S. Bultje	f5ad774814	Merge "Change above/left_context to use an 8x8 basis." into experimental	2013-04-29 11:28:25 -07:00
Ronald S. Bultje	2dbaa4f4f4	Change above/left_context to use an 8x8 basis. Output changes slightly because of a minor bug in (at least) the sb32x16 block2above tx16x16 tables that previously existed in vp9_blockd.c. Change-Id: I624af28ac200a8322d64454cf05c79e9502968cc	2013-04-29 10:37:25 -07:00
Deb Mukherjee	040eeed9d0	Turning model based reverse update on for coefs Turns model based reverse updates on for coefficients in an effort to reduce the memory requirement for counters. With this patch the counters needed will be reduced by about 75% since only 3 counts are needed instead of 12. The impact in performance is: derf300: -0.252% stdhd250: -0.046% However retraining should alleviate some of the drop in performance. Change-Id: I6f2b3e13f6d5520aa3400b0b228fb5e8b4a43caa	2013-04-29 10:09:57 -07:00
Paul Wilkins	fb3e4ed9eb	Merge "Minor tweak to implicit segmentation experiment." into experimental	2013-04-27 11:58:13 -07:00
Ronald S. Bultje	7f8cbda333	Merge "Grow MODE_INFO array to use an 8x8 basis." into experimental	2013-04-26 14:46:50 -07:00
Dmitry Kovalev	9713a68719	Renaming refresh_entropy_probs to refresh_frame_context. Change-Id: I5429c02246d198eb1b6aadbc3313b26bf3436062	2013-04-26 14:39:58 -07:00
Johann	9e23bd5df5	Merge "Merge branch 'master' into experimental" into experimental	2013-04-26 13:35:28 -07:00
Johann	32a5c52856	Merge branch 'master' into experimental Conflicts: vp9/common/vp9_findnearmv.c vp9/common/vp9_rtcd_defs.sh vp9/decoder/vp9_decodframe.c vp9/decoder/x86/vp9_dequantize_sse2.c vp9/encoder/vp9_rdopt.c vp9/vp9_common.mk Resolve file name changes in favor of master. Resolve rdopt changes in favor of experimental, preserving the newer experiments. Change-Id: If51ed8f457470281c7b20a5c1a2f4ce2cf76c20f	2013-04-26 12:57:10 -07:00
Dmitry Kovalev	5a5a1f25a8	Consistent names for quant-related functions and variables. Change-Id: I3a6d601e90e8740b9c26dd0afbfe9d467b75d367	2013-04-26 12:30:20 -07:00
Ronald S. Bultje	1a46b30ebe	Grow MODE_INFO array to use an 8x8 basis. Change-Id: I087e08e7909a406b71715b8525c104208daa6889	2013-04-26 11:57:17 -07:00
John Koleszar	bb41ab4a0c	Remove BLOCKD structure All members can be referenced from their per-plane counterparts, and removes assumptions about 24 blocks per macroblock. Change-Id: I7ff2fa72d22c29163eb558981c8193765a8113d9	2013-04-26 10:35:54 -07:00
John Koleszar	4f55c5618a	Remove destination pointers from BLOCKD Access these members from MACROBLOCKD instead. Change-Id: I7907230dd473ff12ebe182b9280d8b7f12a888c4	2013-04-26 10:14:07 -07:00
Scott LaVarnway	57f180b388	Removed bmi from blockd This originally was "Removed update_blockd_bmi()". Now, this patch removed bmi from blockd and uses the bmi found in mode_info_context. Eliminates unnecessary bmi copies between blockd and mode_info_context. Change-Id: I287a4972974bb363f49e528daa9b2a2293f4bc76	2013-04-26 10:19:43 -04:00
Ronald S. Bultje	8d028402d7	Remove implicit assumption that mode_info_stride == mb_cols + 1. Change-Id: I3030d7adac73109aeaa1ecc0f78ac968c092d9aa	2013-04-25 14:21:01 -07:00
Ronald S. Bultje	c849eaca59	Use b_width/height_log2 instead of mb_ where appropriate. Basic assumption: when talking about transform units, use b_; when talking about macroblock indices, use mb_. Change-Id: Ifd163f595d4924ff892de4eb0401ccd56dc81884	2013-04-25 14:20:59 -07:00
John Koleszar	a99e1aa8ca	Remove predictor pointers from BLOCKD Access these members from MACROBLOCKD instead. Change-Id: I2574622e577bb9feede47f6b7ccbb11f3e928ca8	2013-04-25 12:04:07 -07:00
John Koleszar	6c0c6b86c1	Remove diff from BLOCKD The underlying storage for these buffers is in the per-plane MACROBLOCKD area, so read it from there directly. Change-Id: Id6bd835117fdd9dea07db95ad06eff9f12afaaf7	2013-04-25 11:57:22 -07:00
John Koleszar	15255eef82	Move dequant from BLOCKD to per-plane MACROBLOCKD This data can vary per-plane, but not per-block. Change-Id: I1971b0b2c2e697d2118e38b54ef446e52f63c65a	2013-04-25 11:57:20 -07:00
John Koleszar	4bd0f4f646	Remove BLOCK structure All members can be referenced from their per-plane counterparts, and removes assumptions about 24 blocks per macroblock. Change-Id: I593fb0715e74cd84b48facd1c9b18c3ae1185d4b	2013-04-25 11:33:17 -07:00
Johann	c5b127afea	Rename vp9_idct_x86.c Remove similarly named header file. It is obsolete. Move file to match naming style. Adjust make file to include the file correctly and remove extra unnecessary #if guard. Change-Id: Ifba07ba9938a5df08a9f4eda54a3ac4d6983f7bf	2013-04-25 11:13:02 -07:00
Dmitry Kovalev	61a47da869	Adding is_inter_mode function. Change-Id: I2d32d46002cb92c63050c2b8328865c406103621	2013-04-25 10:23:00 -07:00
Dmitry Kovalev	2cf0675a52	Merge "Removing unused mi_mv_pred_row and mi_mv_pred_col functions." into experimental	2013-04-25 10:18:07 -07:00
Dmitry Kovalev	9b081e6807	Merge "Using ROUND_POWER_OF_TWO macro inside vp9_loopfilter_filters.c." into experimental	2013-04-25 10:17:54 -07:00
Dmitry Kovalev	22c6ce03fa	Merge "Handling frame references and scale factors in one for loop." into experimental	2013-04-25 10:17:34 -07:00
Jingning Han	b42b41c856	Merge "Move sbsegment out of experimental list" into experimental	2013-04-25 09:18:01 -07:00
Scott LaVarnway	a426c7f343	Merge "Moved dequantization into the token decoder" into experimental	2013-04-25 08:53:42 -07:00
Dmitry Kovalev	994c79cccf	Handling frame references and scale factors in one for loop. Using ALLOWED_REFS_PER_FRAME constants instead of hard coded 3, replacing memcpy with plain struct assignment. Change-Id: Ibc86f5d175fcb3f3a3eddacf593525370f1f854c	2013-04-24 17:20:53 -07:00
Yaowu Xu	bcf82cf503	Merge two similar functions into one Function set_mb_row() and set_mb_col() do similar work and are always called together, this commit merged them into a single function for clarity and easy maintainence. This was a TODO item. Change-Id: I956bd9ed6afb8b2b0469b20fd8bc893b26f8a0f3	2013-04-24 15:58:03 -07:00
Dmitry Kovalev	a2d46434b5	Merge "Fixing PRED_SWITCHABLE_INTERP case in vp9_get_pred_context function." into experimental	2013-04-24 15:44:39 -07:00
Jingning Han	b0e3b3df18	Move sbsegment out of experimental list Move rectangular superblock coding out of experimental list. Change-Id: I96c37547d122330d666a67b4bf577ae54547857f	2013-04-24 15:19:17 -07:00
Jingning Han	ff2b8aa2c9	Contextual entropy coding of partition syntax This commit enables selecting probability models for recursive block partition information syntax, depending on its above/left partition information, as well as the current block size. These conditional probability models are reasonably stationary and consistent across frames, hence the backward adaptive approach is used to maintain and update the contextual models. It achieves coding performance gains (on top of enabling rectangular block sizes): derf: 0.242% yt: 0.391% hd: 0.376% stdhd: 0.645% Change-Id: Ie513d9673337f0d27abd65fb566b711d0844ec2e	2013-04-24 14:23:14 -07:00
Dmitry Kovalev	2e3f3e4fbb	Using ROUND_POWER_OF_TWO macro inside vp9_loopfilter_filters.c. Change-Id: Icb671cd011f645a3361684207840d14330ca7488	2013-04-24 11:50:49 -07:00
Ronald S. Bultje	41a8a95bd1	Merge "Change chroma loopfilter to skip inner SB edges for tx16x16 also." into experimental	2013-04-24 11:45:26 -07:00
Ronald S. Bultje	4149ba1cbf	Merge "Minor indent changes in loopfilter code." into experimental	2013-04-24 11:45:19 -07:00
Ronald S. Bultje	cc7ce53140	Merge "Add basic building blocks for 8x8 superblocks experiment." into experimental	2013-04-24 11:45:13 -07:00
Dmitry Kovalev	bd994ed42d	Fixing PRED_SWITCHABLE_INTERP case in vp9_get_pred_context function. Adding xd->up_available as additional check for above context. Change-Id: If5654e4cae184b9c369b7b2e08076cb2951d00ed	2013-04-24 10:45:32 -07:00
Ronald S. Bultje	5b57580cd9	Change chroma loopfilter to skip inner SB edges for tx16x16 also. Change-Id: I6ea9e110b5c5b07ab7d092886dbd51a6eccc0217	2013-04-24 09:45:43 -07:00
Paul Wilkins	6579720e6a	Merge "Extension of segmentation to 8 segments." into experimental	2013-04-24 09:44:48 -07:00
Paul Wilkins	e307e924b5	Merge "Simplify Segment Coding" into experimental	2013-04-24 09:43:01 -07:00
Deb Mukherjee	0e905e03f2	Merge "Fix in token allocation with zerogroup expt" into experimental	2013-04-24 08:57:25 -07:00
Paul Wilkins	da04312f79	Minor tweak to implicit segmentation experiment. This minor tweak makes segment 0 neutral and used by key frames and also extends beyond 4 segments. Change-Id: Ife4744602aba66ac9432746db3113cc5cd88a482	2013-04-24 16:43:01 +01:00
Paul Wilkins	31ee193a9c	Extension of segmentation to 8 segments. Also some further simplification following removal of top node code. There is an issue in regards to the shared file vp8cx.h in regard to the roi_map as this interface assumes that there are only 4 segments. I have left the value here as 4 for now meaning that the roi_map interface is broken for VP9. Note that this change would have been easier if I hadn't had to search for hard wire instances of the number 4 and <= 3. Change-Id: Ia8b6deea4be4dbd20deb1656e689dd43a5f190e8	2013-04-24 16:36:47 +01:00
Paul Wilkins	c77aff1286	Simplify Segment Coding Remove top node optimization. The improvement this gives is not sufficient to justify the extra complexity. Change-Id: I2bb4a12a50ffd52cacfa4a3e8acbb2e522066905	2013-04-24 10:47:12 +01:00
Paul Wilkins	27bb4777cd	Simple implicit segmentation experiment. Change-Id: Iaef16122732c2a81e0927f9862b51b68dc788712	2013-04-24 10:04:27 +01:00
Dmitry Kovalev	156d912025	Merge "Code cleanup inside vp9_get_pred_context function." into experimental	2013-04-23 18:03:29 -07:00
Dmitry Kovalev	97ac785e65	Merge "Simple cleanup inside vp9_decodframe.c and vp9_entropymode.c." into experimental	2013-04-23 18:02:46 -07:00
Dmitry Kovalev	afeff1acd1	Merge "Removing redundant code in vp9_entropymode.c." into experimental	2013-04-23 18:01:37 -07:00
Jingning Han	adbbd26517	Merge "Enable rectangular support for comp inter-intra" into experimental	2013-04-23 17:16:16 -07:00
Dmitry Kovalev	9828a33ebb	Removing unused mi_mv_pred_row and mi_mv_pred_col functions. Change-Id: If8ba37bf0b86e8dea88c27d911e8ddb0f6d5a3c5	2013-04-23 16:34:22 -07:00
John Koleszar	4f35e3e1c1	Merge "Move src_diff to per-plane MACROBLOCK data" into experimental	2013-04-23 16:24:08 -07:00
Dmitry Kovalev	de7c25c9f0	Code cleanup inside vp9_get_pred_context function. Change-Id: Id06b7a299a26ed944a401faae51907537f722a7e	2013-04-23 16:18:09 -07:00
Dmitry Kovalev	d811558f63	Removing redundant code in vp9_entropymode.c. Change-Id: Ia7266b8d3aa3d5cff2db0c3b2f014def045759af	2013-04-23 15:56:27 -07:00
Dmitry Kovalev	144f49c6aa	Simple cleanup inside vp9_decodframe.c and vp9_entropymode.c. Change-Id: I62dde981f5201c5fbc22001609ee4b5fd0a9bdf5	2013-04-23 15:50:56 -07:00
Jingning Han	a26c1edbb4	Enable rectangular support for comp inter-intra This commit enables rectangular block prediction of compound inter-intra mode. It combines the mb/sb32/sb64 prediction functions into a unified version with configurable block width and height. This fixes the enc/dec mismatch of the codebase when comp-interintra-pred is enabled. Change-Id: I1d0db2f1f184007802df04fcd12b9dadb3189ff0	2013-04-23 15:39:19 -07:00
Ronald S. Bultje	276a1106e6	Merge changes I54acef34,I72d42971 into experimental * changes: Make some sb_type comparisons independent of literal enum values. Make loopfilter aware of rectangular blocks.	2013-04-23 15:29:19 -07:00
Dmitry Kovalev	d0d1094a05	Merge "Adding get_scan_{4x4, 8x8, 16x16} functions." into experimental	2013-04-23 12:44:51 -07:00
Johann	7af58d4338	Resolve declaration and implementation. Clean Windows build warnings: warning C4028: formal parameter <N> different from declaration This was fixed independently in master and experimental but the fixes were in opposite directions. One added const to the declaration and the other removed it from the implementation. Also update the variable names. This doesn't modify the data so call it ref, matching the functions in the vicinity, rather than dst. Change-Id: I2ffc6b4a874cb98c26487b909d20a5e099b5582c	2013-04-23 12:42:31 -07:00
Ronald S. Bultje	e0fc9201fe	Minor indent changes in loopfilter code. Change-Id: I0cdc951558e4d7748f63df8c03b1c9dce086acb0	2013-04-23 12:37:05 -07:00
Ronald S. Bultje	94297863bf	Add basic building blocks for 8x8 superblocks experiment. Change-Id: I274a1d2e461e6ffdb106bac4ad6951692ace314e	2013-04-23 12:34:37 -07:00
Ronald S. Bultje	5ba98ebcf1	Make some sb_type comparisons independent of literal enum values. Change-Id: I54acef342b8e787e05af0febd7cf0d7d10288383	2013-04-23 12:34:32 -07:00
Ronald S. Bultje	0db636619f	Make loopfilter aware of rectangular blocks. Also use explicitely named enum values in sb_type comparisons, rather than relying on absolute integer values, because enum values may change in the future. Change-Id: I72d42971a98157af93413a25ac2c7e6f9b369cec	2013-04-23 12:34:32 -07:00
John Koleszar	cbd1315ac4	Move src_diff to per-plane MACROBLOCK data First in a series of commits making certain MACROBLOCK members addressable per-plane. This commit also refactors the block subtraction functions vp9_subtract_b, vp9_subtract_sby_c, etc to be loops-over-planes and variable subsampling aware. Change-Id: I371d092b914ae0a495dfd852ea1a3d2467be6ec3	2013-04-23 12:18:51 -07:00
Ronald S. Bultje	c4cae4cd5d	Remove unused corner_predictor and log2_minus_1 functions. Change-Id: Ic659544ca12b1bc191b93ddfa378964bd133bfc9	2013-04-23 11:19:39 -07:00
Scott LaVarnway	e3167b8c23	Merge "Eliminated prev_mip memsets/memcpys" into experimental	2013-04-23 10:22:13 -07:00
Deb Mukherjee	64e7f017ce	Fix in token allocation with zerogroup expt Fix to increase allocation of tokens at very high rates. Change-Id: Ia27aa0316b0fab664230800f9c9947b5c68ecd58	2013-04-23 08:51:31 -07:00
Deb Mukherjee	735febf1ce	Removing the implicit compound inter experiment Removing this experiment for now, since it has been broken with the latest code changes. Change-Id: I1be2181b56de490fcb577f5905b5e147a8ed82d8	2013-04-22 16:46:54 -07:00
Scott LaVarnway	e732bc298c	Moved dequantization into the token decoder Mostly for cleanup purposes. Now we should be able to rework the encoder/decoder to use a common idct/add function. Change-Id: I1597cc59812f362ecec0a3493b6101a6cc6fa7ff	2013-04-22 17:53:07 -04:00
Dmitry Kovalev	5de7e16ca2	Adding get_scan_{4x4, 8x8, 16x16} functions. Change-Id: Id4306ef6d65d4a3984aed50b775bdf48d4f6c438	2013-04-22 14:08:41 -07:00
John Koleszar	01e41a531b	Remove vp9_recon_intra_mbuv Use common vp9_recon_sbuv instead. Change-Id: I146f79adfdfda2b52257a52fa783727f12afa246	2013-04-22 12:05:24 -07:00
John Koleszar	c2c15e8eb3	Rewrite vp9_recon_sb* Rewrite vp9_recon_sb{,y,uv} to be a loop over planes. Change-Id: Ica2bbbb3105a1d29b2ff2ead07b76cde9683154c	2013-04-22 12:05:24 -07:00
John Koleszar	a443447b8b	Move pre, second_pre to per-plane MACROBLOCKD data Continue moving framebuffers to per-plane data. Change-Id: I237e5a998b364c4ec20316e7249206c0bff8631a	2013-04-22 12:05:24 -07:00
Deb Mukherjee	f12509f640	Merge "Removes the code_nonzerocount experiment" into experimental	2013-04-22 11:53:14 -07:00
Deb Mukherjee	0aa79be7d5	Removes the code_nonzerocount experiment This patch does not seem to give any benefits. Change-Id: I9d2b4091d6af3dfc0875f24db86c01e2de57f8db	2013-04-22 10:58:49 -07:00
Deb Mukherjee	6ce718eb18	Merge "End of orientation zero group experiment" into experimental	2013-04-22 10:33:12 -07:00
Deb Mukherjee	70d9f116fd	End of orientation zero group experiment Adds an experiment that codes an end-of-orientation symbol for every eligible zero encountered in scan order. This cleans out various other sub-experiments that were part of the origiinal patch, which will be later included if found useful. Results are slightly positive on all sets (0.1 - 0.2% range). Change-Id: I57765c605fefc7fb9d1b57f1b356843602abefaf	2013-04-22 09:27:59 -07:00
John Koleszar	6d5ac8f2e1	reconinter: remove unnecessary functions, params Removes the redundant dst pointers from vp9_build_inter_predictors_sb{y,uv} and the remaining mb specific functions. Change-Id: I7b6bf439d9394b85ea79b4fe61a3ffc1025720da	2013-04-22 08:20:54 -07:00
Paul Wilkins	f82c61b886	Merge "make DC_PRED for i4x4 to use real pixels only" into experimental	2013-04-22 05:10:36 -07:00
Dmitry Kovalev	c7a38f77ef	Merge "Removing get_segment_id function and using existing vp9_get_pred_mb_segid." into experimental	2013-04-20 11:05:50 -07:00
Dmitry Kovalev	5c632dbb19	Merge "Renaming vp9_extra_bit_struct to vp9_extra_bit." into experimental	2013-04-20 11:03:25 -07:00
John Koleszar	fa8ddbd2a6	Merge "Move dst to per-plane MACROBLOCKD data" into experimental	2013-04-19 16:33:45 -07:00
John Koleszar	588c3cb02e	Merge "Remove vp9_recon_mb{,y}" into experimental	2013-04-19 16:33:09 -07:00
Yaowu Xu	e3465a63d7	make DC_PRED for i4x4 to use real pixels only Wherever there are real pixels available before falling back to use assumed values 127 and 129. This also make DC_PRED for i4x4 consistent with DC_PRED for larger blocks. Change-Id: I54372924826118da023f402c802ac6ce0caa70c3	2013-04-19 16:22:07 -07:00
John Koleszar	95c6c13ce6	Merge "Remove redundant pointers from void vp9_recon_sb{y,uv}" into experimental	2013-04-19 16:17:42 -07:00
John Koleszar	d12376aa2c	Move dst to per-plane MACROBLOCKD data First in a series of commits moving the framebuffers pointers to per-plane data, so that they can be indexed numerically rather than by name. Change-Id: I6e0d60fd4d51e6375c384eb7321776564df21775	2013-04-19 16:16:10 -07:00
Scott LaVarnway	9662531d77	Eliminated prev_mip memsets/memcpys For 1080 material, this buffer is currently 2,270,928 bytes. This patch swaps ptrs instead of copying and uses the last show_frame flag instead of setting the entire buffer to zero. For the test clip used, the decoder improved by up to 1%. Change-Id: I686825712ad56043e09ada9808dc489f875a6ce0	2013-04-19 18:38:10 -04:00
Dmitry Kovalev	c09f652590	Removing get_segment_id function and using existing vp9_get_pred_mb_segid. Change-Id: Iff35d4b2f8f65511f80c594958c01fb4673fa033	2013-04-19 14:25:32 -07:00
Paul Wilkins	fb754fd37e	Merge "Mv ref candidates cut to 2." into experimental	2013-04-19 14:09:44 -07:00
John Koleszar	9ec0f658a1	Remove vp9_recon_mb{,y} Use the common sb functions instead. Change-Id: I4fa0a8ee3c6ada56271dd09bf895b97642f55858	2013-04-19 12:12:00 -07:00
John Koleszar	d747986d29	Remove redundant pointers from void vp9_recon_sb{y,uv} Remove the unnecessary _s_ from their names, and add a new vp9_recon_sb() that calls the y and uv variants. Change-Id: I7ffaa5ff5605a8472cac2a53de8cf889353039a6	2013-04-19 12:06:07 -07:00
Dmitry Kovalev	684ddc61ea	Renaming vp9_extra_bit_struct to vp9_extra_bit. Change-Id: Ie4713da125e954c1d30e1d4cbeb38666fce90ccc	2013-04-19 11:14:33 -07:00
John Koleszar	17313c408f	Move diff to MACROBLOCKD per-plane data. Change-Id: Ic27af09e38af8317ac4743241883d577a44f1490	2013-04-19 11:11:54 -07:00
John Koleszar	0053b46d51	make build_inter_predictors block size agnostic (split) All build_inter_predictors can now be serviced by the same inner function. Change-Id: I40b08bee8f047286db4b1aad9dcae37b879c3f2a	2013-04-19 10:29:42 -07:00
John Koleszar	e0df9b213d	Removing rounding from UV MV calculation for SPLITMV Similar to the prior change that removed the rounding from non-SPLITMV modes. Improves quality by a similar amount (Additional +0.087% on derf) Change-Id: I39d80b4a3037a3aa7e285eb2320346ddaf646f52	2013-04-19 10:23:26 -07:00
John Koleszar	48b2e43470	Merge "make buid_inter_predictors block size agnostic (chroma)" into experimental	2013-04-19 10:23:04 -07:00
John Koleszar	6e5d2ac54c	Merge "Use SSSE3 for 2d filters larger than 16" into experimental	2013-04-19 10:22:54 -07:00
John Koleszar	5b8a7d6e25	Use SSSE3 for 2d filters larger than 16 The C code was being used as a fallback for the >16 case, but only for 2D. Change-Id: I1e2e6da9e4b28bd88bde9ba4dd32724ce466cf6f	2013-04-19 09:51:16 -07:00
Paul Wilkins	de80da39dc	Mv ref candidates cut to 2. Further simplification of mvref search to return only the top two candidates. Distance weights removed as the test order reflects distance anyway. Change-Id: I0518cab7280258fec2058670add4f853fab7b855	2013-04-19 16:13:53 +01:00
Paul Wilkins	aa76bf3d28	Removal of CONFIG_NEW_MVREF experiment. This experiment has failed to give much benefit but does add complexity so deprecated. Change-Id: Ic7b929ba706390b9907ef0b4f965bd401ca799a4	2013-04-19 11:54:02 +01:00
Paul Wilkins	92e8a3f514	Simplification of MVref search. As we are no longer able to sort the candidate mvrefs in both encoder and decode and given that the cost of explicit signalling has proved prohibitive, it no longer makes sense to find more than 2 candidates. This patch: Modifies and simplifies add_candidate_mv() Removes the forced addition of a 0 vector in the MAX_MV_REF_CANDIDATES-1 position (in preparation to reducing MAX_MV_REF_CANDIDATES to 2). Re-orders the addition of candidates slightly. This actually gives small gains (circa 0.2% on std-hd) A subsequent patch will remove NEW_MVREF experiment, reduce MAX_MV_REF_CANDIDATES to 2 and remove distance weights as these are implicit now in the order. Change-Id: I3dbe1a6f8a1a18b3c108257069c22a1141a207a4	2013-04-19 11:19:59 +01:00
John Koleszar	fc49a377d7	make buid_inter_predictors block size agnostic (chroma) Updates to make non-SPLITMV inter predictors work for all plane types. Change-Id: I25dbef40b7ffcac30254b43eed1e22fc732378ae	2013-04-18 17:50:22 -07:00
John Koleszar	2987fa1dc1	Removing rounding from UV MV calculation Consider the previous behavior for the MV 1 3/8 (11/8 pel). In the existing code, the fractional part of the MV is considered separately, and rounded is applied, giving a result of 6/8. Rounding is not required in this case, as we're increasing the precision from a q3 to a q4, and the correct value 11/16 can be represented exactly. Slight gain observed (+.033 average on derf) Change-Id: I320e160e8b12f1dd66aa0ce7966b5088870fe9f8	2013-04-18 17:47:17 -07:00
John Koleszar	4924934d2b	make buid_inter_predictors block size agnostic (luma) This commit converts the luma versions of vp9_build_inter_predictors_sb to use a common function. Update the convolution functions to support block sizes larger than 16x16, and add a foreach_predicted_block walker. Next step will be to calculate the UV motion vector and implement SBUV, then fold in vp9_build_inter16x16_predictors_mb and SPLITMV. At the 16x16, 32x32, and 64x64 levels implemented in this commit, each plane is predicted with only a single call to vp9_build_inter_predictor. This is not yet called for SPLITMV. If the notion of SPLITMV/I8X8/I4X4 goes away, then the prediction block walker can go away, since we'll always predict the whole bsize in a single step. Implemented using a block walker at this stage for SPLITMV, as a 4x4 "prediction block size" within the BLOCK_SIZE_MB16X16 macroblock. It would also support other rectangular sizes too, if the blocks smaller than 16x16 remain implemented as a SPLITMV-like thing. Just using 4x4 for now. There's also a potential to combine with the foreach_transformed_block walker if the logic for calculating the size of the subsampled transform is made more straightforward, perhaps as a consequence of supporing smaller macroblocks than 16x16. Will watch what happens there. Change-Id: Iddd9973398542216601b630c628b9b7fdee33fe2	2013-04-18 17:42:55 -07:00
Dmitry Kovalev	b27edc67d2	Merge "Code cleanup inside findnearmv code." into experimental	2013-04-18 15:29:44 -07:00
Jingning Han	f0b065e946	Merge "Make the use of pred buffers consistent in MB/SB" into experimental	2013-04-18 15:24:55 -07:00
Dmitry Kovalev	19e9714572	Code cleanup inside findnearmv code. Using predefined clamp function, removing redundant variables, declare and init on the same line. Change-Id: I14636eb242194bac33f8a9d4a273a416d32856fc	2013-04-18 15:07:36 -07:00
Jingning Han	6f43ff5824	Make the use of pred buffers consistent in MB/SB Use in-place buffers (dst of MACROBLOCKD) for macroblock prediction. This makes the macroblock buffer handling consistent with those of superblock. Remove predictor buffer MACROBLOCKD. Change-Id: Id1bcd898961097b1e6230c10f0130753a59fc6df	2013-04-18 14:59:36 -07:00
Dmitry Kovalev	8726752cb6	Merge "Adding DEFAULT_PRED_PROB_{0, 1, 2} constants." into experimental	2013-04-18 14:39:14 -07:00
Dmitry Kovalev	3fe7b64722	Merge "Motion vector decoding code cleanup." into experimental	2013-04-18 14:38:38 -07:00
Dmitry Kovalev	bef4e474e7	Merge "Changing argument type of vp9_get_mv_joint from MV to MV*." into experimental	2013-04-18 14:27:44 -07:00
John Koleszar	66c0d1100b	Merge "convolve: support larger blocks, fix asm saturation bug" into experimental	2013-04-18 14:27:16 -07:00
Dmitry Kovalev	a8d903e539	Merge "Replacing VP9_COMBINEENTROPYCONTEXTS macro with function." into experimental	2013-04-18 14:26:34 -07:00
Dmitry Kovalev	8b20aa2337	Merge "Renaming y1dc_delta_q, uvdc_delta_q, uvac_delta_q fields from VP9Common." into experimental	2013-04-18 14:26:06 -07:00
John Koleszar	a9ebbcc338	convolve: support larger blocks, fix asm saturation bug Updates the common convoloution code to support blocks larger than 16x16, and rectangular blocks. This uncovered a bug in the SSSE3 filtering routines due to the order of application of saturation. This commit fixes that bug, adjusts the unit test to bias its random values towards the extremes, and adds a test to ensure that all filters conform to the expected pairwise addition structure. Change-Id: I81f69668b1de0de5a8ed43f0643845641525c8f0	2013-04-18 13:57:59 -07:00
Dmitry Kovalev	eae38910ce	Motion vector decoding code cleanup. Change-Id: I9790baedbd4acb7113575efc6f228b2656c42ff7	2013-04-18 11:05:34 -07:00
John Koleszar	38f6232118	Merge "Use BLOCK_SIZE_TYPE in foreach_ walker" into experimental	2013-04-17 21:02:58 -07:00
Ronald S. Bultje	d49df319ab	Merge "Fix edge bug in recent merge of 64x64 and 32x32 inter predictors." into experimental	2013-04-17 16:30:42 -07:00
Ronald S. Bultje	d63826ac12	Fix edge bug in recent merge of 64x64 and 32x32 inter predictors. Change-Id: I83aa188d414922db19cccb210c4001c02d5a404c	2013-04-17 16:12:02 -07:00
John Koleszar	ff3f93639c	Use BLOCK_SIZE_TYPE in foreach_ walker Change-Id: I655305c9e22bdd9abc893d3c40d4bc6616aa1d35	2013-04-17 15:08:37 -07:00
Yaowu Xu	acfc5981c3	Merge "clean out experiments" into experimental	2013-04-17 14:53:00 -07:00
Yaowu Xu	c8606a241f	Merge "make lf_deltas dependent on filter_lvl" into experimental	2013-04-17 14:51:55 -07:00
Ronald S. Bultje	1cf31428ff	Merge "Remove unused file vp9_context.c." into experimental	2013-04-17 13:49:48 -07:00
Ronald S. Bultje	0a20625bd8	Remove unused file vp9_context.c. Change-Id: Id268ccaf1aefee6a3ed3e31486d4370f1c25e8cb	2013-04-17 13:40:31 -07:00
Dmitry Kovalev	ecff8d71ab	Adding DEFAULT_PRED_PROB_{0, 1, 2} constants. Also using ALLOWED_REFS_PER_FRAME instead of 3. Change-Id: I810dd8521d8138edb9dbd78edede49b62f706554	2013-04-17 11:45:35 -07:00
Ronald S. Bultje	88192546cf	Merge "Remove BLOCK_SIZE_LG2." into experimental	2013-04-17 11:22:44 -07:00
Ronald S. Bultje	0bb49c4e30	Merge "Add SSE2 versions for rectangular sad and sad4d functions." into experimental	2013-04-17 11:22:32 -07:00
Dmitry Kovalev	0db175ffed	Changing argument type of vp9_get_mv_joint from MV to MV*. Change-Id: I28c3026946fc1bde7074e6e0198da93bb0d75dfe	2013-04-17 11:21:28 -07:00
Yaowu Xu	642ac924ab	Merge "replace hev_thr_lut[][] with simpler logic" into experimental	2013-04-17 11:08:36 -07:00
Yaowu Xu	421ad3f1b1	clean out experiments that are related to using reconstructed pixel for selecting reference motion vectors. Change-Id: I048dfae39ca7385e344b57d46347ecc6e753e1bb	2013-04-17 11:00:46 -07:00
Ronald S. Bultje	213fe85da3	Remove BLOCK_SIZE_LG2. It is unused. Change-Id: Ied3269ffacf9b6303bc9d85f996384c3575ef812	2013-04-17 11:00:30 -07:00
Yaowu Xu	888d0c82da	make lf_deltas dependent on filter_lvl Change-Id: Idb0d11e3ae9afabe667a9f327bf4d3aa84f63649	2013-04-17 10:59:48 -07:00
Yaowu Xu	0d310de97b	replace hev_thr_lut[][] with simpler logic Using filter_level/16 instead. Change-Id: I73a7e83a785d6aa6f9b5d22cf66e22f0a39ed078	2013-04-17 10:54:30 -07:00
Ronald S. Bultje	c17c440233	Merge "Fairly basic integration of rectangular blocks in encoding RD loop." into experimental	2013-04-17 10:46:45 -07:00
Ronald S. Bultje	0c481f4d18	Add SSE2 versions for rectangular sad and sad4d functions. About 11% overall encoder speedup with the sbsegment experiment enabled. Change-Id: Iffb1bdba6932d9f11a6c791cda8697ccf9327183	2013-04-17 10:31:59 -07:00
Yaowu Xu	cb3192b72c	Change to do LPF in SB64 order Change-Id: I41b3f5932ecd6256e8207369ad19aa81e7987be1	2013-04-17 10:15:02 -07:00
Ronald S. Bultje	e693472236	Fairly basic integration of rectangular blocks in encoding RD loop. Adds RD integration for 32x16, 16x32, 64x32 and 32x64 rectangular blocks. Derf almost +0.6%, HD a little over +1.0%, STDHD +1.3%. Change-Id: Id651fdb6a655fdbb5c47009757e63317acfb88a5	2013-04-17 09:25:06 -07:00
Jingning Han	90a91cc683	Recursive partition syntax coding Enable recursive partition information coding from SB64X64 down to MB16X16. The bit-stream syntax is now supporting rectangular block sizes. It starts from SB64X64 and recursively describes the partition type of the current block. If the partition type is PARTITION_NONE, the block is coded as a single unit; if it is PARTITION_HORZ or PARTITION_VERT, the block is segmented into two independently coded rectangular units, with no further partition needed; otherwise, the block is segmented into 4 square blocks. i.e., PARTITION_SPLIT case, each can be potentially further partitioned. Forward adaptive probability modeling is used for the partition information coding, conditioned on the current block size. Change-Id: I499365fb547839d555498e3bcc0387d8a3587d87	2013-04-16 18:41:26 -07:00
Ronald S. Bultje	c0a1b5bc7e	Merge "Slightly hackish workaround to support rectangles in directional intra predictors." into experimental	2013-04-16 17:05:20 -07:00
Christian Duvivier	5b6d33f9af	Faster vp9_short_fdct4x4 and vp9_short_fdct8x4. Scalar path is about 1.3x faster (2.1% overall encoder speedup). SSE2 path is about 5.0x faster (8.4% overall encoder speedup). Change-Id: I360d167b5ad6f387bba00406129323e2fe6e7dda	2013-04-16 16:38:30 -07:00
Christian Duvivier	f13b69d07c	Faster vp9_short_fdct4x4 and vp9_short_fdct8x4. Scalar path is about 1.3x faster (2.1% overall encoder speedup). SSE2 path is about 5.0x faster (8.4% overall encoder speedup). Change-Id: I360d167b5ad6f387bba00406129323e2fe6e7dda	2013-04-16 16:11:56 -07:00
Dmitry Kovalev	9087d6d470	Replacing VP9_COMBINEENTROPYCONTEXTS macro with function. Change-Id: I3bbc31840af69481e1d9bb4427c9ee25abf82946	2013-04-16 15:30:28 -07:00
Dmitry Kovalev	1ad7c1f250	Renaming y1dc_delta_q, uvdc_delta_q, uvac_delta_q fields from VP9Common. New names are y_dc_delta_q, uv_dc_delta_q, uv_ac_delta_q. Change-Id: I4acae1fc23a4697ce2c5a5becb8dc28ef0a4b552	2013-04-16 15:05:52 -07:00
Ronald S. Bultje	94996b9d26	Slightly hackish workaround to support rectangles in directional intra predictors. Change-Id: I8a4da6925f2d58a426c4d122df8b97bb69452e49	2013-04-16 14:33:03 -07:00
John Koleszar	e3cfe4e89e	Remove the mb_no_coeff_skip flag This flag was added to VP8 to allow a mode where MB-level skipping was not allowed, saving a bit per mb. It was never used in practice, and hasn't been tested in VP9, so remove it. Change-Id: Id450ec6904c6d06c1919508e7efc52d05cde5631	2013-04-16 12:36:16 -07:00
Dmitry Kovalev	5953a98631	Merge "Code cleanup inside vp9_reconintra4x4.c file." into experimental	2013-04-16 10:24:32 -07:00
Dmitry Kovalev	b30182c733	Merge "Adding mv_joint_vertical and mv_joint_horizontal functions." into experimental	2013-04-16 10:24:01 -07:00
Yunqing Wang	e87c7f0930	Merge "Optimize the scaling calculation" into experimental	2013-04-16 09:14:22 -07:00
Scott LaVarnway	466f395148	Merge "Removing extra params from x_add_residual() functions" into experimental	2013-04-16 08:58:28 -07:00
Yunqing Wang	148eb803bb	Optimize the scaling calculation In decoder, the scaling calculation, such as (mv * x_num / x_den), is fairly time-consuming. In this patch, we check if the scaling happens or not at frame level, and then decide which function to call to skip scaling calculation when no scaling is needed. Tests showed a 3% decoder performance gain. Change-Id: I270901dd0331048e50368cfd51ce273dd82b8733	2013-04-16 08:52:40 -07:00
Scott LaVarnway	6f95d53e37	Removing extra params from x_add_residual() functions Now that the predictor is the dest, we do not need the extra parameters. Change-Id: I31e2c3d2015f4a1cd12e7f04536d8db478582a0a	2013-04-16 09:59:01 -04:00
John Koleszar	4054ff5da5	Merge "Removing TRUE and FALSE macro definitions." into experimental	2013-04-16 06:55:13 -07:00
John Koleszar	7f7d1357a2	Merge branch 'experimental' into master VP9 preview bitstream 2, commit '868ecb55a1528ca3f19286e7d1551572bf89b642' Conflicts: vp9/vp9_common.mk Change-Id: I3f0f6e692c987ff24f98ceafbb86cb9cf64ad8d3	2013-04-16 06:49:46 -07:00
Scott LaVarnway	5393379c84	Merge "Removing extra params in dequant functions" into experimental	2013-04-16 06:37:00 -07:00
Dmitry Kovalev	a0d9309eab	Removing TRUE and FALSE macro definitions. Using regular 0 and 1 constants now. Change-Id: Ie763503cbb727847cc8f1d6506cd6f2ee607f056	2013-04-15 15:24:39 -07:00
Ronald S. Bultje	f7d43d21bd	Merge "Add rectangular block size variance/sad functions." into experimental	2013-04-15 14:20:25 -07:00
Jingning Han	aaf33d7df5	Add rectangular block size variance/sad functions. With this, the RD loop properly supports rectangular blocks. Change-Id: Iece79048fb4e84741ee1ada982da129a7bf00470	2013-04-15 13:39:07 -07:00
Dmitry Kovalev	fd61b7ea10	Adding mv_joint_vertical and mv_joint_horizontal functions. Change-Id: Ieaec2c48f3752b8558ba051caaf4ba2ab0e9e84d	2013-04-15 12:07:26 -07:00
Dmitry Kovalev	64de375e1f	Code cleanup inside vp9_reconintra4x4.c file. Using ROUND_POWER_OF_TWO macro, using array initialization syntax for less code. Change-Id: I661453a6b29a9046fcff0a3f18fccb452b5eb39d	2013-04-15 11:15:56 -07:00
Scott LaVarnway	74610b1ae4	Removing extra params in dequant functions Now that the predictor is the dest, we do not need the extra parameters. Change-Id: I78db73d39b5aff62f15303f3d51ad2797eae74b6	2013-04-15 13:43:11 -04:00
Yaowu Xu	757e138a3b	Merge "Reorder enum i4X4 predcition modes" into experimental	2013-04-15 10:37:37 -07:00
Adrian Grange	4ee671a15c	Merge "Initial addition of multiple ARF frames" into experimental	2013-04-15 09:46:16 -07:00
Adrian Grange	c2876cf0fd	Initial addition of multiple ARF frames This is work-in-progress, it implements multiple ARF encoding behind an experimental flag. It adds the ability to insert multiple ARF frames into a single ARF group. This patch implements the reordering of the coded frames, and implements a fixed-length coding pattern. It applies a fixed quantizer strategy based on where the frame is in the coding sequence. Further work to modify the rate control strategy is ongoing and will be submitted via a set of future patches. In this first step, each ARF group is recursively bisected and an ARF frame added at that position in the sequence. The recursion continues until ARF frames are within MIN_GF_INTERVAL frames. The code sits behind the "multiple-arf" experimental flag ("CONFIG_MULTIPLE_ARF"). The experimental flag "oneshotq" ("CONFIG_ONESHOTQ") also needs to be enabled for this patch to work correctly. Change-Id: Ie473b05ebb43ac473c0cfb659b2b8042823085e2	2013-04-15 09:11:39 -07:00
Dmitry Kovalev	8ae091823d	Merge "Encoder code cleanup." into experimental	2013-04-14 10:58:44 -07:00
Dmitry Kovalev	ee9ce0e7d7	Merge "Intra code cleanup." into experimental	2013-04-14 04:34:16 -07:00
Dmitry Kovalev	399a6cbcde	Merge "Renaming vp9_token_struct to vp9_token and removing previous typedef." into experimental	2013-04-14 04:31:39 -07:00
Dmitry Kovalev	78ddf964cd	Intra code cleanup. Removing redundant code. Change-Id: I71bfc40a1fb06d8e3149ed5400aa4dfd87a51aac	2013-04-12 16:53:04 -07:00
Jingning Han	3ba9dd4165	Enable inter predictor for rectangular block size Combine superblock inter predictors into a unified function that allows configurable block width and height. The inter predictions of block sizes smaller than 16x16 are handled differently. To be continued on merging them later. Change-Id: I14075959dd5e221f00c205c99ca35c1c31ef728e	2013-04-12 11:51:58 -07:00
Yaowu Xu	c2ad69bcf4	Reorder enum i4X4 predcition modes To match the order of directional intra prediction modes for larger blocks, also renamed the i4x4 prediction modes to mirror the larger variants. Change-Id: I77cea4d0add6c7758460bf9c7a2fe59aca601f0b	2013-04-12 10:13:23 -07:00
Yaowu Xu	7de5edd14a	Rename B_PRED to I4X4_PRED So it is consistent with I8x8_PRED. Change-Id: Iefa65124b2419690d83e526c611129c0ede29d11	2013-04-12 09:23:58 -07:00
Jingning Han	815e95fbeb	Make intra predictor support rectangular blocks The intra predictor supports configurable block sizes. It can handle intra prediction down to 4x4 sizes, when enabled in BLOCK_SIZE_TYPE. Change-Id: I7399ec2512393aa98aadda9813ca0c83e19af854	2013-04-11 16:45:57 -07:00
John Koleszar	2f19cd03aa	Merge "Remove unused vp9_recon_mb{y,uv}_s" into experimental	2013-04-11 15:51:20 -07:00
Scott LaVarnway	cff266bbef	Merge "WIP: removing predictor buffer usage from decoder" into experimental	2013-04-11 15:24:33 -07:00
Ronald S. Bultje	56d01ee0a6	Merge "Remove unused macroblock versions of reconstruction functions." into experimental	2013-04-11 15:19:08 -07:00
Deb Mukherjee	7a97959f13	Merge "Turning model-based updates on with modelcoefprob" into experimental	2013-04-11 14:54:53 -07:00
Deb Mukherjee	66f413af4f	Turning model-based updates on with modelcoefprob This patch changes the default with the modecoefprob expt to use mode-based forward updates with one-node pegged modeling. The maximum difference with fully trained tables is now less that 0.1%. Change-Id: I06b44322e10c6703f93f3c1d48d973b1136a0618	2013-04-11 14:45:26 -07:00
John Koleszar	4ba74ae81a	Merge "Remove unused vp9 ppc files" into experimental	2013-04-11 14:39:18 -07:00
John Koleszar	c382ed09f8	Remove unused vp9_recon_mb{y,uv}_s These functions now are handled through the common superblock code. Change-Id: Ib6688971bae297896dcec42fae1d3c79af7a611c	2013-04-11 14:05:59 -07:00
Scott LaVarnway	6189f2bcb1	WIP: removing predictor buffer usage from decoder This patch will use the dest buffer instead of the predictor buffer. This will allow us in future commits to remove the extra mem copy that occurs in the dequant functions when eob == 0. We should also be able to remove extra params that are passed into the dequant functions. Change-Id: I7241bc1ab797a430418b1f3a95b5476db7455f6a	2013-04-11 13:55:18 -07:00
John Koleszar	8bf6de725c	Merge changes I6721e42f,Iaffb1ae8 into experimental * changes: tokenize: convert skippable functions Add foreach_transformed_block	2013-04-11 13:36:25 -07:00
John Koleszar	633d9e7b4f	Remove unused vp9 ppc files Change-Id: I3fe8c529ddec658cfa2376cfc05d9c8a5366e978	2013-04-11 13:29:37 -07:00
Dmitry Kovalev	24f18e1c34	Renaming vp9_token_struct to vp9_token and removing previous typedef. Change-Id: If69c3d795f87af5cc7bfdfe70ef733c41b4d55c8	2013-04-11 13:01:52 -07:00
John Koleszar	c2bd46bf45	tokenize: convert skippable functions Use the common block walker to calculate skippability. Change-Id: I6721e42f065df237426c91c1d871ec226ba7cdcb	2013-04-11 12:27:37 -07:00
Ronald S. Bultje	13e41ba440	Remove unused macroblock versions of reconstruction functions. More specifically, remove vp9_quantize_mb, vp9_optimize_mb, vp9_inverse_transform_mb* and vp9_transform_mb. Instead, use the generic _sb functions that take a size argument, and call them with BLOCK_SIZE_MB16X16. Change-Id: I33024afea95d3a23ffbc1df7da426e4645110f29	2013-04-11 12:27:15 -07:00
John Koleszar	42471f6b72	Add foreach_transformed_block Adds a framework for doing arbitrary functions on each transform- sized block in the mb/sb. Change-Id: Iaffb1ae8db5ff2abfa8720c608c78376b42f2096	2013-04-11 11:42:19 -07:00
John Koleszar	c18b2617a4	Remove vp9_reset_mb_tokens_context Use sb-common version instead. Change-Id: If2552b5a39fd2e5272f66a41c5667dda85fd3939	2013-04-11 11:39:19 -07:00
Dmitry Kovalev	ec299e2092	Encoder code cleanup. Removing duplicated code from vp9_encodemv.c and reusing ROUND_POWER_OF_TWO macro definitions. Change-Id: I9caf0c17f761ada7905cb99a3e2a31f871fef0f9	2013-04-11 11:08:00 -07:00
Ronald S. Bultje	8fb5be48a6	Make usage of sb_type independent of literal values. Change-Id: I0d12f9ef9d960df0172a1377f8e5236eb6d90492	2013-04-10 17:38:57 -07:00
Ronald S. Bultje	b4f6098ef7	Make RD superblock mode search size-agnostic. Merge various super_block_yrd and super_block_uvrd versions into one common function that works for all sizes. Make transform size selection size-agnostic also. This fixes a slight bug in the intra UV superblock code where it used the wrong transform size for txsz > 8x8, and stores the txsz selection for superblocks properly (instead of forgetting it). Lastly, it removes the trellis search that was done for 16x16 intra predictors, since trellis is relatively expensive and should thus only be done after RD mode selection. Gives basically identical results on derf (+0.009%). Change-Id: If4485c6f0a0fe4038b3172f7a238477c35a6f8d3	2013-04-10 16:50:30 -07:00
Yaowu Xu	8e9819230d	Merge "Remove obselete code" into experimental	2013-04-10 14:56:28 -07:00
Yaowu Xu	2da90fddc2	Remove obselete code The strategy to run fast loop filter picking for encoder speed-up should be revisited at a later stage. Change-Id: I3b75e06d767cff41be952a42e63b3292f4eab996	2013-04-10 13:45:22 -07:00
Dmitry Kovalev	0cef7234e1	Merge "Fixing upper case names." into experimental	2013-04-10 13:29:38 -07:00
Dmitry Kovalev	20645ec4fb	Merge "Cleanup of set_offsets function." into experimental	2013-04-10 10:15:13 -07:00
Ronald S. Bultje	1932828d19	Merge "Make SB coding size-independent." into experimental	2013-04-10 08:51:58 -07:00
Ronald S. Bultje	9b46e30494	Merge "Don't use BLOCKD in vp9_invtrans.c." into experimental	2013-04-09 21:36:09 -07:00
Ronald S. Bultje	a3874850dd	Make SB coding size-independent. Merge sb32x32 and sb64x64 functions; allow for rectangular sizes. Code gives identical encoder results before and after. There are a few macros for rectangular block sizes under the sbsegment experiment; this experiment is not yet functional and should not yet be used. Change-Id: I71f93b5d2a1596e99a6f01f29c3f0a456694d728	2013-04-09 21:28:27 -07:00
John Koleszar	a3ec4cbd33	Merge "detokenize: use consistent structure for all block sizes" into experimental	2013-04-09 14:18:59 -07:00
Dmitry Kovalev	c34f6fcb54	Fixing upper case names. Renaming Y1dequant to y_dequant, UVdequant to uv_dequant, QIndex to qindex. Change-Id: I1c356e5f886deb3f8807dc212de9799b55b09d58	2013-04-09 10:46:57 -07:00
Dmitry Kovalev	df76a617b4	Cleanup of set_offsets function. Adding ALLOWED_REFS_PER_FRAME constant instead of hard coded number 3. Change-Id: I46146aa837896936f920c748c7d4aa4c27f026e4	2013-04-09 10:17:22 -07:00
Jingning Han	b3935e8348	Merge "Clamp inferred motion vectors only" into experimental	2013-04-09 09:24:08 -07:00
John Koleszar	e6deea4e60	detokenize: use consistent structure for all block sizes Restructure the code to avoid the majority of per-block-size switches, code duplication, etc. All block types (mb/sb32/sb64) can be handled by the same code. Change-Id: I4022718d66e31a15a7074e43f3b98cd0a5124ea7	2013-04-08 13:11:40 -07:00
Ronald S. Bultje	f42bee7edf	Don't use BLOCKD in vp9_invtrans.c. Change-Id: I40524170334109e2864b06e3c73c8b34e5aa8b0f	2013-04-08 11:37:29 -07:00
Jingning Han	12bf0796e6	Clamp inferred motion vectors only Clamp only the motion vectors inferred from neighboring reference macroblocks. The motion vectors obtained through motion search in NEWMV mode are constrained during the search process, which allows a relatively larger referencing region than the inferred mvs. Hence further clamping the best mv provided by the motion search may affect the efficacy of NEWMV mode. Synchronized the decoding process. The decoded mvs in NEWMV modes should be guaranteed to fit in the effective range. Put a mv range clamping function there for security purpose. This improves the coding performance of high motion sequences, e.g., derf set: foreman 0.233% husky 0.175% icd 0.135% mother_daughter 0.337% pamphlet 0.561% stdhd set: blue_sky 0.408% city 0.455% also saw sunflower goes down by -0.469%. Change-Id: I3fcbba669e56dab779857a8126a91b926e899cb5	2013-04-08 11:37:03 -07:00
Ronald S. Bultje	aeefa6e194	Fix typo which breaks 4x4 splitmv compound prediction RD code. 0.15% quality increase on derf, particularly noticeable on hard clips at the higher bitrate end. Change-Id: I02415a96eb9bbc361cba923069625fae71844bc9	2013-04-08 09:17:52 -07:00
John Koleszar	0e7b7e47c2	Merge "Small cleanup inside setup_loopfilter function." into experimental	2013-04-05 16:13:46 -07:00
John Koleszar	8bbabbea70	Merge "Segmentation code cleanup." into experimental	2013-04-05 16:03:25 -07:00
John Koleszar	fa135d7b9e	Merge changes Ibbfa68d6,Idb76a0e2 into experimental * changes: Move EOB to per-plane data Move qcoeff, dqcoeff from BLOCKD to per-plane data	2013-04-05 15:56:50 -07:00
Ronald S. Bultje	36c3a67c20	Remove full-pixel-related code. This is a VP8-only feature (part of profile 3) that is unsupported in VP9. Change-Id: I78016eede8d9c834d44d4c517f3e8b8fc2a378b1	2013-04-05 12:50:19 -07:00
Dmitry Kovalev	421baef49e	Small cleanup inside setup_loopfilter function. Change-Id: If7fa8aea02f26c2c2bb5daf4e65c3e661d7031ca	2013-04-05 12:48:48 -07:00
Ronald S. Bultje	61834f7325	Remove some unused macros. Change-Id: Ic219e7878428128e4bb1b3995e8151f92b6bd9c3	2013-04-05 12:40:56 -07:00
Ronald S. Bultje	0732a61c37	Remove struct POS. It is never used. Change-Id: If7462357c0498ed05af2645f0c272124381d3aab	2013-04-05 12:38:40 -07:00
Ronald S. Bultje	1cb34c32ed	Remove unused vpx_log() function prototype. Change-Id: Icd6b4322841fefcc86f06645e6aaf1ea42fdfabd	2013-04-05 12:37:45 -07:00
Ronald S. Bultje	5cd235c6cd	Remove "tx_type" member from union b_mode_info. It is never used. Change-Id: Ibae898c52c766aabf65868611060f9c38fb85b35	2013-04-05 12:36:15 -07:00
Dmitry Kovalev	2c42499513	Segmentation code cleanup. Cleaning up the code, removing unused vp9_check_segref_inter function and useless comments. Change-Id: Ia0e1a3878dc0f9789cba84aeb507a83d9dccd26b	2013-04-05 11:55:52 -07:00
John Koleszar	05a79f2fbf	Move EOB to per-plane data Continue migrating data from BLOCKD/MACROBLOCKD to the per-plane structures. Change-Id: Ibbfa68d6da438d32dcbe8df68245ee28b0a2fa2c	2013-04-04 21:30:23 -07:00
John Koleszar	4c05a051ab	Move qcoeff, dqcoeff from BLOCKD to per-plane data Start grouping data per-plane, as part of refactoring to support additional planes, and chroma planes with other-than 4:2:0 subsampling. Change-Id: Idb76a0e23ab239180c818025bae1f36f1608bb23	2013-04-04 16:30:57 -07:00
John Koleszar	4d9dbb2ae8	Merge "Reimplementation of setup_frame_size." into experimental	2013-04-03 21:04:29 -07:00
Dmitry Kovalev	d5a017300c	General code cleanup. Making code more readable in different places. Change-Id: Iea92c9a35e64d257ee358879fc04fc926843d52e	2013-04-03 18:40:17 -07:00
Yunqing Wang	dcd3a5c055	Merge "Modify vp9_setup_interp_filters function" into experimental	2013-04-03 14:09:01 -07:00
Yunqing Wang	4ca882f32f	Modify vp9_setup_interp_filters function Took vp9_setup_scale_factors_for_frame() out from vp9_setup_interp_filters(), so that it is only called once per frame instead of per macroblock. Decoder tests showed a 1.5% performance gain. Change-Id: I770cb09eb2140ab85132f82aed388ac0bdd3a0aa	2013-04-03 13:49:55 -07:00
Dmitry Kovalev	da0232fd59	Reimplementation of setup_frame_size. General code cleanup in loopfilter code. Modification of setup_frame_size, so now VP9_COMMON is modified in one place after all width/height checks passed. Change-Id: Iedf32df43a912d7aae788ed276ac6c429973f6fe	2013-04-03 12:21:47 -07:00
John Koleszar	30d83c4159	Merge "Fix overlapping writes by copy_and_extend_plane" into experimental	2013-04-03 11:54:29 -07:00
John Koleszar	8b71b8a6de	Merge "Renaming sb32_coded and sb64_coded fields." into experimental	2013-04-02 21:49:03 -07:00
John Koleszar	dc12e6c0dc	Merge "Lower case names for struct members." into experimental	2013-04-02 21:27:32 -07:00
Dmitry Kovalev	dca8ad178c	Renaming sb32_coded and sb64_coded fields. Renaming sb32_coded to prob_sb32_coded and sb64_coded to prob_sb64_coded. Change-Id: I6de5cad00a57c3e066d53467f8c38cb6073dce11	2013-04-02 18:21:55 -07:00
John Koleszar	01247f67a7	Fix overlapping writes by copy_and_extend_plane Broken by refactoring commit `180cd5faa5` Change-Id: I307f6e54d93219a31e7336f1633103ecb25e4832	2013-04-02 14:58:10 -07:00
John Koleszar	42db454c7f	Merge branch 'master' into experimental Conflicts: vp9/vp9_common.mk Change-Id: I2cd5ab47dc31c4210cefc23a282102123d5e2221	2013-04-02 14:54:44 -07:00
Dmitry Kovalev	626635c271	Lower case names for struct members. Lower case member names inside VP9D_CONFIG and VP9D_COMP structs. Change-Id: I75af9ad2d929a35c357207a3fd9ebedddabf79c3	2013-04-02 13:34:20 -07:00
Johann	3db60c8c6c	Demux vp9_loopfilter_x86.c Allow more careful targeting of compiler flags. Change-Id: I963ab4a6479dedb165419310dfca52a58a9877b8	2013-04-02 12:49:04 -07:00
Johann	6c147b9d93	vp9_sadmxn_x86 only contains SSE2 functions Rename the file and clean up includes. In the future we would like to pattern match the files which need additional compiler flags. Change-Id: I2c76256467f392a78dd4ccc71e6e0a580e158e56	2013-04-02 11:20:55 -07:00
John Koleszar	49bc402a94	Merge "Code cleanup." into experimental	2013-04-01 21:12:56 -07:00
John Koleszar	a417a6e32c	Merge "Removing redundant function arguments." into experimental	2013-04-01 21:09:48 -07:00
Dmitry Kovalev	e71248addc	Code cleanup in block reconstruction code. Adding recon, recond_sby and recon_sbuv functions. Change-Id: I6050db233e792e73a3699d18b056eaef9c901d6d	2013-04-01 18:26:58 -07:00
Dmitry Kovalev	50e54c112d	Code cleanup. Adding multiple16 function, removing redundant code, better formatting. Change-Id: I50195b78ac8ab803e3d05c8fb05a7ca134fab386	2013-04-01 18:23:04 -07:00
Deb Mukherjee	e3955007df	Merge "Framework changes in nzc to allow more flexibility" into experimental	2013-03-29 15:57:27 -07:00
John Koleszar	edb1222acb	Merge "Extracting common motion vector prediction code." into experimental	2013-03-29 10:43:38 -07:00
John Koleszar	2e181c2d0b	Merge "General code cleanup." into experimental	2013-03-29 10:40:34 -07:00
Yaowu Xu	4b3e59ef0e	Merge "define a specific neighborhood for SB64 mv search" into experimental	2013-03-29 09:26:14 -07:00
Yaowu Xu	cbc7ec55a5	Merge "remove code not in use" into experimental	2013-03-29 08:40:29 -07:00
Deb Mukherjee	c5840a8d8e	Merge "Reoptimizing the interpolation filters" into experimental	2013-03-29 07:15:05 -07:00
Ronald S. Bultje	6cb2fcf601	Merge "Fix mix-up in pt token indexing." into experimental	2013-03-28 12:53:00 -07:00
Deb Mukherjee	fe9b5143ba	Framework changes in nzc to allow more flexibility The patch adds the flexibility to use standard EOB based coding on smaller block sizes and nzc based coding on larger blocksizes. The tx-sizes that use nzc based coding and those that use EOB based coding are controlled by a function get_nzc_used(). By default, this function uses nzc based coding for 16x16 and 32x32 transform blocks, which seem to bridge the performance gap substantially. All sets are now lower by 0.5% to 0.7%, as opposed to ~1.8% before. Change-Id: I06abed3df57b52d241ea1f51b0d571c71e38fd0b	2013-03-28 09:33:50 -07:00
Ronald S. Bultje	9eea9fa206	Fix mix-up in pt token indexing. This fixes uninitialized reads in the trellis, and probably makes the trellis do something again. Change-Id: Ifac8dae9aa77574bde0954a71d4571c5c556df3c	2013-03-28 09:24:29 -07:00
Yaowu Xu	48104f0dfa	define a specific neighborhood for SB64 mv search Change-Id: Ifda91d697c5970c65ce3ec1feac5562124f91782	2013-03-27 16:34:45 -07:00
Dmitry Kovalev	17cddb4e26	Removing redundant function arguments. Almost all arguments for vp9_build_inter32x32_predictors_sb and vp9_build_inter64x64_predictors_sb can be deduced from the first macroblock argument. Change-Id: I5d477a607586d05698d5b3b9b9bc03891dd3fe83	2013-03-27 16:19:27 -07:00
Dmitry Kovalev	52ccff4719	Extracting common motion vector prediction code. Adding b_mv_pred_row and b_mv_pred_col functions, updating mi_mv_pred_row and mi_mv_pred_row functions. Change-Id: I9af068442d4474478375943cc6fce1605d6fc0a5	2013-03-27 14:35:36 -07:00
Dmitry Kovalev	180cd5faa5	General code cleanup. Removing redundant code, lower case variable names, better indentation, better parameter names, adding const to readonly parameters. Change-Id: Ibfdee00f60316fdc5b3f024028c7aaa76a627483	2013-03-27 14:22:30 -07:00
John Koleszar	9ba8aed179	Merge "Extract setup_frame_size and update_frame_context functions." into experimental	2013-03-27 14:21:57 -07:00
Dmitry Kovalev	8c69c193b5	Extract setup_frame_size and update_frame_context functions. Extracting setup_frame_size and update_frame_context functions. Introducing vp9_read_prob function as shortcut for (vp9_prob)vp9_read_literal(r, 8). Change-Id: Ia5c68fd725b2d1b9c5eb20f69cacb62361b5a3dd	2013-03-27 14:04:35 -07:00
Yunqing Wang	c6c0657c60	Modify idct code to use macro Small modification of idct code. Change-Id: I5c4e3223944c68e4ccf762f6cf07c990250e4290	2013-03-27 12:36:08 -07:00
Yunqing Wang	0e91bec4b5	Merge "Optimize 32x32 idct function" into experimental	2013-03-27 11:30:48 -07:00
Yunqing Wang	21a718d9a7	Optimize 32x32 idct function Wrote sse2 version of vp9_short_idct_32x32 function. Compared to c version, the sse2 version is 5X faster. Change-Id: I071ab7378358346ab4d9c6e2980f713c3c209864	2013-03-27 11:05:42 -07:00
Ronald S. Bultje	513157e093	Scatter-based scantables. This gains about 0.2% on derf, 0.1% on hd and 0.4% on stdhd. I can put this under an experimental flag if wanted, just trying to get my patch queue in shape. Change-Id: Ibe1a30fe0e0b07bec4802e0f3ff0ba22e505f576	2013-03-27 09:44:45 -07:00
Ronald S. Bultje	7c70145914	Merge "Add col/row-based coefficient scanning patterns for 1D 8x8/16x16 ADSTs." into experimental	2013-03-26 19:17:08 -07:00
Ronald S. Bultje	3c77ab4c0f	Merge "Redo banding for all transforms." into experimental	2013-03-26 19:16:44 -07:00
Ronald S. Bultje	c6efbbcfe4	Merge "Use above/left (instead of previous in scan-order) as token context." into experimental	2013-03-26 19:16:24 -07:00
Deb Mukherjee	23144d2345	Implicit weighted prediction experiment Adds an experiment to use a weighted prediction of two INTER predictors, where the weight is one of (1/4, 3/4), (3/8, 5/8), (1/2, 1/2), (5/8, 3/8) or (3/4, 1/4), and is chosen implicitly based on consistency of the predictors to the already reconstructed pixels to the top and left of the current macroblock or superblock. Currently the weighting is not applied to SPLITMV modes, which default to the usual (1/2, 1/2) weighting. However the code is in place controlled by a macro. The same weighting is used for Y and UV components, where the weight is derived from analyzing the Y component only. Results (over compound inter-intra experiment) derf: +0.18% yt: +0.34% hd: +0.49% stdhd: +0.23% The experiment suggests bigger benefit for explicitly signaled weights. Change-Id: I5438539ff4485c5752874cd1eb078ff14bf5235a	2013-03-26 16:58:56 -07:00
Ronald S. Bultje	d9094d8fd3	Add col/row-based coefficient scanning patterns for 1D 8x8/16x16 ADSTs. These are mostly just for experimental purposes. I saw small gains (in the 0.1% range) when playing with this on derf. Change-Id: Ib21eed477bbb46bddcd73b21c5c708a5b46abedc	2013-03-26 16:46:13 -07:00
Ronald S. Bultje	3120dbddb1	Redo banding for all transforms. Now that the first AC coefficient in both directions use the same DC as their context, there no longer is a purpose in letting both have their own band. Merging these two bands allows us to split bands for some of the very high-frequency AC bands. In addition, I'm redoing the banding for the 1D-ADST col/row scans. I don't think the old banding made any sense at all (it merged the last coefficient of the first row/col in the same band as the first two of the second row/col), which was clearly an oversight from the band being applied in scan-order (rather than in their actual position). Now, coefficients at the same position will be in the same band, regardless what scan order is used. I think this makes most sense for the purpose of banding, which is basically "predict energy for this coefficient depending on the energy of context coefficients" (i.e. pt). After full re-training, together with previous patch, derf gains about 1.2-1.3%, and hd/stdhd gain about 0.9-1.0%. Change-Id: I7a0cc12ba724e88b278034113cb4adaaebf87e0c	2013-03-26 16:46:13 -07:00
Ronald S. Bultje	790fb13215	Use above/left (instead of previous in scan-order) as token context. Pearson correlation for above or left is significantly higher than for previous-in-scan-order (absolute values depend on position in scan, but in general, we gain about 0.1-0.2 by using either above or left; using both basically just makes this even better). For eob branch skipping, we continue to use the previous token in scan order. This helps about 0.9% on derf after re-training on a limited data set. Full re-training and results on larger-resolution clips are pending. Note that this commit breaks trellis, so we can probably get further gains out of it by fixing trellis at some later point. Change-Id: Iead68e296fc3a105cca746b5e3da9555d6010cfe	2013-03-26 16:46:09 -07:00
Deb Mukherjee	57c97e2a5b	Reoptimizing the interpolation filters Reoptimizes the 8-tap smooth filter. Results: derf: +0.101% yt: +0.157% hd: +0.791% stdhd: +0.264% The next step will be to reoptimize the other two filters. Change-Id: I3d256a510ad9c7c30c33fae4a70fb43dfc708ed0	2013-03-26 16:34:35 -07:00
Yaowu Xu	43df87e841	remove code not in use Change-Id: I4fa46f10e82aca36c563f7ea829e5a3177a0c740	2013-03-26 15:27:35 -07:00
Dmitry Kovalev	d7209b3a0a	Cleaning up loopfilter code. Lower case variable names, removing redundant variables, declaration and initialization on the same line. Change-Id: Ie0c6c95b14103990eb6a9d7784f8259c662e1251	2013-03-26 11:09:58 -07:00
John Koleszar	8e1c368486	Merge "Add an in-loop deringing experiment" into experimental	2013-03-26 08:36:55 -07:00
John Koleszar	7d9a7fb297	Merge "Code cleanup." into experimental	2013-03-26 08:34:06 -07:00
John Koleszar	f0923f3b01	Merge "Code cleanup." into experimental	2013-03-26 08:30:46 -07:00
John Koleszar	441e2eab1b	Add an in-loop deringing experiment Adds a per-frame, strength adjustable, in loop deringing filter. Uses the existing vp9_post_proc_down_and_across 5 tap thresholded blur code, with a brute force search for the threshold. Results almost strictly positive on the YT HD set, either having no effect or helping PSNR in the range of 1-3% (overall average 0.8%). Results more mixed for the CIF set, (-0.5 min, 1.4 max, 0.1 avg). This has an almost strictly negative impact to SSIM, so examining a different filter or a more balanced search heuristic is in order. Other test set results pending. Change-Id: I5ca6ee8fe292dfa3f2eab7f65332423fa1710b58	2013-03-26 08:23:24 -07:00
Deb Mukherjee	49dcc71493	Merge "Modeling default coef probs with distribution" into experimental	2013-03-26 07:13:13 -07:00
Deb Mukherjee	fd18d5dffe	Modeling default coef probs with distribution Replaces the default tables for single coefficient magnitudes with those obtained from an appropriate distribution. The EOB node is left unchanged. The model is represeted as a 256-size codebook where the index corresponds to the probability of the Zero or the One node. Two variations are implemented corresponding to whether the Zero node or the One-node is used as the peg. The main advantage is that the default prob tables will become considerably smaller and manageable. Besides there is substantially less risk of over-fitting for a training set. Various distributions are tried and the one that gives the best results is the family of Generalized Gaussian distributions with shape parameter 0.75. The results are within about 0.2% of fully trained tables for the Zero peg variant, and within 0.1% of the One peg variant. The forward updates are optionally (controlled by a macro) model-based, i.e. restricted to only convey probabilities from the codebook. Backward updates can also be optionally (controlled by another macro) model-based, but is turned off by default. Currently model-based forward updates work about the same as unconstrained updates, but there is a drop in performance with backward-updates being model based. The model based approach also allows the probabilities for the key frames to be adjusted from the defaults based on the base_qindex of the frame. Currently the adjustment function is a placeholder that adjusts the prob of EOB and Zero node from the nominal one at higher quality (lower qindex) or lower quality (higher qindex) ends of the range. The rest of the probabilities are then derived based on the model from the adjusted prob of zero. Change-Id: Iae050f3cbcc6d8b3f204e8dc395ae47b3b2192c9	2013-03-25 23:43:38 -07:00
Dmitry Kovalev	3644a5b632	Code cleanup. Fixing function arguments alignment, reusing MIN/MAX and clamp functions. Change-Id: I87dd5a40ffb65b521b8abbf0fccf2f50552c5309	2013-03-25 15:16:14 -07:00
Dmitry Kovalev	7cc14e598e	Code cleanup. Lower case variable names, code simplification by using already defined clamp and read_le16 functions. Change-Id: I8fd544365bd8d1daed86d7b2ae0843e4ef80df08	2013-03-25 14:24:26 -07:00
Yunqing Wang	f68350ca98	Merge "Optimize 16x16 idct10 function" into experimental	2013-03-22 11:17:32 -07:00
Paul Wilkins	52abaeca85	Merge "Remove TX size segment feature" into experimental	2013-03-22 10:39:22 -07:00
Yunqing Wang	869d6c0534	Optimize 16x16 idct10 function Wrote sse2 version of vp9_short_idct10_16x16 function. Compared to c version, the sse2 version is 2.3X faster. Change-Id: I314c4f09369648721798321eeed6f58e38857f26	2013-03-21 16:36:01 -07:00
Yunqing Wang	8a3233b54d	Merge "Optimize 16x16 idct function" into experimental	2013-03-21 11:54:20 -07:00
Yunqing Wang	ec3100661c	Optimize 16x16 idct function Wrote sse2 version of vp9_short_idct16x16 function. Compared to c version, the sse2 version is over 2.5X faster. Change-Id: I38536e2b846427a2cc5c5423aaf305fd0e605d61	2013-03-21 11:44:05 -07:00
Dmitry Kovalev	56f3a2c663	Code cleanup: lower case variable names. Renaming Width to width, Height to height and Version to version in several structs and function signatures. Change-Id: I084c3f7e747cb2ce3345aff27a3dff9b13a87543	2013-03-20 16:41:30 -07:00
Paul Wilkins	1c75e77b6d	Remove TX size segment feature Change-Id: I0d226e4cb240caced37230f46905bf69b46e0cce	2013-03-19 17:31:08 +00:00
Yunqing Wang	6344c84c82	Optimize 8x8 idct function Wrote sse2 functions of vp9_short_idct8x8 and vp9_short_idct10_8x8. Compared to c version, the sse2 version is 2X faster. The decoder test didn't show noticeable gain since 8x8 idct doesn't take much of decoding time (less than 1% in my test). Change-Id: I56313e18cd481700b3b52c4eda5ca204ca6365f3	2013-03-18 15:34:14 -07:00
John Koleszar	8a3f55f2d4	Replace scaling byte with explicit display size If the intended display size is different than the size the frame is coded at, then send that size explicitly in the bitstream. Adds a new bit to the frame header to indicate whether the extra size fields are present. Change-Id: I525c66f22d207efaf1e5f903c6a2a91b80245854	2013-03-18 12:02:20 -07:00
John Koleszar	c5b317057b	Merge "Fix pulsing issue with scaling" into experimental	2013-03-18 11:57:36 -07:00
John Koleszar	e5d7542447	Merge "Add VP9_GET_REFERENCE control" into experimental	2013-03-18 11:57:31 -07:00
Yaowu Xu	d29f5435df	Merge "put refmvselection under experiment" into experimental	2013-03-18 08:51:33 -07:00
Yaowu Xu	12ade55719	Merge "removed reference to "LLM" and "x8"" into experimental	2013-03-18 08:51:19 -07:00
Deb Mukherjee	bf7387f6b7	Merge "Context-pred fix to not use top/left on edges" into experimental	2013-03-16 19:09:25 -07:00
Deb Mukherjee	b1921b2f08	Context-pred fix to not use top/left on edges This fix resolves some of the mismatch issues being seen recently. While this is the right thing to do when tiling is used for this experiment, it is not the underlying cause of the the mismatches. Something else is causing writing outside of the allowable frame area in the encoder leading to this mismatch. Change-Id: If52c6f67555aa18ab8762865384e323b47237277	2013-03-16 09:26:52 -07:00
Christian Duvivier	4418b790a7	Faster vp9_short_fdct16x16. Scalar path is about 1.5x faster (3.1% overall encoder speedup). SSE2 path is about 7.2x faster (7.8% overall encoder speedup). Change-Id: I06da5ad0cdae2488431eabf002b0d898d66d8289	2013-03-15 15:55:31 -07:00
Yaowu Xu	5d9ba7938e	Merge "Remove leftover reference to 2nd order dc/ac quant" into experimental	2013-03-14 19:05:11 -07:00
Yaowu Xu	f4d2ad6915	Remove leftover reference to 2nd order dc/ac quant Change-Id: Ib8dacf1d2797743569771b8f699e40e1aeb085cb	2013-03-14 10:46:15 -07:00
John Koleszar	9b7be88883	Fix pulsing issue with scaling Updates the YV12_BUFFER_CONFIG structure to be crop-aware. The exiting width/height parameters are left unchanged, storing the width and height algined to a 16 byte boundary. The cropped dimensions are added as new fields. This fixes a nasty visual pulse when switching between scaled and unscaled frame dimensions due to a mismatch between the scaling ratio and the 16-byte aligned sizes. Change-Id: Id4a3f6aea6b9b9ae38bdfa1b87b7eb2cfcdd57b6	2013-03-13 19:10:10 -07:00
John Koleszar	b3c350a1a9	Add VP9_GET_REFERENCE control This is like VP8_COPY_REFERENCE, but returns a pointer to the reference frame rather than a copy of it. This is useful when the application doesn't know what the size of the reference is, as is the case when scaling is in effect. Change-Id: I63667109f65510364d0e397ebe56217140772085	2013-03-13 19:08:06 -07:00
Jingning Han	76c12ab9c9	Support +/-2048 motion vector coding Enable entropy coding of motion vectors up to +/-2048. Also extend the motion search range accordingly. Change-Id: Iac2bb015e8934521cef83a19edbe967d9f097436	2013-03-13 14:08:27 -07:00
Yaowu Xu	88862c0454	put refmvselection under experiment and turn the experiment off by default. Change-Id: If9e684aa6cc49eacd39f36645a110a447e38d2de	2013-03-13 10:40:31 -07:00
Yaowu Xu	005552639b	removed reference to "LLM" and "x8" The commit changed the name of files and function to remove obselete reference to LLM and x8. Change-Id: I973b20fc1a55149ed68b5408b3874768e6f88516	2013-03-13 08:35:46 -07:00
Ronald S. Bultje	8fc3ab7c62	Merge "Fix typo in comment for number of extra bits for cat6 tokens." into experimental	2013-03-12 10:45:12 -07:00
Ronald S. Bultje	516f7ac04e	Fix typo in comment for number of extra bits for cat6 tokens. Change-Id: I07ddf3be8bc5d6c2eb561d4241879777c315b183	2013-03-12 10:25:43 -07:00
John Koleszar	045c53f51e	fix an assumption about uv_stride Use the uv_stride from the framebuffer rather than deriving it from the y_stride. Change-Id: I94581cb741539d094ff062b3d008235556903b8c	2013-03-12 09:22:44 -07:00
Dmitry Kovalev	2891d70b23	Code cleanup. Removing redundant code, introducing new functions for better decomposition, adding 'clamp' function to vp9_common.h. Change-Id: Ic3b8ca13bbc38f60f0c9c43910b5802005e31aaf	2013-03-11 17:02:27 -07:00
John Koleszar	9b4095c537	Fix vp9_tree_probs_from_distribution with CONFIG_CODE_NONZEROCOUNT The automatic merge result was incomplete. Change-Id: I8976318bfc346d867660a013a302c80edb25fc29	2013-03-11 11:03:36 -07:00
John Koleszar	52fc4f8a78	Merge "Simplify vp9_adapt_nmv_probs" into experimental	2013-03-11 09:57:53 -07:00
John Koleszar	ee4649ded2	Simplify vp9_adapt_nmv_probs Remove the temporary branch count arrays and build the adapted probabilities while walking the tree. Gives an additional 1.5% or so on CIF. Change-Id: I875d61e5e0ec778e5d2f7f9d0837b989a91cf3a3	2013-03-11 09:44:22 -07:00
Deb Mukherjee	fad43d4249	Merge "Minor optimization in mv entropy adaptation" into experimental	2013-03-11 09:43:54 -07:00
John Koleszar	e6257342b1	Merge "Optimize vp9_tree_probs_from_distribution" into experimental	2013-03-11 09:32:11 -07:00
Deb Mukherjee	f74c55eb03	Minor optimization in mv entropy adaptation Adds a check to exit from the increment_nmv_count function when the increment is 0. Change-Id: I99c1e342d351f7800e23590f9c2419881bf1d708	2013-03-11 08:49:14 -07:00
John Koleszar	bd84685f78	Optimize vp9_tree_probs_from_distribution The previous implementation visited each node in the tree multiple times because it used each symbol's encoding to revisit the branches taken and increment its count. Instead, we can traverse the tree depth first and calculate the probabilities and branch counts as we walk back up. The complexity goes from somewhere between O(nlogn) and O(n^2) (depending on how balanced the tree is) to O(n). Only tested one clip (256kbps, CIF), saw 13% decoding perf improvement. Note that this optimization should port trivially to VP8 as well. In VP8, the decoder doesn't use this function, but it does routinely show up on the profile for realtime encoding. Change-Id: I4f2848e4f41dc9a7694f73f3e75034bce08d1b12	2013-03-10 13:39:30 -07:00
Deb Mukherjee	a28139c849	Continued experiment with nonzero count Adds probability updates for extra bits for the nzcs, code for getting nzc stats, plus some minor cleanups and fixes. Change-Id: If2814e7f04fb52f5025ad9f400f3e6c50a00b543	2013-03-08 16:37:08 -08:00
Yunqing Wang	cb7acbc0e1	Merge "Add vp9_idct4_1d_sse2" into experimental	2013-03-08 15:14:02 -08:00
Yunqing Wang	11ca81f8b6	Add vp9_idct4_1d_sse2 Added SSE2 idct4_1d which is called by vp9_short_iht4x4. Also, modified the parameter type passed to vp9_short_iht functions to make it work with rtcd prototype. Change-Id: I81ba7cb4db6738f1923383b52a06deb760923ffe	2013-03-08 15:04:22 -08:00
Dmitry Kovalev	3edbc77ae3	Merge "Consistent usage of ROUND_POWER_OF_TWO macro." into experimental	2013-03-08 11:35:22 -08:00
Yunqing Wang	2e0553227e	Merge "Optimize add_constant_residual function" into experimental	2013-03-08 10:18:52 -08:00
Jingning Han	2a5278bdbd	Extend diff MV limit from +/-256 to +/-1024 Increase the motion search range by 4x. Change MV_CLASS tree of the entropy coding to allow two additional mv classes to cover the extended motion vector limit. The codec determines the effective motion search range conditioned on the actual frame dimension. It provides coding gains: stdhd 0.39% yt 0.56% hd 0.47% Major coding performance gains are packed in several sequences with intense motion activities, e.g., ped_1080p gains 7% at high bit-rates, and on average 3%. TODO: Need to further tune the rate control and motion search units. Change-Id: Ib842540a6796fbee5a797809433ef6a477c6d78d	2013-03-08 10:04:36 -08:00
Yunqing Wang	f240782650	Optimize add_constant_residual function Optimized adding constant diff to predictor, which gave about 2% decoder performance gain. Change-Id: I47db20c31428e8c4a8f16214a85cbe386a6e9303	2013-03-07 15:49:07 -08:00
Dmitry Kovalev	3603dfb62c	Consistent usage of ROUND_POWER_OF_TWO macro. Change-Id: I44660975e9985310d8c654c158ee7a61291b5a08	2013-03-07 12:24:35 -08:00
Ronald S. Bultje	89e4ce20d0	Update ADST selection if tx_size < block_size. Change-Id: Ic9b336486774c95ffbb92adcb110cc0fc2a83cc5	2013-03-07 11:19:15 -08:00
Ronald S. Bultje	d3724abe9f	Re-add support for ADST in superblocks. This also changes the RD search to take account of the correct block index when searching (this is required for ADST positioning to work correctly in combination with tx_select). Change-Id: Ie50d05b3a024a64ecd0b376887aa38ac5f7b6af6	2013-03-07 11:19:10 -08:00
Deb Mukherjee	eb6ef2417f	Coding con-zero count rather than EOB for coeffs This patch revamps the entropy coding of coefficients to code first a non-zero count per coded block and correspondingly remove the EOB token from the token set. STATUS: Main encode/decode code achieving encode/decode sync - done. Forward and backward probability updates to the nzcs - done. Rd costing updates for nzcs - done. Note: The dynamic progrmaming apporach used in trellis quantization is not exactly compatible with nzcs. A suboptimal approach has been used instead where branch costs are updated to account for changes in the nzcs. TODO: Training the default probs/counts for nzcs Change-Id: I951bc1e22f47885077a7453a09b0493daa77883d	2013-03-07 07:20:30 -08:00
Dmitry Kovalev	a9961fa819	Merge "Code cleanup." into experimental	2013-03-06 16:57:34 -08:00
Yunqing Wang	f4e383f3d1	Merge "Optimize add_residual function" into experimental	2013-03-05 16:47:58 -08:00
Yunqing Wang	943c6d7172	Optimize add_residual function Optimized adding diff to predictor, which gave 0.8% decoder performance gain. Change-Id: Ic920f0baa8cbd13a73fa77b7f9da83b58749f0f8	2013-03-05 16:27:45 -08:00
Dmitry Kovalev	7f99c3c59a	Code cleanup. Removing redundant 'extern' keywords, fixing formatting and #include order, code simplification. Change-Id: I0e5fdc8009010f3f885f13b5d76859b9da511758	2013-03-05 14:12:16 -08:00
Ronald S. Bultje	4209bba462	Merge changes Ifacbf5a0,Ibad7c3dd into experimental * changes: vpxenc: actually report mismatch on stderr. Make superblocks independent of macroblock code and data.	2013-03-05 11:17:14 -08:00
Dmitry Kovalev	764be4f66f	Merge "Code cleanup and simplification of build_4x4uvmvs function." into experimental	2013-03-04 16:57:30 -08:00
Ronald S. Bultje	111ca42133	Make superblocks independent of macroblock code and data. Split macroblock and superblock tokenization and detokenization functions and coefficient-related data structs so that the bitstream layout and related code of superblock coefficients looks less like it's a hack to fit macroblocks in superblocks. In addition, unify chroma transform size selection from luma transform size (i.e. always use the same size, as long as it fits the predictor); in practice, this means 32x32 and 64x64 superblocks using the 16x16 luma transform will now use the 16x16 (instead of the 8x8) chroma transform, and 64x64 superblocks using the 32x32 luma transform will now use the 32x32 (instead of the 16x16) chroma transform. Lastly, add a trellis optimize function for 32x32 transform blocks. HD gains about 0.3%, STDHD about 0.15% and derf about 0.1%. There's a few negative points here and there that I might want to analyze a little closer. Change-Id: Ibad7c3ddfe1acfc52771dfc27c03e9783e054430	2013-03-04 16:34:36 -08:00
Yunqing Wang	37932d9168	Merge "Optimize vp9_short_idct4x4llm function" into experimental	2013-03-04 14:13:31 -08:00
Yunqing Wang	e8bc9f4220	Optimize vp9_short_idct4x4llm function Wrote a SSE2 vp9_short_idct4x4llm to improve the decoder performance. Change-Id: I90b9d48c4bf37aaf47995bffe7e584e6d4a2c000	2013-03-04 12:01:27 -08:00
Jingning Han	5957b2b514	Support 16K sequence coding Fixed a couple of variable/function definitions, as well as header handling to support 16K sequence coding at high bit-rates. The width and height are each specified by two bytes in the header. Use an extra byte to explicitly indicate the scaling factors in both directions, each ranging from 0 to 15. Tested coding up to 16400x16400 dimension. Change-Id: Ibc2225c6036620270f2c0cf5172d1760aaec10ec	2013-03-04 11:08:41 -08:00
John Koleszar	1cfc86ebe0	Add unit test for x4 multi-SAD functions Update the function prototypes to match between VP9 and VP8. Change-Id: If58965073989e87df3b62b67a030ec6ce23ca04f	2013-03-01 18:14:02 -08:00
Dmitry Kovalev	b5a9795d25	Code cleanup and simplification of build_4x4uvmvs function. Change-Id: Iab0176f058045181821ded95ff1cf423af1625f9	2013-03-01 17:50:55 -08:00
John Koleszar	69c67c9531	Merge master branch into experimental Picks up some build system changes, compiler warning fixes, etc. Change-Id: I2712f99e653502818a101a72696ad54018152d4e	2013-03-01 11:06:05 -08:00
Yunqing Wang	67dbc8fe55	Merge "Add eob<=10 case in idct32x32" into experimental	2013-03-01 08:58:19 -08:00
Yunqing Wang	c550bb3b09	Add eob<=10 case in idct32x32 Simplified idct32x32 calculation when there are only 10 or less non-zero coefficients in 32x32 block. This helps the decoder performance. Change-Id: If7f8893d27b64a9892b4b2621a37fdf4ac0c2a6d	2013-02-28 16:40:29 -08:00
John Koleszar	17c221687f	Merge "Fix use of uninitialized memory in CONFIG_ABOVESPREFMV" into experimental	2013-02-28 15:18:50 -08:00
Yunqing Wang	72b146690a	Merge "Refactor vp9_dequant_idct_add function" into experimental	2013-02-28 14:34:27 -08:00
Yunqing Wang	6193bc3ba8	Refactor vp9_dequant_idct_add function Provided a wrapper and removed duplicate code. Change-Id: Iaef842226ec348422e459202793b001d0983ea30	2013-02-28 14:18:46 -08:00
Scott LaVarnway	aa8fb070b8	Removed vp9_dequantize_b Change-Id: Ie89bd00d58e30bf4094cb748a282f1dfa81a31d8	2013-02-28 14:08:12 -08:00
John Koleszar	2eab4372fc	Fix use of uninitialized memory in CONFIG_ABOVESPREFMV The ABOVESPREFMV experiment uses four pixels to the left of the current block, which don't exist for the left-most column. Change-Id: I4cf0b42ae8f54c0b3e7b1ed8755704b74fafc39c	2013-02-28 13:48:58 -08:00
Jim Bankoski	714aa9f3c0	this commit converts all sad ptrs to uint32 sse4_1 code used uint16_t for returning sad, but that won't work for 32x32 or 64x64. This code fixes the assembly for those and also reenables sse4_1 on linux Change-Id: I5ce7288d581db870a148e5f7c5092826f59edd81	2013-02-28 08:46:35 -08:00
Christian Duvivier	c129203f7e	Faster vp9_short_fdct8x8. Scalar path is about 1.4x faster (4% overall encoder speedup). SSE2 path is about 7x faster (13% overall encoder speedup). Change-Id: I7e85d8225a914a74c61ea370210414696560094d	2013-02-27 17:23:08 -08:00
Dmitry Kovalev	347f3a0aa8	Code cleanup. Fixing code style, using array lookup instead of switch statements for forward hybrid transforms (in the same way as for their inverses). Consistent usage of ROUND_POWER_OF_TWO macro in appropriate places. Change-Id: I0d3822ae11f928905fdbfbe4158f91d97c71015f	2013-02-27 13:51:04 -08:00
John Koleszar	5ac141187a	Merge "Remove unused vp9_copy32xn" into experimental	2013-02-27 12:23:45 -08:00
Yunqing Wang	d6ff6fe2ed	Merge "Remove unused file" into experimental	2013-02-27 11:58:29 -08:00
Ronald S. Bultje	90932399b4	Merge "Move eob from BLOCKD to MACROBLOCKD." into experimental	2013-02-27 11:39:16 -08:00
Yunqing Wang	8092aaf9ec	Merge "Optimize vp9_dc_only_idct_add_c function" into experimental	2013-02-27 11:38:45 -08:00
John Koleszar	09be534f13	Merge "give vp9 variance struct a unique name"	2013-02-27 11:22:36 -08:00
Yunqing Wang	5ef694cfb8	Remove unused file Removed vp9_idctllm_mmx.asm Change-Id: I7152756f23a5a09ed69e8fb40edb2ab3237290fe	2013-02-27 11:00:58 -08:00
Ronald S. Bultje	e8c74e2b70	Move eob from BLOCKD to MACROBLOCKD. Consistent with VP8. Change-Id: I8c316ee49f072e15abbb033a80e9c36617891f07	2013-02-27 11:00:55 -08:00
John Koleszar	7ad8dbe417	Remove unused vp9_copy32xn This function was part of an optimization used in VP8 that required caching two macroblocks. This is unused in VP9, and might not survive refactoring to support superblocks, so removing it for now. Change-Id: I744e585206ccc1ef9a402665c33863fc9fb46f0d	2013-02-27 10:24:56 -08:00
John Koleszar	d8e68bd14b	Merge changes I922f8602,I0ac3343d into experimental * changes: Use 256-byte aligned filter tables Set scale factors consistently for SPLITMV	2013-02-27 10:08:53 -08:00
Jan Kratochvil	82ed3f9a41	Fix --as=nasm compatibility for new asm code. s/movd/movq/ Change-Id: Id1a56de91551f8dc796f14f1056c565dfc1ba626	2013-02-27 09:55:38 -08:00
John Koleszar	350ba5f30e	Merge "Combined motion compensation with scaled predictors" into experimental	2013-02-27 09:46:12 -08:00
John Koleszar	6fd7dd1a70	Use 256-byte aligned filter tables This avoids duplicating all the filters twice. Includes fixups to the convolve routines and associated tests to make this work. Change-Id: I922f86021594e55072ddb63b42b2313605db6e00	2013-02-27 08:22:39 -08:00
John Koleszar	77f88e97fa	Combined motion compensation with scaled predictors This patch extends the previous support for using references of a different resolution in ZEROMV mode to all inter prediction modes. Subpixel based best-mv scoring is disabled when the reference frame differs in resolution from the current frame. Change-Id: Id4dc3e5e6692de98d9857fd56bfad3ac57e944ac	2013-02-27 08:22:39 -08:00
John Koleszar	472eeaf082	Set scale factors consistently for SPLITMV This commit updates the 4x4 prediction to consistently use the build_2x1_inter_predictor() method. That function is updated to calculate the scale offset, rather than relying on the caller to calculate it. In the case that the 2x1 prediction can not be used, the scale offset is recalculated for each 1x1 block. The idea here is that the offsets are calculated before each call to vp9_build_scaled_inter_predictor(). Change-Id: I0ac3343dd54e2846efa3c4195fcd328b709ca04d	2013-02-27 08:22:39 -08:00
Yaowu Xu	858b60e8d0	Merge "Improve 32x32 forward dct" into experimental	2013-02-27 07:56:42 -08:00
John Koleszar	eb939f45b8	Spatial resamping of ZEROMV predictors This patch allows coding frames using references of different resolution, in ZEROMV mode. For compound prediction, either reference may be scaled. To test, I use the resize_test and enable WRITE_RECON_BUFFER in vp9_onyxd_if.c. It's also useful to apply this patch to test/i420_video_source.h: --- a/test/i420_video_source.h +++ b/test/i420_video_source.h @@ -93,6 +93,7 @@ class I420VideoSource : public VideoSource { virtual void FillFrame() { // Read a frame from input_file. + if (frame_ != 3) if (fread(img_->img_data, raw_sz_, 1, input_file_) == 0) { limit_ = frame_; } This forces the frame that the resolution changes on to be coded with no motion, only scaling, and improves the quality of the result. Change-Id: I1ee75d19a437ff801192f767fd02a36bcbd1d496	2013-02-26 23:54:23 -08:00
Yunqing Wang	35bc02c6eb	Optimize vp9_dc_only_idct_add_c function Wrote SSE2 version of vp9_dc_only_idct_add_c function. In order to improve performance, clipped the absolute diff values to [0, 255]. This allowed us to keep the additions/subtractions in 8 bits. Test showed an over 2% decoder performance increase. Change-Id: Ie1a236d23d207e4ffcd1fc9f3d77462a9c7fe09d	2013-02-26 17:16:13 -08:00
Dmitry Kovalev	971ff2679f	Removing redundant 'extern' keyword from function declarations. Change-Id: I893fa36297b9bd9cff93d082f1736f6860b15c0d	2013-02-26 15:52:05 -08:00
John Koleszar	25686fc22d	Merge "Refactor inter recon functions to support scaling" into experimental	2013-02-26 11:45:28 -08:00
John Koleszar	6a4f708c25	Refactor inter recon functions to support scaling Ensure that all inter prediction goes through a common code path that takes scaling into account. Removes a bunch of duplicate 1st/2nd predictor code. Also introduces a 16x8 mode for 8x8 MVs, similar to the 8x4 trick we were doing before. This has an unexpected effect with EIGHTTAP_SMOOTH, so it's disabled in that case for now. Change-Id: Ia053e823a8bc616a988a0af30452e1e75a739cba	2013-02-26 10:03:29 -08:00
Yaowu Xu	66d94ac13c	Improve 32x32 forward dct The commit improves the 32x32 forward dct implementation: 1. change to use same constants and rounding as other forward dcts 2. select rounding to specifically minimize the roundtrip error, which improved average 19/block to .77/block using 100000 random input. Test showed a small but consistent gain on all test sets, about .15% Change-Id: If0afd6a71880a522f60c1c234be0462092c2eb53	2013-02-26 09:23:01 -08:00
Dmitry Kovalev	9bf3f75168	Changing pitch value meaning for fht and iht transforms. Pitch now means the number of elements, not the number of bytes. Change-Id: Idb9f2f012e39b09d596a3cc1802305a80b7c13af	2013-02-25 18:19:55 -08:00
Dmitry Kovalev	9770d564f4	Code cleanup. Removing switch statements for inverse hybrid transforms. Making code style consistent for all similar transform implementations. Renaming shortpitch and short_pitch variables to half_pitch. Change-Id: I875f7a82aae4e8063a58777bf1cc3f1e67b48582	2013-02-25 15:14:01 -08:00
Dmitry Kovalev	20b0cb599b	Code cleanup. Removing redundant parentheses, better code formatting, introducing ROUND_POWER_OF_TWO macro to replace repeated expression. Change-Id: I91aad7a53ed03482428b2419de4bb99fd92c6771	2013-02-25 13:38:18 -08:00
Jingning Han	77a3becf92	clean up forward and inverse hybrid transform Rebased. Remove the old matrix multiplication transform computation. The 16x16 ADST/DCT can be switched on/off and evaluated by setting ACTIVE_HT16 300/0 in vp9/common/vp9_blockd.h. Change-Id: Icab2dbd18538987e1dc4e88c45abfc4cfc6e133f	2013-02-25 09:16:12 -08:00
Ronald S. Bultje	0c9e2e9a1d	Split coefficient token tables intra vs. inter. Change-Id: I5416455f8f129ca0f450d00e48358d2012605072	2013-02-23 07:33:46 -08:00
Paul Wilkins	c17672a33d	Further changes to coefficient contexts. This patch alters the balance of context between the coefficient bands (reflecting the position of coefficients within a transform blocks) and the energy of the previous token (or tokens) within a block. In this case the number of coefficient bands is reduced but more previous token energy bands are supported. Some initial rebalancing of the default tables has been by running multiple derf clips at multiple data rates using the ENTOPY_STATS macro. Further balancing needs to be done using larger image formatsd especially in regard to the bigger transform sizes which are not as well represented in encodings of smaller image formats. Change-Id: If9736e95c391e711b04aef6393d26f60f36e1f8a	2013-02-23 07:29:09 -08:00
James Zern	e5fb6321a1	give vp9 variance struct a unique name variance_vtable clashed with vp8/common/variance.h Change-Id: I09c1de44d5519f1bd13f58c01144c0de4706de6f	2013-02-22 16:25:13 -08:00
John Koleszar	606a2561d6	Merge "Code cleanup." into experimental	2013-02-22 11:20:20 -08:00
Dmitry Kovalev	548b4dd5f2	Code cleanup. Removing redundant 'extern' keywords and parentheses, fixing indentation, making variable names lower case, using short expressions x = c instead of x = x c, minor code simplifications. Change-Id: If6a25fcf306d1db26e90d27e3c24a32735c607de	2013-02-22 11:03:14 -08:00
Jingning Han	babbd5d170	Forward butterfly hybrid transform This patch includes 4x4, 8x8, and 16x16 forward butterfly ADST/DCT hybrid transform. The kernel of 4x4 ADST is sin((2k+1)(n+1)/(2N+1)). The kernel of 8x8/16x16 ADST is of the form sin((2k+1)(2n+1)/4N). Change-Id: I8f1ab3843ce32eb287ab766f92e0611e1c5cb4c1	2013-02-21 18:24:28 -08:00
Ronald S. Bultje	35524e2231	Remove "eobs" array in MACROBLOCKD. The information is a duplicate of "eob" in BLOCKD. Change-Id: Ia6416273bd004611da801e4bfa6e2d328d6f02a3	2013-02-21 10:07:36 -08:00
Deb Mukherjee	048f593703	Merge "Refactoring of switchable filter search for speed" into experimental	2013-02-21 09:23:50 -08:00
John Koleszar	138ffb6ea9	Merge "Avoid division in intra prediction" into experimental	2013-02-21 08:33:17 -08:00
Deb Mukherjee	28b1db9278	Refactoring of switchable filter search for speed Refactors the switchable filter search in the rd loop to improve encode speed. Uses a piecewise approximation to a closed form expression to estimate rd cost for a Laplacian source with a given variance and quantization step-size. About 40% encode time reduction is achieved. Results (on a feb 12 baseline) show a slight drop: derf: -0.019% yt: +0.010% std-hd: -0.162% hd: -0.050% Change-Id: Ie861badf5bba1e3b1052e29a0ef1b7e256edbcd0	2013-02-20 18:34:42 -08:00
Dmitry Kovalev	e6c89a1f9b	Merge "Code cleanup." into experimental	2013-02-20 12:47:54 -08:00
Dmitry Kovalev	eb6aee50a4	Code cleanup. Change-Id: I7c6e3bebd94856b24dbe2aded7f9e04ef8bb8c08	2013-02-20 11:36:31 -08:00
Yaowu Xu	d262e26cc7	Merge lossless experiment Change-Id: I7b7b8d4fda3a23699e0c920d727f8c15d37d43aa	2013-02-20 07:54:28 -08:00
Tero Rintaluoma	56e6c66b49	Avoid division in intra prediction - Using multiplication and shifting instead of division in intra prediction. - Maximum absolute difference is 1 for division statements in d45, d27, d63 prediction modes. However, errors can cumulate for large block sizes when using already predicted values. - Maximum number of non-matching result values in loops using division are: 4x4 0/16 8x8 0/64 16x16 10/256 32x32 13/1024 64x64 122/4096 Overall PSNR derf: 0.005 yt: -0.022 std-hd: 0.021 hd: -0.006 Change-Id: I3979a02eb6351636442c1af1e23d6c4e6ec1d01d	2013-02-20 10:37:36 +02:00
Jingning Han	cd907b1601	16x16 butterfly inverse ADST/DCT hybrid transform rebased. This patch includes 16x16 butterfly inverse ADST/DCT hybrid transform. It uses the variant ADST of kernel sin((2k+1)*(2n+1)/4N), which allows a butterfly implementation. The coding gains as compared to DCT 16x16 are about 0.1% for both derf and std-hd. It is noteworthy that for std-hd sets many sequences gains about 0.5%, some 0.2%. There are also few points that provides -1% to -3% performance. Hence the average goes to about 0.1%. Change-Id: Ie80ac84cf403390f6e5d282caa58723739e5ec17	2013-02-19 09:07:00 -08:00
Yaowu Xu	93d6b86cfd	Use lossless for Q0 The commit changes the coding mode to lossless whenever the lowest quantizer is choosen. As expected, test results showed no difference for cif and std-hd set where Q0 is rarely used. For yt and yt-hd set, Q0 is used for a number of clips, where this commit helped a lot in the high end. Average over all clips in the sets: yt: 2.391% 1.017% 1.066% hd: 1.937% .764% .787% Change-Id: I9fa9df8646fd70cb09ffe9e4202b86b67da16765	2013-02-19 06:18:42 -08:00
Ronald S. Bultje	3af36ea8cc	Remove Y2 and Y-no-DC token types from the bitstream. Change-Id: I7a5314daca993d46b8666ba1ec2ff3766c1e5042	2013-02-15 14:06:30 -08:00
Ronald S. Bultje	48598e30b1	Remove y2dc/ac Q delta values from the bitstream. Since there is no Y2, these values are always zero. This changes the bitstream results slightly, hence a separate commit. Change-Id: I2f838f184341868f35113ec77ca89da53c4644e0	2013-02-15 14:06:30 -08:00
Ronald S. Bultje	46dff5d233	Remove some Y2-related code. Change-Id: I4f46d142c2a8d1e8a880cfac63702dcbfb999b78	2013-02-15 14:06:25 -08:00
Scott LaVarnway	7755657ea7	Merge "WIP: ssse3 version of convolve avg functions" into experimental	2013-02-15 07:54:21 -08:00
Scott LaVarnway	ae886d6bff	Moved vp9_get_coef_band to header file allowing the compiler to inline. Change-Id: I66e5caf5e7fefa68a223ff0603aa3f9e11e35dbb	2013-02-14 12:27:25 -08:00
Paul Wilkins	45712dc8c8	Merge "Abstract selection of coef band." into experimental	2013-02-14 03:23:31 -08:00
Ronald S. Bultje	51afedbe28	Merge "Remove 2nd-order transform for first-order DC coefficients." into experimental	2013-02-13 13:58:02 -08:00
Ronald S. Bultje	89a206ef2f	Add support for tile rows. These allow sending partial bitstream packets over the network before encoding a complete frame is completed, thus lowering end-to-end latency. The tile-rows are not independent. Change-Id: I99986595cbcbff9153e2a14f49b4aa7dee4768e2	2013-02-13 12:31:00 -08:00
Ronald S. Bultje	42d6be8080	Remove 2nd-order transform for first-order DC coefficients. Since addition of the larger-scale transforms (16x16, 32x32), these don't give a benefit at macroblock-sizes anymore. At superblock-sizes, 2nd-order transform was never used over the larger transforms. Future work should test whether there is a benefit for that use case. Change-Id: I90cadfc42befaf201de3eb0c4f7330c56e33330a	2013-02-13 12:28:19 -08:00
Paul Wilkins	9255ad107f	Abstract selection of coef band. This patch abstracts the selection of the coefficient band context into a function as a precursor to further experiments with the coefficient context. It also removes the large per TX size coefficient band structures and uses a single matrix for all block sizes within the test function. This may have an impact on quality (results to follow) but is only an intermediate step in the process of redefining the context. Also the quality impact will be larger initially because the default tables will be out of step with the new banding. In particular the 4x4 will in this case only use 7 bands. If needed we can add back block size dependency localized within the function, but this can follow on after the other changes to the definition of the context. Change-Id: Id7009c2f4f9bb1d02b861af85fd8223d4285bde5	2013-02-13 19:01:25 +00:00
Paul Wilkins	0d284ffed1	Abstract the selection of coefficient context. This is an initial step to facilitate experimentation with changes to the prior token context used to code coefficients to take better account of the energy of preceding tokens. This patch merely abstracts the selection of context into two functions and does not alter the output. Change-Id: I117fff0b49c61da83aed641e36620442f86def86	2013-02-13 18:56:30 +00:00

... 11 12 13 14 15 ...

1481 Commits