generic-library/vpx

Author	SHA1	Message	Date
Dmitry Kovalev	fe2a201eb1	Replacing "txfm" with "tx" in identifiers. Consistent names with TX_SIZE, TX_MODE, and TX_MODE. Change-Id: I79592218bf5a40ace89197a34a06ee7de581ed8d	2013-08-02 17:28:23 -07:00
Dmitry Kovalev	5edc65d00d	Removing NMS_STATS defines. Change-Id: Iabab0e59042a33456df1d449c0d0f01debc00c7c	2013-08-02 17:10:15 -07:00
Dmitry Kovalev	7b50333e8f	Merge "Adding is_inter_block function."	2013-08-02 16:54:32 -07:00
Dmitry Kovalev	fec4ec4edd	Removing unused functions. Removed functions: model_rd_for_sb_y, block_error_sby, get_sb_variance Change-Id: Iec458df180caf6f8eac3605773841a4121dd3a8f	2013-08-02 16:41:09 -07:00
Dmitry Kovalev	603931e291	Merge "Changing function arg type from int_mv* to MV*."	2013-08-02 16:30:06 -07:00
Dmitry Kovalev	a6adc82e78	Merge "Cleanups around allow_high_precision_mv flag."	2013-08-02 16:27:05 -07:00
Dmitry Kovalev	680ec32d18	Adding is_inter_block function. Using it instead of long unclear verbose check "mbmi->ref_frame[0] != INTRA_FRAME". Change-Id: I9c7b4b3797942fa962bf3ba7460fff3084beabe9	2013-08-02 16:25:33 -07:00
Dmitry Kovalev	d4e020c4b1	Merge "Cleaning up set_contexts_on_border function."	2013-08-02 16:22:50 -07:00
Yunqing Wang	d340c114fb	Merge "Add more checking to using_small_partition_info"	2013-08-02 15:55:09 -07:00
Dmitry Kovalev	769bcab3f5	Cleaning up set_contexts_on_border function. Change-Id: I8f21c18b29f54b277fb1c167f278f109d9f3b996	2013-08-02 15:52:26 -07:00
Dmitry Kovalev	25b77e2569	Changing function arg type from int_mv* to MV*. Change-Id: Ic878d31df2ce783a2c9a8c4bc9ed301ec8ffe25e	2013-08-02 15:26:32 -07:00
Adrian Grange	60ff123536	Merge "Fixed typos and added a few explanatory comments"	2013-08-02 11:37:47 -07:00
Adrian Grange	075b11f004	Merge "Changed name of rd_pick_intra4x4mby_modes"	2013-08-02 11:36:46 -07:00
Dmitry Kovalev	86053d3ae2	Cleanups around allow_high_precision_mv flag. Change-Id: Ic07f5f8ffeaedd5b7513b464871f83afc82dcd5c	2013-08-02 11:21:16 -07:00
Dmitry Kovalev	b47153deed	Replacing long block size enum values with shorter ones. Change-Id: I0e9329490828684a4fd46f540d89114cc68e8407	2013-08-02 10:48:27 -07:00
Yunqing Wang	0d68080445	Merge "Comment out 2 unused speed features"	2013-08-02 09:58:46 -07:00
Dmitry Kovalev	741537f3ce	Cleanup: replacing xd->seg with seg, and xd->lf with lf. Change-Id: I73b59d7699a8e7e7acd3bf8041cb6c98ce9ba4bf	2013-08-01 15:38:16 -07:00
Dmitry Kovalev	9f4f001ba5	Merge "Cleanup: removing unused function arguments."	2013-08-01 15:07:12 -07:00
Dmitry Kovalev	ddf02e323a	Merge "Nice looking motion vector clamping functions."	2013-08-01 14:50:14 -07:00
Dmitry Kovalev	ce8dedc353	Cleanup: removing unused function arguments. Change-Id: I27471768980fc631916069f24bc7c482a5c9ca17	2013-08-01 13:41:38 -07:00
Dmitry Kovalev	b621e2d72e	Nice looking motion vector clamping functions. Removing assign_and_clamp_mv function, making implementation of clamp_mv and clamp_mv2 more clear and consistent. Change-Id: Iecd08e1c1bf0379f8314ebe01811f8253f4ade58	2013-08-01 13:40:26 -07:00
Deb Mukherjee	dbea726daf	Adds a source variance computation function Adds a function to compute source variance for various sb_types to be used for pruning mode and partition searches. [The existing activity measure function is currently specialized for only 16x16 MBs and needs to be updated]. Change-Id: I22a41e6f1430184201487326fdbebb9b47e6fc24	2013-08-01 13:01:54 -07:00
Yunqing Wang	215b010f4b	Add more checking to using_small_partition_info If the partition is out of partition size range, we don't need to process small partition information. Change-Id: Ice9bfbbdebe1f2ef79271a3aee17de0ed4608376	2013-08-01 11:37:41 -07:00
Yunqing Wang	7965a6ea34	Comment out 2 unused speed features use_min_partition_size and use_max_partition_size are not used currently, and could be added back if needed later. Change-Id: Ib22a9c06b064567a7c1d6d5445567ed77e0d3acc	2013-08-01 11:03:34 -07:00
Dmitry Kovalev	ff4bfa726b	Merge "Adding missing const to vp9_extra_bits array."	2013-08-01 10:19:51 -07:00
Adrian Grange	89e73c63c0	Fixed typos and added a few explanatory comments Change-Id: Ib4e4b41094b54874ee34343dd77c0c131ceed9d2	2013-08-01 09:23:49 -07:00
Adrian Grange	5271d47892	Changed name of rd_pick_intra4x4mby_modes The function name rd_pick_intra4x4mby_modes is confusing, so I changed it to rd_pick_intra_sub_8x8_y_modes to better reflect what the function does. Also added const qualifiers to some of the input parameters and removed camel-case. Change-Id: I23d53d4c7af5d79ed8a471acd59a09bbb47add39	2013-08-01 09:23:49 -07:00
Dmitry Kovalev	5b65246a71	Adding missing const to vp9_extra_bits array. Change-Id: Icd128ab58719e0b9066bdfa66a5d0d427a84d6df	2013-07-31 18:51:18 -07:00
Jingning Han	12f5762756	Remove unnecessary arguments in rd_pick_ref_frame This commit removes redundant arguments passing in the function of rd_pick_reference_frame. This resolves the clang warnings about potential use of uninitialized values. Change-Id: Ic68f949a9f8fcd0a583786b0c75321104ea44739	2013-07-31 17:04:13 -07:00
Dmitry Kovalev	9239e96536	Removing get_mi_{row, col} functions. Passing mi_row and mi_col parameters to functions explicitly. Removing unused xd argument from scale_mv function. Change-Id: Icb4c495ec72d26fb066c14470d3ae0b741fbf18a	2013-07-31 14:06:55 -07:00
Dmitry Kovalev	3be9fd9120	Merge "Removing unused "ishp" arguments."	2013-07-31 12:03:04 -07:00
Dmitry Kovalev	0e0a6f840b	Merge "Consistent update for inter_mode probabilities."	2013-07-31 12:02:35 -07:00
Dmitry Kovalev	500ade243a	Removing unused "ishp" arguments. Using different variable names "allow_hp" and "use_hp" instead of "usehp". Change-Id: I0cd5996ddeb46bd754473b680a993c0aaf8eb879	2013-07-31 11:27:53 -07:00
Jingning Han	ac7bab7575	Merge "Make the use of ref_frame index consistent"	2013-07-31 09:11:37 -07:00
Jingning Han	86c384d398	Make the use of ref_frame index consistent Refactor the frame buffer referencing in choose_partition and make it consistent with other places. This means to prevent potential issues when we extend reference frame buffer. Change-Id: I5ff33ed5f671e1f4cc7049622212769a9b4578d9	2013-07-30 19:49:36 -07:00
Dmitry Kovalev	8701bc11df	Consistent update for inter_mode probabilities. Using inter-mode counts instead of inter-mode-tree branch counts inside FRAME_COUNTS structure. Change-Id: I60dde13af37d06146d7d15543311c1b5044e9e04	2013-07-30 18:06:34 -07:00
Adrian Grange	fbd73648dd	Merge "Cleanup typos, remove unnecessary lines, replace switch"	2013-07-30 12:59:46 -07:00
Adrian Grange	b30a06b930	Cleanup typos, remove unnecessary lines, replace switch Removed unnecessary code lines, replaced switch with an if, fixed spelling errors and formatting. Change-Id: Ie48aa4604aa0ed48362ca359d792fb21b2ec1dc6	2013-07-30 12:10:32 -07:00
Yaowu Xu	88e48444da	Merge "removed duplication"	2013-07-30 09:38:02 -07:00
Yaowu Xu	a15d1f3134	removed duplication Change-Id: Ica23b66f6664e5a5b168499584f0afffbc54794f	2013-07-30 09:09:14 -07:00
Jingning Han	525745b17a	Remove a redundant branching in tokenize_b The tokenize_b function is only called when output flag is on. Hence removing the conditional branch on it therein. Change-Id: Ib709f47f23f39ca05a695faf86fa3377f11f2dd0	2013-07-29 17:08:13 -07:00
Jingning Han	455f2de20b	Tune tokenization/detokenization flow for speed-up This commit optimizes the tokenization and detokenization operational flow for speed-up. It makes the coding process about 0.3% faster at speed 0. Change-Id: I28008df7482874e4b5f237f2d418ff82a249dd56	2013-07-29 16:15:30 -07:00
Jingning Han	b5323ed89a	Skip redundant tokenization in rd loop This commit makes the encoder skip the redundant tokenization process in the rate-distortion optimization search loop, while updating the entropy contexts accordingly. It makes the speed 0 encoding process about 0.5% faster at no performance change. Change-Id: I34a4155a0b5332afeb45c93a51c7f35a294d685c	2013-07-29 16:09:16 -07:00
Jingning Han	5875d7a4a4	Merge "16x16 inverse 2D-DCT with DC only"	2013-07-29 15:29:25 -07:00
Jingning Han	a7c4de22e1	16x16 inverse 2D-DCT with DC only This commit provides special handle on 16x16 inverse 2D-DCT, where only DC coefficient is quantized to be non-zero value. Change-Id: I7bf71be7fa13384fab453dc8742b5b50e77a277c	2013-07-29 14:45:53 -07:00
Dmitry Kovalev	828119d6ab	Renaming txfm to tx for consistency in some places. Change-Id: I2a6a646570e2af66315e7c658d00d99f80c4b127	2013-07-29 14:35:55 -07:00
Dmitry Kovalev	730a34416f	Renaming NB_TXFM_MODES constant to TX_MODES. Change-Id: I10bf06e3a3d5271221ae6a42a36074d01d493039	2013-07-29 13:38:40 -07:00
Dmitry Kovalev	23391ea835	Renaming TX_SIZE_MAX_SB to TX_SIZES. Change-Id: I6aa4191935aa93461a07c41b59fdae1eb5f5f107	2013-07-29 12:25:34 -07:00
Jingning Han	decb1b94de	Merge "Shortcut 8x8/16x16 inverse 2D-DCT"	2013-07-29 11:04:07 -07:00
Ronald S. Bultje	118ccdcd30	Inverse dimension order in token_cost array. This allows us to increment the position at the band-level only as we go from one band to the next; more importantly, that allows us to use an add instead of multiply instruction, and omit the instruction altogether if the band doesn't change from one coef to the next, thus being slightly faster (probably more noticeable on systems where a multiply is expensive, like arm). Change-Id: I4343fe35b9f9a47fa00b217bdcbf5f91ff96c381	2013-07-26 17:30:04 -07:00
Ronald S. Bultje	dcacce6dd9	Merge "Save pixels instead of coefficients in intra4x4 RD loop."	2013-07-26 17:20:58 -07:00
Ronald S. Bultje	d30c8f41ef	Merge "Add best_rd breakout in intra4x4 RD loop."	2013-07-26 17:20:51 -07:00
Jingning Han	38fa487164	Shortcut 8x8/16x16 inverse 2D-DCT This commit brought back the shortcut implementation of 8x8/16x16 inverse 2D-DCT. When the eob <= 10, it skips the inverse transform operations on row 4:7/4:15 in the first round. For bus_cif at 1000 kbps, this provides about 2% speed-up at speed 0. Change-Id: I453e2d72956467d75be4ad8c04b4482ab889d572	2013-07-26 17:19:14 -07:00
Jingning Han	b9c3dd481a	Merge "Special handle on DC only inverse 8x8 2D-DCT"	2013-07-26 16:04:14 -07:00
Jingning Han	325e0aa650	Special handle on DC only inverse 8x8 2D-DCT This commit enables a special handle for the 8x8 inverse 2D-DCT, where only DC coefficient is quantized to be non-zero. For bus_cif at 2000 kbps, it provides about 1% speed-up at speed 0. Change-Id: I2523222359eec26b144cf8fd4c63a4ad63b1b011	2013-07-26 14:16:51 -07:00
Dmitry Kovalev	c09b81719f	Merge "General cleanups."	2013-07-26 13:59:39 -07:00
Yaowu Xu	4f75a1f4ed	Merge "Auto min and max partition size experiment."	2013-07-26 12:10:27 -07:00
Paul Wilkins	fe5e2a91bb	Auto min and max partition size experiment. Speed feature experiment to set an upper and lower partition size limit based on what has been seen in spatial neighbors. This seems to gives quite reasonable speed gains in local (10-15%) and when used with speed 0 the losses are small (0.25% derf, 0.35% stdhd). However, for now I am only enabling it on speed 1 as there may be clashes with the existing temporal partition selection in speed 2. Using a tighter min / max around the range derived from the neighbors increases speed further but at the cost of a bigger quality loss. However, I think this spatial method could be combined with data from either the last frame or a variance method (or both) to refine the range of minimum and maximum partition size. I.e. consider the min and max from spatial and temporal neighbors and the variance recommendation. Change-Id: I1b96bf8b84368d6aad0c7aa600fe141b4f07435f	2013-07-26 18:30:49 +01:00
Yunqing Wang	52256cdbca	Modify static threshold calculation Used 3 * standard_deviation in internal threshold calculation instead of fit curve. This actually approached the algorithm better. For comparison, similar tests were done: The overall psnr loss is less than before. 1. derf set: when static-thresh = 1, psnr loss is 0.329%; when static-thresh = 500, psnr loss is 0.970%; 2. stdhd set: when static-thresh = 1, psnr loss is 0.922%; when static-thresh = 500, psnr loss is 1.307%; Similar speedup is achieved. For example, clip bitrate static-thresh psnr time akiyo(cif) 500 0 48.952 5.077s(50f) akiyo 500 500 48.866 4.169s(50f) parkjoy(1080p) 4000 0 30.388 78.20s(30f) parkjoy 4000 500 30.367 70.85s(30f) sunflower(1080p) 4000 0 44.402 74.55s(30f) sunflower 4000 500 44.414 68.69s(30f) Change-Id: Ic78833642ce1911dbbd1cb6c899a2d7e2dfcc1f3	2013-07-25 19:59:33 -07:00
Yunqing Wang	845fd5011c	Merge "Add encoding option --static-thresh"	2013-07-25 14:58:00 -07:00
Yunqing Wang	d36852b702	Add encoding option --static-thresh This option exists in VP8, and it was rewritten in VP9 to support skipping on different partition levels. After prediction is done, we can check if the residuals in the partition block will be all quantized to 0. If this is true, the skip flag is set, and only prediction data are needed in reconstruction. Based on DCT's energy conservation property, the skipping check can be estimated in spatial domain. The prediction error is calculated and compared to a threshold. The threshold is determined by the dequant values, and also adjusted by partition sizes. To be precise, the DC and AC parts for Y, U, and V planes are checked to decide skipping or not. Test showed that 1. derf set: when static-thresh = 1, psnr loss is 0.666%; when static-thresh = 500, psnr loss is 1.162%; 2. stdhd set: when static-thresh = 1, psnr loss is 1.249%; when static-thresh = 500, psnr loss is 1.668%; For different clips, encoding speedup range is between several percentage and 20+% when static-thresh <= 500. For example, clip bitrate static-thresh psnr time akiyo(cif) 500 0 48.923 5.635s(50f) akiyo 500 500 48.863 4.402s(50f) parkjoy(1080p) 4000 0 30.380 77.54s(30f) parkjoy 4000 500 30.384 69.59s(30f) sunflower(1080p) 4000 0 44.461 85.2s(30f) sunflower 4000 500 44.418 78.1s(30f) Higher static-thresh values give larger speedup with larger quality loss. Change-Id: I857031ceb466ff314ab580ac5ec5d18542203c53	2013-07-25 14:28:05 -07:00
Dmitry Kovalev	7131cb0e3d	General cleanups. Removing unused constants, macros, and function declarations. Using ROUND_POWER_OF_TWO macro, vp9_zero, vp9_copy where possible. Moving #include from .h to .c. Merging for loops for motion vectors. Change-Id: Ic3bf841764a2bb177128bb3a6d7aa8f68229cd13	2013-07-25 14:13:48 -07:00
Dmitry Kovalev	d53fc9ee4e	Merge "Adding lookup table for size group."	2013-07-25 13:57:28 -07:00
Dmitry Kovalev	08fd41ccd7	Adding lookup table for size group. Change-Id: Ia6144d77ebed66e0739b62e4d673e26a95aa9550	2013-07-25 12:58:54 -07:00
Adrian Grange	e862c6f9eb	Merge "Simplify handling of sub-partition motion vectors"	2013-07-25 12:58:38 -07:00
Adrian Grange	6f0f0e4907	Merge "Use local variables rather than structure members"	2013-07-25 12:57:52 -07:00
Dmitry Kovalev	d604914f09	Merge "Removing vp9_adapt_mode_context function."	2013-07-25 12:46:31 -07:00
Jingning Han	d571af76d3	Merge "Make coeff_optimize initialized per-plane"	2013-07-25 12:46:14 -07:00
Yaowu Xu	51a8458822	Merge "fix a bug where flags are not reset"	2013-07-25 12:18:51 -07:00
Adrian Grange	be700e140a	Simplify handling of sub-partition motion vectors Simplified the code that extracts and uses the motion vectors for the 4 sub-partitions in rd_pick_partition. Change-Id: Iaf698ef7ee3aef9edd59015e1ae065dd359b17d9	2013-07-25 11:51:51 -07:00
Jingning Han	2f58faffa4	Make coeff_optimize initialized per-plane This commit makes the initialization of trellis coeff optimization a per-plane operation, thereby eliminating the redundant steps in encode_sby and encode_sbuv. It makes the encoder at speed 0 slightly faster. Change-Id: Iffe9faca6a109dafc0dd69dc7273cbdec19b17cd	2013-07-25 11:44:29 -07:00
Dmitry Kovalev	47d61f008f	Removing vp9_adapt_mode_context function. Moving code from vp9_adapt_mode_context to vp9_adapt_mode_probs. Change-Id: I60829c30b28968cd813551ef3a206dfb98d323c9	2013-07-25 10:48:45 -07:00
Yaowu Xu	3e386aefc2	fix a bug where flags are not reset The feature that uses small partition results as a measure to skip mode evaluation at larger partition requires the flags to be reset. The reset was missing in the code path that calls rd_use_partition(). Change-Id: Ia0a3a0aee1a862b6e2333d596808db7c48033d50	2013-07-25 10:28:38 -07:00
Scott LaVarnway	a0e8b45fee	Merge "pack_inter_mode_mvs cleanup"	2013-07-25 04:47:56 -07:00
Dmitry Kovalev	fcc34796d2	Removing CONFIG_BALANCED_COEFTREE experiment. Change-Id: I61a8b0101eac3ee2e0621d56151b90c269fd4db4	2013-07-24 15:53:42 -07:00
Dmitry Kovalev	9139ee0908	Adding condition inside get_tx_type_{4x4, 8x8, 16x16}. Adding plane type check condition because it was always used outside of get_tx_type_{4x4, 8x8, 16x16}. Change-Id: I02f0bbfee8063474865bd903eb25b54d26e07230	2013-07-24 12:55:45 -07:00
Adrian Grange	4cfd36d8fd	Use local variables rather than structure members Although local copies of the mode member variables (mode, ref_frame) were made, they were not used in all places. Also, made a local copy of the second_ref_frame member. Change-Id: I84d8c822e5cb3d8a02fc3de8a4037ca3fea8bfad	2013-07-24 11:17:44 -07:00
Adrian Grange	a183f17d33	Merge "Correct spelling mistakes"	2013-07-24 09:48:57 -07:00
Ronald S. Bultje	7817d3221f	Save pixels instead of coefficients in intra4x4 RD loop. Prevents doing duplicate IDCTs; encoding of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min4.0 to 1min3.5, i.e. 0.87% faster overall. Change-Id: I2df39e29ed9d5ea5e7d2704a34940ba622832ddd	2013-07-24 09:03:20 -07:00
Ronald S. Bultje	b72ecbb1b9	Add best_rd breakout in intra4x4 RD loop. Encoding time of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min5.4 to 1min4.0, i.e. 2.2% faster overall. Change-Id: I8c32f2aff9a649ce7dd49d910dc5ba16b99c3bc6	2013-07-24 09:02:05 -07:00
Adrian Grange	bc8b0529db	Correct spelling mistakes Change-Id: Id4138293efeac4503b2e01ce7a6c150a5abeef77	2013-07-24 07:58:26 -07:00
Ronald S. Bultje	47336afd8d	Merge "More optimizations for cost_coeffs()."	2013-07-23 21:36:12 -07:00
Jingning Han	666c266623	Merge "Unify the use of encode_b_args/optimize_block_args"	2013-07-23 18:08:50 -07:00
Dmitry Kovalev	1099a436d3	Moving counts from FRAME_CONTEXT to new struct FRAME_COUNTS. Counts are separate from frame context. We have several frame contexts but need only one copy of all counts. Change-Id: I5279b0321cb450bbea7049adaa9275306a7cef7d	2013-07-23 17:02:08 -07:00
Jingning Han	ab77828b36	Unify the use of encode_b_args/optimize_block_args The struct optimize_block_args is defined same as encode_b_args. Remove this redundant definition, and use encode_b_args consistently. Change-Id: I1703aeeb3bacf92e98a34f4355202712110173d9	2013-07-23 16:04:02 -07:00
Dmitry Kovalev	8d13b0d1df	Removing LOW_PRECISION_MV_UPDATE define. Change-Id: I78d16ee758e1fae0200b746f00031f6d9c6d6ce7	2013-07-23 15:41:45 -07:00
Dmitry Kovalev	a9bbabd94b	Merge "Removing vp9_is_interpolating_filter array."	2013-07-23 15:01:19 -07:00
Adrian Grange	719cd35f3a	Merge "Rolled-up several for loops into one"	2013-07-23 15:00:06 -07:00
Adrian Grange	646edbc1b2	Rolled-up several for loops into one Several consecutive for loops executed over the same index range, so I rolled them into one. Change-Id: I5cfcc8c38c738478965768409cca9d09adf224e1	2013-07-23 14:32:21 -07:00
Dmitry Kovalev	db7f5d28b9	Removing vp9_is_interpolating_filter array. All filters are interpolating now, so we don't need this array, all values from this array are evaluated to true. Change-Id: I9af6d8219ae0eb984063cd15e4e2296374ae4961	2013-07-23 14:24:39 -07:00
Dmitry Kovalev	2855d8aea1	Merge "Adding update_tx_counts function."	2013-07-23 13:57:59 -07:00
Jingning Han	e9e2fe8ec3	Make xform_quant operations tx_type independent The xform_quant() module is only used by inter modes, hence removing the redundant switches therein conditioned on tx_type. Change-Id: Ib87ce5b2f2e4cbf3ceb133a1108afa173c933a3f	2013-07-23 12:37:25 -07:00
James Zern	8dede954c7	Merge "vp9: make some static tables const"	2013-07-23 11:37:01 -07:00
Jingning Han	4ef1d35abf	Merge "Skip inverse transform when eob is zero"	2013-07-23 10:31:19 -07:00
Deb Mukherjee	9360fd3dcf	Merge "Diamond search change to accelerate movement"	2013-07-23 10:14:10 -07:00
Jingning Han	0359ad7f9a	Skip inverse transform when eob is zero When all the transform coefficients were quantized to zero, skip the inverse transform operation. For bus_cif at 1000 kbps, the runtime goes from 154967ms -> 149842ms, i.e., about 3% speed-up, at speed 0. Change-Id: Ic0a813fff5e28972d4888ee42d8747846a6c3cc6	2013-07-23 10:06:41 -07:00
Paul Wilkins	cedd24ec61	Merge "Renaming of segment constants."	2013-07-23 08:16:12 -07:00
Scott LaVarnway	7bc294a3fe	pack_inter_mode_mvs cleanup xd->mode_info_context is set to m prior to this call. Change-Id: Ibc442529961750c29ccf0c6cae08cb2b0431415f	2013-07-23 10:08:28 -04:00
Jim Bankoski	256ee00093	Merge "clean up bw, bh"	2013-07-23 06:58:28 -07:00
Jim Bankoski	86a9dec73c	clean up bw, bh many structures use bw and bh and they have different meanings. This cl attempts to start this clean up and remove unneccessary 2 step look up log and then shift operations... also removed partition type multiple operation code in bitstream.c. Change-Id: I7e03e552bdfc0939738e430862e3073d30fdd5db	2013-07-23 06:51:44 -07:00
Scott LaVarnway	2fd20eb37d	Merge "Eliminated prev_mip memsets/memcpys in encoder"	2013-07-23 06:43:52 -07:00
Paul Wilkins	7c134bc0cd	Merge "Reworked the auto_mv_step_size speed feature"	2013-07-23 04:49:55 -07:00
Paul Wilkins	32042af14b	Renaming of segment constants. Renamed: MAX_MB_SEGMENTS to MAX_SEGMENTS MB_SEG_TREE_PROBS to SEG_TREE_PROBS The minimum unit for segmentation in the segment map is now 8x8 so it is misleading to use MB_ as macro-block traditionally refers to a 16x16 region. Change-Id: I0b55a6f0426bb46dd13435fcfa5bae0a30a7fa22	2013-07-23 12:09:04 +01:00
James Zern	3c8cce353f	vp9: make some static tables const Change-Id: I8bcae51271673da8755c66a51aea005dfe6a3739	2013-07-22 19:19:13 -07:00
Ronald S. Bultje	e20fcd9585	More optimizations for cost_coeffs(). 4x4: 163 -> 123 cycles (33% faster) 8x8: 491 -> 399 cycles (23% faster) 16x16: 1889 -> 1763 cycles (7% faster) 32x32: 8311 -> 8180 cycles (1.6% faster) Overall encoding time of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min4.33 to 1min3.00, i.e. 2.11% faster. Change-Id: Ib52d1dbb5649b14de769d3e7a74af67440b5284f	2013-07-22 16:09:09 -07:00
Dmitry Kovalev	b2fc6fa969	Adding update_tx_counts function. Moving common encoder/decoder code to update_tx_counts. Also renaming vp9_get_pred_probs_tx_size to get_tx_probs2 and adding get_tx_probs to call vp9_get_pred_context_tx_size inside read_selected_tx_size only once (twice before). Change-Id: Ia50247f3893de88ef8e9041b0d44be44a40aaa4d	2013-07-22 14:57:43 -07:00
Yaowu Xu	6261d79206	Merge "fix a build error"	2013-07-22 13:02:15 -07:00
James Zern	76db4d599a	Merge "VP[89]_COMMON: remove golden/altref frame counts"	2013-07-22 12:55:07 -07:00
Yaowu Xu	fc186dcad6	fix a build error Change-Id: I3b05687f439ff6a7c426d2c97a6c58c831fa51ac	2013-07-22 12:37:30 -07:00
Jingning Han	416f315e82	Merge "Skip buffer update in sub8x8 rd loop"	2013-07-22 12:08:22 -07:00
Jingning Han	a5a9f5f7f3	Merge "Optimize operation flow in sub8x8 rd loop"	2013-07-22 12:08:15 -07:00
Deb Mukherjee	a1e2d50be9	Diamond search change to accelerate movement Optional change in diamond search to continue in the best move direction until that move turns worse. This is still WIP since the exact way the new method is to be used is under investigation. One option is to make it an option in diamond search and use it only when motion is large. Overall slightly positive on derfraw300 +0.02%, stdhdraw +0.13%, but works a lot better for high motion sequences (ex. football : +1%). Change-Id: If88e01a6021daa0cda934680cdc70be1ee04f798	2013-07-22 11:19:15 -07:00
Paul Wilkins	3798d7a641	Merge "Re-order mode search in rd."	2013-07-22 10:46:04 -07:00
Jingning Han	409e77f2d4	Optimize operation flow in sub8x8 rd loop Stack the rate-distortion statistics in the sub8x8 rd loop. This allows the encoder to skip the forward transform, quantization, and coeff cost estimation, in the sub8x8 rd optimization search, if the motion vector(s) are of integer pixel value, and have been tested in the previous prediction filter type rd loops of the same block. This gives about 2% speed-up for bus_cif at 2000 kpbs, for speed 0. Its efficacy depends how frequently the motion search will select an integer motion vector. Change-Id: Iee15d4283ad4adea05522c1d40b198b127e6dd97	2013-07-22 10:40:33 -07:00
Paul Wilkins	1d189d6464	Re-order mode search in rd. Mode search order in rd loop changed to better reflect observed hit counts. Also some adjustment of the baseline mode rd thresholds to reflect the order change and observed frequencies. Change-Id: I47a131cc83e11551df8add6d6d8d413d78d3a63c	2013-07-22 17:21:12 +01:00
Jim Bankoski	9ad604c6fb	Merge "fix left over overflow"	2013-07-22 08:51:26 -07:00
Jim Bankoski	2ac8b50cd8	fix left over overflow This cl fixes issues rbultje brought up. that I somehow neglected when I submitted yaowu's patch. Change-Id: I07ad18796317822510b96e951c88d29f194a3c2e	2013-07-22 06:39:39 -07:00
Paul Wilkins	888375d243	Fix build error. When CONFIG_POSTPROC is set there was a now invalid reference to cm->filter_level. Changed to cpi->mb.e_mbd.lf.filter_level in line with change Iaf5fb71c33719cdfa1b991f671caf071be9ea035 Change-Id: If746e60044903f7ba8d0d346225b3d015226c7d0	2013-07-22 14:01:43 +01:00
Dmitry Kovalev	ee1fe2f750	Merge "Removing pre probabilities from FRAME_CONTEXT."	2013-07-20 22:50:32 -07:00
Dmitry Kovalev	8962d975b2	Merge "Moving all loop filter related variables into new struct."	2013-07-20 22:45:24 -07:00
Dmitry Kovalev	39342db138	Merge "Consistent names for inter mode probabilities and encodings."	2013-07-20 22:40:51 -07:00
Dmitry Kovalev	f66821afbb	Merge "Removing frame_type field from MACROBLOCKD struct."	2013-07-20 22:40:06 -07:00
Dmitry Kovalev	2b089f149a	Merge "Removing unused static arrays from vp9_reatectrl.c."	2013-07-20 22:39:33 -07:00
Jingning Han	c725502bf3	Skip buffer update in sub8x8 rd loop This commit allows the encoder to skip a few buffer update steps in rd_pick_best_mbsegmentation, when early breakout has been triggered in the rd_check_segment_txsize. It provides about 1% speed-up for bus_cif at 2000 kbps, in the settings of speed 0. Change-Id: Ica034f10a24dec572b397d8389a2b81020ebc0b9	2013-07-20 21:38:12 -07:00
Yaowu Xu	ea284d6281	added checks to prevent rate/distortion overflow At speed 2, due to the threshold scheme used, it is possible the rate and distortion assigned with INT_MAX value. The patch added checking to prevent the INT_MAX value is used in further calculation of RD scores. The patch also changed the assertion in rd_use_partition() to be mirror similar assertion in rd_pick_partition(). Change-Id: Idb52c543cc1e10abdf6e6a5d6e9cb535a42214dc	2013-07-19 17:52:50 -07:00
Dmitry Kovalev	7e703de729	Removing pre probabilities from FRAME_CONTEXT. Using cm->frame_contexts[cm->frame_context_idx] as source of previous probabilities. Change-Id: Ie03778acf0e7bebdc3a1f6a51854d4a0712f24a1	2013-07-19 17:33:10 -07:00
Dmitry Kovalev	ee1771ebaa	Moving all loop filter related variables into new struct. Adding loopfilter struct with fields from MACROBLOCKD and VP9Common. Eventually it will be moved to vp9_loopfilter.h for better code structure. Change-Id: Iaf5fb71c33719cdfa1b991f671caf071be9ea035	2013-07-19 16:19:10 -07:00
Dmitry Kovalev	29f0f79317	Removing unused static arrays from vp9_reatectrl.c. Removed arrays: kf_boost_seperation_adjustment, gf_adjust_table, gf_intra_usage_adjustment, gf_interval_table. Change-Id: I62e400cb6e4d039787615169a3779e31ebf95893	2013-07-19 15:55:09 -07:00
Dmitry Kovalev	c3a56ee583	Merge "Moving Scale2Ration function from vp9_onyx.h to vp9_onyx_if.c."	2013-07-19 15:27:24 -07:00
Deb Mukherjee	302698fb12	Reworked the auto_mv_step_size speed feature This patch modifies the auto_mv_step_size speed feature to use a combination of the maximum magnitude mv from the last inter frame, and the maximum magnitude mv for the two reference mvs with the same reference. For arf frames, the max mav step for the resolution is used. The bounds therefore are slightly tighter. The feature is made a speed 1 feature. Rebased. Results (when this feature is turned on over speed 0): derfraw300: -0.046% psnr, about 5+% speedup (tested on football: goes from 4m30.760s to 4m17.410s). Change-Id: If492797a61b0b4b3e58c0b8f86afb880165fc9f6	2013-07-19 15:12:56 -07:00
Dmitry Kovalev	e71a4a77bb	Merge "Renaming TXFM_MODE to TX_MODE (like TX_SIZE, TX_TYPE)."	2013-07-19 12:14:32 -07:00
Dmitry Kovalev	97e96bc4e9	Removing frame_type field from MACROBLOCKD struct. Change-Id: Ia4e83913251c1cdc7aa2abd64bf01ecb1a962119	2013-07-19 11:55:36 -07:00
Dmitry Kovalev	c0eb57406c	Renaming TXFM_MODE to TX_MODE (like TX_SIZE, TX_TYPE). Moving TX_MODE enum to vp9_enums.h. Renaming txfm_mode variables to tx_mode. Change-Id: I459d1af6dd928ce7fccdf8ce30b6f1ca057bef92	2013-07-19 11:37:13 -07:00
Dmitry Kovalev	afe43d4089	Removing redundant VP9_COMMON* from function signatures. Functions: vp9_get_pred_context_switchable_interp, vp9_get_pred_context_intra_inter, vp9_get_pred_context_single_ref_p1, vp9_get_pred_context_single_ref_p2. Change-Id: I3d6fb8aee23c9062270768e1e6da416dd9bb8f96	2013-07-19 11:20:49 -07:00
Dmitry Kovalev	bc7acb134b	Consistent names for inter mode probabilities and encodings. Renaming vp9_sb_mv_ref_tree to vp9_inter_mode_tree, and vp9_sb_mv_ref_encoding_array to vp9_inter_mode_encodings. Change-Id: I0e91fbf81350d3ec5a2599064c74089b5d06133a	2013-07-19 10:40:04 -07:00
Paul Wilkins	f3ed9f5523	Alignment of THR_MODES to vp9_mode_order[] Change-Id: I4032dd0442043543954dcb3724df974b7cc7e515	2013-07-19 11:33:39 +01:00
Dmitry Kovalev	13253d6121	Merge "Removing kf_{y, uv}_mode_prob arrays from VP9Common."	2013-07-19 01:00:46 -07:00
Ronald S. Bultje	e4686c589e	Fix slightly quality drop caused at speed 1. We would skip the rectangular blocks for sub8x8 partitions because we would conclude that PARTITION_NONE was better than PARTITION_SPLIT, however, that conclusion was made before we actually really tested PARTITION_SPLIT. Change-Id: I8fa91e59894badc1d8cee3ba8a49e40ae4c4a489	2013-07-18 17:52:08 -07:00
Yaowu Xu	37d901a47a	Merge "Add best_rd breakout to keyframe partition selection also."	2013-07-18 17:50:39 -07:00
Yaowu Xu	67fb0679ee	Merge "Merge scale_factors and scale_factors_uv."	2013-07-18 17:50:34 -07:00
Yaowu Xu	55b52e32da	Merge "Do in-place UV intra mode selection."	2013-07-18 17:50:07 -07:00
Yaowu Xu	51972d1279	Merge "Change break statement in a 2d loop to a return statement."	2013-07-18 17:49:58 -07:00
Dmitry Kovalev	92f4198d52	Merge "Using VP9_REF_NO_SCALE instead of (1 << VP9_REF_SCALE_SHIFT)."	2013-07-18 17:29:05 -07:00
Dmitry Kovalev	0b562b2d3d	Using VP9_REF_NO_SCALE instead of (1 << VP9_REF_SCALE_SHIFT). Change-Id: Ide58a74d31ff948319445a6337d2c05e98720e34	2013-07-18 15:12:46 -07:00
Ronald S. Bultje	96e4db2660	Add best_rd breakout to keyframe partition selection also. Change-Id: I96b8058f6dfecf8aa3e152cdcbfd7e10071fbbc9	2013-07-18 14:10:56 -07:00
Ronald S. Bultje	5ebe503f04	Merge scale_factors and scale_factors_uv. This prevents a duplicate memcpy of a 128-byte struct every time set_scale_factors() is called (which is a lot), thus leading to a decrease from 3.7 MB to 1.85 MB of struct copying per 64x64 block RD/partition loop. Overall, this decreases encoding time of the first 50 frames of bus @ 1500kbps (speed 0) from 1min5.9 to 1min4.9, i.e. about a 1.5% overall speedup. We can likely get more gains by removing the copy of the other struct (and replacing it with an indexing) as well. Change-Id: I3dceb7e79f71e6fe911b11cc994cf89a869dde7a	2013-07-18 14:10:56 -07:00
Ronald S. Bultje	df4b4fab26	Do in-place UV intra mode selection. This means we only do UV intra mode selection if we find any intra mode to actually be useful at all; in addition, we only do UV intra mode selection for the transform sizes that were selected, rather than all sizes available in this partition. First 50 frames of bus @ 1500kbps (speed 0) gains about 5% with this change. Change-Id: I7b461eb8b803247f57896c5a9505f745b55502b3	2013-07-18 14:10:56 -07:00
Ronald S. Bultje	e54a5782b9	Change break statement in a 2d loop to a return statement. The break statement only breaks out of the nested loop, not the top-level loop, so it doesn't always work as intended. Changing it to a return statement does what's intended. Change-Id: I585419823b39a04ec8826b1c8a216099b1728ba7	2013-07-18 14:10:56 -07:00
Ronald S. Bultje	2d4929e340	Remove motion vectors from PARTITION_INFO. The same information already exists in union b_mode_info. Change-Id: Iac5086b99a3c3cc270380138062bb693e58f9e6d	2013-07-18 14:10:52 -07:00
James Zern	5f30a0c687	VP[89]_COMMON: remove golden/altref frame counts these are only used in the encoder. frames_since_golden / frames_till_alt_ref_frame -> VP[89]_COMP Change-Id: Ie14a6f46987bced685ddb449b85dc261caba6dfe	2013-07-18 14:09:21 -07:00
Dmitry Kovalev	9f3c0e34a9	Moving Scale2Ration function from vp9_onyx.h to vp9_onyx_if.c. Change-Id: Idfe2a850f72b38f519aea1aac1266d8c3aa813ee	2013-07-18 14:05:06 -07:00
Ronald S. Bultje	9da67da04a	Merge "Fix bug where we don't choose any mode in RD selection."	2013-07-18 12:47:50 -07:00
Ronald S. Bultje	247197d57b	Fix bug where we don't choose any mode in RD selection. This could happen during golden overlay frame coding from a previous alt-ref frame if the special overlay code was triggered. Change-Id: I3056d0c547cd26903b260ef93c94026e96bd9868	2013-07-18 12:13:15 -07:00
Ronald S. Bultje	4f5815290c	Merge "Fix bug which skips zeromv even if near/nearest is not 0,0."	2013-07-18 10:06:51 -07:00
Ronald S. Bultje	deb7456058	Fix bug which skips zeromv even if near/nearest is not 0,0. Change-Id: Id4f454831f3f11099f39c30246adeaa52857d08d	2013-07-18 09:35:19 -07:00
Jingning Han	ced3c20165	Use mv_check_bounds in sub8x8 rd loop Make the use of mv_check_bounds consistent for mvs of both ref_frame[0] and ref_frame[1]. Change-Id: I1ca24865cc7232ca9cbe5db566c53abad1592211	2013-07-17 17:13:51 -07:00
Dmitry Kovalev	f9f453ec8d	Removing kf_{y, uv}_mode_prob arrays from VP9Common. These arrays have constant values (no any updates). Removing two corresponding memcpy calls. Making a little cleanup in vp9_entropymode.h as well: removing redundant 'extern' keyword and moving all function declarations at the end. Change-Id: Ia16b38b46aec2e2500f5df29c40a297ae241dede	2013-07-17 16:50:52 -07:00
Ronald S. Bultje	facecd80da	Merge "Add a best_yrd shortcut in splitmv mode search."	2013-07-17 16:11:13 -07:00
Ronald S. Bultje	056111c822	Merge "Skip redundant nearest/near/zero encodes in splitmv."	2013-07-17 16:10:51 -07:00
Ronald S. Bultje	0b1eba25b2	Merge "Skip nearest/near/zero redundant encodes."	2013-07-17 16:10:41 -07:00
Ronald S. Bultje	607424449c	Merge "Best_rd breakout in rd partition search."	2013-07-17 16:10:22 -07:00
Yunqing Wang	3798db88e1	Remove unnecessary calling of vp9_init_quantizer() vp9_init_quantizer() is called in vp9_create_compressor(), and should not be called in vp9_set_speed_features(). Change-Id: Ic2f1f4b0531b9d46bb841d7e1d8da9812207dad6	2013-07-17 14:59:00 -07:00
Yaowu Xu	6ac5b7db2c	Merge "changed mode checking order"	2013-07-17 14:44:40 -07:00
Dmitry Kovalev	a7a1e96136	Merge changes Ieffea49e,Idf610746 * changes: Removing two unused arguments from vp9_inc_mv signature. Changing signature of vp9_get_pred_probs_tx_size.	2013-07-17 14:44:20 -07:00
Ronald S. Bultje	c6917528a5	Add a best_yrd shortcut in splitmv mode search. Encoding of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min6.2 to 1min5.9, i.e. 0.5% faster overall. Change-Id: I59d8a3b2f0a75010fa041d5e2646c8caac5bd683	2013-07-17 14:21:57 -07:00
Ronald S. Bultje	161c995658	Skip redundant nearest/near/zero encodes in splitmv. Encode of first 50 frames of bus @ 1500kbps (speed 0) goes from 1min7.3 to 1min6.2, i.e. 1.7% faster overall. Change-Id: I19d2deacfbffadd61d32551cee9586757ab4a987	2013-07-17 13:53:48 -07:00
Yaowu Xu	42facc292d	changed mode checking order Change-Id: Ic4c4b363ed840935e42f495f13ea5e601a56f1b2	2013-07-17 13:43:50 -07:00
Ronald S. Bultje	8fea880b6f	Skip nearest/near/zero redundant encodes. Encode of first 50 frames of bus @ 1500kbps (speed 0) goes from 1min12.8 to 1min7.3, i.e. 8% faster. Change-Id: Ia22d1c7b687316c553cc60eacae988b24e175b62	2013-07-17 11:33:15 -07:00
Yunqing Wang	10e83b0717	Enable disable_splitmv feature for other speeds Added disable_splitmv feature at other speed levels. For speed 3 or above, always turn it on. Change-Id: Ibb36f0a7ef12a34b4f8d0f9cb6193eab43b34360	2013-07-17 10:25:49 -07:00
Ronald S. Bultje	9f427bfe98	Best_rd breakout in rd partition search. About 15% faster for bus (speed 0) first 50 frames @ 1500kbps, which goes from 1min36 to 1min24. Results become slightly better (+0.2% on derf/yt, +0.4% on hd), probably because of a bugfix for skipmode in super_block_yrd(). Overall speed change (on derfraw300) is roughly -13%. This can probably be improved further by caching best_yrd between partition searches. Also, we might be able to get more speedups by always doing PARTITION_NONE before PARTITIONS_SPLIT, not just at the sb8x8 level. Change-Id: I83736949ebd5b4a3b400ee688d7661913fefc98b	2013-07-17 09:56:46 -07:00
Ronald S. Bultje	83c7e13a6b	Do a skip-block check for sub8x8 partitions also. +0.2% SSIM and glbPSNR on derfraw300. Change-Id: I9cba0bca55e606a22f557c7732b064f738efe84d	2013-07-17 09:46:47 -07:00
Yunqing Wang	df90d58f4f	Speed up motion estimation using small partitions' result(experiment) Current partition checking starts from small sizes, and then goes up to large sizes. This experiment uses the small partitions' motion estimation result, which is already available, to speed up the large partition's motion estimation. We can decide to skip some patition checkings if they are unlikely choices. We could use the motion vector(MV) result as current partition's prediction MV, limit the search range and reference frame. Current result at speed 1: psnr loss: 1.19% for stdhd, 0.287% for derf. speed gain: 14% for sunflower(hd), 11% for akiyo. Further improvement will be done later. Change-Id: I5abfd070e9cace2e91e2a0247d1325df313887ab	2013-07-17 09:11:47 -07:00
Paul Wilkins	d66eab15dd	Merge "Move uv intra mode selection in rd loop."	2013-07-17 05:19:26 -07:00
Paul Wilkins	154c34a3ee	Merge "Limit transform sizes searched for uv intra."	2013-07-17 03:40:11 -07:00
Paul Wilkins	2ee338ce3b	Move uv intra mode selection in rd loop. Use an estimate based on DC_PRED for intra uv cost within the rd loop then only do a full uv mode analysis if an intra mode is chosen. Significant speed gains in some cases. Currently only enabled for speed 2 pending speed/quality tests. Change-Id: Ie851a12400d5483bce47ec0e3ccb8516041e91c0	2013-07-17 11:11:21 +01:00
Paul Wilkins	6c667f0ffe	Limit transform sizes searched for uv intra. Apply limit if search_method == USE_LARGESTALL to the range of UV tx sizes searched. Change-Id: I6db29f0dd237285ffc50d75a37e8b68151ad821c	2013-07-17 11:08:55 +01:00
Paul Wilkins	5f4722c75f	Merge "Minor cleanup in code to fine uv tx_size."	2013-07-17 02:50:09 -07:00
Dmitry Kovalev	6638b6f63f	Merge "Removing MV_GROUP_UPDATE define and corresponding code."	2013-07-16 21:09:00 -07:00
Jingning Han	0b58fa80a0	Merge "Skip redundant motion search in 4x4 level rd loop"	2013-07-16 20:54:25 -07:00
Jingning Han	a142d6fc93	Skip redundant motion search in 4x4 level rd loop This commit makes the encoder to perform motion search only once per reference frame type for each 4x4/4x8/8x4 block. For bus_cif at 2000 kbps, the runtime goes from 253812ms -> 217817ms (14% speed-up) for speed 0. Change-Id: I5f17599ccc8cfaf93ccb4f98fcb6008af6d79e92	2013-07-16 17:21:11 -07:00
Dmitry Kovalev	41ae3d02d4	Removing two unused arguments from vp9_inc_mv signature. Change-Id: Ieffea49eb7a5e5092f21f8694c546aff69b07c6d	2013-07-16 17:01:08 -07:00
Dmitry Kovalev	5b65a71cdc	Changing signature of vp9_get_pred_probs_tx_size. Removing VP9_COMMON* argument and adding struct tx_probs* instead of MACROBLOCKD*. Change-Id: Idf61074631a90ec51eac22c8dcd977f44ac0757c	2013-07-16 16:34:54 -07:00
Dmitry Kovalev	3997da0d35	Removing MV_GROUP_UPDATE define and corresponding code. Change-Id: I4884cdc2557d25d50c7c4f7e19b1ad8bdb93cd63	2013-07-16 15:03:00 -07:00
Dmitry Kovalev	9482a0bf10	Cleaning up tile code. Removing tile_rows and tile_columns from VP9Common, removing redundant constants MIN_TILE_WIDTH and MAX_TILE_WIDTH, changing signature of vp9_get_tile_n_bits. Change-Id: I8ff3104a38179b2c6900df965c144c1d6f602267	2013-07-16 14:47:15 -07:00
James Zern	39ce4b13d5	Merge "use consistent framerate naming"	2013-07-16 14:22:52 -07:00
James Zern	9581eb6e8a	use consistent framerate naming s/frame_rate/framerate/g Change-Id: I6fc3e088e419c5f46e3a9390dd8a2cad2677a2fc	2013-07-16 14:12:47 -07:00
Dmitry Kovalev	5de96b3ce6	Merge "Rewriting vp9_set_pred_flag_{seg_id, mbskip}."	2013-07-16 13:34:42 -07:00
James Zern	5baa416b6c	Merge "vp9: remove frames_{since,till}.. from MACROBLOCKD"	2013-07-16 13:00:14 -07:00
James Zern	3a7c2665d0	Merge "yv12config: remove YUV_TYPE"	2013-07-16 12:16:04 -07:00
Dmitry Kovalev	863138a2ad	Rewriting vp9_set_pred_flag_{seg_id, mbskip}. Making implementation of vp9_set_pred_flag_{seg_id, mbskip} consistent with vp9_get_segment_id without using confusing sub(a, b) macro. Passing mi_row and mi_col to functions explicitly instead of replying on mb_to_right_edge and mb_to_bottom_edge. Change-Id: I54c1087dd2ba9036f8ba7eb165b073e807d00435	2013-07-16 10:44:48 -07:00
Paul Wilkins	30d2ea45ce	Minor cleanup in code to fine uv tx_size. Change-Id: I94b97a966b5efbc9a243048f1f5ddbbdc4b1846e	2013-07-16 18:27:33 +01:00
Jingning Han	dd97c62ab8	Merge "Skip inter-coded block reconstruction in rd loop"	2013-07-16 09:03:38 -07:00
Dmitry Kovalev	e8e7620a1f	Merge "Removing and moving around constant definitions."	2013-07-16 00:52:53 -07:00
Yaowu Xu	c5b0cd8405	Merge "Change to extend full border only when needed"	2013-07-15 21:35:32 -07:00
Yaowu Xu	5b915ebd92	Change to extend full border only when needed This is a short term optimization till we work out a decoder implementation requiring no frame border extension. Change-Id: I02d15bfde4d926b50a4e58b393d8c4062d1be70f	2013-07-15 20:52:13 -07:00
Dmitry Kovalev	ca75f1255f	Removing and moving around constant definitions. Removing unused and duplicated constants, moving them from .h to .c if possible. Change-Id: Ief4d6b984a3ca2e9b38504f0d855ed072cf7133f	2013-07-15 19:26:30 -07:00
Johann	6eae37f45c	Merge "Remove print_nmvcounts"	2013-07-15 18:43:41 -07:00
Ronald S. Bultje	1ff94fea56	Inline vp9_quantize() in xform_quant(). Cycle times: 4x4: 151 to 131 cycles (15% faster) 8x8: 334 to 306 cycles (9% faster) 16x16: 1401 to 1368 cycles (2.5% faster) 32x32: 7403 to 7367 cycles (0.5% faster) Total encode time of first 50 frames of bus @ 1500kbps (speed 0) goes from 1min39.2 to 1min38.6, i.e. a 0.67% overall speedup. Change-Id: I799a49460e5e3fcab01725564dd49c629bfe935f	2013-07-15 17:30:57 -07:00
Ronald S. Bultje	6fb418741f	Inline xform_quant() in encode_block_intra(). Also inline some of the block calculations to assist the compiler to not do silly things like calculating the same offset (or converting between raster/transform block offset or block, mi and pixel unit) many, many, many times. Cycle times: 4x4: 584 -> 505 cycles (16% faster) 8x8: 1651 -> 1560 cycles (6% faster) 16x16: 7897 -> 7704 cycles (2.5% faster) 32x32: 16096 -> 15852 cycles (1.5% faster) Overall, this saves about 0.5 seconds (1min49.8 -> 1min49.3) on the first 50 frames of bus (speed 0) @ 1500kbps, i.e. 0.5% overall. Change-Id: If3dd62453f8e2ab9d4ee616bc4ea956fb8874b80	2013-07-15 16:00:42 -07:00
Jingning Han	043e0f9dad	Skip inter-coded block reconstruction in rd loop Skip the inverse transform and reconstruction of inter-mode coded blocks in the rate-distortion optimization loop, when skip_encode_sb feature is turned on. This provides about 1% speed-up at speed 0, and 1.5% speed-up at speed 1. No performance change in both settings. Change-Id: I2932718bf4d007163702b61b16b6ff100cf9d007	2013-07-15 11:32:14 -07:00
Jingning Han	faff6ed0fb	Skip duplicate block encoding in the rd loop This speed feature allows the encoder to largely remove the spatial dependency between blocks inside a 64x64 superblock, thereby removing the need to repeatedly encode superblocks per partition type in the rate-distortion optimization loop. A major challenge lies in the intra modes tested in the rate-distortion optimization loop. The subsequent blocks do not have access to the reconstructed boundary pixels without the intermediate coding steps. This was resolved by using the original pixels for intra prediction in the rd loop, followed by an appropriately designed distortion modeling on the quantization parameters. Experiments also suggested that the performance impact is more discernible at lower bit-rate/psnr settings. Hence a quantizer dependent threshold is applied to deactivate skip of block coding. For bus_cif at 2000 kbps, speed 0: runtime 269854ms -> 237774ms (12% speed-up) at 0.05dB performance loss. speed 1: runtime 65312ms -> 61536ms, (7% speed-up) at 0.04dB performance loss. This operation is currently turned on in settings of speed 1. Change-Id: Ib689741dfff8dd38365d8c1b92860a3e176f56ec	2013-07-15 11:08:58 -07:00
James Zern	dc1d2331f6	vp9: remove frames_{since,till}.. from MACROBLOCKD frames_since_golden / frames_till_alt_ref_frame are unused. Change-Id: I348e7689d4d75412cf4de7703d885be942e4a26b	2013-07-13 18:02:11 -07:00
Dmitry Kovalev	429070987a	Using vp9_copy and vp9_zero instead of custom code. Change-Id: Id9b6ceeddca3f9b34bfada5c499b1e7a2f42c30b	2013-07-12 18:07:43 -07:00
Yaowu Xu	cdea4a7c66	Merge "Fix a build issue"	2013-07-12 16:17:22 -07:00
James Zern	4fc6c88e9c	yv12config: remove YUV_TYPE this was never fleshed out in the context of VP8, for which it was added. for VP9 it has no meaning. Change-Id: Iba2ecc026d9e947067b96690245d337e51e26eff	2013-07-12 15:25:48 -07:00
Dmitry Kovalev	cc662dd768	Adding struct tx_probs and struct tx_counts to cleanup the code. Also removing unused declarations from vp9_entropymode.h file. Change-Id: Ib9c5826db3584a32f6bb3297a76c522b99d83402	2013-07-12 15:22:38 -07:00
Yaowu Xu	fb754b182f	Fix a build issue Change-Id: I23a75c495ed7ea917d7f312bef0990e20a6b53d9	2013-07-12 11:38:44 -07:00
James Zern	0195fb53cb	vp9: consistent 'log2' variable naming lg2 -> log2 Change-Id: I0602ddff49e42c9c40c29c084d04b7592b9f8edf	2013-07-12 11:37:43 -07:00
Deb Mukherjee	94c481f9f1	Some minor cleanups for efficiency Implements some of the helper functions more efficiently with lookups rathers than branches. Modeling function is consolidated to reduce some computations. Also merged the two enums BLOCK_SIZE_TYPES and BlockSize into one because there is no need to keep them separate (even though the semantics are a little different). No bitstream or output change. About 0.5% speedup Change-Id: I7d71a66e8031ddb340744dc493f22976052b8f9f	2013-07-12 10:22:56 -07:00
Dmitry Kovalev	727631873d	Merge "Removing redundant code mostly from vp9_pred_common.{h, c}."	2013-07-12 10:22:30 -07:00
Paul Wilkins	b8ddc9f0d3	Merge "Speed 2 feature adjustment."	2013-07-12 02:14:01 -07:00
Jingning Han	84c3ac0476	Merge "Remove unnecessary tx_type branch in encode_block"	2013-07-11 21:52:27 -07:00
Dmitry Kovalev	dd150e8ea9	Removing redundant code mostly from vp9_pred_common.{h, c}. Removing redundant function arguments and curly braces. Change-Id: I46e02561f33fe02e84a3b19756f03b9504bd6a1b	2013-07-11 18:39:10 -07:00
Johann	e6ab476dd4	Remove print_nmvcounts For some reason iOS builds take a really long time to sort this function out. It's not used anywhere so remove it. Change-Id: Ia5c8513a0d9c7eb32641cca58ca1c1113e2dd9f4	2013-07-11 17:22:03 -07:00
Ronald S. Bultje	ee09dd9949	Remove unused function block_error(). Change-Id: I78a79fc51c2d7cc3c261f35b569155397f3dc0c4	2013-07-11 17:14:03 -07:00
Dmitry Kovalev	8c05e59065	Calling is_inter_mode() instead of custom code. Change-Id: Iccd4ab95ea51a6d57ed43947f2fd7ad92e8979cf	2013-07-11 14:14:47 -07:00
Dmitry Kovalev	c4ad3273c7	Moving segmentation related vars into separate struct. Adding segmentation struct to vp9_seg_common.h. Struct members are from macroblockd and VP9Common structs. Moving segmentation related constants and enums to vp9_seg_common.h. Change-Id: I23fabc33f11a359249f5f80d161daf569d02ec03	2013-07-11 11:57:57 -07:00
Dmitry Kovalev	f70c021d36	Merge "Adding write_compressed_header function."	2013-07-11 11:57:17 -07:00
Dmitry Kovalev	802e57535a	Merge "Removing unused TOKENEXTRA arg from pick_sb_modes function."	2013-07-11 11:46:06 -07:00
Jingning Han	b9381b6faf	Remove unnecessary tx_type branch in encode_block The function encode_block is called only by inter-prediction modes, hence removing the transform type branching there. Change-Id: I34a3172e28ce2388835efd0f8781922211bff857	2013-07-11 09:11:35 -07:00
Scott LaVarnway	f2a6bcfb18	Eliminated prev_mip memsets/memcpys in encoder This patch is in experimental but was not merged into master. This patch swaps ptrs instead of copying and uses the last show_frame flag instead of setting the entire buffer to zero. Change-Id: Ia0950466c8ba301a2a5bf917ff3d07bc1a2c2311	2013-07-11 10:47:28 -04:00
Paul Wilkins	5290eeab88	Speed 2 feature adjustment. With sf->auto_mv_step_size on it is questionable whether sf->reduce_first_step_size is worthwhile. At speed 2 it was not having a big impact. Even at speed 2 sf->optimize_coefficients = 0 is not having a big speed imapct so for now I have moved it down into a higher speed setting. Change-Id: I8a54de76d486ad37aabce76474889da2768b14c1	2013-07-11 13:59:12 +01:00
Jingning Han	aedc7c59b1	Merge "Fix tx_type bug in intra4x4 rd loop"	2013-07-10 20:13:25 -07:00
Ronald S. Bultje	c13e0bcb52	Remove unused fwalsh/fdct x86 SIMD implementations. Change-Id: Ia942e56cf322821d42ba06178672791eeee2847e	2013-07-10 18:22:51 -07:00
Dmitry Kovalev	544d8c3316	Removing unused TOKENEXTRA arg from pick_sb_modes function. Change-Id: I0543e72fa092eef3976b65e16bb597197c364873	2013-07-10 15:57:28 -07:00
Jingning Han	18803f9cc4	Fix tx_type bug in intra4x4 rd loop This commit fixed the mis-use of the tx_type for inverse transform in intra4x4 rate-distortion optimization loop. It improves the overall coding performance. Change-Id: I7fe9953175b74890357dbcee33c138573766e980	2013-07-10 15:49:49 -07:00
Deb Mukherjee	7494bba66b	Merge "Prunes out full-rd computation based on modeled rd"	2013-07-10 15:37:11 -07:00
Dmitry Kovalev	0ac5e4dd58	Adding write_compressed_header function. Change-Id: Ic5257fa8278e9b6297de230e4fd26a1e23ad2bb7	2013-07-10 15:08:34 -07:00
Jim Bankoski	68ef7a6b8a	configure with internal stats not working Change-Id: I5dea4570cb05df27a522abf6e7b695998654284a	2013-07-10 15:07:53 -07:00
Jim Bankoski	865ca76604	Merge "remove warnings when NDEBUG is set"	2013-07-10 14:39:39 -07:00
Jim Bankoski	6591cf2f7e	remove warnings when NDEBUG is set Change-Id: Ie0cb732fdcb98616a422c4463bff80642248d136	2013-07-10 14:27:20 -07:00
Deb Mukherjee	53ff43adc3	Prunes out full-rd computation based on modeled rd Adds a speed feature to eliminate full-rd computation if the modeled rd or rd based on a different parameter in the same mode is already a lot larger than the best rd yet. Specifically, only search the sharp and smooth filters if the modeled rd cost based on the regular filter is within a certain factor of the best rd cost so far. Also, skip full-rd computation of non splitmv inter modes if the modeled rd cost based on pred error is within the same factor of the best rd cost so far. Also adds some enhancements in the rd search for splitmv mode to speed things up by early breakouts. Negligible impact on performance. Resuts on derfraw300: psnr: -0.013% with the splitmv enhancements, -0.24% with the rd breakout feature on. speedup: 6% with splitmv enhancements, 20% with also residual breakout (tested on football sequence at 600 Kbps) Change-Id: I37abc308ea9f110c1679ce649b6a7e73ab1ad5fc	2013-07-10 13:49:49 -07:00
Jingning Han	114423538f	SSE2 16x16 ADST/DCT hybrid transform This commit enables 16x16 ADST/DCT forward hybrid transform using SSE2 operations. It reduces the runtime from 5433 cycles to 1621 cycles, at no compression performance loss. Change-Id: I75fd7f1984e9e28846af459f810ff0d6ae125230	2013-07-10 12:14:53 -07:00
Dmitry Kovalev	417df1d42e	Merge "Adding encode_tiles function to vp9_bitstream.c."	2013-07-10 11:43:50 -07:00
Yaowu Xu	e52eec490c	Merge "Add a feature to reduce chrome intra mode search"	2013-07-10 11:35:47 -07:00
Ronald S. Bultje	b1df674a99	Remove memcpy() in handle_inter_mode() filter selection. Encode time of first 50 frames of bus (speed 0) @ 1500kbps goes from 2min4.9 to 2min3.1, i.e. a 1.4% speedup overall. Change-Id: I9b25e87974430cb942caa276410bb2eda815bd83	2013-07-10 09:27:56 -07:00
Yaowu Xu	bed27a960a	Add a feature to reduce chrome intra mode search Change-Id: I721ebdeef2b53ce3e5c3eba3f7462ae2103c95a8	2013-07-10 08:59:18 -07:00
Jim Bankoski	fb027a7658	removing case statements around prediction entropy coding Removes SEG_ID Removes MBSKIP Removes SWITCHABLE_INTERP Removes INTRA_INTER Removes COMP_INTER_INTER Removes COMP_REF_P Removes SINGLE_REF_P1 Removes SINGLE_REF_P2 Removes TX_SIZE Change-Id: Ie4520ae1f65c8cac312432c0616cc80dea5bf34b	2013-07-09 20:10:16 -07:00
Yaowu Xu	059f2929e9	Merge "Revert "Remove memcpy() in handle_inter_mode() filter selection.""	2013-07-09 20:10:06 -07:00
Yaowu Xu	205efbc153	Revert "Remove memcpy() in handle_inter_mode() filter selection." This reverts commit `fcf7998a47`. Change-Id: Ic6532223faec9f1483b78adb2e37b79c7b1a0efb	2013-07-09 17:42:10 -07:00
Dmitry Kovalev	d82f459d1a	Adding encode_tiles function to vp9_bitstream.c. Change-Id: Ie44824ec25fd8fdb25d7c8124a9b28c26d802029	2013-07-09 15:59:19 -07:00
John Koleszar	f0d9f10d24	Remove all asm offset files from VP9 The files are empty and unused. Change-Id: Ieb4242d14273efdf24149bda33f9591540bba06a	2013-07-09 14:26:53 -07:00
Ronald S. Bultje	204d1b7058	Merge "Unbreak lossless."	2013-07-09 09:54:48 -07:00
Ronald S. Bultje	d8fa5d45cc	Merge "Make intra prediction pointers RTCD-based."	2013-07-09 09:54:43 -07:00
Ronald S. Bultje	059c0ba5d4	Unbreak lossless. Change-Id: I8130ec9b5371c65e885f245a5ac73840c23cb4a1	2013-07-09 09:46:37 -07:00
Dmitry Kovalev	c6c279aff0	Merge "Using mi_cols instead of mb_cols."	2013-07-08 20:09:19 -07:00
Dmitry Kovalev	1c65c580d6	Merge "Refactoring setup_pre_planes function."	2013-07-08 20:08:05 -07:00
Dmitry Kovalev	6254c8d780	Merge "Calling set_partition_seg_context() instead of code duplication."	2013-07-08 20:07:06 -07:00
Ronald S. Bultje	8350e7fe38	Make intra prediction pointers RTCD-based. This probably has a mildly negative impact on performance, but will (in future commits - or possibly merged with this one) allow SIMD implementations of individual intra prediction functions. We may perhaps want to consider having separate functions per txfm-size also (i.e. 4x4, 8x8, 16x16 and 32x32 intra prediction functions for each intra prediction mode), but I haven't played much with that yet. Change-Id: Ie739985eee0a3fcbb7aed29ee6910fdb653ea269	2013-07-08 17:25:51 -07:00
Ronald S. Bultje	a5062cc635	Don't call encode_sb() for the final of 4-split subpartitions. The resulting reconstruction is never used, thus it just wastes CPU cycles. Reduces encode time of first 50 frames of bus (speed 0) @ 1500kbps from 2min2.0 to 2min1.2, i.e. a 0.65% overall speedup. Change-Id: I74755ca3aadc21e2be220f486259060bd4088c45	2013-07-08 16:22:39 -07:00

... 3 4 5 6 7 ...

1638 Commits