generic-library/vpx

Author	SHA1	Message	Date
Jim Bankoski	5bbb0c6295	Clean up removal of vp9_pareto8 table. Change-Id: I5556e8d1fc150be8a3e93af21900829b59a500dc	2013-11-20 11:17:26 -08:00
Jim Bankoski	03276bf6e6	remove the model and copy in pack_mb_tokens Change-Id: I00a5203c8ed76c184d936fccf93d76e7c06773d3	2013-11-20 10:06:04 -08:00
Jim Bankoski	d6667dd54f	scan order table lookup same for encoder and decoder Change-Id: I473947b5ca70b7a81151926284bff86f8555492a	2013-11-19 15:31:43 -08:00
Jim Bankoski	8cf352abac	entropy code speedup Change-Id: Ic316d3374ff9a2b43897272260947d56765a0fdd	2013-11-19 14:31:38 -08:00
Deb Mukherjee	cfcd5c4f61	Simplifies band-getting with a static array Simplifies the code by implementing band mapping with static arrays. A lot of the code complexity introduced in a previous patch disappears. Change-Id: Ia3fac36e594fb5ad2d55ae141c58bba4c55c2d28	2013-11-13 22:15:16 -08:00
Dmitry Kovalev	20f34ff0db	Adding const to tree pointer inside vp9_extra_bit struct. Change-Id: I60e02fa3de930ff1f969687ab5af93dee40d86ad	2013-11-12 14:21:15 -08:00
Deb Mukherjee	5ade423774	Removes conditional statements from band getting Implements scan order to band map with arrays in both the encoder and decoder to remove conditional statements. Encoding seems to be about 1% faster at speed 0, tested on football. Decoding seems to be about 0.5-1% faster on a set of 25 videos. Change-Id: Idb233ca0b9e0efd790e30880642e8717e1c5c8dd	2013-11-12 10:13:27 -08:00
Dmitry Kovalev	7c524bbef4	Cleanup. Adding const to function pointer arguments. Change-Id: I12c67c8c0fa1aa7fb3f7d6cc2ef65be29c4ea292	2013-10-31 14:34:21 -07:00
Dmitry Kovalev	9deb614a57	Adding get_band_translate() function. Moving code that gets band_translate array from get_scan_and_band() function to get_band_translate() function. Renaming get_scan_and_band() to get_scan(). Change-Id: I43047c205a1ca2a6e24be44db39dc04b7a385008	2013-10-16 15:11:42 -07:00
Dmitry Kovalev	501a8c6b91	Merge "Removing print_prob_tree function and vp9_coeff_probs typedef."	2013-10-16 13:13:25 -07:00
Dmitry Kovalev	6965e6f3d5	Removing print_prob_tree function and vp9_coeff_probs typedef. Change-Id: If14265084e9b4c85c75b43e8d33a6fafad468cbc	2013-10-14 21:08:21 -07:00
Dmitry Kovalev	a97fe89538	Using constants instead of plain numbers. Replacing 22 with TREE_SIZE(MAX_ENTROPY_TOKENS) 12 with MAX_ENTROPY_TOKENS Change-Id: If24919336e8ace9cf64991bd5ae33fa6656f7b93	2013-10-14 20:33:37 -07:00
Dmitry Kovalev	e765aade0b	Merge "Replacing {VP9_COEF, MODE}_UPDATE_PROB with DIFF_UPDATE_PROB."	2013-10-11 14:15:46 -07:00
Dmitry Kovalev	4a0f9478ef	Replacing {VP9_COEF, MODE}_UPDATE_PROB with DIFF_UPDATE_PROB. Values of MODE_UPDATE_PROB and VP9_COEF_UPDATE_PROB are equal, so replacing them with one constant. Inlining appropriate arguments for functions: vp9_cond_prob_diff_update (encoder) vp9_diff_update_prob (decoder) Change-Id: I1255a1cb477743b799b3bfbbcd8de6b32b067338	2013-10-11 10:47:22 -07:00
Dmitry Kovalev	98400c1bc4	Removing vp9_tree_p typedef. It is used only two times and it is more clear to use real type instead of typedef. Change-Id: Idc25c16504c3da4d040e0cdb33a2987631bb6a5b	2013-10-10 17:16:20 -07:00
Dmitry Kovalev	9a1250e3e0	Merge "Moving all scan/iscan code into separate vp9_scan.{h, c} files."	2013-10-10 10:45:07 -07:00
Dmitry Kovalev	23cc1cd8e6	Removing redundant vp9_pt_energy_class declarations. Declaring vp9_pt_energy_class in vp9_entropy.h instead of many external places. Change-Id: I66e8a3fc119a43f88d130d0dae4133c825a047a3	2013-10-07 14:11:01 -07:00
Dmitry Kovalev	e3597c6af7	Moving all scan/iscan code into separate vp9_scan.{h, c} files. Now we have entropy code separate from scan/iscan code. The next step in future is to move iscan code from common part to the encoder. Change-Id: Id9732f7d80aec00af35c1d58d1137c4c96c91451	2013-10-07 13:55:56 -07:00
Dmitry Kovalev	c982a73b9f	Removing unused vp9_coeff_stats_model typedef. Change-Id: I6973e7121b6393379b5759f288632e8eab763d3e	2013-09-30 15:10:00 -07:00
Dmitry Kovalev	209c6cbf8f	Removing vp9_get_coef_neighbors_handle function. Change-Id: I6be72c8b048d1ccc7ef43764cf84c32360098970	2013-09-27 14:11:13 -07:00
Dmitry Kovalev	a23c2a9e7b	Adding get_scan_and_band function. Extracting get_scan_and_band function from get_entropy_context to remove duplicated code. Change-Id: I5da1f5a60263017e887da68bc834317b5f084cb2	2013-09-19 16:53:48 +04:00
Jim Bankoski	2e4ca9d1a5	resolve clang warnings : uninitialized vars in vp9_entropy.h This helps clear out some of the warnings Change-Id: Ie7ccaca8fd92542386a7f1b257398e1bdf2f55dc	2013-09-04 18:38:41 -07:00
James Zern	d765df2796	consistently name VP9_COMMON variables #3 stragglers Change-Id: Ib1e853f9a331b7b66639dc34d79568d84d1930f1	2013-08-29 13:27:41 -07:00
Dmitry Kovalev	a93992e725	Adding get_entropy_context function. Moving common code from encoder and decoder to this function. Change-Id: I60fa643fb1ddf7ebbff5e83b6c4710137b0195ef	2013-08-27 14:17:53 -07:00
Dmitry Kovalev	45870619f3	Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the common/decoder. Adding temporary "typedef BLOCK_SIZE BLOCK_SIZE_TYPE" which will go away after encoder's patch. Change-Id: I06ec6a6f079401439843ec981d1496234fd7775c	2013-08-26 11:33:16 -07:00
Dmitry Kovalev	c43da352ab	Cleaning up reset_skip_context function. Change-Id: Ib3e72671eb8da6f2e9767a6de292ec7c7cde6bc7	2013-08-21 16:31:51 -07:00
Dmitry Kovalev	8b0e6035a2	Entropy context related cleanups. Adding set_skip_context() function used from both encoder and decoder. Change-Id: Ia22cfad3211a00a63eb294f64f857b78f4aa9b85	2013-08-12 11:24:24 -07:00
Dmitry Kovalev	5b65246a71	Adding missing const to vp9_extra_bits array. Change-Id: Icd128ab58719e0b9066bdfa66a5d0d427a84d6df	2013-07-31 18:51:18 -07:00
John Koleszar	a31effca75	Remove unnecessary 64 byte alignment Fixes a warning on MSVS 2012 where the alignment of vp9_default_iscan_8x8 didn't match between its declaration and definition. Change-Id: I1466a15635f4b22594d705d570b7e399bfb6cf21	2013-07-29 14:02:02 -07:00
Dmitry Kovalev	7131cb0e3d	General cleanups. Removing unused constants, macros, and function declarations. Using ROUND_POWER_OF_TWO macro, vp9_zero, vp9_copy where possible. Moving #include from .h to .c. Merging for loops for motion vectors. Change-Id: Ic3bf841764a2bb177128bb3a6d7aa8f68229cd13	2013-07-25 14:13:48 -07:00
Ronald S. Bultje	e20fcd9585	More optimizations for cost_coeffs(). 4x4: 163 -> 123 cycles (33% faster) 8x8: 491 -> 399 cycles (23% faster) 16x16: 1889 -> 1763 cycles (7% faster) 32x32: 8311 -> 8180 cycles (1.6% faster) Overall encoding time of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min4.33 to 1min3.00, i.e. 2.11% faster. Change-Id: Ib52d1dbb5649b14de769d3e7a74af67440b5284f	2013-07-22 16:09:09 -07:00
Dmitry Kovalev	ca75f1255f	Removing and moving around constant definitions. Removing unused and duplicated constants, moving them from .h to .c if possible. Change-Id: Ief4d6b984a3ca2e9b38504f0d855ed072cf7133f	2013-07-15 19:26:30 -07:00
Ronald S. Bultje	26b6318de8	Make get_coef_context() branchless. This should significantly speedup cost_coeffs(). Basically what the patch does is to make the neighbour arrays padded by one item to prevent an eob check in get_coef_context(), then it populates each col/row scan and left/top edge coefficient with two times the same neighbour - this prevents a single/double context branch in get_coef_context(). Lastly, it populates neighbour arrays in pixel order (rather than scan order), so we don't have to dereference the scantable to get the correct neighbours. Total encoding time of first 50 frames of bus (speed 0) at 1500kbps goes from 2min10.1 to 2min5.3, i.e. a 2.6% overall speed increase. Change-Id: I42bcd2210fd7bec03767ef0e2945a665b851df56	2013-07-01 16:34:10 -07:00
Ronald S. Bultje	7353ceab9d	Quantize (64-bit only, for now) SSSE3 SIMD. Total encoding time for first 50 frames of bus (speed 0) @ 1500kbps goes 2min34.8 to 2min14.4, i.e. a 10.4% overall speedup. The code is x86-64 only, it needs some minor modifications to be 32bit compatible, because it uses 15 xmm registers, whereas 32bit only has 8. Change-Id: I2df53770c2e850813ffa713e1a91b45b0082b904	2013-07-01 11:36:07 -07:00
Ronald S. Bultje	d00b8e5f82	Inline vp9_get_coef_context() (and remove vp9_ prefix). Makes cost_coeffs() a lot faster: 4x4: 236 -> 181 cycles 8x8: 888 -> 588 cycles 16x16: 3550 -> 2483 cycles 32x32: 17392 -> 12010 cycles Total encode time of first 50 frames of bus (speed 0) @ 1500kbps goes from 2min51.6 to 2min43.9, i.e. 4.7% overall speedup. Change-Id: I16b8d595946393c8dc661599550b3f37f5718896	2013-06-28 10:40:21 -07:00
John Koleszar	08b1798ae7	Move vp9_full_to_model_counts to encoder This function is not called from the decoder, so it doesn't need to be in common/. Change-Id: I6977dd462a25b4ff39c9c7e1b0b5b16aa58ee733	2013-06-24 15:46:15 -07:00
John Koleszar	5c32215e27	Remove unused vp9_model_to_full_probs_sb() This function never referenced. Change-Id: I1c42cd355bfa88e17d169f7335a44be682af58cc	2013-06-21 15:38:55 -07:00
John Koleszar	0f7a66e962	Remove constant vp9_coef_update_prob table All elements of this table are equal to 252, so replace it with a single constant VP9_COEF_UPDATE_PROB. Change-Id: I1e2d1d284326ce6df9899a740c2fc344b3ec81c9	2013-06-14 15:12:31 -07:00
Deb Mukherjee	b8b3f1a46d	Balancing coef-tree to reduce bool decodes This patch changes the coefficient tree to move the EOB to below the ZERO node in order to save number of bool decodes. The advantages of moving EOB one step down as opposed to two steps down in the other parallel patch are: 1. The coef modeling based on the One-node becomes independent of the tree structure above it, and 2. Fewer conext/counter increases are needed. The drawback is that the potential savings in bool decodes will be less, but assuming that 0s are much more predominant than 1's the potential savings is still likely to be substantial. Results on derf300: -0.237% Change-Id: Ie784be13dc98291306b338e8228703a4c2ea2242	2013-05-29 16:25:52 -07:00
Deb Mukherjee	d8c0989d56	Clean up related to coefficient modeling Uses reduced arrays for probabilities and branch counts in the encoder. No change in bitstream. Change-Id: Iec605446f44db4cd325eb45fa12a3003a6ee29db	2013-05-28 16:32:03 -07:00
Paul Wilkins	e41fd6e3e2	Fix bug in 4x4 band definition. Also some unused data structures/references removed. Change-Id: I295809e887173543e794250cb60ddaf1475ffd24	2013-05-23 17:51:02 -07:00
Paul Wilkins	33ecd6ad54	Merge Scatter Scan experiment. Removal from under configure flag. A bit renaming Change-Id: I2213229dfe852001dfec16b149f47c52ce88f3aa	2013-05-23 13:09:27 +01:00
Deb Mukherjee	de4d682ca4	Using 128 entry look up table for coef models Reverts to using 128 bit LUT for the coef models rather than 48 to ease hardware implementation. Also incorporates some cleanups including removing various hooks to support different lookup tables based on block_type and ref_type. Change-Id: I54100c120cca07a2ebd3a7776bc4630fa6a153f6	2013-05-22 08:44:31 -07:00
Deb Mukherjee	7a645e4e12	Merging the model coef prob experiment Merges the experiment. Change-Id: I4eb19af6de6df6aa3a96a2e82f231d47ed9b3ae9	2013-05-21 14:44:38 -07:00
Deb Mukherjee	07443f1589	Refinements on modelcoef expt to reduce storage Uses more aggrerssive interpolation to reduce storage for the model tables by almost more than half. Only 48 lists of probs are stored (as opposed to 128 before), corresponding to ONE_NODE probabilities of: 1, 3, 7, 11, ..., 115, 119, 127, 135, ..., 247, 255. Besides, only 1 table is used as opposed to 2 before. So the overall memory needed for the tables is just 48 * 8 = 384 bytes. The table currently used is based on a new Pareto distribution with heavier tail than a generalized Gaussian - which improves results on derf by about 0.1% over a single table Generaized Gaussian. Results overall on derfraw300 is -0.14%. Change-Id: I19bd03559cbf5894a9f8594b8023dcc3e546f6bd	2013-05-21 10:06:56 -07:00
Deb Mukherjee	39a90bc8e8	Updating the model coef experiment Cleans up the experiment. Actually uses reduced counts for backward updates, and reduced number of probabilities in the context. No change in bitstream when the experiment is on. Between expt on and off: derfraw300 is down only -0.062% (which is better than when expts were run previously). Change-Id: I55285a049a0c22810bdb42914212ab5a4f8521b5	2013-05-20 12:46:36 -07:00
Paul Wilkins	e5f715201a	Change to band calculation. Change band calculation back to simpler model based on the order in which coefficients are coded in scan order not the absolute coefficient positions. With the scatter scan experiment enabled the results were appear broadly neutral on derf (-0.028) but up a little on std-hd +0.134). Without the scatterscan experiment on the results were up derf as well. Change-Id: Ie9ef03ce42a6b24b849a4bebe950d4a5dffa6791	2013-05-13 17:21:49 +01:00
Dmitry Kovalev	a0b6b8a7d4	Merge "Removing unused code + little cleanup." into experimental	2013-05-08 11:23:14 -07:00
Dmitry Kovalev	cba0a5db2b	Removing unused code + little cleanup. Change-Id: I81c19a8f19cfb5c7183609656ade833d72feb500	2013-05-07 16:56:22 -07:00
Paul Wilkins	a14ae84749	Deprecate code_zerogroup experiment. Delete code under the CONFIG_CODE_ZEROGROUP flag. Change-Id: I5fe6c7b42a5da9b73118e33594301da4129f320a	2013-05-07 16:52:55 -07:00

1 2

90 Commits