generic-library/vpx

Author	SHA1	Message	Date
John Koleszar	ab77b4e898	RTCD: add remaining IDCT functions This commit continues the process of converting to the new RTCD system. Change-Id: I03c4dbf30dfd3558b0e256ff9d3ff4c012aadc80	2012-01-30 12:08:22 -08:00
John Koleszar	55f74c59c7	RTCD: add loopfilter functions This commit continues the process of converting to the new RTCD system. Change-Id: Ic8a4047d72ff3a54ec98977dd90e70c13213db71	2012-01-30 12:06:31 -08:00
John Koleszar	a910049aea	New RTCD implementation This is a proof of concept RTCD implementation to replace the current system of nested includes, prototypes, INVOKE macros, etc. Currently only the decoder specific functions are implemented in the new system. Additional functions will be added in subsequent commits. Overview: RTCD "functions" are implemented as either a global function pointer or a macro (when only one eligible specialization available). Functions which have RTCD specializations are listed using a simple DSL identifying the function's base name, its prototype, and the architecture extensions that specializations are available for. Advantages over the old system: - No INVOKE macros. A call to an RTCD function looks like an ordinary function call. - No need to pass vtables around. - If there is only one eligible function to call, the function is called directly, rather than indirecting through a function pointer. - Supports the notion of "required" extensions, so in combination with the above, on x86_64 if the best function available is sse2 or lower it will be called directly, since all x86_64 platforms implement sse2. - Elides all references to functions which will never be called, which could reduce binary size. For example if sse2 is required and there are both mmx and sse2 implementations of a certain function, the code will have no link time references to the mmx code. - Significantly easier to add a new function, just one file to edit. Disadvantages: - Requires global writable data (though this is not a new requirement) - 1 new generated source file. Change-Id: Iae6edab65315f79c168485c96872641c5aa09d55	2012-01-30 12:06:27 -08:00
John Koleszar	9951f46133	Merge "Hook up VP8D_GET_LAST_REF_USED"	2012-01-27 11:31:35 -08:00
John Koleszar	8be41bba80	Hook up VP8D_GET_LAST_REF_USED Commit `892e23a5b` introduced support for the VP8D_GET_LAST_REF_USED, but missed the mapping of the control id to the underlying function, so it was unavailable to applications. In addition, the underlying function vp8_references_buffer() is moved from common/postproc.c to decoder/onyxd_if.c as postproc.c is not built in all configurations. Change-Id: I426dd254e7e6c4c061b70d729b69a6c384ebbe44	2012-01-27 10:13:20 -08:00
John Koleszar	319f7c4d56	Merge changes I17e1a348,Iad710941 * changes: Correct clamping in use of vp8_find_near_mvs() Revert "Multithreaded encoder, late sync loopfilter"	2012-01-26 14:33:28 -08:00
John Koleszar	83cef816fd	Correct clamping in use of vp8_find_near_mvs() Commit `e06c242ba` introduced a change to call vp8_find_near_mvs() only once instead of once per reference frame by observing that the only effect that the frame had was on the bias applied to the motion vector. By keeping track of the sign_bias value, the mv to use could be flip-flopped by multiplying its components by -1. This behavior was subtley wrong in the case when clamping was applied to the motion vectors found by vp8_find_near_mvs(). A motion vector could be in-bounds with one sign bias, but out of bounds after inverting the sign, or vice versa. The clamping must match that done by the decoder. This change modifies vp8_find_near_mvs() to remove the clamping from that function. The vp8_pick_inter_mode() and vp8_rd_pick_inter_mode() functions instead track the correctly clamped values for both bias values, switching between them by simple assignment. The common clamping and inversion code is in vp8_find_near_mvs_bias() Change-Id: I17e1a348d1643497eca0be232e2fbe2acf8478e1	2012-01-26 09:37:27 -08:00
Attila Nagy	294aa37745	Rename save_neon_reg.asm as save_reg_neon.asm Easier to filter out all NEON asm. Change-Id: I0022dae8321a9608e864b09d4181414c5fff4610	2012-01-26 09:44:00 +02:00
John Koleszar	630d3b95e2	Revert "Multithreaded encoder, late sync loopfilter" This commit is incomplete, as it does not synchronize the loop filter before returning a handle to the reconstructed frame in vpx_codec_get_preview_frame(), which can cause (false?) failures when running the test_reconstruct_buffer test. This may be related to a bug that does cause visible artifacts, which is also under investigation. This reverts commit `380d64ecb1`. Change-Id: Iad710941e7731d44fc2bde63bc63d6763cc4629e	2012-01-24 15:41:59 -08:00
Fritz Koenig	c14754be1d	Merge "Disconnect ARM tgt_isa from dsp extensions"	2012-01-23 11:13:33 -08:00
Scott LaVarnway	8a6af9f98f	Improved uv mv calculations in build inter predictor Changed calculations to use shifts instead of if-then-else. Eliminates branches. Change-Id: I11b75e8bb305301ffd9cb577fb7df059a3cf9ea4	2012-01-23 11:34:43 -05:00
Fritz Koenig	892102842a	Disconnect ARM tgt_isa from dsp extensions A processor with ARMv7 instructions does not necessarily have NEON dsp extensions. This CL has the added side effect of allowing the ability to enable/disable the dsp extensions cleanly. Change-Id: Ie1e879b8fe131885bc3d4138a0acc9ffe73a36df	2012-01-20 10:38:15 -08:00
Deb Mukherjee	f357e5e2f7	Merge "Overhauling the thresholds and mixing proportions for mfqe postprocessor."	2012-01-20 08:45:42 -08:00
Deb Mukherjee	47dcd769c1	Overhauling the thresholds and mixing proportions for mfqe postprocessor. Makes the thresholds for the multiframe quality enhancement module depend on the difference between the base quantizers. Also modifies the mixing function to weigh the current low quality frame less if the difference in quantizer is large. With the above modifications mfqe works well for both scalable patterns as well as low quality key frames. Change-Id: If24e94f63f3c292f939eea94f627e7ebfb27cb75	2012-01-20 05:11:00 -08:00
Jeff Faust	ac97b089d1	Merge "Simplify an assignment statement"	2012-01-18 21:14:51 -08:00
John Koleszar	6a4ff6f325	Merge "get_plane_pointers: use u/v planes consistently"	2012-01-18 14:22:55 -08:00
John Koleszar	4753ee4166	get_plane_pointers: use u/v planes consistently The prior commit accidentally used the u plane where it should have used the v plane. Change-Id: Ib6c8443b99061536389f05ac25b8e0a307ace637	2012-01-18 12:50:06 -08:00
Jeff Faust	15c29afeca	Simplify an assignment statement Separated a double assignment that looked suspiciously like an assignment and equality typo. Change-Id: I7813979e9d7ea2539afb3c8ae6074f9df5ebdf52	2012-01-18 12:49:43 -08:00
Jim Bankoski	de368fd0b5	Merge "vp8d - valgrind warnings in mb post processor"	2012-01-18 11:12:37 -08:00
John Koleszar	0e06bc817a	Merge changes I1ebe76aa,Ia079b52b * changes: rdopt/pickinter: factor out some common setup rdopt: remove unused frame_lf_or_gf	2012-01-18 09:30:46 -08:00
Deb Mukherjee	d3879738bf	Merge "Modifying the base q propagation in the mfqe post processing filter in a way such that when there is a single bad frame, the post-processing is applied not only to just that frame but a few subsequent frames as well."	2012-01-18 09:15:24 -08:00
Jim Bankoski	ed208f7d2f	vp8d - valgrind warnings in mb post processor Solved by extending the border in the postproc buffer as necessary Change-Id: Ic3f61397fe5bc8e4db6fc78050b0b160bd0aee86	2012-01-17 17:27:39 -08:00
Deb Mukherjee	90b9f993c1	Modifying the base q propagation in the mfqe post processing filter in a way such that when there is a single bad frame, the post-processing is applied not only to just that frame but a few subsequent frames as well. Change-Id: Iba5d9896eed77244eb76b4a74692a93f8ecff634	2012-01-17 09:28:03 -08:00
Adrian Grange	e479379abb	Fixed bugs in multi-layer code related to changing params When running multi-layer (ML) encodes and dynamically changing coding parameters on the fly (e.g. frame duration/rate, bandwidths allocated to each layer) the encoder would not produce sensible output. In certain cases the rate targeting would be hideously inaccurate. These fixes make it possible to change these coding parameters correctly and to maintain accurate control of the rate targeting. I also added the specification of the input timebase into the test program, vp8_scalable_patterns.c. Patch 2: Moved declaration to appease MS compiler) Change-Id: Ic8bb5a16daa924bb64974e740696e040d07ae363	2012-01-13 16:52:25 -08:00
John Koleszar	4ade079633	rdopt/pickinter: factor out some common setup Add new get_predictor_pointers() and get_reference_search_order() functions for code shared between the two implementations. Change-Id: I1ebe76aa8f168b1f5cfabc00d05d8f19a0d4d207	2012-01-11 14:43:52 -08:00
John Koleszar	bd5bfd94b8	rdopt: remove unused frame_lf_or_gf This flag was set but unused. Change-Id: Ia079b52b88ffbe3b16fdbde4b84e2b87304eaa13	2012-01-11 13:02:19 -08:00
Deb Mukherjee	9c2ca8c1ca	Allowing the mfqe post-processing filter to be used in conjunction with deblock or demacroblock filters. When --mfqe is used together with --demacroblock or --deblock, mfqe is applied first and then demacroblock/deblock is applied to the mfqe result. Change-Id: Id83ee01f1b4a33a116f071dcf26d59c7f3497c32	2012-01-10 14:14:41 -08:00
John Koleszar	e6c91b625e	Merge "fix: roundoff initializer is not a constant"	2012-01-10 13:33:26 -08:00
James Berry	6ce1f15dfb	fix: roundoff initializer is not a constant precision used in initialization of roundoff is not a constant updated to use #define MFQE_PRECISION 4 Change-Id: If2fc3d3d633d58a7f4ab34d258c232ec1e5f0a79	2012-01-10 14:19:30 -05:00
Jim Bankoski	892e23a5ba	vp8d - function to check if a reference frame is used. Change-Id: Id683b4d7f46ffa99145fc4b824c7232ab4182f21	2012-01-10 10:10:26 -08:00
Deb Mukherjee	28aa08748e	Merge "Multiframe quality enhancement postprocessing"	2012-01-09 10:23:24 -08:00
James Zern	80528410fc	Merge "Reduce the default kf_max_dist to 128."	2012-01-06 12:32:07 -08:00
John Koleszar	66da859e5e	Merge "Reduced the size of Y1Dequant and friends to [128][2]"	2012-01-06 11:59:06 -08:00
Ralph Giles	2a0d7b1a55	Reduce the default kf_max_dist to 128. The default maximum keyframe interval is 9999, or over five minutes at normal video rates. When the encoder produces streams with such a long interval seeking (with correct output) is more expensive, and live streaming is impossible. Of course the encoding application should set this parameter based on its knowledge of the intended use of the stream, but reducing the default gives better results for applications which do not. Change-Id: I900b15d74ce72ecc3ade4d43f758c5cf97a2098a	2012-01-06 11:34:45 -08:00
Scott LaVarnway	5f25d4c175	Reduced the size of Y1Dequant and friends to [128][2] This patch removes the local copies of the dequantize constants and implements John's idea as described in "Make a local copy of the dequantized data" commit. Change-Id: Ic6b7d681f00bf63263f71ff1e39ab2f80729e8b2	2012-01-06 11:12:00 -08:00
Deb Mukherjee	87aa846b47	Multiframe quality enhancement postprocessing Adds a multiframe postprocessing module to enhance the quality of certain frames that are coded at lower quality than preceding frames. The module can be invoked from the commandline by use of the --mfqe option, and will be most beneficial for enhancing the quality of frames decoded using scalable patterns. Uses the vp8_variance_var16x16 and vp8_variance_sad16x16 function pointers to compute SAD and Variance of blocks. Change-Id: Id73d2a6e3572d07f9f8e36bbce00a4fc5ffd8961	2012-01-05 10:21:48 -08:00
Johann	0780f258da	Merge "Improve SSSE3 fast quantizer function"	2012-01-05 10:09:39 -08:00
Scott LaVarnway	b2c8dff727	Merge "Removed unused diff buffer"	2012-01-05 09:06:28 -08:00
Scott LaVarnway	89cdfdb231	Merge "SSE2 optimizations for vp8_build_intra_predictors_mby{,_s}()"	2012-01-05 09:05:19 -08:00
Scott LaVarnway	77119a5cd8	Merge "Improved sse2 version of simple loopfilter"	2012-01-04 13:26:13 -08:00
Scott LaVarnway	5bfa29b6c5	Merge "Make a local copy of the dequantized data"	2012-01-04 07:50:13 -08:00
Yunqing Wang	9f1083e9a0	Merge "Improve vp8cx_init_quantizer()"	2012-01-04 06:22:15 -08:00
Scott LaVarnway	33d9ea5471	Merge "Remove useless g_common.h"	2012-01-03 09:48:35 -08:00
Yunqing Wang	2b2c0c9bda	Improve SSSE3 fast quantizer function Simplified the EOB calculation in the function. Change-Id: I7422f18be40ae270358f5cb0811d66e64436b56f	2011-12-29 12:05:50 -05:00
John Koleszar	3cb92b85b9	Remove unused MACROBLOCK member vector_range Change-Id: Ie2dc0d72363ff38e0f71b59f6e2d1a2d70c5266b	2011-12-28 14:58:38 -08:00
John Koleszar	31e86192ba	Remove unused BLOCK member force_empty Change-Id: I72ed49ce14ca0124dd0d31bfcf4c7630a4681587	2011-12-28 13:57:51 -08:00
Yunqing Wang	b510863f8f	Improve vp8cx_init_quantizer() Except zrun_zbin_boost, 15 AC values are the same for all other parameters. Removed unneccessary calculation. Change-Id: I6101c0fe8080bd2b4387c3b04d7ddedbf6010409	2011-12-28 13:55:55 -05:00
John Koleszar	03fadc4b20	Merge "Remove unnecessary ternary constructs"	2011-12-22 13:01:05 -08:00
John Koleszar	d48ea5a2ab	Merge "Remove legacy integer types"	2011-12-22 13:00:23 -08:00
John Koleszar	adb10c47a8	Merge "Use lookup tables for mode_check_freq"	2011-12-22 12:59:47 -08:00
John Koleszar	64c4be2669	Merge "Use lookup tables for thresh_mult"	2011-12-22 10:31:21 -08:00
John Koleszar	0c2b2c79ae	Remove unnecessary ternary constructs The code had a number of constructs like (condition)?1:0, which is redundant with C's semantics. In the cases where a boolean operator was used in the condition, simply remove the ternary part. Otherwise adjust the surrounding expression to remove the condition (eg, for rounding up. See pickinter.c and rdopt.c) Change-Id: Icb2372defa3783cf31857d90b2630d06b2c7e1be	2011-12-22 10:09:46 -08:00
John Koleszar	f56918ba9c	Remove legacy integer types Remove BOOL, INTn, UINTn, etc, in favor of C99-style fixed width types. Change-Id: I396636212fb5edd6b347d43cc940186d8cd1e7b5	2011-12-22 09:58:40 -08:00
John Koleszar	aa8650dd7f	Use lookup tables for mode_check_freq Mostly cosmetic. Trying for a more compact representation of speed selection thresholds. Change-Id: I339e7840049b91ad569aabbdc9c702a496110d3b	2011-12-22 09:43:44 -08:00
John Koleszar	efb4783d36	Use lookup tables for thresh_mult Mostly cosmetic. Trying for a more compact representation of speed selection thresholds. Change-Id: Icaebea632c7bb71ca8e07b4def04a046d4515e27	2011-12-22 09:43:40 -08:00
John Koleszar	a2407935d2	Merge "Remove opaque pointer VP8D_PTR"	2011-12-22 09:36:11 -08:00
John Koleszar	0c2f8e77cc	Remove useless g_common.h This file declared a bunch of nonexistent, unreferenced global function pointers. Change-Id: Ic26bb8c7712deba754c49fc01f383b53afc9e728	2011-12-21 15:02:23 -08:00
John Koleszar	bf1a8073c3	Remove opaque pointer VP8D_PTR Use an opaque struct rather than typecasting through VP8D_PTR, an int*. Change-Id: Ia260b7d53d7e0950cfa1e00f4ecead1099bd3b87	2011-12-21 14:48:10 -08:00
James Zern	b651875e24	squash some signed/unsigned comparison warnings Change-Id: Ifc64cf990ae04d77934da3324d0afb3993f061e7	2011-12-21 13:49:19 -08:00
Johann	db389cb804	Make a local copy of the dequantized data Multithreaded encoding was breaking at low bitrates Please review/comment. Not sure if this is the best fix. Change-Id: I87468c765372593fd865bc82e25121ebb8ca6af2	2011-12-21 12:39:39 -08:00
John Koleszar	bb1915274f	Remove unreferenced includes These files are legacy and have no current references. Change-Id: I38224961fafeb33bc3eb6150bb0c2249ccbb4f60	2011-12-21 11:01:11 -08:00
John Koleszar	16a8948c45	Merge "Remove opaque pointer VP8_PTR"	2011-12-21 09:59:22 -08:00
Scott LaVarnway	1d7d18c69c	Improved sse2 version of simple loopfilter Change-Id: Iae406d16fab5bace47fbcf5ef7ed021f08af159d	2011-12-21 12:52:18 -05:00
John Koleszar	63d9c4da5e	Merge "tokenizer: use correct block type context in stuff1st_order_b"	2011-12-21 09:20:21 -08:00
John Koleszar	b0056c3b5e	Remove opaque pointer VP8_PTR Use an opaque struct rather than typecasting through VP8_PTR, an int*. Change-Id: I5ed4d9238ba2e8d51bfa07a8da87a2eb4c8fa43a	2011-12-21 09:13:51 -08:00
John Koleszar	056bcc8771	remove armv6 files from armv5 build Make bilinearfilter_arm.c compiled only when HAVE_ARMV6, as its definitions are v6 only. This is normally not a problem for static builds as the file is elided at link time, but this was not being done properly for the --enable-shared --enable-pic build. Change-Id: Ic800a7cde751f74f22555c5b247f99f9df5e550d	2011-12-19 13:51:11 -08:00
Johann	080919b3c2	Merge "Avoid heap allocation of firstpass stats"	2011-12-19 10:11:23 -08:00
John Koleszar	c75f0ec379	Merge "fix: make sure ss_err is large enough"	2011-12-19 09:50:12 -08:00
Yunqing Wang	fd294c553a	Merge "Merge mr_pick_inter_mode and pick_inter_mode"	2011-12-19 08:42:20 -08:00
Yunqing Wang	c647ec4462	Merge mr_pick_inter_mode and pick_inter_mode Merged multi-resolution motion estimation with regular motion estimation function in order to remove duplicated part. This caused slight changes in multi-resulotion encoder quality & performance. Change-Id: Ib4ecc7acfebfe5eea959b5b91febae6db7b95fd1	2011-12-16 18:02:29 -05:00
James Berry	24196dd987	fix: make sure ss_err is large enough increase size of ss_err by one to make sure there is room for 64 elements. Change-Id: I355cb8c499aa7da3b9675f2326a8d25a74bb88d2	2011-12-16 17:43:55 -05:00
John Koleszar	26c6a44c66	Avoid heap allocation of firstpass stats The total_stats, this_frame_stats, and total_left_stats structures were previously create by a heap allocation, despite being of fixed size. These structures were allocated and deallocated during {de,}allocate_compressor_data, which is reinvoked whenever the frame size changes. Unfortunately, this clobbers the total_stats and total_left_stats data. Historically, these were variable size at one time, due to the first pass motion map, which necessitated their being created by a unique heap allocation. However, this bug with the total_stats being clobbered has probably been present since that initial implementation. These structures are instead moved to be stored within the struct twopass_rc directly, rather than being heap allocated separately. Change-Id: I7f9e519e25c58b92969071f0e99fa80307e0682b	2011-12-16 11:40:23 -08:00
Scott LaVarnway	0ccefd2c8f	Fixed mb_skip_coeff bug When mb_skip_coeff is set, the idct is not necessary. Prior to this patch, the code would call idcts based on leftover eob information. This patch will now skip the idct for SPLIT_MV and clear out the eobs for B_PRED, forcing the idct to be skipped. Change-Id: If5b0d2ed3ebd07789d30ec5160df927485fcaa17	2011-12-16 13:48:01 -05:00
Scott LaVarnway	a53d5a4c44	Moved dequant idct into common These functions are now used by the encoder. This is WIP with the goal of creating a common idct/add for the encoder and decoder. A boost of 1.8% was seen for the HD rt test clip used. [Tero] Added needed changes to ARM side. Change-Id: Ibbb8000be09034203d7adffc457d3c3f8b06a5bf	2011-12-15 14:23:41 -05:00
Yunqing Wang	c8df1656bd	Merge "Only call vp8_find_near_mvs() once for each macroblock"	2011-12-15 09:53:25 -08:00
Yunqing Wang	e06c242baa	Only call vp8_find_near_mvs() once for each macroblock While doing motion search on a macroblock, we usually call vp8_find_near_mvs once per reference frame. Actually, for different reference frames, the only difference in calculating these near_mvs is they may have different sign_bias, which causes a sign change in resulting near_mvs. In this change, we only do find_near_mvs for the first reference frame. For other reference frames, only need to adjust the near_mvs according to that reference frame's sign_bias value. Change-Id: I661394b49c6ad79fed7d0f2eb2be239b9c56f149	2011-12-15 11:19:18 -05:00
Yunqing Wang	d7e09b6ada	Merge "Force realtime version 1 streams to only use simple loopfilter"	2011-12-15 05:38:57 -08:00
John Koleszar	72f459c77f	Merge "Avoid multiple test for same lvl in auto filter lvl pick"	2011-12-14 16:28:13 -08:00
John Koleszar	4f8f360098	Merge "add check to ensure that cq_level falls within min and max q"	2011-12-14 12:10:06 -08:00
John Koleszar	e542627b0c	Merge "fix: active_worst_quality could be set above 127"	2011-12-14 11:22:00 -08:00
James Berry	c1c47e83b0	add check to ensure that cq_level falls within min and max q Add the notion of deferred validation of parameters. We don't want to validate the cq_level at initialization time, because it won't have been set via set_param() yet. Change-Id: Ia1308395e8c10e0b1dc4e9af3a09b2bd6744cc30	2011-12-14 10:39:06 -08:00
Attila Nagy	51c4f9e6b1	Avoid multiple test for same lvl in auto filter lvl pick Sometimes same level is tested 2-3 times; store and reuse the calculated error value. Change-Id: Ia1c04a2568232edf9a5a62c4e2d8e8a50d85e00e	2011-12-14 15:56:29 +02:00
Attila Nagy	55fbdd58ac	Force realtime version 1 streams to only use simple loopfilter ...regardless of the speed settings. Change-Id: I4b91ac7a7208efd690dfc69e175f8eb769b6ce03	2011-12-14 12:57:49 +02:00
James Berry	f8b431c334	fix: active_worst_quality could be set above 127 add check to set active_worst_quality to 127 if it is set above 127 Change-Id: I7db353d5c1b1c8516a116542b6ed21c0110bb512	2011-12-13 14:58:59 -05:00
John Koleszar	d6020f9d52	tokenizer: use correct block type context in stuff1st_order_b The fast-path for skipped MBs was not correctly respecting the block type during update of the coefficient counts. Extracted this from part of change I365cfb6ac636f19c545f682e3aeac185253abaef Change-Id: I53d8cf0a00a98034b97b0ed3414b703bae74a739	2011-12-13 11:58:56 -08:00
Jim Bankoski	6b2792b0e0	Merge "vp8e - entropy stats per frame type"	2011-12-12 09:08:34 -08:00
Scott LaVarnway	c4aa1d508e	Removed unused diff buffer Change-Id: I9211358cca89b1c4f84b53a202a63ecf9e79ae4c	2011-12-12 11:06:55 -05:00
Scott LaVarnway	afa1b66108	Merge "Improved mmx/sse2 versions of iwalsh"	2011-12-12 06:40:28 -08:00
Jim Bankoski	6de67cd6e8	vp8e - entropy stats per frame type Change-Id: I4168eb6ea22ae541471738a7a3453e7d52059275	2011-12-09 16:56:18 -08:00
Scott LaVarnway	9fa6132fc5	Improved mmx/sse2 versions of iwalsh Removed unnecessary transposes. Change-Id: I029fbaf8afafee34d54a4f3333c22023c15003c3	2011-12-08 14:37:59 -05:00
Johann	a69810b893	Merge "Reduce mem copies in encoder loopfilter level picking"	2011-12-07 10:41:00 -08:00
Attila Nagy	e570b0406d	Reduce mem copies in encoder loopfilter level picking Do the test filtering in the existing backup frame buffer instead of the original. Copy the original data into extra buffer before doing the filtering. This way there is no need to restore the original unfiltered frame at the end of level picking process. This came up in some discussions with Johann. Thanks! Change-Id: I495f4301d983854673276c34ec0ddf9a9d622122	2011-12-07 09:59:50 +02:00
Yunqing Wang	aa7335e610	Multiple-resolution encoder The example encoder down-samples the input video frames a number of times with a down-sampling factor, and then encodes and outputs bitstreams with different resolutions. Support arbitrary down-sampling factor, and down-sampling factor can be different for each encoding level. For example, the encoder can be tested as follows. 1. Configure with multi-resolution encoding enabled: ../libvpx/configure --target=x86-linux-gcc --disable-codecs --enable-vp8 --enable-runtime_cpu_detect --enable-debug --disable-install-docs --enable-error-concealment --enable-multi-res-encoding 2. Run make 3. Encode: If input video is 1280x720, run: ./vp8_multi_resolution_encoder 1280 720 input.yuv 1.ivf 2.ivf 3.ivf 1 (output: 1.ivf(1280x720); 2.ivf(640x360); 3.ivf(320x180). The last parameter is set to 1/0 to show/not show PSNR.) 4. Decode: ./simple_decoder 1.ivf 1.yuv ./simple_decoder 2.ivf 2.yuv ./simple_decoder 3.ivf 3.yuv 5. View video: mplayer 1.yuv -demuxer rawvideo -rawvideo w=1280:h=720 -loop 0 -fps 30 mplayer 2.yuv -demuxer rawvideo -rawvideo w=640:h=360 -loop 0 -fps 30 mplayer 3.yuv -demuxer rawvideo -rawvideo w=320:h=180 -loop 0 -fps 30 The encoding parameters can be modified in vp8_multi_resolution_encoder.c, for example, target bitrate, frame rate... Modified API. John helped a lot with that. Thanks! Change-Id: I03be9a51167eddf94399f92d269599fb3f3d54f5	2011-12-05 17:59:42 -05:00
John Koleszar	6127af60c1	Merge "Speed selection support for disabled reference frames"	2011-12-05 14:36:54 -08:00
Yunqing Wang	06fc0f83b6	Populate q_index in multi-thread encoding This value needs to be copied to each thread's data structure. This fixed artifact problem in multi-thread encoder. Change-Id: Iab6d9745a1d44846aa503184705376f63a505597	2011-11-28 15:58:28 -05:00
Scott LaVarnway	34d7c8b3d4	Added vp8_dequant_idct_add_y_block_sse2 setup In Change I83202ffd, I deleted one too many lines. Change-Id: If05d7c8988eb5c00898dc7c833ad7d99b5eb23e7	2011-11-28 13:06:13 -05:00
Scott LaVarnway	f46e17fd6f	Merge "Modified the inverse walsh to output directly"	2011-11-28 07:26:07 -08:00
Scott LaVarnway	4a91541c94	Modified the inverse walsh to output directly to the dqcoeff or qcoeff buffer. The encoder would populate the dc coeffs of the y blocks as a separate stage (recon_dcblock) and the decoder would use a special version of the idct. This change eliminates the extra copy and reduces the code footprint. [Tero] Added needed changes to armv6 and NEON assembly. Change-Id: I83202ffdbaf83f6e5dd69f4ba2519fcf0b13b3ba	2011-11-25 09:24:04 +02:00
Johann	e2bacd581a	Merge "Move shared data to shared location"	2011-11-23 11:20:54 -08:00
Johann	15ea268d62	Merge "Fix encoder partitioned output on ARM"	2011-11-23 08:44:21 -08:00
Attila Nagy	97259b460c	Fix encoder partitioned output on ARM API was not returning correct partition sizes on arm targets. The armv5 token packing functions were not storing the information to the partition size table. As a fix, have one boolcoder instance allocated for each partition so that partition sizes are internally available after all partitions were encoded. This will also allow more flexibility in producing several partitions in parallel. Use buffer validation (overflow check) in all ARM bitpacking functions. Change-Id: I31c8a11d8a7613676f0ff50928cb2a2ab14fd169	2011-11-23 12:29:43 +02:00
John Koleszar	b79879c2e3	Merge "Decoder fixes to better support reference picture selection."	2011-11-22 17:12:06 -08:00
Stefan Holmer	b5ee7b12d2	Decoder fixes to better support reference picture selection. Change-Id: Id3388985d754706b9fd1f079c47121e79a63efdf	2011-11-21 10:25:21 +01:00
Johann	f2cd4ded22	Move shared data to shared location Storing vp8_bilinear_filters_mmx in an mmx file and using it in an sse2 file is bad Moving towards allowing --disable-mmx Change-Id: I20493b35bdedcdcfc0915e6f05fdbe6c81a4a742	2011-11-18 16:23:14 -08:00
John Koleszar	e55974bf86	Speed selection support for disabled reference frames There was an implicit reference frame test order (typically LAST, GOLD, ARF) in the mode selection logic, but this doesn't provide the expected results when some reference frames are disabled. For instance, in real-time mode, the speed selection logic often disables the ARF modes. So if the user disables the LAST and GOLD frames, the encoder was always choosing INTRA, when in reality searching the ARF in this case has the same speed penalty as searching LAST would have had. Instead, introduce the notion of a reference frame search order. This patch preserves the former priorities, so if a frame is disabled, the other frames bump up a slot to take its place. This patch lays the groundwork for doing something smarter in the frame test order, for example considering temporal distance or looking at the frames used by nearby blocks. Change-Id: I1199149f8662a408537c653d2c021c7f1d29a700	2011-11-18 13:53:21 -08:00
Attila Nagy	c84d42f864	Validate encoder buffer writes for single token partition Extend buffer write validation (overflow check) to single token partition packing, both mb and row based functions. Change-Id: I36e19b7d37fc43712d05c70e3ad223d3eb5b973d	2011-11-18 12:49:27 +02:00
Scott LaVarnway	3c755577b8	Merge "Added predictor stride argument(s) to subtract functions"	2011-11-17 10:17:53 -08:00
Scott LaVarnway	edd98b7310	Added predictor stride argument(s) to subtract functions Patch set 2: 64 bit build fix Patch set 3: 64 bit crash fix [Tero] Patch set 4: Updated ARMv6 and NEON assembly. Added also minor NEON optimizations to subtract functions. Patch set 5: x86 stride bug fix Change-Id: I1fcca93e90c89b89ddc204e1c18f208682675c15	2011-11-15 12:53:01 -05:00
John Koleszar	bdd35c13cc	avoid resetting framerate during vpx_codec_enc_config_set() The calculated frame_rate is a state variable in the codec, and shouldn't be maintained in the configuration struct. Move it to the main part of cpi so that it isn't clobbered when the configuration struct is updated. The initial framerate estimate is moved from the vp8_cx_iface.c wrapper into the body of init_config() in onyx_if.c, so that it is only called once and not reset on every call to vp8_change_config(). Change-Id: I8d9a3d1283330d1ee297d07e9d78d1f2875f2465	2011-11-11 14:45:58 -08:00
Scott LaVarnway	df49c7c58d	SSE2 optimizations for vp8_build_intra_predictors_mby{,_s}() Ronald recently sent me this patch that he did in April. > From: Ronald S. Bultje <rbultje@google.com> > Date: Thu, 28 Apr 2011 17:30:15 -0700 > Subject: [PATCH] SSE2 optimizations for > vp8_build_intra_predictors_mby{,_s}(). HD decode tests have shown a performance boost up to 1.5%, depending on material. Patch set 3: Fixed encoder crash. Change-Id: Ie1fd1fa3dc750eec1a7a20bfa2decc079dcf48c8	2011-11-09 15:30:35 -05:00
Scott LaVarnway	9532bda0fb	Merge "Relocated idct/add calls for encoder"	2011-11-09 10:17:43 -08:00
Johann	ea2229bab6	Merge "ARMv6 optimized Intra4x4 prediction"	2011-11-09 09:36:33 -08:00
John Koleszar	2999ca3094	Merge "Reset FPU state after calc_plane_error()"	2011-11-09 09:35:08 -08:00
John Koleszar	3fcf0e3668	Merge "Compiler warning fix for const array."	2011-11-09 09:34:50 -08:00
John Koleszar	82e8884ad8	Merge "Remove unused file recon.c"	2011-11-09 09:31:23 -08:00
Scott LaVarnway	861ed6a5c1	Relocated idct/add calls for encoder Call the idct/add after the tokenize. This is WIP with the goal of creating a common idct/add for the encoder and decoder. This move is necessary because the decoder's version of the idct clobbers qcoeff, which is used by the tokenize. Change-Id: I6b08d8e8397cd873647fa4fb9469884e3c876756	2011-11-09 10:41:05 -05:00
Tero Rintaluoma	5a2fd63a2a	ARMv6 optimized Intra4x4 prediction Added ARM optimized intra 4x4 prediction - 2x faster on Profiler compared to C-code compiled with -O3 - Function interface changed a little to improve BLOCKD structure access Change-Id: I9bc2b723155943fe0cf03dd9ca5f1760f7a81f54	2011-11-09 09:13:51 +02:00
James Zern	9d60506130	threading: avoid defining _WIN32_WINNT The referenced function (SignalObjectAndWait) isn't used. Reduces the warnings with mingw32-w64 which defines this. Change-Id: I4ce592879ec9372bf196dac640204c4d370bd210	2011-11-08 18:50:45 -08:00
John Koleszar	f89e109f56	Remove unused file recon.c File not referenced from anywhere and no longer compiles. Change-Id: I38b11bd60db615c2c2c9d7ad35caba3a1adf1750	2011-11-08 15:54:56 -08:00
Yunqing Wang	4c14efd234	Fix checks in MB quantizer initialization vp8cx_mb_init_quantizer() needs to be called at least once to get all values calculated. This change added one check to decide if we could skip initialization or not. Change-Id: I3f65eb548be57580a61444328336bc18c25c085b	2011-11-08 12:11:48 -05:00
Adrian Grange	b615a6d47f	Third set of checks of buffer level against maximum buffer size Additional check of buffer level to ensure it doesn't exceed the maximum buffer size. Change-Id: I1ba4f8b09bbec89646885040ff47470196af521e	2011-11-07 17:15:28 -08:00
Adrian Grange	fa25a31ed4	Additional clipping of buffer level to maximum buffer size Added additional check of buffer level against maximum buffer size. Change-Id: Iaf1fbaf008601161e402b43ce82c3dbc129bf740	2011-11-07 16:54:40 -08:00
Adrian Grange	9dc95b0a12	Added check to make sure maximum buffer size not exceeded Added code to clip the buffer level to the maximum buffer size. Without this the buffer level would increase unchecked. This bug was found when encoding an essentially static scene at 2Mb/s. The encoder is unable to generate frames consistent with the high data-rate because Q bottoms out at Qmin. As frames generated are consistently undersized the buffer level increases and does not get checked against the maximum size specified by the user (or default). Change-Id: Id8a3c6323d3246da50f7cb53ddbf78b5528032c6	2011-11-07 16:28:13 -08:00
James Zern	f89ea3432f	fix file permissions all of googletest import (0ab00a22) was marked executable Change-Id: Id7b7ee03efc21ab998bb03349bd91644e8af25da	2011-11-04 18:50:35 -07:00
Fritz Koenig	f0c01413fb	Compiler warning fix for const array. Fix compiler warning for passing a non const array to a function expecting a const array by using an intermediary pointer and casting. Change-Id: I9bdd358ebdc926223993fb8fb2098ffedd2f3fc7	2011-11-04 18:19:26 -07:00
Yunqing Wang	e1a55b504a	Merge "Add checks in MB quantizer initialization"	2011-11-04 11:52:27 -07:00
Scott LaVarnway	44b5f76e34	Merge "Fix issue 374: eob read incorrectly"	2011-11-04 11:31:17 -07:00
John Koleszar	7ca6c91732	Merge "Changing decoder input partition API to input fragments."	2011-11-04 09:36:27 -07:00
Tero Rintaluoma	d497ec688d	Fix issue 374: eob read incorrectly Updated eob changes to check_reset_2nd_coeffs function. Change-Id: Id1b21c91c7f0fd286640b487ffe47867009b717d	2011-11-04 09:36:49 +02:00
Scott LaVarnway	46639567a0	Merge "Change use of eob in the encoder"	2011-11-03 08:06:06 -07:00
Tero Rintaluoma	e4f2ec7a52	Change use of eob in the encoder Changed 'int eob' to 'char *eob' in BLOCKD so that both encoder and decoder will use eobs[25] array from MACROBLOCKD structure. In future, this will enable use of the decoder side IDCT in the encoder. Change-Id: I6e1c011628cb8864fd4a0b80f0279ce16a5ca978	2011-11-03 16:08:09 +02:00
Yaowu Xu	8002c31804	Merge "added code to clear 2nd order block when appropriate"	2011-11-02 08:22:58 -07:00
John Koleszar	63bf108731	Merge "Fix: Increase default cx_data_size"	2011-11-01 15:25:47 -07:00
Stefan Holmer	1427205215	Changing decoder input partition API to input fragments. Adding support for several partitions within one input fragment. This is necessary to fully support all possible packetization combinations in the VP8 RTP profile. Several partitions can be transmitted in the same packet, and they can only be split by reading the partition lengths from the bitstream. Change-Id: If7d7ea331cc78cb7efd74c4a976b720c9a655463	2011-11-01 14:44:37 -07:00
Yunqing Wang	e44720af84	Add checks in MB quantizer initialization In some situations (f.g. error-resilient is turned on), vp8cx_mb _init_quantizer() was called once per macroblock. Added checks to avoid calculations when there is no change. Change-Id: Ie4f0a5ade2202041254990a4e9d5b03bd1ac5aea	2011-11-01 17:41:22 -04:00
John Koleszar	9bf3bc9a72	Correct SPLITMV clamping Prior to this fix, the clamping state of the last subblock partition dominated, whereas the correct behavior is to clamp if any partition needs clamping. This bug was introduced by v0.9.6-232-g6b25501 See also: [1]: http://code.google.com/p/webm/issues/detail?id=371 [2]: https://bugzilla.mozilla.org/show_bug.cgi?id=696390 Change-Id: I444db492b4c4f05f039c7da6f4216da8207dc138	2011-10-31 14:42:51 -07:00
Yaowu Xu	88e24f07ae	added code to clear 2nd order block when appropriate It is discovered that in rare situations the 2nd order block may produce a few small magnitude coefficients that has no effect on reconstruction. The situations are a combination of low quantizer values (high quality) and low energy in residual signals (content dependent). This commit added code to detect such cases and reset the 2nd order block to all 0. Patch 1 to 4 used code to do all-zero-check on idct result buffer, and tests on derf set showed a consistent gain of .12%-.14% on all metrics.But due to a recent change Ie31d90b, the idct result buffer is not longer populated. So patch 5&6 use an alternative method to detect the situations. Tests on derf set now shows a consistent quality gain of .16%-.20%. As suggested by Jim, Patch 7&8 removed the condition of all first order block not having any coefficient, instead we reset 2nd order coefficients to all 0 if sum of absolute value of the coefficients is small. So it does slightly more than just detecting the oddity as discussed above, but tests on derf set now show a consistent gain of .20%-.23% on all metrics. It is worth noting here that this change does not have any effect on mid/high quantizer range, it only affects the quantizer value 18 or blow. Within this range, the change helps compression by up to 2.5% on clips in the derf set. Change-Id: I718e19cf59a4fc2462cb7070832759beb9f7e7dd	2011-10-28 12:07:21 -07:00
Scott LaVarnway	e0309e1509	Merge "Improved decode_split_mv()"	2011-10-28 09:27:17 -07:00
Johann	cd1ef53d12	Merge "Fix ARM build problem introduced by CL I3fab6f2b"	2011-10-27 11:17:54 -07:00
Scott LaVarnway	6064384d59	Improved decode_split_mv() Tests showed ~1.2% performance boost on the HD clip used. Performance will vary based on material. Change-Id: Icbcf1a828750d5b4ae5252bf596b3ef594042e8a	2011-10-27 11:26:30 -04:00
Scott LaVarnway	0db5599957	Merge "Improved mv_bias"	2011-10-27 06:14:00 -07:00
Attila Nagy	9452dce181	Fix ARM build problem introduced by CL I3fab6f2b Update ARM asm implementation of vp8_start_encode to new definition. Change-Id: Ic44791c969e351082331ba6146c3384c01a0dfad	2011-10-27 09:06:45 +03:00
Johann	294777b915	Merge "Reduce partial frame copy in encoder's pick_filter_level_fast"	2011-10-26 11:33:14 -07:00
Scott LaVarnway	21970d1dc2	Improved mv_bias Small performance gains. Change-Id: I709b9390a8a27a70f5f23574313b8db85ac7f23d	2011-10-26 11:46:10 -04:00
Scott LaVarnway	efa69d26a1	Merge "Improved read_mb_modes_mv()"	2011-10-26 08:26:30 -07:00
Scott LaVarnway	ff1d170e69	Improved read_mb_modes_mv() Interleaved vp8_find_near_mvs and vp8_mv_ref_probs. 2.5% to 4% performance improvement for the HD clips used. Change-Id: Id888b667cf5ae2f0e19da18743140f055ff7de8d	2011-10-26 10:46:36 -04:00
Attila Nagy	de82809444	Reduce partial frame copy in encoder's pick_filter_level_fast The partial frame copy function used to copy an extra 8 lines above and below. The partial frame filtering can only modify 3 pixel rows above the partial frame. Reduce copy to bare minimum needed, which is 4 lines, so that partial filtering on copied frame is possible. Define the "magic" fraction number for partial filtering in loopfilter.h . Change-Id: I4791ffc541b6884b12759a0d0714a8faf16147ec	2011-10-26 15:25:07 +03:00
Johann	f9dba66877	Merge "remove uninitialized variable warning"	2011-10-25 14:42:21 -07:00
Scott LaVarnway	e03330bd80	Merge "Improved token decoder"	2011-10-25 10:04:11 -07:00
Johann	9409af2083	remove uninitialized variable warning Restructure if statement to clarify the error condition. Trigger the error before clobbering pc-> variables. Change-Id: Id01cab798a341ce9899078fdcec265a0e942a0b7	2011-10-25 09:24:40 -07:00
Scott LaVarnway	3579baa115	Merge "Removed read_mv_ref"	2011-10-25 08:01:32 -07:00
Johann	a82cc0205d	remove unused variable warning Change-Id: I4fcd6e4656d9823aead941616cd63501aecbd6e2	2011-10-24 16:33:45 -07:00
Scott LaVarnway	49ea2bc3f4	Removed read_mv_ref Decode the mv mode with if-then-elses instead of traversing the vp8_mv_ref_tree data structure. This will make it easier to interleave vp8_find_near_mvs and vp8_mv_ref_probs. Change-Id: I1e798d6ec40fcaeeff06ccc82f81201978d12f74	2011-10-24 16:16:08 -04:00
Scott LaVarnway	f182376dd6	Moved the split motion vector decode into a function. Change-Id: Ia023a0587100a52cb084f5d9d5512efa6198dad3	2011-10-24 13:52:15 -04:00
Scott LaVarnway	231339932b	Merge "Removed redundant mv clamps for nearmv and nearestmv"	2011-10-24 10:27:53 -07:00
James Berry	bac6c229e5	Fix: Increase default cx_data_size Prior size could be too small in some instances resulting in an error. Change-Id: Ic601e49cbae92c98a0e7fb51ba8c186b352ffba6	2011-10-24 11:50:27 -04:00
Scott LaVarnway	a99c20c0f4	Removed redundant mv clamps for nearmv and nearestmv Did some cleanup as well. Patchset 2: Fixed bug. Will revisit the segmentation logic. Change-Id: Idf9fbcff9aaf467bdace9fbd58ef2cea6c602049	2011-10-24 11:37:52 -04:00
Scott LaVarnway	e59d53e999	Merge "Remove unused DETOK structure"	2011-10-21 06:30:57 -07:00
Tero Rintaluoma	bdb4fb8991	Remove unused DETOK structure DETOK structure is not used anymore. Change-Id: Id22e1af78fb85d4bb151237a60290d9364faf217	2011-10-21 09:33:49 +03:00
John Koleszar	2c0b4a24b9	Merge "Fix: check cx_data buffer prior to write"	2011-10-20 17:36:40 -07:00
James Berry	bc7151131d	Fix: check cx_data buffer prior to write check to make sure that cx_data buffer has enough room before writting to it, prior behavior did not which could result in a crash. Change-Id: I3fab6f2bc4a96d7c675ea81acd39ece121738b28	2011-10-20 15:55:00 -04:00
Johann	7cdc986cdf	Don't copy borders for loop_filter_pick During the _pick only the Y plane is examined. In addition, data beyond the borders of the frame is not read. Change-Id: Ic549adfca70fc6e0b55f8aab0efe81f0afac89f9	2011-10-19 18:54:14 -07:00
Johann	f382173225	Merge "enc: save entropy probs only when needed for refresh"	2011-10-19 14:36:29 -07:00
Scott LaVarnway	5e54085703	Improved token decoder Tests showed over 2% improvement on various HD clips. Change-Id: I94a30d209c92cbd5fef285122f9fc570688635fe	2011-10-19 13:38:35 -04:00
Scott LaVarnway	63a77cbed9	Merge "Remove usage of predict buffer for decode"	2011-10-19 10:24:48 -07:00
Scott LaVarnway	ed9c66f584	Remove usage of predict buffer for decode Instead of using the predict buffer, the decoder now writes the predictor into the recon buffer. For blocks with eob=0, unnecessary idcts can be eliminated. This gave a performance boost of ~1.8% for the HD clips used. Tero: Added needed changes to ARM side and scheduled some assembly code to prevent interlocks. Patch Set 6: Merged (I1bcdca7a95aacc3a181b9faa6b10e3a71ee24df3) into this commit because of similarities in the idct functions. Patch Set 7: EC bug fix. Change-Id: Ie31d90b5d3522e1108163f2ac491e455e3f955e6	2011-10-18 12:06:50 -04:00
Attila Nagy	a5cd42feb9	Fix: vp8cx_pack_tokens_into_partitions_armv5 crash It was crashing when number of partitions was bigger than the number of MB rows (ex. 128x96 with 8 partitions). Start point was not checked against mb_rows, plus extra "empty" partitions were not written out. Change-Id: I9c2f013b9ec022354b658fab4ef799ff8b1de93d	2011-10-14 10:53:04 +03:00
Adrian Grange	04182a121a	Merge "Added rate-targeted temporal scalability"	2011-10-11 12:54:52 -07:00
Adrian Grange	217591fde5	Added rate-targeted temporal scalability Added the ability to create rate-targeted, temporally scalable, VP8 compatible bitstreams. The application vp8_scalable_patterns.c demonstrates how to use this capability. Users can create output bitstreams containing upto 5 temporally separable streams encoded as a single VP8 bitstream. (previously abandoned as: I92d1483e887adb274d07ce9e567e4d0314881b0a) Change-Id: I156250a3fe930be57c069d508c41b6a7a4ea8d6a	2011-10-11 12:49:12 -07:00
John Koleszar	07ba411914	Reset FPU state after calc_plane_error() Fixes a MMX/SSE2 mismatch when building with --enable-internal-stats. Change-Id: I0c50a1f246f6916b7a5fc6f36864ceb362f25520	2011-10-11 08:43:30 -07:00
James Berry	05bde9d4a4	bug fix - starting/optimal/max and buffer_level changed from int to int64_t buffer_level in VP8_COMP and starting_buffer_level, optimal_buffer_level and maximum_buffer_size in VP8_CONFIG changed from int to int64_t to avoid potential crash issues for larger target bit rates. Change-Id: I0d5ab6c8a44c2fef51f30cd8df4bb4b739c5df26	2011-10-10 12:16:55 -04:00
Attila Nagy	c0de35b413	enc: save entropy probs only when needed for refresh Previous entropy probs need to be saved (and restored) only when current updates are not propagated. Change-Id: Ie6ee0543066e30874e56258be0a6b7d2dd2fdb2b	2011-10-10 13:44:54 +03:00
Scott LaVarnway	af12c23e8e	Merge "Improved tokenize"	2011-10-04 09:57:42 -07:00
John Koleszar	8f8b526b54	Merge "Fix uninitialized new_mv_count in first pass file"	2011-10-04 07:40:49 -07:00
Yunqing Wang	538865dfa5	Merge "Multithreaded encoder, late sync loopfilter"	2011-10-04 07:04:30 -07:00
John Koleszar	86712c50f2	Fix uninitialized new_mv_count in first pass file Uninitialized data could be written to the first pass file when no motion vectors are present in the frame. Also fix a number of compiler warnings. Change-Id: Icc9f53b6d33da9de4563d86d9fd591910473ea90	2011-10-04 09:50:52 -04:00
Johann	2aa408524c	Merge "Reduce computational complexity of generic C loop filter."	2011-09-30 16:17:56 -07:00
Johann	48b1917112	Merge "combine loopfilter data access"	2011-09-30 15:47:56 -07:00
Scott LaVarnway	ab00d209bc	Improved tokenize For a realtime HD encodings, up to 1.6% gains seen. Change-Id: If45028e23db95124da63f9d38ffe06e05596cc6e	2011-09-30 12:49:46 -04:00
Johann	3556deaca3	combine loopfilter data access The data processed by the loopfilter overlaps. At the block level, this results in some redundant transforms. Grouping the filtering allows for a single 16x16 transpose (and inversion) instead of three 16x8 transposes (and three more inversions). This implementation is x86_64 only. We retain the previous implementation for x86. Improvements are obviously material dependant, but it seems to be ~%1 in tests here. Change-Id: I467b7ec3655be98fb5f1a94b5d145e5e5a660007	2011-09-30 07:38:35 -07:00
Alpha Lam	7bce513afe	Call vp8_find_near_mvs lazily vp8_find_near_mvs() is being called on all possible reference frames but the data computed may be used if the loop exits early, which can be due to x->skip beign set to 1. Optimize this by call vp8_find_near_mvs() laziy only if it is going to be used and not computed yet. Change-Id: Iccdbd4c962a670c9f2c99b8aca8096042ca5dc98	2011-09-30 14:48:18 +01:00
Paul Wilkins	a572ac8327	Merge "CQ and two pass rate control."	2011-09-30 02:57:54 -07:00
Paul Wilkins	b6e27d5f0b	CQ and two pass rate control. Changes to the selection of Q limits for two pass and two pass CQ mode. Allowance made for Mode and motion vector costs. Some refactoring of common code. For Derf and YT sets CQ mode average improvement circa 1% (SSIM and Global PSNR). Some increased tendency to undershoot even when user CQ not reached. Patch2: Removed some test code accidentally merged. Change-Id: Icf74d13af77437c08602571dc7a97e747cce5066	2011-09-30 10:55:52 +01:00
Aaron Watry	69aa303d96	Reduce computational complexity of generic C loop filter. Change-Id: I1e7f9ed3cd907844a495b9e0073bc140b87e5c06	2011-09-29 17:25:48 -05:00
Attila Nagy	380d64ecb1	Multithreaded encoder, late sync loopfilter Sync with loopfilter thread just at the beginning of next frame encoding. This returns control to application faster and allows a better multicore scaling. When PSNR packets are generated the final filtered frame is needed imediatly so we cannot delay the sync. Change-Id: I288d97b5e331d41d6f5bb49d97986fa12ac6f066	2011-09-29 10:06:24 +03:00
John Koleszar	6f9457ec12	Merge "clamp_mvs() using the wrong motion vector information"	2011-09-22 11:54:15 -07:00
John Koleszar	3c85c532bb	Merge changes Ie650e9b8,I2427e494 * changes: vpxenc: get version string programatically Install missing default_coef_probs.h	2011-09-22 11:18:00 -07:00
Johann	9f41a8b0aa	Merge "Replace vpx_ports/config.h with vpx_config.h"	2011-09-22 09:30:18 -07:00
John Koleszar	4a6ac727fe	Install missing default_coef_probs.h Make sure that this header is listed as one of the sources, so that it will be installed if necessary. Change-Id: I2427e494488126b179151dc21043c1e2c8ba5991	2011-09-22 11:08:24 -04:00
Attila Nagy	1a7d25a484	Replace vpx_ports/config.h with vpx_config.h Just a clean-up. Change-Id: Iea5b6dc925dcfa7db548bc1ab1a13d26ed5a2c9a	2011-09-22 13:33:54 +03:00
Fritz Koenig	bd0c3409a8	Move neon only arm functions under arm/neon. These files don't contain generic arm code, so should only be compiled by neon. Change-Id: Ie712823aa04d4235e7cfe7a3b725e73ee4c3e564	2011-09-20 10:51:06 -07:00
Johann	6829e62718	Merge "NEON FDCT updated to match current C code"	2011-09-20 09:51:05 -07:00
Johann	86e07525d5	Merge "NEON walsh transform updated to match C"	2011-09-20 09:50:42 -07:00
Johann	3a16276cf7	Merge "Updated ARMv6 forward transforms to match C"	2011-09-20 09:50:36 -07:00
Johann	fdd51829b1	Merge "Fixed armv5te multiplications"	2011-09-20 09:50:19 -07:00
Tero Rintaluoma	0c2529a812	NEON FDCT updated to match current C code - Removed fast_fdct4x4_neon and fast_fdct8x4_neon - Uses now short_fdct4x4 and short_fdct8x4 - Gives ~1-2% speed-up on Cortex-A8/A9 Change-Id: Ib62f2cb2080ae719f8fa1d518a3a5e71278a41ec	2011-09-20 10:20:55 +03:00
Tero Rintaluoma	3c19bc3fb3	Fixed armv5te multiplications Rd and Rm registers should be different in 'mul'. This register combination results in unpredictable behaviour. GCC will give a warning and RVCT an error in this case. Restriction applies only to armv5 targets and not for armv6 and above. Change-Id: I378d17c51e1f16a6820814fbed43e115aaabb03e	2011-09-20 09:59:27 +03:00
Stefan Holmer	e529a825f7	Fix necessary for input partitions iface to match the RTP profile These changes fixes a glitch between the RTP profile and the input partitions interface. Since there's no way for the user to know the actual number of partitions, the decoder have to read the multi_token_paritition bits also when input partitions mode is enabled. Included are also a couple of fixes for issues with independent partitions and uninitialized memory reads. Change-Id: I6f93b15287d291169ed681898ed3fbcc5dc81837	2011-09-19 15:00:21 +02:00
Tero Rintaluoma	4c3ad66b7f	Updated ARMv6 forward transforms to match C - Updated walsh transform to match C (based on Change Id24f3392) - Changed fast_fdct4x4 and 8x4 to short_fdct4x4 and 8x4 correspondingly Change-Id: I704e862f40e315b0a79997633c7bd9c347166a8e	2011-09-19 10:26:59 +03:00
Tero Rintaluoma	2a4b2a000c	NEON walsh transform updated to match C Modified original patch If2f07220885c4c3a0cae0dace34ea0e36124f001 according to comments. Scheduled code a little bit to prevent some interlocks. Change-Id: I338f02b881098782f82af63d97f042b85e63e902	2011-09-19 10:15:33 +03:00
John Koleszar	35ce4eb01d	Merge "Fixes the boundary checks for extrapolated and interpolated MVs."	2011-09-16 08:09:44 -07:00
Scott LaVarnway	c0ee870b0a	clamp_mvs() using the wrong motion vector information In the "Removed bmi copy to/from BLOCKD" commit, the copy to the bmi in BLOCKD was eliminated. The clamp_mvs() used the bmi in BLOCKD, which now contains incorrect values. This patch fixes this problem. Change-Id: I8eca1eaf4015052b0b63e90876f7ad321aba7cff	2011-09-16 11:03:53 -04:00
Stefan Holmer	b854bbd844	Fixes the boundary checks for extrapolated and interpolated MVs. Change-Id: I5b47d39d1604f2650d2f2d1ca2a3f40843c8e1ea	2011-09-16 11:58:57 +02:00
Scott LaVarnway	5bc7b3a68e	Fixed encoder crash caused by the "Removed bmi copy to/from BLOCKD" commit. Change-Id: I9fae71bdc34c8ecc07bb81cd3ccf498b91ce3ec7	2011-09-13 11:46:33 -04:00
Scott LaVarnway	c4b9089bb9	Merge "Skip computation of distortion in vp8_pick_inter_mode if active_map is used"	2011-08-31 07:18:52 -07:00
Scott LaVarnway	222c72e50f	Merge "Removed bmi copy to/from BLOCKD"	2011-08-31 06:57:20 -07:00
Alpha Lam	0e05f2c6c9	Skip computation of distortion in vp8_pick_inter_mode if active_map is used If a block is marked to be inactive then set distortion to 0. Change-Id: Ib415f19642a2ff7b5cf5cfaedd60ebbd79732272	2011-08-31 14:06:55 +01:00
John Koleszar	800b70a3bf	Merge "Recalculate zbin_extra only if regular quantizer is being used"	2011-08-30 12:49:24 -07:00
Alpha Lam	bc9293b815	Recalculate zbin_extra only if regular quantizer is being used vp8_update_zbin_extra() is called all the time even though the fast quantizer doesn't use it. Skip this call if fast quantizer is used. Change-Id: Ia711c38431930cc2486cf59b8466060ef0e9d9db	2011-08-30 19:23:34 +01:00
Yunqing Wang	1f20202e2c	Minor modification on key frame decision This change makes sure that no key frame recoding in real-time mode even if CONFIG_REALTIME_ONLY is not configured. Change-Id: Ifc34141f3217a6bb63cc087d78b111fadb35eec2	2011-08-25 16:54:45 -04:00
Fritz Koenig	4797a97215	Quiet warning by removing unused variable. fwd_boost_score was not being computed or referenced, so remove declaration. Change-Id: Iece36cde1ec113e3c6afaff1407d24cdf12bd0a8	2011-08-24 15:47:09 -07:00
Scott LaVarnway	b870947d42	Removed bmi copy to/from BLOCKD for SPLITMV and B_PRED modes. Modified code to use the bmi found in mode_info_context instead of BLOCKD. On the decode side, the uvmvs are calculated only when required, instead of every macroblock. This is WIP. (bmi should eventually be removed from BLOCKD) Small performance gains noticed for RT encodes and decodes.(VGA) Change-Id: I2ed7f0fd5ca733655df684aa82da575c77a973e7	2011-08-24 14:42:26 -04:00
Fritz Koenig	112bd4e2b4	Fix naming of sse2 idct functions. Prepend idct function names with vp8_ so that under profiling they show up associated with libvpx. Change-Id: I4fe357b50236cb7730a4cc00164c0a3487a1d8b4	2011-08-24 10:25:32 -07:00
Scott LaVarnway	1de5da80c9	Merge "Faster vp8_default_coef_probs"	2011-08-24 07:52:10 -07:00
Johann	85358d04cd	Fix data accesses for simple loopfilters The data that the simple horizontal loopfilter reads is aligned, treat it accordingly. For the vertical, we only use the bottom 4 bytes, so don't read in 16 (and incur the penalty for unaligned access). This shows a small improvement on older processors which have a significant penalty for unaligned reads. postproc_mmx.c is unused Change-Id: I87b29bbc0c3b19ee1ca1de3c4f47332a53087b3d	2011-08-23 20:42:45 -04:00
Fritz Koenig	c5f890af2c	Use local labels for jumps/loops in x86 assembly. Prepend . to local labels in assembly code. This allows non unique labels within a file. Also makes profiling information more informative by keeping the function name with the loop name. Change-Id: I7a983cb3a5ba2413d5dafd0a37936b268fb9e37f	2011-08-23 09:05:29 -07:00
Fritz Koenig	694d4e7777	Reclassify optimized ssim calculations as SSE2. Calculations were incorrectly classified as either SSE3 or SSSE3. Only using SSE2 instructions. Cleanup function names and make non-RTCD code work as well. Change-Id: I48ad0218af0cc51c5078070a08511dee43ecfe09	2011-08-22 12:36:28 -07:00
Fritz Koenig	b7a6f1d20e	Merge "Revert "Reclasify optimized ssim calculations as SSE2.""	2011-08-22 12:32:12 -07:00
Fritz Koenig	734b1b2041	Revert "Reclasify optimized ssim calculations as SSE2." This reverts commit `01376858cd`	2011-08-22 11:31:12 -07:00
Fritz Koenig	f8e3d23b99	Merge "Reclasify optimized ssim calculations as SSE2."	2011-08-22 09:20:33 -07:00
Fritz Koenig	01376858cd	Reclasify optimized ssim calculations as SSE2. Calculations were incorrectly classified as either SSE3 or SSSE3. Only using SSE2 instructions. Cleanup function names and make non-RTCD code work as well. Change-Id: I29f5c2ead342b2086a468029c15e2c1d948b5d97	2011-08-19 08:51:27 -07:00
John Koleszar	edec5eb5e7	Merge "Copy less when active map is in use"	2011-08-19 07:31:00 -07:00
Alpha Lam	4e8d35a461	Copy less when active map is in use When active map is specified and the current frame is not a key frame, golden frame nor a altref frame then copy only those active regions. This significantly reduces encoding time by as much as 19% on the test system where realtime encoding is used. This is particularly useful when the frame size is large (e.g. 2560x1600) and there's only a few action macroblocks. Change-Id: If394a813ec2df5a0201745d1348dbde4278f7ad4	2011-08-19 10:29:41 -04:00
Paul Wilkins	744f482350	Small boost to every other frame. Instead of a single mid GF boost apply a few extra bits to every other frame. This gives a very small average metrics improvement on both derf and YT sets. Also use min GF interval as min KF interval. Change-Id: Iee238b8cae0ffaed850a5a944ac825cee18da485	2011-08-17 14:14:23 +01:00
Scott LaVarnway	19987dcbfa	Faster vp8_default_coef_probs Copies from a generated table instead of building the default coeff probabilities during runtime. Change-Id: I4d9551ea3a2d7d4a4f7ce9eda006495221a8de50	2011-08-16 16:21:21 -04:00
John Koleszar	9cc1611588	Merge v0.9.7-p1 release int 'origin/master' Change-Id: I93388d2f8846615ad1e26b975308c5e96b9b1918	2011-08-15 17:10:01 -04:00
Stefan Holmer	99d870a472	Don't set the bmi mode when doing error concealment Since the block will be interpreted as an inter block, the mode will be interpreted as a motion vector, resulting in bad concealment. Change-Id: Ifcc685ae1cc883492bce6dbd61e418d91a89b053	2011-08-15 11:46:04 -04:00
Stefan Holmer	ff35649758	Don't set the bmi mode when doing error concealment Since the block will be interpreted as an inter block, the mode will be interpreted as a motion vector, resulting in bad concealment. Change-Id: Ifcc685ae1cc883492bce6dbd61e418d91a89b053	2011-08-15 09:56:07 +02:00
John Koleszar	e96131705a	Revert "Improved 1-pass CBR rate control" This reverts commit `b5ea2fbc2c`. Further testing showed noticable keyframe popping in some cases, reverting this for now to give time for a proper fix. Conflicts: vp8/encoder/onyx_if.c vp8/encoder/ratectrl.c Change-Id: I159f53d1bf0e24c035754ab3ded8ccfd58fd04af	2011-08-12 14:51:36 -04:00
John Koleszar	a4c2211ea3	Propagate macroblock MV to subblocks for error concealment EC expects the subblock MVs to be populated, but `f1d6cc79e4` removed this code. This commit restores it, protected by CONFIG_ERROR_CONCEALMENT. May move this to the EC code more directly in the future. Change-Id: I44f8f985720cb9a1bf222e59143f9e69abf56ad2	2011-08-12 14:49:35 -04:00
Stefan Holmer	a609be5633	Disable error concealment until first key frame is decoded When error concealment is enabled the first key frame must be successfully received before error concealment is activated. Error concealment will be activated when the delta following delta frame is received. Also fixed a couple of bugs related to error tracking in multi-threading. And avoiding decoding corrupt residual when we have multiple non-resilient partitions. Change-Id: I45c4bb296e2f05f57624aef500a874faf431a60d	2011-08-12 14:49:34 -04:00
John Koleszar	cdae03a4eb	Fix potential OOB read with Error Concealment This patch fixes an OOB read when error concealment is enabled and the partition sizes are corrupt. The partition size read from the bitstream was not being validated in EC mode. Change-Id: Ia81dfd4bce1ab29ee78e42320abe52cee8318974	2011-08-12 14:49:34 -04:00
John Koleszar	4645c89889	Merge "Disable error concealment until first key frame is decoded"	2011-08-12 11:45:26 -07:00
John Koleszar	91206793c2	Propagate macroblock MV to subblocks for error concealment EC expects the subblock MVs to be populated, but `f1d6cc79e4` removed this code. This commit restores it, protected by CONFIG_ERROR_CONCEALMENT. May move this to the EC code more directly in the future. Change-Id: I44f8f985720cb9a1bf222e59143f9e69abf56ad2	2011-08-12 11:34:40 -04:00
Stefan Holmer	3e10be93f2	Disable error concealment until first key frame is decoded When error concealment is enabled the first key frame must be successfully received before error concealment is activated. Error concealment will be activated when the delta following delta frame is received. Also fixed a couple of bugs related to error tracking in multi-threading. And avoiding decoding corrupt residual when we have multiple non-resilient partitions. Change-Id: I45c4bb296e2f05f57624aef500a874faf431a60d	2011-08-12 16:10:04 +02:00
John Koleszar	810a06b12c	Fix potential OOB read with Error Concealment This patch fixes an OOB read when error concealment is enabled and the partition sizes are corrupt. The partition size read from the bitstream was not being validated in EC mode. Change-Id: Ia81dfd4bce1ab29ee78e42320abe52cee8318974	2011-08-11 18:07:03 -04:00
Yunqing Wang	b84e8f20c3	Merge "Adjust half-pixel only search"	2011-08-05 12:15:32 -07:00
John Koleszar	238dae8604	Fix source buffer selection This patch fixes a bug in the interaction between the recode loop and spatial resampling. If the codec was in a spatial resampling state, and a subsequent iteration of the recode loop disables resampling, then the source buffer must be reset to the unscaled source. Change-Id: I4e4cd47b943f6cd26a47449dc7f4255b38e27c77	2011-08-03 16:13:15 -04:00
Yunqing Wang	b9f19f8917	Adjust half-pixel only search Changed motion search in vp8_find_best_half_pixel_step() to be the same as in vp8_find_best_sub_pixel_step(), which checks 5 points instead of 8 points. This only affects real-time mode with cpu-used >=9. Tests showed it gives 2% encoding speedup with a quality loss(psnr) of up to 0.5%. Change-Id: I16049cad1535002346d46cfdfad345bfc3dc5146	2011-08-03 11:51:07 -04:00
Johann	30e5deae5d	update extend frame borders the neon code made several assumptions which were broken by a recent change: https://review.webmproject.org/2676 update the code with new assumptions and guard them with a compile time assert Change-Id: I32a8378030759966068f34618d7b4b1b02e101a0	2011-08-02 19:26:46 -04:00
James Berry	27ee521753	include asm_com/dec_offsets for make dist Change-Id: Ia1ad66066a24c01915cd9e3ff75c7e070cc984c8	2011-08-02 13:42:03 -04:00
John Koleszar	f475f0c1bb	Merge "include the arm header files in make dist" into cayuga	2011-08-02 05:21:10 -07:00
John Koleszar	06c3d5bb9a	Fix building with --disable-postproc Change-Id: I7e6bc28e7974a376da747300744e0dd5dc1d21e9	2011-08-01 17:50:23 -04:00
Johann	3e8c6d3d35	include the arm header files in make dist Change-Id: Ibcf5b4b14153f65ce1b53c3bfba87ad2feb17bbd	2011-08-01 17:20:21 -04:00
John Koleszar	6f080f9cec	Merge "Convert rc_max_intra_bitrate_pct to control"	2011-07-29 11:57:48 -07:00
John Koleszar	1f71d2e2c8	Correctly track sharpness in vp8cx_pick_filter_level_fast Make sure to update last_sharpness_level from the current sharpness_level whenever it changes. Change-Id: I0258d2f5b11a407abf6176a8d4c4994d925943f0	2011-07-29 12:27:03 -04:00
John Koleszar	1654ae9a2a	Convert rc_max_intra_bitrate_pct to control Since this is the only ABI incompatible change since the last release, convert it to use the control interface instead. The member of the configuration struct is replaced with the VP8E_SET_MAX_INTRA_BITRATE_PCT control. More significant API changes were expected to be forthcoming when this control was first introduced, and while they continue to be expected, it's not worth breaking compatibility for only this change. Change-Id: I799d8dbe24c8bc9c241e0b7743b2b64f81327d59	2011-07-28 09:17:35 -04:00
Yunqing Wang	2f2302f8d5	Preload reference area in sub-pixel motion search (real-time mode) This change implemented same idea in change "Preload reference area to an intermediate buffer in sub-pixel motion search." The changes were made to vp8_find_best_sub_pixel_step() and vp8_find_best_half _pixel_step() functions which are called when speed >= 5. Test result (using tulip clip): 1. On Core2 Quad machine(Linux) rt mode, speed (-5 ~ -8), encoding speed gain: 2% ~ 3% rt mode, speed (-9 ~ -11), encoding speed gain: 1% ~ 2% rt mode, speed (-12 ~ -14), no noticeable encoding speed gain 2. On Xeon machine(Linux) Test on speed (-5 ~ -14) didn't show noticeable speed change. Change-Id: I21bec2d6e7fbe541fcc0f4c0366bbdf3e2076aa2	2011-07-27 14:19:10 -04:00
Yunqing Wang	f11613b620	Merge "Fix range checks in motion search"	2011-07-27 09:34:13 -07:00
Yunqing Wang	bde2afbe23	Fix range checks in motion search There were some situations that the start motion vectors were out of range. This fix adjusted range checks to make sure they are checked and clamped. Change-Id: Ife83b7fed0882bba6d1fa559b6e63c054fd5065d	2011-07-27 10:37:33 -04:00

... 3 4 5 6 7 ...

1416 Commits