generic-library/vpx

Author	SHA1	Message	Date
Scott LaVarnway	ba420f1097	Merge "Broken EC after MODE_INFO size reduction"	2011-05-27 07:52:04 -07:00
Yunqing Wang	5a8cbb8955	Merge "Remove unused code"	2011-05-27 07:25:25 -07:00
Yunqing Wang	2dc24635ec	Remove unused code Hex search is not called in rdopt.c Change-Id: I67347f03e13684147a7c77fb9e9147e440bb5e8e	2011-05-27 10:20:49 -04:00
John Koleszar	2fa7fe66c4	Merge remote branch 'origin/master' into experimental Change-Id: I6d6692418eecf54e23e00a08394b0b37d6e7682b	2011-05-27 00:05:12 -04:00
John Koleszar	fb50cad10f	Merge remote branch 'internal/upstream' into HEAD	2011-05-27 00:05:09 -04:00
Scott LaVarnway	4f586f7bd0	Broken EC after MODE_INFO size reduction This patch fixes the compiler errors and the seg fault when running decode_with_partial_drops. Change-Id: I7c75369e2fef81d53b790d5dabc327218216838b	2011-05-26 15:13:00 -04:00
John Koleszar	1fe5070b76	Merge "Do not copy data between encoder reference buffers."	2011-05-26 09:58:26 -07:00
Yaowu Xu	9a248f1593	Merge "fix the mix use of errorperbit and sadperbit"	2011-05-26 09:39:41 -07:00
John Koleszar	d1910cc484	Merge remote branch 'internal/upstream' into HEAD	2011-05-26 11:45:14 -04:00
John Koleszar	4c7fdd1800	Merge remote branch 'internal/upstream' into HEAD	2011-05-26 11:44:58 -04:00
John Koleszar	9dccdc1f08	Merge remote branch 'internal/upstream-experimental' into HEAD Conflicts: vp8/encoder/encodeframe.c vp8/encoder/ethreading.c Change-Id: I4becf6f101756923de6b98ca6a2132c3605c6ea5	2011-05-26 11:44:36 -04:00
Scott LaVarnway	40b850b458	Merge "Use int_mv instead of MV in vp8_mv_cont"	2011-05-26 07:01:38 -07:00
John Koleszar	26fd970b15	Merge remote branch 'origin/master' into experimental Change-Id: Ica721b36ffaa6c4c02e1cf82850496c7063ce577	2011-05-26 00:05:13 -04:00
Yaowu Xu	d8c525b8b1	fix the mix use of errorperbit and sadperbit error_per_bit and sad_per_bit were designed as estimates of a bit worth of sum squared error and sum absolute difference respectively. Under this assumption, error_per_bit should be used in combination with 2nd order errors (variance or sum squared error) while sad_per_bit should be used in combination with 1st order SADs in motion estimation. There were a few places where sad_per_bit has been misused with variances, this commit changes to use error_per_bit for those places, also changes parameter names to properly indicate which constant is being used. On cif set, the change has a universal gain by all metrics: 0.13% by average/overall psnr and 0.1% by ssim. Change-Id: I4850fdcc3fd6886b30f784bd843f13dd401215fb	2011-05-25 16:48:10 -07:00
Yunqing Wang	13b56eeb7a	Merge " Use var8x8 instead of get8x8var in VP8_UVSSE"	2011-05-25 11:35:42 -07:00
Yunqing Wang	f299d628f3	Merge "Return sse value in vp8_variance SSE2 functions"	2011-05-25 11:31:07 -07:00
Yaowu Xu	22c05c0575	remove code not in use Change-Id: I6e5e86235d341cce3b02abda26dbeb71940ed955	2011-05-25 09:46:37 -07:00
Yunqing Wang	b6679879b8	Return sse value in vp8_variance SSE2 functions Minor modification. Change-Id: I09511d38fd1451d5c4106a48acdb3f766ce59cb7	2011-05-25 11:55:41 -04:00
Attila Nagy	a615c40499	Use var8x8 instead of get8x8var in VP8_UVSSE 'sum' returned by get8x8var is not used and var8x8 has optimizations for more platforms. Change-Id: I4a907fb1a05f285669fb0b95dc71d42182c980f6	2011-05-25 12:54:34 +03:00
John Koleszar	117fcb207e	Merge remote branch 'origin/master' into experimental Change-Id: I9e5c28f898d92091e39f62193f6329b593968819	2011-05-25 00:05:14 -04:00
Yunqing Wang	d75eb73653	Fix a bug happening while encoding at profile=3 While profile=3, there is no sub-pixel search. Distortion and SSE have to calculated using get_inter_mbpred_error(). Change-Id: Ifb36e17eef7750af93efa7d0e2870142ef540184	2011-05-24 16:28:23 -04:00
Scott LaVarnway	a39321f37e	Use int_mv instead of MV in vp8_mv_cont Less operations. Change-Id: Ibb9cd5ae66b8c7c681c9a654d551c8729c31c3ae	2011-05-24 16:01:12 -04:00
Scott LaVarnway	cfab2caee1	Removed unused variable warnings Change-Id: I6e5e921f03dc15a72da89a457848d519647677a3	2011-05-24 15:17:03 -04:00
Scott LaVarnway	b5278f38b0	Merge "MODE_INFO size reduction"	2011-05-24 12:08:24 -07:00
Scott LaVarnway	e11f21af9a	MODE_INFO size reduction Declared the bmi in MODE_INFO as a union instead of B_MODE_INFO. This reduced the memory footprint by 518,400 bytes for 1080 resolutions. The decoder performance improved by ~4% for the clip used and the encoder showed very small improvements. (0.5%) This reduction was first mentioned to me by John K. and in a later discussion by Yaowu. This is WIP. Change-Id: I8e175fdbc46d28c35277302a04bee4540efc8d29	2011-05-24 13:24:52 -04:00
John Koleszar	fbea372817	Merge "Fixing bug in VP8_SET_REFERENCE decoder control command"	2011-05-24 05:57:44 -07:00
Yunqing Wang	69aad3a720	Merge "Rewrite hex search function"	2011-05-24 05:26:16 -07:00
Henrik Lundin	a126cd1760	Fixing bug in VP8_SET_REFERENCE decoder control command In vp8dx_set_reference, the new reference image is written to an unused reference frame buffer. Change-Id: I9e4f2cef5a011094bb7ce7b2719cbfe096a773e8	2011-05-24 09:03:43 +02:00
John Koleszar	b9f98a52c8	Merge remote branch 'origin/master' into experimental Change-Id: I56a5665a5d4e2ed590d75a5ad49e8feb54393f6e	2011-05-24 00:05:10 -04:00
John Koleszar	f7044d4058	Merge remote branch 'internal/upstream' into HEAD	2011-05-24 00:05:09 -04:00
Yaowu Xu	99fb568e67	Merge "use get8x8var directly for non-subpixel motion case in VP8_UVSSE"	2011-05-23 14:49:56 -07:00
Yunqing Wang	7838f4cfff	Rewrite hex search function Reduced some bound checks in hex search function. Change-Id: Ie5f73a6c227590341c960a74dc508cff80f8aa06	2011-05-23 16:18:52 -04:00
Yaowu Xu	ab2dfd22f3	use get8x8var directly for non-subpixel motion case in VP8_UVSSE VP8_UVSSE mistakenly used subpixvar8x8 to calculate SSE for non-subpixl motion cases. Change-Id: I4a5398bb9ef39c211039f6af4540546d4972e6a9	2011-05-23 09:11:28 -07:00
John Koleszar	4d240d1eae	Merge remote branch 'origin/master' into experimental Change-Id: I90a1d0095712e0474b0c03773b57376911027fc6	2011-05-21 00:05:14 -04:00
John Koleszar	e4be958e08	Merge remote branch 'internal/upstream' into HEAD	2011-05-21 00:05:14 -04:00
John Koleszar	ad6fe4a88c	Merge "bug fix active_worst_quality set below active_best_quality"	2011-05-20 11:23:10 -07:00
John Koleszar	8196cc85f8	Merge "cleanup: collect twopass variables"	2011-05-20 11:20:44 -07:00
Johann	6d82d2d22e	Merge "Fixed iwalsh_neon build problems with RVDS4.1"	2011-05-20 07:51:11 -07:00
Yaowu Xu	1fbc81a970	Merge "revise two function definitions with less parameters"	2011-05-20 07:45:42 -07:00
John Koleszar	a0c11928db	Merge "Remove unused members of VP8_COMP"	2011-05-20 07:39:03 -07:00
John Koleszar	54bc4fde77	Merge remote branch 'origin/master' into experimental Conflicts: configure Change-Id: I91b9059e5b724a96368c7765c147fdf5a5ce03f2	2011-05-20 08:33:51 -04:00
John Koleszar	27331e1377	Merge remote branch 'internal/upstream' into HEAD	2011-05-20 00:05:16 -04:00
Yaowu Xu	a4c69e9a0f	revise two function definitions with less parameters Change-Id: Ia96e5bf915e4d3c0ac9c1795114bd9e5dd07327a	2011-05-19 19:06:03 -07:00
Yaowu Xu	1f3f18443d	Merge "disable trellis optimization for first pass"	2011-05-19 17:25:31 -07:00
Yaowu Xu	d5b8f7860f	disable trellis optimization for first pass also remove 2 #defines and 1 function declaration that are not in use. Change-Id: I8f743d0e3dd9ebf1de24a8b0c30ff09f29b00c53	2011-05-19 17:22:14 -07:00
James Berry	caa1b28be3	bug fix active_worst_quality set below active_best_quality fixed a bug where active_worst_quality could be set below active_best_quality which could result in an infinite loop. Change-Id: I93c229c3bc5bff2a82b4c33f41f8acf4dd194039	2011-05-19 18:10:31 -04:00
John Koleszar	63cb1a7ce0	cleanup: collect twopass variables This patch collects the twopass specific memebers of VP8_COMP into a dedicated struct. This is a first step towards isolating the two pass rate control and aids readability by decorating these variables with the 'twopass.' namespace. This makes it clear to the reader in what contexts the variable will be valid, and is a hint that a section of code might be a good candidate to move to firstpass.c in later refactoring. There likely will be other rate control modes that need their own specific data as well. This notation is probably overly verbose in firstpass.c, so an alternative would be to access this struct through a pointer like 'rc->' instead of 'cpi->firstpass.' in that file. Feel free to make a review comment to that effect if you prefer. Change-Id: I0ab8254647cb4b493a77c16b5d236d0d4a94ca4d	2011-05-19 17:26:09 -04:00
Scott LaVarnway	dba79821f0	Merge "Using partition_info instead of blockd info for splitmv"	2011-05-19 13:22:59 -07:00
John Koleszar	048497720c	Remove unused members of VP8_COMP Various members that were either completely unreferenced or written and not read. Change-Id: Ie41ebac0ff0364a76f287586e4fe09a68907806e	2011-05-19 15:49:09 -04:00
Scott LaVarnway	99b9757685	Using partition_info instead of blockd info for splitmv The partition_info struct contains info just for SPLITMV, so it should be used instead of BLOCKD. Eventually, I want to reduce the size of B_MODE_INFO struct found in BLOCKD, so this is the first step toward that goal. Also, since SPLITMV is not supported in vp8_pick_inter_mode(), the unnecessary mem copies and checks were removed. For rt encodes, this gave a slight performance improvement. Change-Id: I5585c98fa9d5acbde1c7e0f452a01d9ecc080574	2011-05-19 15:03:36 -04:00
Scott LaVarnway	914f7c36d7	Merge "Make hor UV predict ~2x faster (73 vs 132 cycles) using SSSE3."	2011-05-19 11:22:01 -07:00
John Koleszar	c684d5e5f2	Merge "changed configure option name to reduce confusion"	2011-05-19 11:17:08 -07:00
John Koleszar	ff39958cee	Merge "Make activity masking functions static"	2011-05-19 11:12:18 -07:00
John Koleszar	21ca4c4d5d	Merge "Fix segv without --enable-error-concealment"	2011-05-19 10:58:24 -07:00
John Koleszar	7def902261	Fix segv without --enable-error-concealment Missed wrapping one function call in #if CONFIG_ERROR_CONCEALMENT. Change-Id: I5746b1e6e4531670dbed1130467331fe309bdcae	2011-05-19 13:57:45 -04:00
John Koleszar	e3081b2502	Merge "Adding error-concealment to the decoder."	2011-05-19 10:48:58 -07:00
Stefan Holmer	d04f852368	Adding error-concealment to the decoder. The error-concealer is plugged in after any motion vectors have been decoded. It tries to estimate any missing motion vectors from the motion vectors of the previous frame. Intra blocks with missing residual are replaced with inter blocks with estimated motion vectors. This feature was developed in a separate sandbox (sandbox/holmer/error-concealment). Change-Id: I5c8917b031078d79dbafd90f6006680e84a23412	2011-05-19 13:46:33 -04:00
John Koleszar	a84177b432	Make activity masking functions static These don't need extern linkage. Change-Id: I21220ada926380a75ff654f24df84376ccc49323	2011-05-19 11:14:13 -04:00
John Koleszar	87254e0b7b	Move quantizer init functions to quantize.c Group related functions together. Change-Id: I92fd779225b75a7204650f1decb713142c655d71	2011-05-19 11:07:41 -04:00
Attila Nagy	f96d56c4aa	Fixed iwalsh_neon build problems with RVDS4.1 rvct 4.1 was complaining about vstmia.16, store multiple expects 64 data type. optimized the implementation. Change-Id: I0701052cabd685c375637bbc3796ff6d88f5972c	2011-05-19 10:27:26 +03:00
John Koleszar	a741e0e3cb	Merge remote branch 'origin/master' into experimental Change-Id: I2f9fd68d7fd52e0aebc57e561c77ebe99e9c33e4	2011-05-19 00:05:12 -04:00
John Koleszar	23d525a503	Merge remote branch 'internal/upstream' into HEAD	2011-05-19 00:05:12 -04:00
Yunqing Wang	00a1e2f8e4	Merge "Modify MVcount in pick_inter_mode to eliminate calling of vp8_find_near_mvs"	2011-05-18 12:53:27 -07:00
Yunqing Wang	9c62f94129	Fix a bug in vp8_clamp_mv function Scott fixed the bug in MV clamping function in encoder, which could cause artifacts. Change-Id: Id05f2794c43c31cdd45e66179c8811f3ee452cb9	2011-05-18 09:52:56 -04:00
Yunqing Wang	f62b33f140	Modify MVcount in pick_inter_mode to eliminate calling of vp8_find_near_mvs Moved MVcount modification in pick_inter_mode, and eliminated calling of vp8_find_near_mvs. Change-Id: Icd47448a1dfc8fdf526f86757d0e5a7f218cb5e8	2011-05-17 10:59:42 -04:00
John Koleszar	11b9b14691	Merge remote branch 'origin/master' into experimental Conflicts: vp8/encoder/rdopt.c Change-Id: I85275aab07625bd30bbef16a752b08b18f4451ab	2011-05-16 09:11:37 -04:00
John Koleszar	a5074a8b8b	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/encoder/encodeframe.c vp8/encoder/rdopt.c Change-Id: I3c66714e704b22569aff701cc5b9b2a5b70989f3	2011-05-16 09:09:36 -04:00
John Koleszar	eafdc5e10a	Merge "Improve framerate adaptation"	2011-05-13 11:18:42 -07:00
Yaowu Xu	5608c14020	Merge "adjusting rd constant slightly by ~10%"	2011-05-13 09:28:26 -07:00
Paul Wilkins	0e86235265	Merge "Restructure of activity masking code."	2011-05-13 09:23:50 -07:00
John Koleszar	72913435cb	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/common/blockd.h vp8/decoder/decodemv.c Change-Id: Ib97c226d5b33b1ac1675d9c96eac1986af4dd579	2011-05-13 10:16:37 -04:00
Paul Wilkins	ff52bf3691	Restructure of activity masking code. This commit restructures the mb activity masking code to better facilitate experimentation using different metrics etc. and also allows for adjustment of the zero bin either for encode only or both the encode and mode selection stages It also uses information from the current frame rather than the previous frame and the default strength has been reduced. Change-Id: Id39b19eace37574dc429f25aae810c203709629b	2011-05-13 10:37:50 +01:00
John Koleszar	71a0eaf33c	Merge remote branch 'origin/master' into experimental Change-Id: Idf2dead51d2936984eb9827dd6d2cb704817f4c8	2011-05-13 00:05:14 -04:00
John Koleszar	5ed116e220	Improve framerate adaptation This patch improves the accuracy of frame rate estimation by using a larger, 1 second window. It also more quickly adapts to step changes in the input frame rate (ie 30fps to 15fps) Change-Id: I39e48a8f5ac880b4c4b2ebd81049259b81a0218e	2011-05-12 15:07:50 -04:00
Scott LaVarnway	71a7501bcf	Removed mv_bits_sadcost This sad cost is being generated but never used. Change-Id: I562eebdcb792b743770954feca365b5b37491ecd	2011-05-12 11:20:41 -04:00
Scott LaVarnway	6b25501bf1	Using int_mv instead of MV The compiler produces better assembly when using int_mv for assignments. The compiler shifts and ors the two 16bit values when assigning MV. Change-Id: I52ce4bc2bfbfaf3f1151204b2f21e1e0654f960f	2011-05-12 11:08:16 -04:00
Yunqing Wang	6ed81fa5b3	Merge "Modification and issue fix in full-pixel refining search"	2011-05-12 07:20:44 -07:00
Yunqing Wang	b4da1f83e6	Modification and issue fix in full-pixel refining search Further modification and wrong implementation fix which caused refining_search and refining_searchx4 result mismatching. Change-Id: I80cb3a44bf5824413fd50c972e383eebb75f9b6f	2011-05-12 10:18:40 -04:00
Yaowu Xu	bd9d890605	adjusting rd constant slightly by ~10% This is to reflect the RD improvement in the encoder. The change has a small positive impact on quality (0.25% by VPXSSIM and 0.05% by PSNR) Change-Id: Ic66ffc19b10870645088c0624c85556f009fd210	2011-05-11 23:32:06 -07:00
John Koleszar	5c849a64d9	Merge remote branch 'origin/master' into experimental Change-Id: I3149502b80e7c30decc125a2ddc5ad12b12b3667	2011-05-11 00:05:10 -04:00
John Koleszar	65b1648f35	Merge remote branch 'internal/upstream' into HEAD	2011-05-11 00:05:07 -04:00
John Koleszar	6edd07d656	Merge remote branch 'internal/upstream-experimental' into HEAD	2011-05-11 00:05:07 -04:00
Yaowu Xu	ba6f60dba7	Merge "remove a variable no longer in use"	2011-05-10 20:20:59 -07:00
Yaowu Xu	1bcf4e66bb	Merge "fix a bug related to gf_active_flags in multi-threaded encoder"	2011-05-10 19:59:52 -07:00
Yaowu Xu	f7cf439b34	remove a variable no longer in use The variable is introduced in commit `2e53e9e53` to make more use of trellis quantization, but this is no longer necessary after RDMULT was made adaptive in a number of later commits. Change-Id: I7420522ec7723f38cf77033466c25afb405d52ae	2011-05-10 19:57:51 -07:00
Johann	df2023a6cb	set up Global Offset Table in recon global values were being referenced, but the GOT was not being set up. as the GOT is only required for PIC, this issue wasn't caught in the default configuration. Change-Id: I8006e53776139362a76f2c80cf9d0f8458602b2f http://code.google.com/p/webm/issues/detail?id=328	2011-05-10 15:58:56 -04:00
Yunqing Wang	c7a56f677d	Merge "Use diamond search to replace full search in full-pixel refining search"	2011-05-10 06:59:38 -07:00
John Koleszar	b08c6fa699	Merge remote branch 'origin/master' into experimental Change-Id: I24a548e3ce7794409b6731829f83befc0d465800	2011-05-10 00:05:10 -04:00
Yunqing Wang	cb7b1fb144	Use diamond search to replace full search in full-pixel refining search In NEWMV mode, currently, full search is used as the refining search after n-step search. By replacing it with an iterative diamond search of radius 1 largely reduced the computation complexity, but still maintained the same encoding quality since the refining search is done for every macroblock instead of only a small precentage of macroblocks while using full search. Tests on the test set showed a 3.4% encoding speed increase with none psnr & ssim loss. Change-Id: Ife907d7eb9544d15c34f17dc6e4cfd97cb743d41	2011-05-09 14:07:06 -04:00
Johann	a7d4d3c550	clean up unused variable warnings Change-Id: I9467d7a50eac32d8e8f3a2f26db818e47c93c94b	2011-05-09 12:56:20 -04:00
John Koleszar	cadb2d6651	Merge remote branch 'origin/master' into experimental Change-Id: I22f61430b52348b32078253d5ef38e68e7f91939	2011-05-07 00:05:11 -04:00
John Koleszar	017e85cf58	Merge remote branch 'internal/upstream' into HEAD	2011-05-07 00:05:11 -04:00
Yaowu Xu	89c6017cc0	fix a bug related to gf_active_flags in multi-threaded encoder Paul pointed out that the pointer to the gf_active_flags is not being properly incremented in multithreaded encoder. This commit fixes the issue by making sure the gf_active_ptr points to the starting of next group of mb rows. Change-Id: I3246e657d23beabb614dfb880733a68a5fd7e34c	2011-05-06 09:00:44 -07:00
John Koleszar	5c756005aa	Merge "Don't override active_worst_quality in 2 pass"	2011-05-06 08:59:05 -07:00
Johann	52490354f3	Merge "neon fast quantizer updated"	2011-05-06 08:54:14 -07:00
John Koleszar	abc9958c52	Don't override active_worst_quality in 2 pass Commit `db5057c` introduced a bug in that the active_worst_quality selected by the 2 pass rate controller was being overridden for key frames, causing a severe quality loss. Change-Id: I4865a6fbe3e94e9b4fb9271c7dd68b455d7b371d	2011-05-06 11:48:53 -04:00
Tero Rintaluoma	33fa7c4ebe	neon fast quantizer updated vp8_fast_quantize_b_neon function updated and further optimized. - match current C implementation of fast quantizer - updated to use asm_enc_offsets for structure members - updated ads2gas scripts to handle alignment issues Change-Id: I5cbad9c460ad8ddb35d2970a8684cc620711c56d	2011-05-06 08:59:52 +03:00
Aron Rosenberg	eeb8117303	Fix semaphore emulation on Windows The existing emulation of posix semaphores on Windows uses SetEvent() and WaitForSingleObject(), which implements a binary semaphore, not a counting semaphore as implemented by posix. This causes deadlock when used with the expected posix semantics. Instead, this patch uses the CreateSemaphore() and ReleaseSemaphore() calls (introduced in Windows 2000) which have the expected behavior. This patch also reverts commit `eb16f00`, which split a semaphore that was being used with counting semantics into two binary semaphores. That commit is unnecessary with corrected emulation. Change-Id: If400771536a27af4b0c3a31aa4c4e9ced89ce6a0	2011-05-06 00:13:59 -04:00
John Koleszar	e965d8f6f3	Merge remote branch 'origin/master' into experimental Change-Id: Ib6c8596030140ed2b5e1dea76de024d27ad8ed86	2011-05-06 00:05:11 -04:00
John Koleszar	39e36f8604	Merge remote branch 'internal/upstream' into HEAD	2011-05-06 00:05:10 -04:00
Yunqing Wang	eb16f00cf2	Fix rare hang in multi-thread encoder on Windows This patch is to fix a rare hang in multi-thread encoder that was only seen on Windows. Thanks for John's help in debugging the problem. More test is needed. Change-Id: Idb11c6d344c2082362a032b34c5a602a1eea62fc	2011-05-05 10:42:29 -04:00
Johann	ca5c1b17a2	Merge "Loopfilter NEON: Use VMOV for constant vectors instead of VLD."	2011-05-05 06:16:21 -07:00
Yunqing Wang	aeb86d615c	Merge "Runtime detection of available processor cores."	2011-05-05 04:59:54 -07:00
Attila Nagy	a6aa389d2f	Loopfilter NEON: Use VMOV for constant vectors instead of VLD. Change-Id: I562b6e01c32bb51d00f3b95faf757fc7dc29a3a3	2011-05-04 11:29:23 +03:00
John Koleszar	7f1c9c6a13	Merge remote branch 'origin/master' into experimental Change-Id: I6db2326eb0eca9d8d5941dab1bd8577c7a545825	2011-05-04 00:05:09 -04:00
John Koleszar	848c18e9be	Merge remote branch 'internal/upstream' into HEAD	2011-05-04 00:05:08 -04:00
Yunqing Wang	3fbade23a2	Merge "Modify HEX search"	2011-05-03 11:59:32 -07:00
Yunqing Wang	04ec930abc	Modify HEX search Changed 8-neighbor searching to 4-neighour searching, and continued searching until the center point is the best match. Test on test set showed 1.3% encoding speed improvement as well as 0.1% PSNR and SSIM improvement at speed=-5 (rt mode). Will continue to improve it. Change-Id: If4993b1907dd742b906fd3f86fee77cc5932ee9a	2011-05-03 14:26:33 -04:00
Yaowu Xu	e9465daee3	Merge "change to use fast ssim code for internal ssim calculations"	2011-05-03 11:20:52 -07:00
Yaowu Xu	6c565fada0	change to use fast ssim code for internal ssim calculations The commit also removed the slow ssim calculation that uses a 7x7 kernel, and revised the comments to better describe how sample ssim values are computed and averaged Change-Id: I1d874073cddca00f3c997f4b9a9a3db0aa212276	2011-05-03 08:36:17 -07:00
John Koleszar	b336dc6bff	Merge remote branch 'origin/master' into experimental Change-Id: Ibcddf16cdbfde86d2e3fc0adb7b727072a3d12e9	2011-05-03 00:05:09 -04:00
John Koleszar	e2990fcc48	Merge remote branch 'internal/upstream' into HEAD	2011-05-03 00:05:05 -04:00
John Koleszar	c09d8c1419	Merge "Fix documentation typos"	2011-05-02 06:50:22 -07:00
John Koleszar	a66d8d33dd	Fix compile error with --enable-postproc-visualizer Typo. Change-Id: I9cc6a4587c3d93c9f0da5e101d376741fc9622a4	2011-05-02 09:28:37 -04:00
Thijs Vermeir	8942f70cdf	Fix documentation typos Change-Id: I97124670926433bf1593c91660d8b8f8482ea9ce	2011-04-30 09:34:59 +02:00
John Koleszar	518c551903	Merge remote branch 'origin/master' into experimental Change-Id: I9c995f1fdb46c098b0c519bf333318dff651cb40	2011-04-30 00:05:06 -04:00
John Koleszar	8398449cbf	Merge remote branch 'internal/upstream' into HEAD	2011-04-30 00:05:05 -04:00
Ronald S. Bultje	5a23352c03	Make hor UV predict ~2x faster (73 vs 132 cycles) using SSSE3. Change-Id: I658a1df7d825f820573cb2d11ad402f9d2791035	2011-04-29 11:52:09 -07:00
Yaowu Xu	57ad189129	changed configure option name to reduce confusion Renamed configure option "enable-psnr" to "enable-internal-stats" to better reflect the purpose of the option and eliminate the confusion reported in http://code.google.com/p/webm/issues/detail?id=35 Change-Id: If72df6fdb9f1e33dab1329240ba4d8911d2f1f7a	2011-04-29 09:39:05 -07:00
Yunqing Wang	dfa9e2c5ea	Merge "Use insertion sort instead of quick sort"	2011-04-29 08:27:58 -07:00
Scott LaVarnway	1b2abc5f49	Merge "Consolidated build inter predictors"	2011-04-29 07:13:49 -07:00
John Koleszar	89c3269636	Merge remote branch 'origin/master' into experimental Change-Id: I993021d0b2d7fbe44d6371464f2686eed3ccfaae	2011-04-29 00:05:07 -04:00
John Koleszar	57afffbcbb	Merge remote branch 'internal/upstream' into HEAD	2011-04-29 00:05:07 -04:00
James Berry	f10732554b	bug fix removed inline from recon_wrapper_sse2.c removed inline from recon_wrapper_sse2.c to build for visual stuido Change-Id: I74a3482950448e2cdb30e9cd7087145b440d8a22	2011-04-28 15:12:00 -04:00
Scott LaVarnway	219ba87a93	Merge "Use psadbw to get the sum of bytes in a line."	2011-04-28 07:58:20 -07:00
Scott LaVarnway	ccd6f7ed77	Consolidated build inter predictors Code cleanup. Change-Id: Ic8b0167851116c64ddf08e8a3d302fb09ab61146	2011-04-28 10:53:59 -04:00
John Koleszar	c26bb0fe8f	Merge remote branch 'origin/master' into experimental Change-Id: I7d91efbc3662c86d6efa2d7495eb4689ccdb0ced	2011-04-28 00:05:07 -04:00
John Koleszar	e1b90ce862	Merge remote branch 'internal/upstream' into HEAD	2011-04-28 00:05:07 -04:00
Ronald S. Bultje	1e7ded69cf	Use psadbw to get the sum of bytes in a line. Thanks Jason for pointing that out on #vp8. ;-). Change-Id: I5330a753e752a8704b78a409597472628e0b26a5	2011-04-27 13:49:21 -07:00
Scott LaVarnway	2e102855f4	Removed unused code in reconinter The skip flag is never set by the encoder for SPLITMV. Change-Id: I5ae6457edb3a1193cb5b05a6d61772c13b1dc506	2011-04-27 15:25:32 -04:00
John Koleszar	085fb4b737	Merge "SSE2/SSSE3 optimizations for build_predictors_mbuv{,_s}()."	2011-04-27 12:02:55 -07:00
Ronald S. Bultje	1083fe4999	SSE2/SSSE3 optimizations for build_predictors_mbuv{,_s}(). decoding before 10.425 10.432 10.423 =10.426 after: 10.405 10.416 10.398 =10.406, 0.2% faster encoding before 14.252 14.331 14.250 14.223 14.241 14.220 14.221 =14.248 after 14.095 14.090 14.085 14.095 14.064 14.081 14.089 =14.086, 1.1% faster Change-Id: I483d3d8f0deda8ad434cea76e16028380722aee2	2011-04-27 11:31:27 -07:00
Yunqing Wang	5abafcc381	Use insertion sort instead of quick sort Insertion sort performs better for sorting small arrays. In real- time encoding (speed=-5), test on test set showed 1.7% performance gain with 0% PSNR change in average. Change-Id: Ie02eaa6fed662866a937299194c590d41b25bc3d	2011-04-27 13:53:28 -04:00
John Koleszar	64355ecad3	Merge "Speed up VP8DX_BOOL_DECODER_FILL"	2011-04-27 09:03:45 -07:00
John Koleszar	f8ffecb176	Merge "Update VP8DX_BOOL_DECODER_FILL to better detect EOS"	2011-04-27 09:03:24 -07:00
John Koleszar	5e1fd41357	Speed up VP8DX_BOOL_DECODER_FILL The end-of-buffer check is hoisted out of the inner loop. Gives about 0.5% improvement on x86_64. Change-Id: I8e3ed08af7d33468c5c749af36c2dfa19677f971	2011-04-27 10:25:03 -04:00
John Koleszar	9594370e0c	Update VP8DX_BOOL_DECODER_FILL to better detect EOS Allow more reliable detection of truncated bitstreams by being more precise with the count of "virtual" bits in the value buffer. Specifically, the VP8_LOTS_OF_BITS value is accumulated into count, rather than being assigned, which was losing the prior value, increasing the required tolerance when testing for the error condition. Change-Id: Ib5172eaa57323b939c439fff8a8ab5fa38da9b69	2011-04-27 10:24:39 -04:00
John Koleszar	b93faff5a0	Merge remote branch 'origin/master' into experimental Change-Id: I76db6b5bd9f3817d5a3e32cad5891154ff9c9b18	2011-04-27 00:05:07 -04:00
John Koleszar	5944829d6d	Merge remote branch 'internal/upstream' into HEAD	2011-04-27 00:05:07 -04:00
John Koleszar	db5057c742	Refactor calc_iframe_target_size Combine calc_iframe_target_size, previously only used for forced keyframes, with calc_auto_iframe_target_size, which handled most keyframes. Change-Id: I227051361cf46727caa5cd2b155752d2c9789364	2011-04-26 16:55:35 -04:00
John Koleszar	81d2206ff8	Move pick_frame_size() to ratectrl.c This is a first step in cleaning up the redundancies between vp8_calc_{auto_,}iframe_target_size. The pick_frame_size() function is moved to ratectrl.c, and made to be the primary interface. This means that the various calc_*_target_size functions can be made private. Change-Id: I66a9a62a5f9c23c818015e03f92f3757bf3bb5c8	2011-04-26 16:49:54 -04:00
Scott LaVarnway	0da77a840b	Merge "Test vector mismatch fix"	2011-04-26 10:12:37 -07:00
Scott LaVarnway	7a2b9c50a3	Test vector mismatch fix Fixed test vector mismatch that was introduced in the "Removed dc_diff from MB_MODE_INFO" (Ie2b9cdf9e0f4e8b932bbd36e0878c05bffd28931) Change-Id: I98fa509b418e757b5cdc4baa71202f4168dc14ec	2011-04-26 09:37:19 -04:00
John Koleszar	0a77e59847	Merge remote branch 'origin/master' into experimental Conflicts: vp8/common/alloccommon.c vp8/encoder/rdopt.c Change-Id: I142167d31d1b9cffe143774f6915bca463df67f0	2011-04-26 08:28:51 -04:00
John Koleszar	bbc24a65c4	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/common/alloccommon.c vp8/encoder/rdopt.c Change-Id: Ic34b33577423031e277235ffa6bcaff7b252e5cb	2011-04-26 08:27:39 -04:00
Johann	d5c46bdfc0	Merge "remove simpler_lpf"	2011-04-25 14:51:07 -07:00
Johann	01527e743f	remove simpler_lpf the decision to run the regular or simple loopfilter is made outside the function and managed with pointers stop tracking the option in two places. use filter_type exclusively Change-Id: I39d7b5d1352885efc632c0a94aaf56b72cc2fe15	2011-04-25 17:37:41 -04:00
John Koleszar	fd6da3b2e7	Fix duplicate vp8_compute_frame_size_bounds Likely introduced by a bad automatic merge from gerrit. Change-Id: I0c6dd6ec18809cf9492f524d283fa4a3a8f4088b	2011-04-25 14:30:57 -04:00
John Koleszar	1f32b1489c	Merge "Remove unused functions"	2011-04-25 11:05:00 -07:00
John Koleszar	47bc1c7013	Remove unused functions Remove estimate_min_frame_size() and calc_low_ss_err(), as they are never referenced. Change-Id: I3293363c14ef70b79c4678ca27aa65b345077726	2011-04-25 13:54:23 -04:00
John Koleszar	cfbfd39de8	Merge "Change rc undershoot/overshoot semantics"	2011-04-25 10:49:32 -07:00
John Koleszar	76557e34d2	Merge "Limit size of initial keyframe in one-pass."	2011-04-25 10:48:13 -07:00
John Koleszar	d9f898ab6d	Merge "Add rc_max_intra_bitrate_pct control"	2011-04-25 10:47:57 -07:00
John Koleszar	454cbc96b7	Limit size of initial keyframe in one-pass. Rather than using a default size of 1/2 or 3/2 seconds for the first frame, use a fraction of the initial buffer level to give the application some control. This will likely undergo further refinement as size limits on key frames are currently under discussion on codec-devel@, but this gives much better behavior for small buffer sizes as a starting point. Change-Id: Ieba55b86517b81e51e6f0a9fe27aabba295acab0	2011-04-25 13:47:20 -04:00
John Koleszar	aa926fbd27	Add rc_max_intra_bitrate_pct control Adds a control to limit the maximum size of a keyframe, as a function of the per-frame bitrate. See this thread[1] for more detailed discussion: [1]: http://groups.google.com/a/webmproject.org/group/codec-devel/browse_thread/thread/271b944a5e47ca38 Change-Id: I7337707642eb8041d1e593efc2edfdf66db02a94	2011-04-25 13:47:14 -04:00
John Koleszar	2089b2cee5	Merge "bug fix possible keyframe context divide by zero"	2011-04-25 09:35:12 -07:00
James Berry	8d5ce819dd	bug fix possible keyframe context divide by zero vp8_adjust_key_frame_context() divides by estimate_keyframe_frequency() which can return 0 in the case where --kf-max-dist=0. Change-Id: Idfc59653478a0073187cd2aa420e98a321103daa	2011-04-25 12:16:36 -04:00
Johann	aeca599087	Merge "keep values in registers during quantization"	2011-04-25 06:52:38 -07:00
Scott LaVarnway	c36b6d4d01	Merge "Removed unnecessary frame type checks"	2011-04-25 06:45:43 -07:00
Scott LaVarnway	5b67329747	Merge "Removed dc_diff from MB_MODE_INFO"	2011-04-25 06:45:32 -07:00
John Koleszar	308e31a3ef	Merge remote branch 'internal/upstream-experimental' into HEAD Conflicts: vp8/decoder/onyxd_int.h Change-Id: Icf445b589c2bc61d93d8c977379bbd84387d0488	2011-04-25 09:13:41 -04:00
John Koleszar	5227798c57	Merge remote branch 'origin/master' into experimental Change-Id: Iaaa51ec66768fe7cf4de0035602165efcc5fc5e4	2011-04-23 00:05:08 -04:00
Ronald S. Bultje	496bcbb0de	Fix overflow in temporal_filter_apply_sse2(). The accumulator array is an integer array, so use paddd instead of paddw to add values to it. Fixes overflows when using large --arnr-maxframes (>8) values. Change-Id: Iad83794caa02400a65f3ab5760f2517e082d66ae	2011-04-22 10:00:38 -04:00
John Koleszar	5dfd6f51cb	Merge remote branch 'origin/master' into experimental Change-Id: I6f77e7c10a54c54b26126b8acd5edd0a03358a41	2011-04-22 00:05:08 -04:00
John Koleszar	73c3d32705	Merge "Remove unused kf rate variables"	2011-04-21 16:54:14 -07:00
Adrian Grange	d2a6eb4b1e	Corrected format specifiers in debug print statements The arguments to these fprintfs are int not long int so the format specifier should be "%d" and not "%ld". This was writing garbage in the linux build. Change-Id: I3d2aa8a448d52e6dc08858d825bf394929b47cf3	2011-04-21 15:45:57 -07:00
Johann	508ae1b3d5	keep values in registers during quantization add an sse4 quantizer so we can use pinsrw/pextrw and keep values in xmm registers instead of proxying through the stack. and as long as we're bumping up, use some ssse3 instructions in the EOB detection (see ssse3 fast quantizer) pick up about a percent on 32bit and about two on 64bit. Change-Id: If15abba0e8b037a1d231c0edf33501545c9d9363	2011-04-21 15:47:55 -04:00
Scott LaVarnway	6f6cd3abb9	Removed unnecessary frame type checks ref_frame is set to INTRA_FRAME for keyframes. The B_PRED mode is only used in intra frames. Change-Id: I9bac8bec7c736300d47994f3cb570329edf11ec0	2011-04-21 14:59:42 -04:00
Scott LaVarnway	3698c1f620	Removed dc_diff from MB_MODE_INFO The dc_diff flag is used to skip loopfiltering. Instead of setting this flag in the decoder/encoder, we now check for this condition in the loopfilter. Change-Id: Ie2b9cdf9e0f4e8b932bbd36e0878c05bffd28931	2011-04-21 14:38:36 -04:00
John Koleszar	b59bd22cc0	Merge remote branch 'origin/master' into experimental Change-Id: I78a30fb4438ddd0730262691d7c120d67cbcaaa9	2011-04-21 00:05:08 -04:00
Scott LaVarnway	7a49accd0b	Removed force_no_skip force_no_skip is always set to zero. Change-Id: I89b61c5e0bee34627a9c07c05f3517e1db76af77	2011-04-20 15:45:12 -04:00
Scott LaVarnway	09c933ea80	Removed redundant checks of the mode_info_context flags Code cleanup. The build inter predictor functions are redundantly checking the mode_info_context for either INTRA_FRAME or SPLITMV. Change-Id: I4d58c3a5192a4c2cec5c24ab1caf608bf13aebfb	2011-04-20 14:06:40 -04:00
Attila Nagy	43464e94ed	Do not copy data between encoder reference buffers. Golden and ALT reference buffers were refreshed by copying from the new buffer. Replaced this by index manipulation. Also moved all the reference frame updates to one function for easier tracking. Change-Id: Icd3e534e7e2c8c5567168d222e6a64a96aae24a1	2011-04-20 15:26:55 +03:00
John Koleszar	65b44c2911	Merge remote branch 'origin/master' into experimental Change-Id: I9e9ece0424b2f4b6861e9c7c0986f6eccc9159d6	2011-04-20 00:05:12 -04:00
John Koleszar	ad6a8ca58b	Remove unused kf rate variables Remove tot_key_frame_bits and prior_key_frame_size[] as they were tracked but never used. Remove intra_frame_target, as it was only used to initialize prior_key_frame_size. Refactor vp8_adjust_key_frame_context() some to remove unnecessary calculations. Change-Id: Icbc2c83d2b90e184be03e6f9679e678f3a4bce8f	2011-04-19 16:14:57 -04:00
Johann	4a2b684ef4	modify SAVE_XMM for potential 64bit use the win64 abi requires saving and restoring xmm6:xmm15. currently SAVE_XMM and RESTORE XMM only allow for saving xmm6:xmm7. allow specifying the highest register used and if the stack is unaligned. Change-Id: Ica5699622ffe3346d3a486f48eef0206c51cf867	2011-04-19 10:42:45 -04:00
Johann	a9b465c5c9	Merge "Add save/restore xmm registers in x86 assembly code"	2011-04-19 06:32:10 -07:00
John Koleszar	a5d3febc13	Merge remote branch 'origin/master' into experimental Change-Id: I920c3ed6af244ef9032b744675d9f664e5878d0e	2011-04-19 00:05:09 -04:00
Johann	c7cfde42a9	Add save/restore xmm registers in x86 assembly code Went through the code and fixed it. Verified on Windows. Where possible, remove dependencies on xmm[67] Current code relies on pushing rbp to the stack to get 16 byte alignment. This broke when rbp wasn't pushed (vp8/encoder/x86/sad_sse3.asm). Work around this by using unaligned memory accesses. Revisit this and the offsets in vp8/encoder/x86/sad_sse3.asm in another change to SAVE_XMM. Change-Id: I5f940994d3ebfd977c3d68446cef20fd78b07877	2011-04-18 16:30:38 -04:00
Yunqing Wang	48438d6016	Merge "Use sub-pixel search's SSE in mode selection"	2011-04-18 13:20:04 -07:00
Yunqing Wang	b8f0b59985	Use sub-pixel search's SSE in mode selection Passed SSE from sub-pixel search back to pick_inter_mode function, which is compared with the encode_breakout to see if we could skip evaluating the remaining modes. Change-Id: I4a86442834f0d1b880a19e21ea52d17d505f941d	2011-04-18 16:12:28 -04:00
Yunqing Wang	d5069b5af0	Merge "Handle long delay between video frames in multi-thread decoder(issue 312)"	2011-04-18 10:11:41 -07:00
Johann	cd103a5721	Merge "store quant_shift as an unsigned char"	2011-04-18 10:03:40 -07:00
Yaowu Xu	c619f6cb0f	Merge "fixed an overflow in ssim calculation"	2011-04-18 07:44:34 -07:00
Scott LaVarnway	e1a8b6c8d5	Removed unused timers Change-Id: I209803b9dbed2b2f6d02258fd7a3963a6645f4ab	2011-04-18 09:09:57 -04:00
John Koleszar	0ba3fffc3a	Merge remote branch 'origin/master' into experimental Change-Id: I6ee7c49138576326887b32316cffe8d3e48aa044	2011-04-16 00:05:08 -04:00
John Koleszar	9d75a502c4	Merge remote branch 'internal/upstream' into HEAD	2011-04-16 00:05:07 -04:00
Yunqing Wang	8ba58951e9	Handle long delay between video frames in multi-thread decoder(issue 312) This is reported by m...@hesotech.de (see issue 312): "The decoder causes an access violation when you decode the first frame, then make a pause of about 60 seconds and then decode further frames. But only if vpx_codec_dec_cfg_t.threads> 1. This is caused by a timeout of WaitForSingleObject. When I change the definition of VPXINFINITE to INFINITE(0xFFFFFFFF), the problem is solved." Reproduced the crash and verified the changes on Windows platform. This brings the behavior inline with the other platforms using sem_wait(). Change-Id: I27b32f90bce05846ef2684b50f7a88f292299da1	2011-04-15 17:27:26 -04:00
Johann	d889035fe6	Merge "remove dead code, add missing RESTORE_XMM"	2011-04-15 13:32:54 -07:00
Johann	f64f425a50	remove executable bit source files are not executable Change-Id: Id2c7294695a22217468426423979f68f02d82340	2011-04-15 13:43:24 -04:00
Adrian Grange	0d2abe3084	Merge "Fix usage of value returned by vp8_pick_intra4x4mby_modes"	2011-04-15 08:37:19 -07:00
Yunqing Wang	1312a7a2e2	Merge "Reduce unnecessary distortion computation"	2011-04-15 08:17:03 -07:00
Johann	487c0299c9	remove dead code, add missing RESTORE_XMM vp8_filter_block1d16_h4_ssse3 was never called because UNSHADOW_ARGS moves the stack by 'mov rsp, rbp', the issue was masked. however, if/when win64 used those registers for persistant data, issues could/will arise. Change-Id: I56d6effca0aeba1f86082689771cb10145d39651	2011-04-15 10:11:53 -04:00
John Koleszar	a3399291ad	Fix off-by-one in copy_and_extend_plane Should only copy h lines, not h+1. Change-Id: I802a85686635900459c6dc79596189033e5298d8	2011-04-15 08:44:39 -04:00
John Koleszar	b4bb910b57	Merge remote branch 'origin/master' into experimental Change-Id: Iacd40d38693f433cd25b071fc8420f563b242696	2011-04-15 00:05:09 -04:00
John Koleszar	b709794929	Merge remote branch 'internal/upstream' into HEAD	2011-04-15 00:05:08 -04:00
Yunqing Wang	918fb5487e	Reduce unnecessary distortion computation In vp8_pick_inter_mode(), for NEWMV mode, use the error result got from motion search as distortion. This helps performance in real- time mode. Change-Id: I398c4e46cc5381f7d874e748cf78827ef0e0860c	2011-04-14 15:53:33 -04:00
John Koleszar	63f15987a5	Merge "Refactor lookahead ring buffer"	2011-04-14 12:35:01 -07:00
Fritz Koenig	e749ae510f	Merge "Use consistent delimiters."	2011-04-14 11:56:18 -07:00
Adrian Grange	8608de1c6f	Fix usage of value returned by vp8_pick_intra4x4mby_modes The value of distortion2 returned by vp8_pick_intra4x4mby_modes was being overwritten by the value returned by get16x16prederror before it was tested. Change-Id: If00e80332b272c5545c3a7e381c8041e8319b41a	2011-04-14 10:50:00 -07:00
Fritz Koenig	33cefd6f6e	Use consistent delimiters. opsnr.stt file was using \t for delimiters on everything except between VPXSSIM and Time. Change-Id: I6284c4e40c05ff642bf4b0170dca062c279a42df	2011-04-13 15:06:17 -07:00
Adrian Grange	8861174624	Fixed use of early breakout in vp8_pick_intra4x4mby_modes Index i is used to detect early breakout from the first loop, but its value is lost due to reuse in the second for loop. I moved the position of the second loop and did some format cleanup. Change-Id: I02780eae1bd89df4b6c000fb8a018b0837aac2e5	2011-04-13 12:56:46 -07:00
John Koleszar	88841f1059	Refactor lookahead ring buffer This patch cleans up the source buffer storage and copy mechanism to allow access through a standard push/pop/peek interface. This approach also avoids an extra copy in the case where the source is not a multiple of 16, fixing issue #102. Change-Id: I05808c39f5743625cb4c7af54cc841b9b10fdbd9	2011-04-13 14:26:45 -04:00
Johann	70f30aa95d	store quant_shift as an unsigned char in encodframe.c, quant_shift is set to 0 or 1 in vp8cx_invert_quant only use 8 bits to store this, instead of 16. will allow saving an xmm register in an updated version of the regular quantize Change-Id: Ie88c47fe2aff5af0283dab1147fb2791e4b12f90	2011-04-13 13:50:12 -04:00
John Koleszar	cb3e0aaba3	Merge remote branch 'origin/master' into experimental Change-Id: I231e4dd65adcf4f5c158e3749880a18b8c36cbe4	2011-04-13 00:05:09 -04:00
John Koleszar	8b20b578bf	Merge remote branch 'internal/upstream' into HEAD	2011-04-13 00:05:07 -04:00
John Koleszar	c99f9d7abf	Change rc undershoot/overshoot semantics This patch changes the rc_undershoot_pct and rc_overshoot_pct controls to set the "aggressiveness" of rate adaptation, by limiting the amount of difference between the target buffer level and the actual buffer level which is applied to the target frame rate for this frame. This patch was initially provided by arosenberg at logitech.com as an attachment to issue #270. It was modified to separate these controls from the other unrelated modifications in that patch, as well as to use the pre-existing variables rather than introducing new ones. Change-Id: Id542e3f5667dd92d857d5eabf29878f2fd730a62	2011-04-12 20:49:33 -04:00
John Koleszar	538f110407	Merge "Bugfix for error accumulator stats"	2011-04-12 06:59:00 -07:00
John Koleszar	e689a27d62	Bugfix for error accumulator stats Previous to commit `de4e9e3`, there was an early return in the alt-ref case that was inadvertantly removed when the function was refactored to return void. This patch restores the prior behavior. Change-Id: I783ffd594a4690297e2742f99526fd7ad67698b2	2011-04-12 08:47:33 -04:00
John Koleszar	fd09009227	Merge "Fix encoder range check for frame width and height"	2011-04-12 05:34:12 -07:00
Attila Nagy	1aadcedcfb	Fix encoder range check for frame width and height 14 bits available in the bistream => valid range [1..16383] Removed unused local vars. Change-Id: Icf3385e47a9fa13af70053129c2248671f285583	2011-04-12 15:07:37 +03:00
John Koleszar	7ff5084f33	Merge remote branch 'origin/master' into experimental Change-Id: Ib42656b05f2b099f17fd6c2033bbc3445421150c	2011-04-12 00:05:09 -04:00
John Koleszar	f809f4f93c	Merge remote branch 'internal/upstream' into HEAD	2011-04-12 00:05:08 -04:00
Yunqing Wang	4fd81a99f8	Set cpu_used range to [-16, 16] in real-time mode Remove encoding speed limitation in real-time mode. Change-Id: Ib5e35d8bb522b2a25f3e4ad5cfe2788ebebb3617	2011-04-11 15:55:04 -04:00
Yunqing Wang	d1abe62d1c	Define RDCOST only once Clean up the code. Change-Id: I7db048efa4d972b528d553a7921bc45979621129	2011-04-11 11:53:56 -04:00
John Koleszar	a9ce3e3834	Remove unused files Change-Id: I36ca3f2f4620358033da34daf764f0b388dacd08	2011-04-11 10:34:40 -04:00
John Koleszar	e33241bb13	Merge remote branch 'origin/master' into experimental Change-Id: I1a58ce4643377bae4cc6bf9c89320251f724ca66	2011-04-09 00:05:08 -04:00
John Koleszar	f6360955f4	Merge remote branch 'internal/upstream' into HEAD	2011-04-09 00:05:08 -04:00
Yunqing Wang	4b43167ad1	Fix input MV for full search Input MV needs to be modified to full-pixel precision. Change-Id: Ic5d78e41bf27077e325024332b9fe89f76c44f0c	2011-04-08 16:29:41 -04:00
Johann Koenig	6e156a4cd7	Merge "use asm_offsets with vp8_fast_quantize_b_sse3"	2011-04-08 10:05:47 -07:00
John Koleszar	921a32a306	Merge "Error accumulator stats bug."	2011-04-08 08:20:32 -07:00
Paul Wilkins	de4e9e3b44	Error accumulator stats bug. The error accumulator stats values cpi->prediction_error and cpi->intra_error were being populated with rd values not distortion values. These are only "currently" used in a limited way for RT compress key frame detection. Change-Id: I2702ba1cab6e49ab8dc096ba75b6b34ab3573021	2011-04-08 14:21:36 +01:00
John Koleszar	be3dee8903	Merge remote branch 'origin/master' into experimental Change-Id: Ib70851b1d801d719edb8f5cd48d2f8fb210d3867	2011-04-08 00:05:08 -04:00
John Koleszar	fd599efb25	Merge remote branch 'internal/upstream' into HEAD	2011-04-08 00:05:07 -04:00
Jim Bankoski	d4cdb683a4	fixed an overflow in ssim calculation This commit fixed an overflow in ssim calculation, added register save and restore to make sure assembly code working for x64 platform. It also changed the sampling points to every 4x4 instead of 8x8 and adjusted the constants in SSIM calculation to match the scale of previous VPXSSIM. Change-Id: Ia4dbb8c69eac55812f4662c88ab4653b6720537b	2011-04-07 14:25:25 -07:00
Johann Koenig	08702002e8	use asm_offsets with vp8_fast_quantize_b_sse3 on the same order as the sse2 fast quantize change: ~2% except for 32bit. only a slight improvment there. Change-Id: Iff80e5f1ce7e646eebfdc8871405458ff911986b	2011-04-07 16:40:05 -04:00
James Berry	aec5487cdd	Use correct 32 bit comparisons for SAD breakout. Rax updated to eax to avoid uninitialized memory usage. Change-Id: Iedb953f104329ede2a786fc648a47f1be2f3798a	2011-04-07 15:08:03 -04:00
John Koleszar	6e4f6c96b3	Merge remote branch 'origin/master' into experimental Change-Id: Icee86a4b25e53dc04b508179101b1a782b688f61	2011-04-07 00:05:11 -04:00
John Koleszar	1805223162	Merge remote branch 'internal/upstream' into HEAD	2011-04-07 00:05:06 -04:00
Johann	2de858b9fc	Merge "use asm_offsets with vp8_fast_quantize_b_sse2"	2011-04-06 10:53:55 -07:00
Yunqing Wang	9e9f61a317	Merge "Minor modification"	2011-04-06 06:12:13 -07:00
Yunqing Wang	02423b2e92	Minor modification A small change. Change-Id: I2e7726e58370a95d0319361f4f6ad231138d1328	2011-04-06 09:08:47 -04:00
John Koleszar	d64aa018be	Merge remote branch 'origin/master' into experimental Change-Id: Ied0fedb05342dead6d34740209cf75997f155e72	2011-04-06 00:05:10 -04:00
John Koleszar	77058ad62b	Merge remote branch 'internal/upstream' into HEAD	2011-04-06 00:05:09 -04:00
John Koleszar	a6be45c9ca	Merge remote branch 'origin/master' into experimental Change-Id: I53be500dad1a98e21d0a28f9e07761d8d03fdcf6	2011-04-05 00:05:10 -04:00
John Koleszar	89bdcc211e	Merge remote branch 'internal/upstream' into HEAD	2011-04-05 00:05:07 -04:00
Johann	c32e0ecc59	use asm_offsets with vp8_fast_quantize_b_sse2 on the same order as the regular quantize change: ~2% Change-Id: I5c9eec18e89ae7345dd96945cb740e6f349cee86	2011-04-04 16:23:29 -04:00
Scott LaVarnway	f212a98ee7	Fixed unused variable warnings for firstpass.c Change-Id: I8378a9a541ade2f098359a7b20fa08e6c1596d80	2011-04-04 14:18:31 -04:00
John Koleszar	91036996ac	Merge "Slightly simplify vp8_decode_mb_tokens."	2011-04-04 08:58:25 -07:00
Johann	610dd90288	Merge "tweak vp8_regular_quantize_b_sse2"	2011-04-04 08:56:25 -07:00
Gaute Strokkenes	15f03c2f13	Slightly simplify vp8_decode_mb_tokens. Change-Id: I0058ba7dcfc50a3374b712197639ac337f8726be	2011-04-04 16:47:22 +01:00
Yunqing Wang	f5c0d95e8c	Merge "Use full-pixel MV in mvsadcost calculation"	2011-04-04 08:40:51 -07:00
John Koleszar	b3b2657f21	Merge remote branch 'internal/upstream' into HEAD	2011-04-02 00:05:11 -04:00
John Koleszar	2588b8fe05	Merge remote branch 'origin/master' into experimental Change-Id: I1cd5ad3df61463ca7d946857a548d7611d65c593	2011-04-02 00:05:10 -04:00
Yunqing Wang	3d6815817c	Use full-pixel MV in mvsadcost calculation MV sad cost error is only used in full-pixel motion search, which only need full-pixel resolution instead of quarter-pixel resolution. This change reduced mvsadcost table size, and removed unneccessary pamameter passing since this table is constant once it is generated. Change-Id: I9f931e55f6abc3c99011321f1dfb2f3562e6f6b0	2011-04-01 16:41:58 -04:00
Johann	8520b5c785	tweak vp8_regular_quantize_b_sse2 rather than look up rc in the zig zag table, embed it in the macro. this also allows us to shuffle some values in the macro and keep *d in rsi gains of about the same order as the obj_int_extract implementation: ~2% Change-Id: Ib7252dd10eee66e0af8b0e567426122781dc053d	2011-04-01 09:58:23 -04:00
Johann	ba11e24d47	Merge "Wrapper function removed from vp8_subtract_b_neon function call"	2011-04-01 05:47:21 -07:00
Tero Rintaluoma	cec76a36d6	Wrapper function removed from vp8_subtract_b_neon function call Address calculations moved from encodemb_arm.c file to neon optimized assembly function to save cycles in function calls. - vp8_subtract_b_neon_func replaced with vp8_subtract_b_neon that contains all needed address calculations - unnecessary file encodemb_arm.c removed - consistent with ARMv6 optimized version Change-Id: I6cbc1a2670b56c2077f59995fcf8f70786b4990b	2011-04-01 10:06:44 +03:00
John Koleszar	305c9b57b2	Merge remote branch 'internal/upstream' into HEAD	2011-04-01 00:05:12 -04:00
John Koleszar	88ed17298f	Merge remote branch 'origin/master' into experimental Change-Id: Ie59ab2f2e93464df0f484bd73d2394d05640536d	2011-04-01 00:05:12 -04:00
Johann	9d138379a2	Merge "ARMv6 optimized subtract functions"	2011-03-31 08:40:10 -07:00
Attila Nagy	297b27655e	Runtime detection of available processor cores. Detect the number of available cores and limit the thread allocation accordingly. On decoder side limit the number of threads to the max number of token partition. Core detetction works on Windows and Posix platforms, which define _SC_NPROCESSORS_ONLN or _SC_NPROC_ONLN. Change-Id: I76cbe37c18d3b8035e508b7a1795577674efc078	2011-03-31 10:23:01 +03:00
Attila Nagy	7d335868df	Fix: lpf semaphore was signaled in single threaded run After picking filter level, post the loopfilter semaphore just when multiple threads are in use. Change-Id: If7bfb64601d906adef703f454dafc25e978b93c6	2011-03-30 15:55:29 +03:00
John Koleszar	23afb5810e	Merge remote branch 'origin/master' into experimental Change-Id: Ie86a006320f3cea6a068a6b235267e19c3a19c4e	2011-03-30 00:05:07 -04:00
John Koleszar	9a82cc7455	Merge remote branch 'internal/upstream' into HEAD	2011-03-30 00:05:06 -04:00
Johann	0e43668546	Merge "Half pixel variance further optimized for ARMv6"	2011-03-29 12:14:54 -07:00
Yunqing Wang	534ea700bd	Merge "Fix a crash while enabling shared (--enable-shared)"	2011-03-29 09:04:22 -07:00
Yunqing Wang	b843aa4eda	Fix a crash while enabling shared (--enable-shared) Fixed a bug in SSSE3 sub-pixel filter functions. Change-Id: I2e2126652970eb78307ffcefcace1efd5966fb0a	2011-03-29 11:31:06 -04:00
Johann	f0c22a3f33	use GLOBAL correctly on 32bit shared libraries http://code.google.com/p/webm/issues/detail?id=309 Change-Id: I6fce9e2f74bc09a9f258df7f91ab599812324e8c	2011-03-29 11:27:03 -04:00
Tero Rintaluoma	6fdc9aa79f	ARMv6 optimized subtract functions Adds following ARMv6 optimized functions to encoder: - vp8_subtract_b_armv6 - vp8_subtract_mby_armv6 - vp8_subtract_mbuv_armv6 Gives 1-5% speed-up depending on input sequence and encoding parameters. Functions have one stall cycle inside the loop body on Cortex pipeline. Change-Id: I19cca5408b9861b96f378e818eefeb3855238639	2011-03-29 16:52:00 +03:00
John Koleszar	b9f2356182	Merge remote branch 'origin/master' into experimental Change-Id: Iae24496ca5ceb4446211c1e27351434c16b09dd1	2011-03-29 00:05:07 -04:00
John Koleszar	057ace0d92	Merge remote branch 'internal/upstream' into HEAD	2011-03-29 00:05:04 -04:00
Johann	4be062bbc3	add asm_enc_offsets.c for all targets now that we need asm_enc_offsets.c for x86 and arm and it is harmless to build it for other targets, add it unconditionally Change-Id: I320c5220afd94fee2b98bda9ff4e5e34c67062f3	2011-03-28 10:43:47 -04:00
Tero Rintaluoma	f5e433464b	Half pixel variance further optimized for ARMv6 Half pixel interpolations optimized in variance calculations. Separate function calls to vp8_filter_block2d_bil_x_pass_armv6 are avoided.On average, performance improvement is 6-7% for VGA@30fps sequences. Change-Id: Idb5f118a9d51548e824719d2cfe5be0fa6996628	2011-03-28 09:51:51 +03:00
John Koleszar	b8a78cfa49	Merge remote branch 'origin/master' into experimental Change-Id: Ibffdedc3bd2e1ec349e79ba038b065c98db77d06	2011-03-25 00:05:04 -04:00
John Koleszar	cdada23377	Merge remote branch 'internal/upstream' into HEAD	2011-03-25 00:05:04 -04:00
Johann	beaafefcf1	Merge "use asm_offsets with vp8_regular_quantize_b_sse2"	2011-03-24 11:06:36 -07:00
Johann	8edaf6e2f2	use asm_offsets with vp8_regular_quantize_b_sse2 remove helper function and avoid shadowing all the arguments to the stack on 64bit systems when running with --good --cpu-used=0: ~2% on linux x86 and x86_64 ~2% on win32 x86 msys and visual studio more on darwin10 x86_64 significantly more on x86_64-win64-vs9 Change-Id: Ib7be12edf511fbf2922f191afd5b33b19a0c4ae6	2011-03-24 13:34:48 -04:00
John Koleszar	3f4291e6e0	Merge remote branch 'origin/master' into experimental Change-Id: I2e36f806ae5551c5015243de697aac3e9e29334d	2011-03-24 00:05:06 -04:00
John Koleszar	1f1526f8b8	Merge remote branch 'internal/upstream' into HEAD	2011-03-24 00:05:05 -04:00
Johann	4cde2ab765	Merge "ARMv6 optimized fdct4x4"	2011-03-23 07:52:51 -07:00
John Koleszar	51bcf621c1	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/decoder/decodemv.c vp8/decoder/onyxd_if.c vp8/encoder/ratectrl.c vp8/encoder/rdopt.c Change-Id: Ia1c1c5e589f4200822d12378c7749ba62bd17ae2	2011-03-23 00:27:52 -04:00
John Koleszar	5f6db3591c	Merge remote branch 'origin/master' into experimental Conflicts: vp8/encoder/ratectrl.c vp8/encoder/rdopt.c Change-Id: I4cc58acb432662d2c47aceda1680e52982adbc06	2011-03-23 00:24:25 -04:00
Yunqing Wang	73065b67e4	Merge "Fix multithreaded encoding for 1 MB wide frame"	2011-03-21 07:41:31 -07:00
John Koleszar	2cbd962088	Remove unused vp8_get4x4sse_cs_mmx declaration This declaration did not match the prototype_sad() prototype, but was unused in this translation unit, so it is removed instead. Fixes issue 290. Change-Id: I168854f88a85f73ca9aaf61d1e5dc0f43fc3fdb3	2011-03-21 07:53:53 -04:00
John Koleszar	769c74c0ac	Merge "Increase static linkage, remove unused functions"	2011-03-21 04:51:51 -07:00
Tero Rintaluoma	a61785b6a1	ARMv6 optimized fdct4x4 Optimized fdct4x4 (8x4) for ARMv6 instruction set. - No interlocks in Cortex-A8 pipeline - One interlock cycle in ARM11 pipeline - About 2.16 times faster than current C-code compiled with -O3 Change-Id: I60484ecd144365da45bb68a960d30196b59952b8	2011-03-21 13:33:45 +02:00
Attila Nagy	bfe803bda3	Fix multithreaded encoding for 1 MB wide frame Thread synchronization was not correct when frame width was 1 MB. Number of allocated encoding threads is limited by the sync_range. There is no point having more because each thread lags sync_range MBs behind the thread processing the row above. http://code.google.com/p/webm/issues/detail?id=302 Change-Id: Icaf67a883beecc5ebf2f11e9be47b6997fdf6f26	2011-03-18 12:35:30 +02:00
John Koleszar	4a1c3cf7d8	Merge remote branch 'origin/master' into experimental Change-Id: If77de7e96a971edd8666ea0b1bd5eac6b09c6912	2011-03-18 00:05:07 -04:00
John Koleszar	cba980e3eb	Merge remote branch 'internal/upstream' into HEAD	2011-03-18 00:05:06 -04:00
John Koleszar	429dc676b1	Increase static linkage, remove unused functions A large number of functions were defined with external linkage, even though they were only used from within one file. This patch changes their linkage to static and removes the vp8_ prefix from their names, which should make it more obvious to the reader that the function is contained within the current translation unit. Functions that were not referenced were removed. These symbols were identified by: $ nm -A libvpx.a \| sort -k3 \| uniq -c -f2 \| grep ' [A-Z] ' \ \| sort \| grep '^ *1 ' Change-Id: I59609f58ab65312012c047036ae1e0634f795779	2011-03-17 20:53:47 -04:00
Ralph Giles	185557344a	Set bounds from the array when iterating mmaps. The mmap allocation code in vp8_dx_iface.c was inconsistent. The static array vp8_mem_req_segs defines two descriptors, but only the first is real. The second is a sentinel and isn't actually allocated, so vpx_codec_alg_priv is declared with mmaps[NELEMENTS(vp8_mem_req_segs)-1]. Some functions use this reduced upper bound when iterating though the mmap array, but these two functions did not. Instead, this commit calls NELEMENTS(...->mmaps) to directly query the bounds of the dereferenced array. This fixes an array-bounds warning from gcc 4.6 on vp8_xma_set_mmap. Change-Id: I918e2721b401d134c1a9764c978912bdb3188be1	2011-03-17 14:52:05 -07:00
Ralph Giles	de5182eef3	Remove commented-out VP6 code from vp8_finalize_mmaps Change-Id: I48642c380353043bed96026f56de5908fcee270a	2011-03-17 14:51:31 -07:00
John Koleszar	8431e768c9	Merge "Fix "used uninitialized" warning in vp8_pack_bitstream()"	2011-03-17 14:25:04 -07:00
John Koleszar	53d11fa6ad	Merge remote branch 'origin/master' into experimental Change-Id: I3f6c1e297fc0d33dc239bb4dd41d5afbcd91de19	2011-03-17 00:05:08 -04:00
John Koleszar	42f9104d5c	Merge remote branch 'internal/upstream' into HEAD	2011-03-17 00:05:07 -04:00
John Koleszar	de50520a8c	apple: include proper mach primatives Fixes implicit declaration warning for 'mach_task_self'. This change is an update to Change I9991dedd1ccfddc092eca86705ecbc3b764b799d, which fixed this issue for the decoder but not the encoder. Change-Id: I9df033e81f9520c4f975b7a7cf6c643d12e87c96	2011-03-16 13:59:32 -04:00
John Koleszar	386ceca8d2	Merge remote branch 'origin/master' into experimental Change-Id: If09b27454f82265fd5e3b25c85c1eea70c6c637f	2011-03-16 00:05:07 -04:00
John Koleszar	dc3451b086	Merge remote branch 'internal/upstream' into HEAD	2011-03-16 00:05:06 -04:00
Attila Nagy	71bcd9f1af	Add vp8_variance8x8_armv6 and vp8_sub_pixel_variance8x8_armv6 functions Change-Id: I08edaffc62514907fa5e90e1689269e467c857f5	2011-03-15 15:50:44 +02:00
John Koleszar	54c59a03f3	Merge remote branch 'origin/master' into experimental Change-Id: Ice13978071e98a88cf8ae5c069c6423d74425dea	2011-03-15 00:05:07 -04:00
John Koleszar	b210797a6a	Merge remote branch 'internal/upstream' into HEAD	2011-03-15 00:05:07 -04:00
John Koleszar	8c48c943e7	Merge "Fix an unused variable warning."	2011-03-14 14:13:53 -07:00
Johann	d0ec28b3d3	Merge "Add vp8_mse16x16_armv6 function"	2011-03-14 12:47:42 -07:00
John Koleszar	ba83622a00	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/encoder/onyx_if.c Change-Id: Ieef9a58a2effdc68cf52bc5f14d90c31a1dbc13a	2011-03-14 08:53:02 -04:00
John Koleszar	eeb8c8004e	Merge remote branch 'origin/master' into experimental Conflicts: vp8/encoder/onyx_if.c Change-Id: I230b63cef209cd1ac98357729a91ec07597756bd	2011-03-14 08:48:44 -04:00
Attila Nagy	e54dcfe88d	Add vp8_mse16x16_armv6 function Change-Id: I77e9f2f521a71089228f96e2db72524189364ffb	2011-03-14 14:38:31 +02:00
Johann	3788b3564c	Merge "Move build_intra_predictors_mby to RTCD framework"	2011-03-11 10:23:48 -08:00
John Koleszar	27972d2c1d	Move build_intra_predictors_mby to RTCD framework The vp8_build_intra_predictors_mby and vp8_build_intra_predictors_mby_s functions had global function pointers rather than using the RTCD framework. This can show up as a potential data race with tools such as helgrind. See https://bugzilla.mozilla.org/show_bug.cgi?id=640935 for an example. Change-Id: I29c407f828ac2bddfc039f852f138de5de888534	2011-03-11 13:04:50 -05:00
Johann	5c60a646f3	Merge "ARMv6 optimized quantization"	2011-03-11 08:29:00 -08:00
John Koleszar	75051c8b59	Merge "Only enable ssim_opt.asm on X86_64"	2011-03-11 08:28:05 -08:00
John Koleszar	5db0eeea21	Only enable ssim_opt.asm on X86_64 Fix compiling on 32 bit x86. Change-Id: I6210573e1d9287ac49acbe3d7e5181e309316107	2011-03-11 11:27:08 -05:00
Paul Wilkins	6e73748492	Clean up of vp8_init_config() Clean up vp8_init_config() a bit and remove null pointer case, as this code can't be called any more and is not an adequate trap anyway, as a null pointer would cause exceptions before hitting the test. Change-Id: I937c00167cc039b3aa3f645f29c319d58ae8d3ee	2011-03-11 11:06:51 -05:00
John Koleszar	170b87390e	Merge "1 Pass CQ and VBR bug fixes"	2011-03-11 08:06:09 -08:00
Paul Wilkins	2ae91fbef0	1 Pass CQ and VBR bug fixes Issue 291 highlighted the fact that CQ mode was not working as expected in 1 pass mode, This commit fixes that specific problem but in so doing I also uncovered an overflow issue in the VBR code for 1 pass and some data values not being correctly initialized. For some clips (particularly short clips), the resulting improvement is dramatic. Change-Id: Ieefd6c6e4776eb8f1b0550dbfdfb72f86b33c960	2011-03-11 10:59:34 -05:00
John Koleszar	e34e417d94	Merge "Fix incorrect macroblock counts in twopass rate control"	2011-03-11 06:06:04 -08:00
Yunqing Wang	3c9dd6c3ef	Merge "Align SAD output array to be 16-byte aligned"	2011-03-11 05:56:02 -08:00
John Koleszar	c5c5dcd0be	Merge "vp8cx - psnr converted to call assemblerized sse"	2011-03-11 05:54:00 -08:00
John Koleszar	29c46b64a2	Merge "vp8cx- alternate ssim function with optimizations"	2011-03-11 05:53:41 -08:00
Jim Bankoski	3dc382294b	vp8cx - psnr converted to call assemblerized sse Change-Id: Ie388d4618c44b131f96b9fe526618b457f020dfa	2011-03-11 08:51:22 -05:00
Jim Bankoski	3f6f7289aa	vp8cx- alternate ssim function with optimizations Change-Id: I91921b0a90dbaddc7010380b038955be347964b3	2011-03-11 08:51:21 -05:00
Yunqing Wang	b2aa401776	Align SAD output array to be 16-byte aligned Use aligned store. Change-Id: Icab4c0c53da811d0c52bb7e8134927f249ba2499	2011-03-11 08:24:23 -05:00
Yunqing Wang	76ec21928c	Merge "Encoder loopfilter running in its own thread"	2011-03-11 04:55:05 -08:00
Attila Nagy	9c836daf65	Fix "used uninitialized" warning in vp8_pack_bitstream() Change-Id: Iadcbdba717439f47a2c24e65fd69a3a1464174b5	2011-03-11 12:36:28 +02:00
Attila Nagy	3ae2465788	Encoder loopfilter running in its own thread In multithreaded mode the loopfilter is running in its own thread (filter level calculation and frame filtering). Filtering is mostly done in parallel with the bitstream packing. Before starting the packing the loopfilter level has to be calculated. Also any needed reference frame copying is done in the filter thread. Currently the encoder will create n+1 threads, where n > 1 is the number of threads specified by application and 1 is the extra filter thread. With n = 1 the encoder runs in single thread mode. There will never be more than n threads running concurrently. Change-Id: I4fb29b559a40275d6d3babb8727245c40fba931b	2011-03-11 10:52:51 +02:00
Tero Rintaluoma	7ab08e1fee	ARMv6 optimized quantization Adds new ARMv6 optimized function vp8_fast_quantize_b_armv6 to the encoder. Change-Id: I40277ec8f82e8a6cbc453cf295a0cc9b2504b21e	2011-03-11 10:48:42 +02:00
John Koleszar	314631ca61	Merge remote branch 'origin/master' into experimental Change-Id: Ibc4a75dbbc8b35ce298477e055e5a88df080d4b3	2011-03-11 00:05:09 -05:00
John Koleszar	31ce8f419c	Merge remote branch 'internal/upstream' into HEAD	2011-03-11 00:05:07 -05:00
Adrian Grange	6daacdb785	Added missing format specifier in print statement Printout of firstpass stats for frame had one fewer format specifiers than arguments. Change-Id: I5a42c85aa79c471e1a70afd75e24a91546b7a1cd	2011-03-10 12:43:49 -08:00
Adrian Grange	ed40ff9e2d	Removed firstpass motion map The firstpass motion map consists of an 8-bit flag for each MB indicating how strongly the firstpass code believes it should be filtered during the second pass ARNR filtering. For long or large format material the motion map can become extremely large and hamper the operation of the encoding process. This change removes the motion map altogether, leaving the second pass to rely on the magnitude of the motion compensated error to determine the filter weight to use for the MB during ARNR filtering. Tests on the derf set indicate that the effect of this change is neutral, with some small wins and losses. The motion map has therefore been removed based on a cost/benefit evaluation. Change-Id: I53e07d236f5ce09a6f0c54e7c4ffbb490fb870f6	2011-03-10 11:32:48 -08:00
James Berry	f3e9e2a0f8	Fix incorrect macroblock counts in twopass rate control The previous calculation of macroblock count (w*h)/256 is not correct when the width/height are not multiples of 16. Use the precalculated macroblock count from cpi->common instead. This manifested itself as a divide by zero when the number of pixels was less than 256. num_mbs updated in estimate_max_q, estimate_q, estimate_kf_group_q, and estimate_cq Change-Id: I92ff98587864c801b1ee5485cfead964673a9973	2011-03-10 13:33:06 -05:00
John Koleszar	dc29ed27bd	Merge remote branch 'origin/master' into experimental Change-Id: Icb795cef47a205f33f180f3852d88c36113b673e	2011-03-10 00:05:06 -05:00
John Koleszar	820b2b927f	Merge remote branch 'internal/upstream' into HEAD	2011-03-10 00:05:04 -05:00
Yunqing Wang	a0306ea660	Merge "Add vp8_sub_pixel_variance16x8_ssse3 function"	2011-03-09 12:26:37 -08:00
John Koleszar	c5a049babd	Merge branch 'bali' Change-Id: Icf18b4981afb12ef255fca431d4ba45860dd22c9	2011-03-09 14:11:54 -05:00
John Koleszar	5c24071504	Add missing filter.h to build system Missing file causes 'make dist' to not include a complete copy of the source. Change-Id: I3f55aeb5a86d0e81234e4e4588cb8086ba4cfc4a	2011-03-09 13:43:31 -05:00
Yunqing Wang	7b8e7f0f3a	Add vp8_sub_pixel_variance16x8_ssse3 function Added SSSE3 function Change-Id: I8c304c92458618d93fda3a2f62bd09ccb63e75ad	2011-03-09 12:33:21 -05:00
Yunqing Wang	4561109a69	Remove unused functions Removed some unused functions Change-Id: Ifdfc27453e53cfc75997b38492901d193a16b245	2011-03-09 10:45:03 -05:00
Yunqing Wang	7966dd5287	Merge "Improve SSE2 half-pixel filter funtions"	2011-03-09 07:23:06 -08:00
John Koleszar	fa836faede	Merge "Configuration updates:Making a clear distinction between Init and Change"	2011-03-09 05:07:11 -08:00
John Koleszar	016fb2b554	Merge remote branch 'origin/master' into experimental Change-Id: Ie52ff118b00ce462bb110ae349108e55d3d8ff3b	2011-03-09 00:05:07 -05:00
John Koleszar	96208f2e45	Merge remote branch 'internal/upstream' into HEAD	2011-03-09 00:05:06 -05:00
Ralph Giles	56efffdcd1	Fix an unused variable warning. Move the update of the loopfilter info to the same block where it is used. GCC 4.5 is not able trace the initialization of the local filter_info across the other calls between the two conditionals on pbi->common and issues an uninitialized variable warning. Change-Id: Ie4487b3714a096b3fb21608f6b0c74e745e3c6fc	2011-03-08 14:56:15 -08:00
Yunqing Wang	419f638910	Improve SSE2 half-pixel filter funtions Rewrote these functions to process 16 pixels once instead of 8. Change-Id: Ic67e80124467a446a3df4cfecfb76a4248602adb	2011-03-08 16:25:06 -05:00
Yunqing Wang	859abd6b5d	Merge "Add zero offset checking in SSE2 sub-pixel filter function"	2011-03-08 12:26:58 -08:00
Yunqing Wang	8432a1729f	Add zero offset checking in SSE2 sub-pixel filter function Skip filter at zero offset. Change-Id: I95fc7e211869bc0ab5bcfb7ab2e3259d1c0ccf38	2011-03-08 15:22:07 -05:00
Yunqing Wang	e8f7b0f7f5	Merge "Write SSSE3 sub-pixel filter function"	2011-03-08 10:58:30 -08:00
Yunqing Wang	244e2e1451	Write SSSE3 sub-pixel filter function 1. Process 16 pixels at one time instead of 8. 2. Add check for both xoffset =0 and yoffset=0, which happens during motion search. This change gave encoder 1%~3% performance gain. Change-Id: Idaa39506b48f4f8b2fbbeb45aae8226fa32afb3e	2011-03-08 13:29:01 -05:00
Ralph Giles	e6948bf0f9	Fix a multi-line format-string warning. GCC 4.5 and 4.6 both issue a warning about the multi-line format string introduced in `bc9c30a0`, which also changed the whitespace in the associated stt file by line-wrapping the long format string. Instead, use multiple string constants, which the compiler will concatenate. This maintains the original formatting, but remains legible within the standard line length. Change-Id: I27c9f92d46be82d408105a3a4091f145f677e00e	2011-03-08 07:14:12 -08:00
Paul Wilkins	de87c420ef	Corrected minor typos. Change-Id: Icc9f12bd1e1bdaf51256dc8a90d08aa9be89ef34	2011-03-08 14:46:22 +00:00
Paul Wilkins	0eccee4378	Merge changes I00c3e823,If8bca004 * changes: Improved key frame detection. Improved KF insertion after fades to still.	2011-03-08 06:40:11 -08:00
John Koleszar	5d1d9911cb	correct zbin boost for splitmv mode Disable zbin boost in SPLITMV mode as intended. Was incorrectly looking at vp8_ref_frame_order instead of vp8_mode_order when comparing against SPLITMV. This condition should have always been false, as SPLITMV is not in the range of valid reference frames. Change-Id: I0408cc7595eff68f00efef6d008e79f5b60d14bf	2011-03-07 20:58:37 -05:00
Paul Wilkins	bc9c30a003	Improved key frame detection. In some cases where clips have been encoded with borders (eg. some wide-screen content where there is a border top and bottom and slide shows containing portrait format photographs (border left and right)) key frames were not being correctly detected. The new code looks to measure cases where a portion of the image can be coded equally easily using intra or inter modes and where the resulting error score is also very low. These "neutral" areas are then discounted in the key frame detection code. Change-Id: I00c3e8230772b8213cdc08020e1990cf83b780d8	2011-03-07 15:58:07 +00:00
Paul Wilkins	9fc8cb39aa	Improved KF insertion after fades to still. This code extends what was previously done for GFs, to pick cases where insertion of a key frame after a fade (or other transition or complex motion) followed by a still section, will be beneficial and will reduce the number of forced key frames. Change-Id: If8bca00457f0d5f83dc3318a587f61c17d90f135	2011-03-07 15:11:09 +00:00
John Koleszar	01eb7c2874	Merge remote branch 'origin/master' into experimental Change-Id: I70ac5a4f8388a7bfa058178c0ae53f6bdb0bb6e5	2011-03-05 00:05:07 -05:00
John Koleszar	89d66cbb20	Merge remote branch 'internal/upstream' into HEAD	2011-03-05 00:05:05 -05:00
John Koleszar	0bc31f1887	Merge "Fixing divide by zero"	2011-03-04 05:40:33 -08:00
John Koleszar	fb37eda3e2	Merge "Fix drastic undershoot in long form content"	2011-03-04 05:39:40 -08:00
John Koleszar	eed2ce58e3	Merge "Fix counter of fixed keyframe distance"	2011-03-04 05:28:38 -08:00
Mikhal Shemer	84f7f20985	Configuration updates:Making a clear distinction between Init and Change Change-Id: I7b2fb326e1aabc08b032177a7b914a5b8bb7376f	2011-03-03 10:35:09 -08:00
Mikhal Shemer	1de99a2a81	Fixing divide by zero Change-Id: I9d8a98a2f7ed1e3116d0bae35164618c41998bac	2011-03-03 10:33:36 -08:00
John Koleszar	2c5638334e	Merge remote branch 'origin/master' into experimental Conflicts: vp8/vp8_cx_iface.c Change-Id: Ib30d0cfbdaeb605ee4b846f683d204cd07e0c028	2011-03-03 09:01:10 -05:00
John Koleszar	ca29f6a7c4	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/vp8_cx_iface.c Change-Id: Iecfd4532ab1c722d10ecce8a5ec473e96093cf3b	2011-03-03 08:59:34 -05:00
John Koleszar	738a791917	Merge remote branch 'internal/upstream-experimental' into HEAD Conflicts: vp8/common/blockd.h Change-Id: Ica2bd1c3da614eab5ce23acfb597e777d16b3983	2011-03-03 08:58:57 -05:00
John Koleszar	36be4f7f06	Fix drastic undershoot in long form content When the modified_error_left accumulator exceeds INT_MAX, an incorrect cast to int resulted in a negative value, causing the rate control to allocate no bits to that keyframe group, leading to severe undershoot and subsequent poor quality. This error was exposed by the recent change to the rolling target and actual spend accumulators in commit `305be4e4` which fixed them to actually calculate the average value rather than be re-initialized on every frame to the average per-frame bitrate. When this bug was triggered, the target bitrate could be 0, so the rolling target becomes small, which causes the undershoot. The code prior to `305be4e4` did not exhibit this behavior because the rolling target was always set to a reasonable value and was independent of the actual target bitrate. With this patch, the actual target bitrate is calculated correctly, and the rate control tracks as expected. This cast was likely added to silence a compiler warning on a comparison between a double (modified_error_left) and an int (0). Instead, this patch removes the cast and changes the comparison to be against 0.0, which should prevent the warning from reoccuring. This fixes issue #289. Special thanks to gnafu for his efforts in reporting and debugging this fix. Change-Id: Ie5cc1a7b516c578a76c3a50c892a6f04a11621fe	2011-03-02 22:52:27 -05:00
Johann	6f5189c044	Merge "ARMv6 optimized half pixel variance calculations"	2011-03-02 05:48:46 -08:00
Yunqing Wang	cfaee9f7c6	Merge "Add prefetch before variance calculation"	2011-02-28 11:42:28 -08:00
Scott LaVarnway	3e6d476ac3	Merge "Avoid double copying of key frames into alt and golden buffer"	2011-02-28 10:16:33 -08:00
Yunqing Wang	d96ba65a23	Add prefetch before variance calculation This improved encoding performance by 0.5% (good, speed 1) to 1.5% (good, speed 5). Change-Id: I843d72a0d68a90b5f694adf770943e4a4618f50e	2011-02-28 11:25:55 -05:00
Johann	31dab574cc	Merge "Remove a second check for invalid ptr in vp8_get_compressed_data"	2011-02-25 11:44:18 -08:00
Johann	e4fa638653	Merge "Remove temporal alt ref from realtime only build"	2011-02-25 06:55:17 -08:00
Johann	1fae7018a8	Merge "Handle mem allocation failure in vp8e_init"	2011-02-25 06:55:10 -08:00
Attila Nagy	d8fc974ac0	Avoid double copying of key frames into alt and golden buffer Change-Id: I726976a297a593a35ed6cba3c660e372562f7b27	2011-02-25 09:03:16 +02:00
Attila Nagy	6da2018789	Remove a second check for invalid ptr in vp8_get_compressed_data Check is done first when function si entered. Change-Id: Ief0d0cbd4860aaf492b78728f8d22f24029b1174	2011-02-25 08:41:13 +02:00
John Koleszar	1a7ce50a6c	Merge remote branch 'origin/master' into experimental Change-Id: I52f21ff6f9a1dca7099a8459657f6f288c5bfe40	2011-02-25 00:05:08 -05:00
Scott LaVarnway	861175ef00	Removed vp8_block2type and used defines instead. Change-Id: Idb56e0295d004793f406dfd2d8d8c546aad62e03	2011-02-24 14:35:18 -05:00
Scott LaVarnway	d53492bba4	Merge "Revisited rd_pick_intra4x4block"	2011-02-24 11:25:21 -08:00
Scott LaVarnway	658454a04c	Revisited rd_pick_intra4x4block Removed unnecessary copies. No noticeable speed gains. Change-Id: I996c50c23fedd06d54ee7a3e762cbf559cc4a9d1	2011-02-24 13:31:47 -05:00
Paul Wilkins	b862c108dd	Overflow of frame error accumulators. This fixes an overflow problem in the frame error accumulators. The overflow condition is extreme but did trigger when Frank B. coded some high motion interlaced HD content. The observed effect was a catastrophic breakdown of the rate control leading to massive undershoot and poor bit allocation. All the error values should really be unsigned but I will look at this separately. Change-Id: I9745f5c5ca2783620426b66b568b2088b579151f	2011-02-24 15:49:41 +00:00
Tero Rintaluoma	8ae92aef66	ARMv6 optimized half pixel variance calculations Adds following ARMv6 optimized functions to the encoder: - vp8_variance_halfpixvar16x16_h_armv6 - vp8_variance_halfpixvar16x16_v_armv6 - vp8_variance_halfpixvar16x16_hv_armv6 Change-Id: I1e9c2af7acd2a51b72b3845beecd990db4bebd29	2011-02-23 13:27:27 +02:00
Attila Nagy	e6db21ecc4	Handle mem allocation failure in vp8e_init Change-Id: I0d0445c57eb0889082f83de1948852d57b38fefb	2011-02-23 12:36:03 +02:00
Attila Nagy	7af0d906e3	Remove temporal alt ref from realtime only build It is not used in realtime mode. Reduces memory footprint. Change-Id: I7f163225762368df5457cfd413050161d3704a3f	2011-02-22 12:53:32 +02:00
John Koleszar	b21fe3b278	Merge remote branch 'internal/upstream' into HEAD	2011-02-19 00:05:44 -05:00
John Koleszar	bbfca323fb	Merge remote branch 'origin/master' into experimental Change-Id: Ia3197f432b424213a34b20071e5171a413ba1aaf	2011-02-19 00:05:11 -05:00
Johann	945dad277d	Revert "use unaligned load" This reverts commit `f50f2fd2a7`. Change Ib7506e3e aligns the buffer Change-Id: Ie0f8bd3e57cfdfef81d39638a1451458ebbae2e0	2011-02-18 10:23:02 -05:00
John Koleszar	c764c2a20f	Merge "clean up unused files"	2011-02-18 06:33:05 -08:00
John Koleszar	3ed8fe8778	remove unused vp8_predict_dc function Change-Id: I64fa47889c54cfed094a674c49ef0996d49bdd42	2011-02-18 09:12:20 -05:00
John Koleszar	cbf923b12c	clean up unused files Removed a number of files that were unused or little-used. Change-Id: If9ae5e5b11390077581a9a879e8a0defe709f5da	2011-02-18 09:09:49 -05:00
John Koleszar	d371ca93e5	cosmetic: remove unnecessary scope Clean up some unnecessary scoping around pick_filter_level. Change-Id: Ic57fa33e3fcae37fe6beae977e5743783399d5af	2011-02-18 08:46:07 -05:00
John Koleszar	597d02b508	Merge "Dont pick encoder filter level when loopfilter is disabled."	2011-02-18 05:26:23 -08:00
Attila Nagy	fb5a692d27	Reinitialize quantizer only when any delta is changing No need to reinitialize for base Q changes. Change-Id: Ie76ec21dd3c5582d5183dbed75ed73a1eed3e291	2011-02-18 14:23:37 +02:00
Attila Nagy	c6ef75690f	Dont pick encoder filter level when loopfilter is disabled. Change-Id: I58154faf4f3ece24f9927a5c3ab7e830e0887fb6	2011-02-18 08:53:00 +02:00
John Koleszar	f13212b728	Merge remote branch 'internal/upstream' into HEAD	2011-02-18 00:05:13 -05:00
John Koleszar	4fafc4d985	Merge remote branch 'origin/master' into experimental Change-Id: I8999a33db82d38eb85482f3c423db238d6ee3ed9	2011-02-18 00:05:11 -05:00
John Koleszar	b2ae57f1b6	Merge "Use endian-neutral bitstream packing/unpacking"	2011-02-17 12:34:16 -08:00
John Koleszar	562f1470ce	Use endian-neutral bitstream packing/unpacking Eliminate unnecessary checks on target endianness and associated macros. Change-Id: I1d4e6a9dcee9bfc8940c8196838d31ed31b0e4aa	2011-02-17 15:20:53 -05:00
John Koleszar	ac10665ad8	Merge "Removed unused vp8_recon_intra4x4mb function"	2011-02-17 11:30:13 -08:00
Scott LaVarnway	07f7b66fae	Removed unused vp8_recon_intra4x4mb function Change-Id: I4a328ce152d9dbe6b0d1606d1b523e8e7bfb468e	2011-02-17 13:34:38 -05:00
John Koleszar	c351aa7f1b	Merge "Fix relative include paths"	2011-02-17 04:13:44 -08:00
John Koleszar	c88dbb2dce	Merge remote branch 'internal/upstream' into HEAD	2011-02-17 00:05:14 -05:00
John Koleszar	1293116895	Merge remote branch 'origin/master' into experimental Change-Id: I3efb725e4da4e7c75b2512b80db6af51dec51f79	2011-02-17 00:05:13 -05:00
Yunqing Wang	da9402fbf6	Merge "Allocate source buffers to be multiples of 16"	2011-02-16 11:35:06 -08:00
Yunqing Wang	da227b901d	Allocate source buffers to be multiples of 16 Currently, when the video frame width is not multiples of 16, the source buffer has a stride of non-multiples of 16, which forces an unaligned load in SAD function and hurts the performance. To avoid that, this change allocates source buffers to be multiples of 16. Change-Id: Ib7506e3eb2cea06657d56be5a899f38dfe3eeb39	2011-02-16 12:57:17 -05:00
Johann	0c2cfff9b0	Merge "ARMv6 optimized sad16x16"	2011-02-16 05:22:38 -08:00
John Koleszar	e786bd3a01	Merge remote branch 'internal/upstream' into HEAD	2011-02-16 00:05:13 -05:00
John Koleszar	9e95a1a0cd	Merge remote branch 'origin/master' into experimental Change-Id: If846b0e4ec862b54b98d08608f4b5f9a7b7f94ef	2011-02-16 00:05:10 -05:00
James Zern	0030303b69	Remove redundant ptr checks in calls to vpx_free vpx_free if used contains this check. If replaced, well behaved free will behave similarly. Change-Id: I25483aaa8b39255b9a8cf388d6e5eaa20a908ae1	2011-02-15 12:43:35 -08:00
John Koleszar	c6ea558c05	Merge remote branch 'internal/upstream' into HEAD	2011-02-15 00:05:39 -05:00
John Koleszar	cf8aa08348	Merge remote branch 'origin/master' into experimental Change-Id: I4b1a7a2ad0d62bdcabfed66c9dfdbe9b6bfa8b5e	2011-02-15 00:05:29 -05:00
Yunqing Wang	7725a7eb56	Merge "Improve vp8_sad16x16_sse3 function"	2011-02-14 14:09:25 -08:00
Yaowu Xu	27dad21548	Merge "Improved vp8_rd_pick_intra_mbuv_mode"	2011-02-14 13:58:12 -08:00
Scott LaVarnway	94d4fee08f	Improved vp8_rd_pick_intra_mbuv_mode Eliminated unnecessary calculations. Very small change to performance. Change-Id: Ib7213d43c64e36955177c4d47950ff472266f822	2011-02-14 16:34:33 -05:00
Yunqing Wang	2debd5b5f7	Improve vp8_sad16x16_sse3 function In real-time mode, vp8_sad16x16 function is called heavily in motion search part. Improvement of this function gives 1.2% encoding performance gain (real-time mode, tulip clip). Change-Id: I23c401fc40c061f732a9767e8d383737a179bd58	2011-02-14 16:23:49 -05:00
Yaowu Xu	404e998eb7	Merge "mem leak fix for cpi->tplist"	2011-02-14 11:29:22 -08:00
James Berry	d3dfcde0f7	mem leak fix for cpi->tplist checks added to make sure that cpi->tplist is freed correctly in vp8_dealloc_compressor_data and vp8_alloc_compressor_data. Change-Id: I66149dbbd25c958800ad94f4379d723191d9680d	2011-02-14 14:02:52 -05:00
Scott LaVarnway	d419b93e3e	Improved rd_pick_intra4x4block Eliminated unnecessary calculations. Improved performance by 10% on keyframes and 1.6% overall for the test clip used. Change-Id: I87671b26af5e2cc439e81d0fee3b15c7cd2a3309	2011-02-14 13:32:58 -05:00
Johann	0ff10bb1f7	Merge "remove assembly detokenizer"	2011-02-14 05:10:16 -08:00
John Koleszar	1f8e42e7b8	Merge remote branch 'internal/upstream' into HEAD	2011-02-12 00:05:14 -05:00
John Koleszar	70dc0ed003	Merge remote branch 'origin/master' into experimental Change-Id: I1cd33708d12bd51dfd1e78db4a7500653abc53c9	2011-02-12 00:05:11 -05:00
Johann	bb6bcbccda	remove assembly detokenizer hasn't been kept up to date. remove it to avoid confusion. Change-Id: I52ffde19b59fec5c7a381299ca2e85cb38330be7	2011-02-11 11:09:00 -05:00
Yunqing Wang	353246bd60	Merge "Add improved_mv_pred flag in real-time mode"	2011-02-11 07:20:17 -08:00
Yunqing Wang	9d0b2cbbce	Add improved_mv_pred flag in real-time mode As mentioned in check-in "Improve motion search in real-time mode", MV prediction calculation causes speed loss for speed 7 and above. This change added a flag to turn off this calculation for speed>6 in real-time mode. Change-Id: I9f4ae5a8bf449222d1784b54e7d315fc8347b2d1	2011-02-11 09:59:41 -05:00
Tero Rintaluoma	1ef86980b9	ARMv6 optimized sad16x16 Adds a new ARMv6 optimized function vp8_sad16x16_armv6 to encoder. Change-Id: Ibbd7edb8b25cb7a5b522d391b1e9a690fe150e57	2011-02-11 11:14:07 +02:00
Yaowu Xu	4f8a166058	Merge "Redefining good quality speed settings"	2011-02-10 21:38:19 -08:00
John Koleszar	64aebb6c7a	Merge remote branch 'internal/upstream' into HEAD	2011-02-11 00:05:19 -05:00
John Koleszar	809dae2458	Merge remote branch 'origin/master' into experimental Change-Id: Icf1a7c61a3b07da2ccfd94bca9e8810c01e46b2c	2011-02-11 00:05:14 -05:00
Yunqing Wang	6f53e59641	Merge "Improve motion search in real-time mode"	2011-02-10 12:42:44 -08:00
John Koleszar	02321de0f2	Fix relative include paths Allow compiling without adding vp8/{common,encoder,decoder} to the include paths. Change-Id: Ifeb5dac351cdfadcd659736f5158b315a0030b6c	2011-02-10 15:09:44 -05:00
John Koleszar	ec3b8f1f32	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/decoder/onyxd_int.h Change-Id: Id9aa577f03e37b4f406ba3b593c3c4330812a49e	2011-02-10 14:26:40 -05:00
Yunqing Wang	41e6eceb28	Improve motion search in real-time mode Applied better MV prediction in real-time mode, which improves the encoding quality. Used quarter-pixel search instead of iterative sub-pixel search for speed >=5 to improve encoding performance. Tests on the test set showed: 1. For speed=-5, quality improvement: 1.7% on AvgPSNR and 2.1% on SSIM, performance improvement: 3.6% (This counts in the performance lose caused by MV prediction calculation in "Improve MV prediction in vp8_pick_inter_mode() for speed>3"). 2. For speed=-8, quality improvement: 2.1% on AvgPSNR and 2.5% on SSIM. but, 6.9% performance decrease because of MV prediction calculation. This should be improved later. Change-Id: I349a96c452bd691081d8c8e3e54419e7f477bebd	2011-02-10 13:40:24 -05:00
Johann	7d8199f0c3	Merge "Adds armv6 optimized variance calculation"	2011-02-10 06:06:46 -08:00
John Koleszar	96ddc5c26e	Merge remote branch 'origin/master' into experimental Change-Id: Ie85d40c44bb23d56a519010356b2856c02fb4c05	2011-02-10 00:05:10 -05:00
Scott LaVarnway	19054ab6da	Redefining good quality speed settings Created a new speed 1 which is in the middle of the old speed 0 and speed 1. (for both quality and performance) Change-Id: I4802133cdb43f359ca787646c090899679dd5d84	2011-02-09 17:18:28 -05:00
James Berry	fffa2a61d7	fixed stride in vp8_temporal_filter_predictors_mb_c stride would not be calculated correctly for material with odd sized frame widths. Change-Id: I1710f6aef9ebb93d36249c9239c68c5baa9791f8	2011-02-09 16:55:39 -05:00
John Koleszar	c2b43164bd	Merge "correct cost for implicit bit in mvs"	2011-02-09 11:20:12 -08:00
John Koleszar	9954d05ca6	correct cost for implicit bit in mvs Use 0xFFF0 vice 240 (0xF0) for determining whether the sometimes implicit bit 3 will be transmitted. This is consistent with the decoder and encode_mvcomponent(). Change-Id: Ic1304d0ab56844bed8236edd1c5243a6767fc6b1	2011-02-09 12:50:17 -05:00
John Koleszar	a39b5af10b	Merge "Put more code under #if CONFIG_MULTITHREAD."	2011-02-09 08:31:36 -08:00
Gaute Strokkenes	315e3c2518	Put more code under #if CONFIG_MULTITHREAD. Change-Id: Icf4b692099d7d249fe3553852b1022b027b28e4b	2011-02-09 11:21:18 -05:00
Scott LaVarnway	85e79ce288	Merge "Added early breakout for vp8_rd_pick_intra4x4mby_modes"	2011-02-09 07:55:04 -08:00
John Koleszar	c96031da69	Merge "vp8e_get_preview fixed for resized frames"	2011-02-09 07:41:40 -08:00
Tero Rintaluoma	cb14764fab	Adds armv6 optimized variance calculation Adds vp8_sub_pixel_variance16x16_armv6 function to encoder. Integrates ARMv6 optimized bilinear interpolations from vp8/common/arm/armv6 and adds new assembly file for variance16x16 calculation. - vp8_filter_block2d_bil_first_pass_armv6 (integrated) - vp8_filter_block2d_bil_second_pass_armv6 (integrated) - vp8_variance16x16_armv6 (new) - bilinearfilter_arm.h (new) Change-Id: I18a8331ce7d031ceedd6cd415ecacb0c8f3392db	2011-02-09 10:23:43 -05:00
John Koleszar	b2ad177942	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/vp8_common.mk Change-Id: I2094ddf20834c0b7dfe912feac6a79500bb8cce2	2011-02-09 08:34:48 -05:00
John Koleszar	6e6b46d972	Merge remote branch 'origin/master' into experimental Change-Id: Ibc762883a5e117f5db64dc01a46a9c78438e6c33	2011-02-09 00:05:12 -05:00
Johann	e5aaac24bb	clean up bilinear filter make reference version of bilinear_filters short. use reference versions of bilinear_filters and sub_pel_filters when possible. recognize that Width was being passed into filter_block2d_bil_first_pass multiple times. ARM version had already fixed this. propegate to C. change references to src_pixels_per_line to src_pitch and standardize on src/dst (instead of input/output). recognize that first_pass is only run in the verticle and second_pass only horizontal. ARM version had already fixed this. propegate to C Change-Id: I292d376d239a9a7ca37ec2bf03cc0720606983e2	2011-02-08 17:42:54 -05:00
Scott LaVarnway	13db80c282	Added early breakout for vp8_rd_pick_intra4x4mby_modes Improved performance of good quality, speed 0 (3% average) with no average quality loss. Change-Id: Ica34473f99bd74260eaebde6b132185e09e3c09d	2011-02-08 16:50:43 -05:00
Johann	40dcae9c2e	clarify _offsets.asm differences it's difficult to mux the _offsets.c files because of header conflicts. make three instead, name them consistently and partititon the contents to allow building them as required. Change-Id: I8f9768c09279f934f44b6c5b0ec363f7943bb796	2011-02-08 16:35:43 -05:00
James Berry	ddacf1cf69	vp8e_get_preview fixed for resized frames preview_img d_w and d_h along with w and h would not be updated for resized frames. now uses sd.y_width and sd.y_height Change-Id: I52241de4cc1de5e73f865e668bd70a7cbd954390	2011-02-08 14:27:00 -05:00
John Koleszar	9683198e7b	Merge remote branch 'origin/master' into experimental Change-Id: I7897261eb2956f778f9f9885ce2005b1e134b28f	2011-02-08 00:05:11 -05:00
John Koleszar	c540bbc367	Merge remote branch 'internal/upstream' into HEAD	2011-02-07 14:16:24 -05:00
John Koleszar	2bb322380d	Merge remote branch 'internal/upstream-experimental' into HEAD Conflicts: vp8/encoder/encodeframe.c vp8/encoder/ethreading.c vp8/encoder/onyx_int.h Change-Id: I1c562d2fe6e42c0d1d86f68c77c0e899066e02bd	2011-02-07 14:16:09 -05:00
Andoni Morales Alastruey	48140167cd	Fix counter of fixed keyframe distance When the keyframe distance is fixed the first interval has the right distance but, the next ones have kf_distance + 1. Change-Id: I44f1190fe7146124bd07660a5e0ef08829e3ae07	2011-02-07 18:30:04 +01:00
Johann	3273c7b679	move one of the offset files common/arm/vpx_asm_offsets moves up a level. prepare for muxing with encoder/arm/vpx_vp8_enc_asm_offsets Change-Id: I89a04a5235447e66571995c9d9b4b6edcb038e24	2011-02-07 11:35:30 -05:00
John Koleszar	adaf2b697c	Merge "remove unused dboolhuff code"	2011-02-07 05:36:26 -08:00
Yunqing Wang	58d2e70fc5	Fix link error in real-time mode make vp8_mv_pred() and vp8_cal_sad() available in real-time mode. Change-Id: I71dbae241b486ba943458dcbae552ec4a51689d3	2011-02-07 08:21:14 -05:00
John Koleszar	318a14c637	Merge remote branch 'origin/master' into experimental Change-Id: Ib487cbd7b214a6e3f13180bc0e5dcb792d8a406e	2011-02-05 00:05:11 -05:00
Johann	bb9c95ea53	remove unused dboolhuff code we were holding on to this "just in case." purge it instead Change-Id: I77a367b36d0821d731019f2566ecfffdae1d4b8a	2011-02-04 16:00:00 -05:00
Yunqing Wang	350ffe8dae	Merge "Improve MV prediction in vp8_pick_inter_mode() for speed>3"	2011-02-04 10:10:15 -08:00
John Koleszar	63fc44dfa5	correct quantizer initialization The encoder was not correctly catching transitions in the quantizer deltas. If a delta_q was set, then the quantizer would be reinitialized on every frame, but if they transitioned to 0, the quantizer would not be reinitialized, leading to a encode-decode mismatch. This bug was triggered by commit `999e155`, which sets a Y2 delta Q for very low base Q levels. Change-Id: Ia6733464a55ee4ff2edbb82c0873980d345446f5	2011-02-04 11:37:47 -05:00
John Koleszar	6bf7e2cc37	Merge "Remove duplicate loopfilter parameters."	2011-02-04 07:07:45 -08:00
Gaute Strokkenes	ffc6aeef14	Remove duplicate loopfilter parameters. Change-Id: I0d41415e3961c2c9492d342290c1999f9d02e6d8	2011-02-04 14:55:02 +00:00
John Koleszar	c0a9cbebe1	Merge "Delay auto key frame insertion in realtime configuration"	2011-02-04 05:16:15 -08:00
John Koleszar	16bbf27fa9	Merge remote branch 'origin/master' into experimental Change-Id: I242ca4854cb21f3d63efb979bd6ecc9f06f67f33	2011-02-04 00:05:13 -05:00
Gaute Strokkenes	bf5f585b0d	Make vp8_adjust_mb_lf_value return the updated value rather than manipulating it in situ via a pointer. Change-Id: If4a87a4eccd84f39577c0e91e171245f4954c5cf	2011-02-03 19:24:16 +00:00
Scott LaVarnway	4aa12b6c5f	Merge "Zero out block mv when an intra mode is selected"	2011-02-03 07:16:52 -08:00
Yunqing Wang	a870315629	Merge "Improved encoder threading"	2011-02-03 05:44:57 -08:00
Attila Nagy	e5904f2d5e	Delay auto key frame insertion in realtime configuration Whe auto keyframe insertion is enabled and conditions are right (scene change) the encoder can decide to insert a key frame and does a re-encoding. This can introduce extra latency. In RT mode we do not do the re-encoding of the current frame but force the next frame to key frame. Change-Id: I15c175fa845ac4c1a1f18bea3676e154669522a7	2011-02-02 13:54:40 +02:00
John Koleszar	2d9a394503	Merge remote branch 'internal/upstream' into HEAD	2011-02-02 00:05:14 -05:00
John Koleszar	9aeb6ac4ea	Merge remote branch 'origin/master' into experimental Change-Id: I585615400697b77c50dd05480616f868f2637aa7	2011-02-02 00:05:11 -05:00
Scott LaVarnway	07a7c08aef	Zero out block mv when an intra mode is selected instead of each time mode is tested. Change-Id: Ief0f5586dafde54cc14d348dcecdacb182e7c1d5	2011-02-01 12:55:51 -05:00
Scott LaVarnway	a5ecaca6a7	Removed unnecessary B_MODE_INFO memset. Change-Id: I2bcef6a8e47f88542861fd1356631ca934e2a0e7	2011-02-01 11:35:08 -05:00
Scott LaVarnway	b18df82e1d	Moved rd calculation into vp8_pick_intra4x4mby_modes Then removed unnecessary code. Change-Id: I142658815d843c9396b07881dbdd8d387c43c90e	2011-02-01 11:26:04 -05:00
Scott LaVarnway	4e7e79f770	Removed intra_modes from vp8cx_encode_intra_macro_block Restructured function in order to eliminate the prediction modes save/restore. Code cleanup also. Change-Id: I816e3b910de64d0f0f0ddc2398805c63263191e8	2011-02-01 10:05:35 -05:00
Attila Nagy	385c2a76d1	Improved encoder threading Reduce the number of sync points by letting each thread continue imediatly with a new MB row. Better multicore scaling, improves performance by 5-20% on ARM multicore. Change-Id: Ic97e4d1c4886a842c85dd3539a93cb217188ed1b	2011-02-01 12:17:58 +02:00
John Koleszar	a2bca1a52d	Merge remote branch 'internal/upstream' into HEAD	2011-02-01 00:05:13 -05:00
John Koleszar	76878a0354	Merge remote branch 'origin/master' into experimental Change-Id: Id1d4bbe257cd126bb5f44347b896ddb659724f0b	2011-02-01 00:05:10 -05:00
Scott LaVarnway	9e7fec216e	Removed prediction_error accumulation from vp8cx_encode_intra_macro_block. prediction_error is used when deciding if a frame should be a keyframe. After reviewing this with Yaowu, it was pointed out that vp8cx_encode_intra_macro_block is only called for keyframes, so the accumulation is unnecessary. Change-Id: Id79dc81b80d4f5d124f3a0dba1b923887e2e1ec8	2011-01-31 19:53:02 -05:00
Scott LaVarnway	317f0da91e	Removed last_auto_filter_prediction_error last_auto_filter_prediction_error is not really used. Change-Id: Ic6e56c4076bbd250ef783ee1be46964c85f62864	2011-01-31 19:41:09 -05:00
Scott LaVarnway	4a15e55793	Possible bug in vp8cx_encode_intra_macro_block vp8_pick_intra4x4mby_modes uses the passed in distortion for an early breakout. The best distortion was never saved and the distortion for TM_PRED was always used. Change-Id: Idbaf73027408a4bba26601713725191a5d7b325e	2011-01-31 17:43:18 -05:00
Scott LaVarnway	60fde4d342	Merge "Performance improvement of first pass"	2011-01-31 13:02:23 -08:00
Yaowu Xu	6d19d40718	Merge "change the threshold of DC check for encode breakout"	2011-01-31 11:00:46 -08:00
John Koleszar	f6214d1db8	Merge "validate min_q against max_q"	2011-01-31 07:33:55 -08:00
John Koleszar	2d03f073a7	validate min_q against max_q min_q is required to be <= max_q. Change-Id: I28eccf96df3b52a94913762b54c4fbe0d021ce5e	2011-01-31 10:33:00 -05:00
John Koleszar	de4b3352b8	Merge remote branch 'internal/upstream' into HEAD Conflicts: configure Change-Id: I74063d859de31a62285c8908bcb1821e050b9f3c	2011-01-31 09:11:52 -05:00
John Koleszar	933dfe0a94	Merge remote branch 'origin/master' into experimental Conflicts: configure Change-Id: I18c2292256d2387ff09da209aa9cf6891e1864a0	2011-01-31 09:10:35 -05:00
Adrian Grange	408a8adc15	Merge "Changed condition for using RD in Intra Mode"	2011-01-31 02:18:40 -08:00
Yaowu Xu	8f279596cb	change the threshold of DC check for encode breakout Previously, the DC check is to make sure there is no code-able DC shift for quantizer Q0, which has been verified rather conservative. This commit changes the criteria to have two components, DC and AC, to address the conservativeness. First, it checks if all AC energy is enough to contribute a single non-zero quantized AC coefficient. Second, for DC, the decision to skip further considers two possible scenarios: 1. There is no code-able 2nd order DC coefficient at all; 2 The residue is relatively flat, but the uniform DC change is very small, i.e. less than 1/2 gray level per pixel. Comparing to previous criteria, the new criteria is about 10% to 15% faster in encoding time with a very small quality loss. (threshold ~1000 and quality range 33db-45db) It should be noted that this commit enables "automatic" static threshold for encodebreakout if a non-zero small value is passed in to encoder. Change-Id: I0f77719a1ac2c2dfddbd950d84920df374515ce3	2011-01-28 09:43:23 -08:00
Johann	f3cb9ae459	Merge "Adds "armvX-none-rvct" targets"	2011-01-28 09:03:58 -08:00
Yunqing Wang	7cbe684ef5	Improve MV prediction in vp8_pick_inter_mode() for speed>3 Applied same method used in vp8_rd_pick_inter_mode() to improve the accuracy of MV prediction. Change-Id: Ia50ae26208b18482695601f32febd99fe89fbc17	2011-01-28 10:00:20 -05:00
Adrian Grange	e9f513d74a	Changed condition for using RD in Intra Mode The condition for using RD when selecting the intra coding mode for a MB is that the RD flag is set AND we're not in real-time mode. Previously the code used RD if either the RD flag was set OR we were not using real-time mode. Change-Id: Ic711151298468a3f99babad39ba8375f66d55a08	2011-01-28 14:47:36 +00:00
John Koleszar	f1db3e8358	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/encoder/rdopt.c Change-Id: I68d04397a12f565b9f1bd35d4e50f1cc9afb76ff	2011-01-28 08:37:44 -05:00
John Koleszar	a4c887da63	Merge remote branch 'origin/master' into experimental Conflicts: vp8/encoder/rdopt.c Change-Id: Ic17907df70fff45c9e766b5d0cbab0c5f1a1095f	2011-01-28 08:33:52 -05:00
Paul Wilkins	dcb23e2aaa	Inconsistent distortion metric in vp8_rd_pick_intra_mbuv_mode This function was using a variance metric compared to and SSE metric in other places (eg. vp8_rd_inter_uv) Change-Id: I9109fcc5a13bca9db1d7ead500fe14999ab233eb	2011-01-28 13:13:30 +00:00
Tero Rintaluoma	11a222f5d9	Adds "armvX-none-rvct" targets Adds following targets to configure script to support RVCT compilation without operating system support (for Profiler or bare metal images). - armv5te-none-rvct - armv6-none-rvct - armv7-none-rvct To strip OS specific parts from the code "os_support"-config was added to script and CONFIG_OS_SUPPORT flag is used in the code to exclude OS specific parts such as OS specific includes and function calls for timers and threads etc. This was done to enable RVCT compilation for profiling purposes or running the image on bare metal target with Lauterbach. Removed separate AREA directives for READONLY data in armv6 and neon assembly files to fix the RVCT compilation. Otherwise "ldr <reg>, =label" syntax would have been needed to prevent linker errors. This syntax is not supported by older gnu assemblers. Change-Id: I14f4c68529e8c27397502fbc3010a54e505ddb43	2011-01-28 12:47:39 +02:00
Johann	73207a1d8b	warning: pointer targets differ in signedness vp8/encoder/rdopt.c:728: warning: pointer targets in passing argument 3 of 'macro_block_yrd' differ in signedness vp8/encoder/rdopt.c:541: note: expected 'int ' but argument is of type 'unsigned int ' distortion is signed when calling macro_block_yrd is both other cases, as well as for RDCOST Change-Id: I5e22358b7da76a116f498793253aac8099cb3461	2011-01-27 11:53:26 -05:00
Johann	27000ed6d9	clean up implicit declaration warnings for neon Change-Id: I6ca2d89f355839c4c770773c09fc69dcea7c1406 warning: implicit declaration of function 'vp8_variance_halfpixvar16x16_[h\|v\|hv]_neon' 'vp8_sub_pixel_variance16x16_neon_func'	2011-01-27 11:31:59 -05:00
Scott LaVarnway	8a5c255b3d	Merge "Removed unused members from VP8_COMP"	2011-01-27 08:12:22 -08:00
Yunqing Wang	bb30ffc4dc	Merge "Remove copies of same functions"	2011-01-27 08:11:26 -08:00
Yunqing Wang	3ee4e1e79f	Merge "Refine motion vector prediction for NEWMV mode"	2011-01-27 08:10:53 -08:00
Scott LaVarnway	3c18a2bb2e	Performance improvement of first pass Improved the performance of the first pass only (~6% on 720p test clip) by making use of LUT instead of the float calculations. Might try a SIMD version later. Also started to make use of int_mv instead of MV. Change-Id: If2a217c7d6b59cd2c25c5553e0ca7e0502403af8	2011-01-26 16:42:56 -05:00
Yunqing Wang	cac54404b9	Remove copies of same functions Reduce the code size. Change-Id: I2e1998557a3c8776e262c442fd758c25e17aff7a	2011-01-26 15:37:00 -05:00
Scott LaVarnway	c4887da39c	Removed unused members from VP8_COMP Change-Id: I8f3f2642b02975fbdb14982984a29821f80d30d3	2011-01-26 15:07:17 -05:00
Paul Wilkins	35bb74a6bd	Rationalize vp8_rd_pick_intra16x16mby_mode() Use the function macro_block_yrd() to calculate error and distortion in keeping with what is done for inter frames. The old code was using a variance metric for once case and an SSE function for measuring distortion in the other case. The function vp8_encode_intra16x16mbyrd() is no longer used. Change-Id: Ic228cb00a78ff637f4365b43f58fbe5a9273d36f	2011-01-26 18:46:34 +00:00
Paul Wilkins	e8e09d33df	Merge "Correction to buffer update for non-viewable frames."	2011-01-26 09:33:48 -08:00
Yaowu Xu	82266a1ac9	Merge "cap the best quantizer for 2nd order DC"	2011-01-26 09:27:11 -08:00
John Koleszar	be3e0ff7c3	Merge "Adds vpx_vp8_enc_asm_offsets.c.o to OBJS-yes list"	2011-01-26 07:29:19 -08:00
Attila Nagy	0def48b60f	Adds vpx_vp8_enc_asm_offsets.c.o to OBJS-yes list Change-Id: Ibd6e3bc82471839904b1086b499efc55f7c5cbaf	2011-01-26 17:06:09 +02:00
Paul Wilkins	a3f71ccff6	Correction to buffer update for non-viewable frames. The code previously tested cpi->common.refresh_alt_ref_frame but there are situations where this flag may be set for viewable frames. The correct test should be !cm->show_frame. Change-Id: Ia1a600622992a4a68fe1d38ac23bf6b34b133688	2011-01-26 12:52:31 +00:00
Paul Wilkins	2caa36aa4f	Merge "Fix for incorrect variable declaration."	2011-01-26 01:53:53 -08:00
Yaowu Xu	999e155f55	cap the best quantizer for 2nd order DC This commit also removes artificial RDMULT cap for low quantizers. The intention is to address some abnormal behavior of mode selections at the low quantizer end, where many macroblocks were coded with SPLITMV with all partitions using same motion vector including (0,0). This change improves the compression quality substantially for high quality encodings in both PSNR and SSIM terms. Overall effect on mid/low rate range is also positive for all metrics, but smaller in magnitude. Change-Id: I864b29c4bd9ff610d2545fa94a19cc7e80c02667	2011-01-25 22:26:18 -08:00
John Koleszar	794ff6843f	Merge remote branch 'internal/upstream' into HEAD	2011-01-26 00:05:16 -05:00

... 8 9 10 11 12 ...

1506 Commits