generic-library/vpx

Author	SHA1	Message	Date
Paul Wilkins	f874391e02	Replaced recode loop test with a function call Replaced existing code to decide if a frame recode is required with a function call. This is to simplify addition of extra clauses that may be needed for the planned constrained quality mode. Also fixed a bug where by alt ref not considered in the test. Change-Id: I3d40bb21abe3e19f8456761e6849deb171738b60	2010-11-17 15:12:04 +00:00
John Koleszar	8d94796cad	vp8mt_alloc_temp_buffers: make prototype return void This function was never called in a context expecting a return value, the return value was always a constant, and the !CONFIG_MULTITHREAD path didn't have a return statement, which caused a compiler warning. This patch changes the function to return void instead. Fixes issue #231 Change-Id: I9ef7f56e54418b7265026c54fc4ed5660c1418d1	2010-11-17 09:13:57 -05:00
John Koleszar	79e2b1f39b	Disable compile warning for ERROR macro The ERROR macro collides wiith the MS SDK on Windows. Since we're not making any win32 calls in this function, just #undef it first to take ownership. Change-Id: Ic18c60dfa3a33c52e6c49d3f4f8d3e7e3ac3341d	2010-11-17 09:08:51 -05:00
John Koleszar	3a778de77a	Merge remote branch 'origin/master' into experimental	2010-11-17 00:05:05 -05:00
Fritz Koenig	99d02c0f9f	Merge "Comments for alt ref flags."	2010-11-16 16:11:39 -08:00
Fritz Koenig	69ee697fef	Comments for alt ref flags. Clarify what the alt ref flags do when encoding. Change-Id: I71f78e0f42edae633fb91840f29dfbe64362c44c	2010-11-16 15:16:24 -08:00
Yaowu Xu	d49da085c0	correct errors in token alphabet descriptions There were a few errors in the comment section that describe VP8 token alphabet table. Change-Id: Ie6728a0e08bc3798893221b60408d5b201064bdc	2010-11-16 10:51:43 -08:00
John Koleszar	791cae74da	Merge remote branch 'origin/master' into experimental	2010-11-16 00:05:04 -05:00
Fritz Koenig	e180255375	Remove stack shadowing for x86-x64 for SAD functions. x86-64 passes arguments in registers. There is no need to push them to the stack before using them. This fixes `15acc84f10` where ebx was not getting preserved on x86. Change-Id: I1214b5f818a0201f75ab6ad7d5c6f448e09b16c2	2010-11-15 10:56:02 -08:00
Paul Wilkins	f4709d2895	Merge "Bad cost tables used in ARNR filtering."	2010-11-15 09:55:35 -08:00
Paul Wilkins	373f5c3144	Bad cost tables used in ARNR filtering. The use of incorrect mv costing tables in the ARNR sub-pel filtering code led to corruption of the altref buffer in some cases, particularly at low data rates. The average gain from this fix is about 0.3% but there are a few extreme cases where nasty and visible artifacts manifested and for these few data points the improvement is > 10%. PGW and AWG Change-Id: I95cc02b196a433e71d0d2bd2b933fe68ed31e796	2010-11-15 17:47:12 +00:00
Yaowu Xu	73189f21b3	Merge "make rdmult adaptive for intra in quantizer RDO"	2010-11-15 09:22:45 -08:00
John Koleszar	25fa447acb	Merge remote branch 'origin/master' into experimental	2010-11-13 00:05:04 -05:00
John Koleszar	7d799d2ced	Merge remote branch 'origin/master' into experimental	2010-11-12 00:05:03 -05:00
Yaowu Xu	ef2f27f10e	make rdmult adaptive for intra in quantizer RDO This intends to correct the tendency that VP8 aggressively favors rate on intra coded frames. Experiments tested different numbers in [0, 1] and found 9/16 overall provided about 2-4% gains for all-intra coded clips based on vpx-ssim metric. The impact on regular encoded clips is much smaller but positive overall. Overall impact on psnr is also positive even though very small. Change-Id: If808553aaaa87fdd44691f9787820ac9856d9f8a	2010-11-11 11:33:35 -08:00
John Koleszar	0a49747b01	quantizer: fix assertion in fast quantizer path The fast quantizer assembly code has not been updated to match the new exact quantizer, which was made the default in commit `6adbe09`. Specifically, they are not aware of the potential for the coefficient to be scaled, which results in the quantized result exceeding the range of the DCT. This patch restores the previous behavior of using the non-shifted coefficients when in the fast quantizer code path, but unfortunately requires rebuilding the tables when switching between the two. Change-Id: I0a33f5b3850335011a06906f49fafed54dda9546	2010-11-11 13:05:20 -05:00
Fritz Koenig	58083cb34d	Revert "Remove stack shadowing for x86-64" This reverts commit `15acc84f10`. Change-Id: Ia640be8cbc134432914849c1750f62575ea084e6	2010-11-11 08:20:02 -08:00
John Koleszar	f225211256	Merge remote branch 'origin/master' into experimental Conflicts: configure Change-Id: Ifa63e4610657f75cb953aa7ca08f997267612cc0	2010-11-11 09:25:10 -05:00
Paul Wilkins	213f7b0907	Merge "Relax rate control for last few frames"	2010-11-11 02:39:20 -08:00
Fritz Koenig	9b1ece2cca	Merge "Remove stack shadowing for x86-64"	2010-11-10 14:36:10 -08:00
Fritz Koenig	5f0e0617ba	FDCT optimizations. Fixed up the fdct for mmx and 8x4 sse2 to match them most recent changes. Change-Id: Ibee2d6c536fe14dcf75cd6eb1c73f4848a56d719	2010-11-10 14:34:02 -08:00
Fritz Koenig	647df00f30	postproc : Re-work posproc calling to allow more flags. Debugging in postproc needs more flags to allow for specific block types to be turned on or off in the visualizations. Must be enabled with --enable-postproc-visualizer during configuration time. Change-Id: Ia74f357ddc3ad4fb8082afd3a64f62384e4fcb2d	2010-11-10 14:14:46 -08:00
Paul Wilkins	513f8e6814	Relax rate control for last few frames VBR rate control can become very noisy for the last few frames. If there are a few bits to spare or a small overshoot then the target rate and hence quantizer may start to fluctuate wildly. This patch prevents further adjustment of the active Q limits for the last few frames. Patch also removes some redundant variables and makes one small bug fix. Change-Id: Ic167831bec79acc9f0d7e4698bcc4bb188840c45	2010-11-10 10:09:45 +00:00
Paul Wilkins	6adbe09058	Tuning for the more exact quantizer. Small changes to the default zero bin and rounding tables. Though the tables are currently the same for the Y1 and Y2 cases I have left them as separate tables in case we want to tune this later. There is now some adjustment of the zbin based on the prediction mode. Previously this was restricted to an adjustment for gf/arf 0,0 MV. The exact quantizer now marginal outperforms and is the default. The overall average gain is about 0.5% Change-Id: I5e4353f3d5326dde4e86823684b236a1e9ea7f47	2010-11-10 09:52:58 +00:00
John Koleszar	3a99784b5e	Merge remote branch 'origin/master' into experimental	2010-11-10 00:05:06 -05:00
John Koleszar	458f4fedd2	Merge "improve average framerate calculation"	2010-11-09 08:52:16 -08:00
John Koleszar	2fa664a4e2	Merge remote branch 'origin/master' into experimental	2010-11-06 00:05:08 -04:00
John Koleszar	4d1b0d2a2d	Merge commit 'fix integer promotion bug in partition size check' Change-Id: I4081917b46013fa8f4218cade8bd12cb2d013aee	2010-11-05 16:49:32 -04:00
John Koleszar	9fb80f7170	fix integer promotion bug in partition size check The check '(user_data_end - partition < partition_size)' must be evaluated as a signed comparison, but because partition_size was unsigned, the LHS was promoted to unsigned, causing an incorrect result on 32-bit. Instead, check the upper and lower bounds of the segment separately. Change-Id: I6266aba7fd7de084268712a3d2a81424ead7aa06	2010-11-05 14:52:53 -04:00
John Koleszar	7a590c902b	Merge remote branch 'origin/master' into experimental Conflicts: configure ivfenc.c vp8/common/alloccommon.c vp8/common/onyxc_int.h vp8/vp8_cx_iface.c	2010-11-05 12:30:33 -04:00
John Koleszar	f7e187d362	improve average framerate calculation Change Ice204e86 identified a problem with bitrate undershoot due to low precision in the timestamps passed to the library. This patch takes a different approach by calculating the duration of this frame and passing it to the library, rather than using a fixed duration and letting the library average it out with higher precision timestamps. This part of the fix only applies to vpxenc. This patch also attempts to fix the problem for generic applications that may have made the same mistake vpxenc did. Instead of calculating this frame's duration by the difference of this frame's and the last frame's start time, we use the end times instead. This allows the framerate calculation to scavenge "unclaimed" time from the last frame. For instance: start \| end \| calculated duration ======+=======+==================== 0ms 33ms 33ms 33ms 66ms 33ms 66ms 99ms 33ms 100ms 133ms 34ms Change-Id: I92be4b3518e0bd530e97f90e69e75330a4c413fc	2010-11-05 08:42:46 -04:00
Fritz Koenig	0e7b60617f	postproc : Update visualizations. Change color reference frame to blend the macro block edge. This helps with layering of visualizations. Add block coloring for intra prediction modes. Change-Id: Icefe0e189e26719cd6937cebd6727efac0b4d278	2010-11-04 10:35:02 -07:00
Fritz Koenig	0a29bd9793	postproc : Fix display of motion vectors. Split motion vectors were all being treated as 4x4 blocks. Now correctly handle 16x8, 8x16, 8x8, 4x4 blocks. Change-Id: Icf345c5e69b5e374e12456877ed7c41213ad88cc	2010-11-02 13:29:13 -07:00
Scott LaVarnway	b8f43aec66	Merge "SSSE3 version of fast quantizer"	2010-11-02 06:27:29 -07:00
Fritz Koenig	90c505f218	Merge "postproc : Added SPLITMV visualization, fix line constrain."	2010-11-01 14:41:41 -07:00
Fritz Koenig	9f61a83bf9	postproc : Added SPLITMV visualization, fix line constrain. Now draw 16 vectors for SPLITMV mode. Fixed constrain line to block divide by zero issues. Blend block was not centering the shaded area correctly. Change-Id: I1edabd8b4e553aac8d980f7b45c80159e9202434	2010-11-01 13:27:13 -07:00
Scott LaVarnway	ff4a71f4c2	SSSE3 version of fast quantizer (test clip: tulip) For good quality mode with speed=1, this gave the encoder a small (2 - 3%) performance boost. Change-Id: I8a1d4269465944ac0819986c2f0be4b0a2ee0b35	2010-11-01 16:24:15 -04:00
Scott LaVarnway	dcee88ea37	Finding first label Using tables for the label count and label offset. Change-Id: Iac3d5b292c37341a881be0af282f5cac3b3e01eb	2010-10-29 10:01:04 -04:00
Yunqing Wang	6614563b8f	Save XMM registers in asm functions XMM6/7 are used in these functions, and need to be saved. Change-Id: I3dfaddaf2a69cd4bf8e8735c7064b17bac5a14e5	2010-10-28 16:59:03 -04:00
Yunqing Wang	f57fc7bcc6	Merge "Fix full-search SAD function crash in Visual Studio"	2010-10-28 13:46:35 -07:00
Yunqing Wang	7e3a1e7361	Fix full-search SAD function crash in Visual Studio Unlike GCC, Visual Studio compiler doesn't allocate SAD output array 16-byte aligned, which causes crash in visual studio. Change-Id: Ia755cf5a807f12929bda8db94032bb3c9d0c2362	2010-10-28 15:26:58 -04:00
Timothy B. Terriberry	c4d7e5e67e	Eliminate more warnings. This eliminates a large set of warnings exposed by the Mozilla build system (Use of C++ comments in ISO C90 source, commas at the end of enum lists, a couple incomplete initializers, and signed/unsigned comparisons). It also eliminates many (but not all) of the warnings expose by newer GCC versions and _FORTIFY_SOURCE (e.g., calling fread and fwrite without checking the return values). There are a few spurious warnings left on my system: ../vp8/encoder/encodemb.c:274:9: warning: 'sz' may be used uninitialized in this function gcc seems to be unable to figure out that the value shortcut doesn't change between the two if blocks that test it here. ../vp8/encoder/onyx_if.c:5314:5: warning: comparison of unsigned expression >= 0 is always true ../vp8/encoder/onyx_if.c:5319:5: warning: comparison of unsigned expression >= 0 is always true This is true, so far as it goes, but it's comparing against an enum, and the C standard does not mandate that enums be unsigned, so the checks can't be removed. Change-Id: Iaf689ae3e3d0ddc5ade00faa474debe73b8d3395	2010-10-27 18:08:04 -07:00
Fritz Koenig	a097e18964	postproc: Tweaks to line drawing and blending. Turned down the blending level to make colored blocks obscure the video less. Not blending the entire block to give distinction to macro block edges. Added configuration so that macro block blending function can be optimized. Change to constrain line as to when dx and dy are computed. Now draw two lines to form an arrow. Change-Id: Id3ef0fdeeab2949a6664b2c63e2a3e1a89503f6c	2010-10-27 13:20:03 -07:00
Yunqing Wang	71ecb5d7d9	Full search SAD function optimization in SSE4.1 Use mpsadbw, and calculate 8 sad at once. Function list: vp8_sad16x16x8_sse4 vp8_sad16x8x8_sse4 vp8_sad8x16x8_sse4 vp8_sad8x8x8_sse4 vp8_sad4x4x8_sse4 (test clip: tulip) For best quality mode, this gave encoder a 5% performance boost. For good quality mode with speed=1, this gave encoder a 3% performance boost. Change-Id: I083b5a39d39144f88dcbccbef95da6498e490134	2010-10-27 13:36:31 -04:00
John Koleszar	a0ae3682aa	Fix half-pixel variance RTCD functions This patch fixes the system dependent entries for the half-pixel variance functions in both the RTCD and non-RTCD cases: - The generic C versions of these functions are now correct. Before all three cases called the hv code. - Wire up the ARM functions in RTCD mode - Created stubs for x86 to call the optimized subpixel functions with the correct parameters, rather than falling back to C code. Change-Id: I1d937d074d929e0eb93aacb1232cc5e0ad1c6184	2010-10-27 13:00:30 -04:00
Johann	927f29a644	Merge "fix implicit declarations"	2010-10-27 09:59:28 -07:00
Johann	787733d855	Merge "RTCD build is bringing old errors to light"	2010-10-27 09:59:01 -07:00
Fritz Koenig	cf127474d8	vpxdec : Change --pp-debug-info to be a bit field. This allows multiple post processor debug levels to be overlayed. i.e. can show colored reference blocks and visual motion vectors. Change-Id: Ic4a1df438445b9f5780fe73adb3126e803472e53	2010-10-27 09:53:37 -07:00
Fritz Koenig	36ff6a6743	Merge "postproc: Add mode and refrence frame visualizers."	2010-10-27 09:04:39 -07:00
Johann	b90a072f10	fix implicit declarations ARM used to explicitly remove this file from the build. With the RTCD changes, that's no longer possible. These errors also exist for x86 w/o RTCD, but that's not the default configuration Change-Id: I3e10e5553ddf3278e8d3c9365ca6fb84f52f5066	2010-10-27 11:21:02 -04:00
Johann	abcf36c758	RTCD build is bringing old errors to light needs to be _recon_ not _recon_recon_ Change-Id: I7a8b9ddcb4fb72c2b723c563932c9ea52ff15982	2010-10-27 10:47:48 -04:00
John Koleszar	1747207700	Merge "Add half-pixel variance RTCD functions"	2010-10-26 20:05:02 -07:00
John Koleszar	1320e54d95	Merge "make vp8_recon16x16mb{,y} RTCD functions"	2010-10-26 20:02:57 -07:00
John Koleszar	87e17737e9	Merge "make arm hex search the generic implementation"	2010-10-26 20:02:37 -07:00
John Koleszar	53f64a7736	Merge "arm: move unrolled loops back to generic code"	2010-10-26 20:02:18 -07:00
John Koleszar	9fdd90c9aa	Merge "arm: remove duplicate functions"	2010-10-26 20:01:54 -07:00
John Koleszar	209d82ad72	Add half-pixel variance RTCD functions NEON has optimized 16x16 half-pixel variance functions, but they were not part of the RTCD framework. Add these functions to RTCD, so that other platforms can make use of this optimization in the future and special-case ARM code can be removed. A number of functions were taking two variance functions as parameters. These functions were changed to take a single parameter, a pointer to a struct containing all the variance functions for that block size. This provides additional flexibility for calling additional variance functions (the half-pixel special case, for example) and by initializing the table for all block sizes, we don't have to construct this function pointer table for each macroblock. Change-Id: I78289ff36b2715f9a7aa04d5f6fbe3d23acdc29c	2010-10-26 20:00:56 -07:00
Fritz Koenig	a0ccc97d8a	postproc: Add mode and refrence frame visualizers. Post process option to color the block for either the mode of the macro block, or the frame that the macro block references. Change-Id: Ie498175497f2d20e3319924d352dc4ddc16f4134	2010-10-26 16:00:14 -07:00
John Koleszar	d6c67f02c9	make vp8_recon16x16mb{,y} RTCD functions ARM NEON has a platform specific version of vp8_recon16x16mb, though it's just a stub to extract the various parameters from the MACROBLOCKD struct and pass them to vp8_recon16x16mb_neon(). Using that function's prototype directly will be a better long term solution, but it's quite an invasive change. Change-Id: I04273149e2ade34749e2d09e7edb0c396e1dd620	2010-10-26 13:23:36 -04:00
John Koleszar	96cf6588de	make arm hex search the generic implementation The ARM version of vp8_hex_search() is a faster implementation of the same algorithm. Since it doesn't use any ARM specific code, it can be made the default implementation. This removes a linking error. Change-Id: I77d10f2c16b2515bff4522c350004e03b7659934	2010-10-26 10:46:31 -04:00
John Koleszar	1e7c05e0b4	Merge "add missing GET_GOT/RESTORE_GOT pairs"	2010-10-26 07:05:21 -07:00
John Koleszar	19638c2309	arm: move unrolled loops back to generic code Some of the ARM functions differed from their generic counterparts only by unrolling their loops. Since this change may be useful on other platforms, or might even supercede the looped version in the generic case, move it back to the generic file. This code is left under #if ARCH_ARM for now, but it may be worth considering a different (possibly new) conditional for these. If it turns out that this should be runtime selectable, these functions will have to move to the RTCD infrastructure. Don't want to take that step at this time without more profile data. Change-Id: I4612fdbc606fbebba4971a690fb743ad184ff15f	2010-10-26 09:51:35 -04:00
John Koleszar	d330a5876b	arm: remove duplicate functions These functions were true duplicates of functions present in the generic code. This fixes some of the link errors when building with --enable-shared --enable-pic. Change-Id: Idff26599d510d954e439207883607ad6b74df20c	2010-10-26 09:37:44 -04:00
Jim Bankoski	0a5a638c60	Merge commit 'refs/changes/09/809/1' of https://review.webmproject.org/p/libvpx	2010-10-26 07:34:57 -04:00
John Koleszar	b523dd51bd	add missing GET_GOT/RESTORE_GOT pairs These functions made global references but did not set up the GOT, causing compilation failures in PIC mode. Change-Id: Iac473bf46733f87eb2e001cd736af4acf73fa51d	2010-10-25 23:45:02 -04:00
Fritz Koenig	1d70aaf08b	Merge "Debug option for drawing motion vectors."	2010-10-25 15:40:22 -07:00
Fritz Koenig	d1a4cce809	Debug option for drawing motion vectors. Postproc level that uses Bresenham's line algorithm to draw motion vectors onto the postproc buffer. Change-Id: I34c7daa324f2bdfee71e84fcb1c50b90fa06f6fb	2010-10-25 15:39:04 -07:00
Johann	a3b002fc90	Merge "quiet compiler"	2010-10-25 13:26:55 -07:00
Martin Ettl	c3fd2c4ea7	Fix leaked file descriptor with ENTROPY_STATS cppcheck found a leaked file descriptor in the debugging code enabled by defining ENTROPY_STATS. Fixes issue #60. Change-Id: I0c1d0669cb94d44fed77860f97b82763be06b7cb	2010-10-25 13:16:39 -04:00
Johann	385865f820	quiet compiler clean up compiler warnings, man in the yellow hat warnings, and start to remove unused #includes Change-Id: I6267e98d9b3024b6fb1ef2732b29067a33cb96f6	2010-10-25 10:07:35 -04:00
Johann	1376f061da	reuse common loopfilter code there were four versions for the regular and macroblock loopfilters: horizontal [y\|uv] vertical [y\|uv] this moves all the common code into 2 functions: vp8_loop_filter_neon vp8_mbloop_filter_neon this provides no gain in performance. there's a bit of jitter, but it trends down ~0.25-0.5%. however, this is a huge gain maintenance. also, there is the potential to drop some stack usage in the macroblock loopfilter. Change-Id: I91506f07d2f449631ff67ad6f1b3f3be63b81a92	2010-10-25 09:48:50 -04:00
Timothy B. Terriberry	b71962fdc9	Add runtime CPU detection support for ARM. The primary goal is to allow a binary to be built which supports NEON, but can fall back to non-NEON routines, since some Android devices do not have NEON, even if they are otherwise ARMv7 (e.g., Tegra). The configure-generated flags HAVE_ARMV7, etc., are used to decide which versions of each function to build, and when CONFIG_RUNTIME_CPU_DETECT is enabled, the correct version is chosen at run time. In order for this to work, the CFLAGS must be set to something appropriate (e.g., without -mfpu=neon for ARMv7, and with appropriate -march and -mcpu for even earlier configurations), or the native C code will not be able to run. The ASFLAGS must remain set for the most advanced instruction set required at build time, since the ARM assembler will refuse to emit them otherwise. I have not attempted to make any changes to configure to do this automatically. Doing so will probably require the addition of new configure options. Many of the hooks for RTCD on ARM were already there, but a lot of the code had bit-rotted, and a good deal of the ARM-specific code is not integrated into the RTCD structs at all. I did not try to resolve the latter, merely to add the minimal amount of protection around them to allow RTCD to work. Those functions that were called based on an ifdef at the calling site were expanded to check the RTCD flags at that site, but they should be added to an RTCD struct somewhere in the future. The functions invoked with global function pointers still are, but these should be moved into an RTCD struct for thread safety (I believe every platform currently supported has atomic pointer stores, but this is not guaranteed). The encoder's boolhuff functions did not even have _c and armv7 suffixes, and the correct version was resolved at link time. The token packing functions did have appropriate suffixes, but the version was selected with a define, with no associated RTCD struct. However, for both of these, the only armv7 instruction they actually used was rbit, and this was completely superfluous, so I reworked them to avoid it. The only non-ARMv4 instruction remaining in them is clz, which is ARMv5 (not even ARMv5TE is required). Considering that there are no ARM-specific configs which are not at least ARMv5TE, I did not try to detect these at runtime, and simply enable them for ARMv5 and above. Finally, the NEON register saving code was completely non-reentrant, since it saved the registers to a global, static variable. I moved the storage for this onto the stack. A single binary built with this code was tested on an ARM11 (ARMv6) and a Cortex A8 (ARMv7 w/NEON), for both the encoder and decoder, and produced identical output, while using the correct accelerated functions on each. I did not test on any earlier processors. Change-Id: I45cbd63a614f4554c3b325c45d46c0806f009eaa	2010-10-25 09:23:29 -04:00
Johann	e81e30c25d	isolate new temporal filtering code onyx_if is getting pretty big. split out the temporal code to make it easier to look at. Change-Id: I207c3a94c90e91b32e3ea5e1836a53b7a990fabd	2010-10-25 09:11:03 -04:00
John Koleszar	3b9e72b210	Merge "Improve handling of invalid frames." Change-Id: Icef5226a70260607c190126c1c0cc28b796e759c	2010-10-22 11:54:49 -04:00
Timothy B. Terriberry	09bcc1f710	Improve handling of invalid frames. The code was not checking for frame sizes smaller than 3 bytes, and the partition size checks might have failed if the input buffer was within 16MB of the top of the heap. In addition, the reference count on the current frame buffer was not being decremented on error, so after a small number of errors, no new frame buffer could be found and it would run off the list of them. Change-Id: I0c60dba6adb1e2a29df39754f72a56ab6c776b46	2010-10-22 11:50:56 -04:00
Timothy B. Terriberry	8f75ea6b5c	Convert [4][4] matrices to [16] arrays. Most of the code that actually uses these matrices indexes them as if they were a single contiguous array, and coverity produces reports about the resulting accesses that overflow the static bounds of the first row. This is perfectly legal in C, but converting them to actual [16] arrays should eliminate the report, and removes a good deal of extraneous indexing and address operators from the code. Change-Id: Ibda479e2232b3e51f9edf3b355b8640520fdbf23	2010-10-21 17:04:30 -07:00
Frank Galligan	45e6494177	Change altref times to preceding pts+1. Change the pts of the altref frame to be as close as possible to the pts of the preceding frame and still be strictly increasing. Change-Id: Iae3033a4c89ae5a9d0e5c4198e9196e5f3ee57c7	2010-10-21 14:11:58 -04:00
John Koleszar	1ee3ebcd66	Merge "Move firstpass motion map to stats packet"	2010-10-21 11:09:02 -07:00
John Koleszar	bb7dd5b1ba	Move firstpass motion map to stats packet The first implementation of the firstpass motion map for motion compensated temporal filtering created a file, fpmotionmap.stt, in the current working directory. This was not safe for multiple encoder instances. This patch merges this data into the first pass stats packet interface, so that it is handled like the other (numerical) firstpass stats. The new stats packet is defined as follows: Numerical Stats (16 doubles) -- 128 bytes Motion Map -- 1 byte / Macroblock Padding -- to align packet to 8 bytes The fpmotionmap.stt file can still be generated for debugging purposes in the same way that the textual version of the stats are available (defining OUTPUT_FPF in firstpass.c) Change-Id: I083ffbfd95e7d6a42bb4039ba0e81f678c8183ca	2010-10-21 14:04:20 -04:00
Yunqing Wang	4cefb4434f	Add MMWORD PTR/XMMWORD PTR in subtract_sse2.asm Change-Id: Ia649b500ef020225d8bbf611799d0f47658dc2ac	2010-10-21 13:42:24 -04:00
Yunqing Wang	31752f2f41	Merge "Rewrite vp8_short_walsh4x4_sse2()"	2010-10-21 10:31:23 -07:00
Yunqing Wang	0918747520	Merge "Add SSE2 subtract functions"	2010-10-21 10:30:27 -07:00
Fritz Koenig	15acc84f10	Remove stack shadowing for x86-64 x86-64 passes most arguments in registers. There is no need to push them to the stack before using them. Change-Id: I13c683f1358782682ecafaf1df3fb0af23b978ea	2010-10-21 10:28:08 -07:00
Yunqing Wang	fc94ffcea4	Rewrite vp8_short_walsh4x4_sse2() This rewriting reflects changes made in commit "Improve the accuracy of forward walsh-hadamard transform". Since this function is not called much, only a small encoder performance gain (~0.5% ) is seen. Change-Id: Ie9df58a43028a11fd5b115c4bbe3141f7596578b	2010-10-21 13:02:55 -04:00
John Koleszar	bdf469c91e	Merge "Update arnr strength range form 1-6 to 0-6."	2010-10-19 20:20:31 -07:00
Frank Galligan	15542721ee	Update arnr strength range form 1-6 to 0-6. Change-Id: I8eb49c56f7509f0a8074d440e8345b9e3344b85b	2010-10-19 20:18:13 -07:00
Yaowu Xu	fc2f8dafaf	Merge "fixed a typo that mis-used Y plane stride for UV blocks."	2010-10-19 16:23:31 -07:00
Yaowu Xu	b9fe6d4da4	Merge "change to make use of more trellis quantization"	2010-10-19 08:11:52 -07:00
Yunqing Wang	4db2076594	Add SSE2 subtract functions Instead of doing 8-bit data unpack and 16-bit subtraction, use psubb to do 16 8-bit subtractions and pcmpgtb to preserve the sign information. This does not bring noticable gain since these functions are not called frequently. Change-Id: I90a0dfaa3db9d422e4ada324076596ffb178548e	2010-10-18 14:15:15 -04:00
Johann	ce1ce992ce	copy compiler warning fixes generic version got fixed, but not the arm version. fixes: vp8/encoder/arm/mcomp_arm.c: In function 'vp8_full_search_sadx3': vp8/encoder/arm/mcomp_arm.c:1208: warning: pointer targets in passing argument 5 of 'fn_ptr->sdx3f' differ in signedness vp8/encoder/arm/mcomp_arm.c:1208: note: expected 'unsigned int ' but argument is of type 'int ' and another unsigned change to keep the files similar Change-Id: I1b6255dc3a03b90394a791ee0d15d8167d9454db	2010-10-18 13:23:39 -04:00
Johann	963bcd6c87	remove dead code vp8_diamond_search_sadx4 isn't used in arm because there is no corrosponding sdx4df as in x86. rather than keep it in sync with ../mcomp.c, delete it vp8_hex_search had the original, more readable/understandable code if`d out. it's also available in ../mcomp.c, so remove the dead copy Change-Id: Ia42aa6e23b3a2e88040f467280befec091ec080e	2010-10-15 15:37:09 -04:00
Yaowu Xu	2e53e9e53f	change to make use of more trellis quantization when a subsequent frame is encoded as an alt reference frame, it is unlikely that any mb in current frame will be used as reference for future frames, so we can enable quantization optimization even when the RD constant is slightly rate-biased. The change has an overall benefit between 0.1% to 0.2% bit savings on the test sets based on vpxssim scores. Change-Id: I9aa7bc5cd573ea84e3ee655d2834c18c4460ceea	2010-10-15 10:14:34 -07:00
Jim Bankoski	39f41a4f36	safety check to avoid divide by 0s	2010-10-14 16:19:06 -04:00
Yunqing Wang	a2b598a2f9	Merge "Fix one gcc compiler warning"	2010-10-14 12:20:25 -07:00
Yunqing Wang	7804befb55	Fix one gcc compiler warning ../libvpx/vp8/encoder/bitstream.c: In function ‘pack_inter_mode_mvs’: ../libvpx/vp8/encoder/bitstream.c:1026: warning: array subscript has type ‘char’ Change-Id: Ic77491e0a172fa1821e5b3e914d0dc41fe87c00f	2010-10-14 15:15:35 -04:00
Yunqing Wang	7f31d987f0	Merge "Improve bounds checking in vp8_diamond_search_sadx4()"	2010-10-14 11:29:24 -07:00
Yunqing Wang	d6da7b8ea1	Improve bounds checking in vp8_diamond_search_sadx4() In order to know if all 4/8 neighbor points are within the bounds, 4 bounds checking are enough instead of checking 4 bounds for each points (16/32 checkings). This improvement reduces cost of vp8_diamond_search_sadx4() by 30%, and gives encoder a 1.5% performance gain (test options: 1 pass, good, speed=4). Change-Id: Ie8da29d18a6ecfc9829e74ac02f6fa70e042331a	2010-10-14 11:06:37 -04:00
Fritz Koenig	1dc0ca1340	Fix compiler warning about vp8_fast_quantize_b_impl_ssse2. Typo had function defined as _ssse2 and prototyped as _sse2. Change-Id: If9f19da1a83cff40774a90cf936d601c0bf1b7fe	2010-10-13 17:08:13 -07:00
Fritz Koenig	92df4a06d2	Correct QWORD usage in assembly files QWORD was being undefined because it was being used incorrectly. Change-Id: I3610cefa3d6f0da4054316760f78b9694cde3876	2010-10-13 16:57:57 -07:00
John Koleszar	136857475e	Centralize mb skip state calculation This patch moves the scattered updates to the mb skip state (mode_info_context->mbmi.mb_skip_coeff) to vp8_tokenize_mb. Recent changes to the quantizer exposed a bug where if a macroblock could be coded as a skip but isn't, the encoder would run the loopfilter but the decoder wouldn't, causing a reference buffer mismatch. The loopfilter is controlled by a flag called dc_diff. The decoder looks at the number of decoded coefficients when setting this flag. The encoder sets this flag based on the skip state, since any skippable macroblock should be transmitted as a skip. The coefficient optimization pass (vp8_optimize_b()) could change the coefficients such that a block that was not a skip becomes one. The encoder was not updating the skip state in this situation for intra coded blocks. The underlying issue predates it, but this bug was recently triggered by enabling trellis quantization on the Y2 block in commit `dcd29e3`, and by changing the quantizer range control in commit `305be4e`. Change-Id: I5cce5da0dbc2d22f7d79ee48149f01e868a64802	2010-10-12 09:03:19 -04:00

1 2 3 4 5 ...

396 Commits