generic-library/vpx

Author	SHA1	Message	Date
John Koleszar	209d82ad72	Add half-pixel variance RTCD functions NEON has optimized 16x16 half-pixel variance functions, but they were not part of the RTCD framework. Add these functions to RTCD, so that other platforms can make use of this optimization in the future and special-case ARM code can be removed. A number of functions were taking two variance functions as parameters. These functions were changed to take a single parameter, a pointer to a struct containing all the variance functions for that block size. This provides additional flexibility for calling additional variance functions (the half-pixel special case, for example) and by initializing the table for all block sizes, we don't have to construct this function pointer table for each macroblock. Change-Id: I78289ff36b2715f9a7aa04d5f6fbe3d23acdc29c	2010-10-26 20:00:56 -07:00
Fritz Koenig	a0ccc97d8a	postproc: Add mode and refrence frame visualizers. Post process option to color the block for either the mode of the macro block, or the frame that the macro block references. Change-Id: Ie498175497f2d20e3319924d352dc4ddc16f4134	2010-10-26 16:00:14 -07:00
John Koleszar	75afcee962	vpxenc: add deterministic output option By baking the version number into the output file, a hash of the file will vary from commit to commit, even if the output is otherwise bit exact. Add a -D option to suppress this behavior, for use when bisecting or other debugging. Change-Id: I5089a8ce5719920ffaf47620fa9069b81fa15673	2010-10-26 16:22:22 -04:00
John Koleszar	91d927f9b3	Merge "Update AUTHORS"	2010-10-26 13:10:59 -07:00
John Koleszar	ad760b9660	Update AUTHORS Change-Id: I18e0a9e00731c23a2bdd1a978c8cb38f71e9029d	2010-10-26 16:10:22 -04:00
John Koleszar	d6c67f02c9	make vp8_recon16x16mb{,y} RTCD functions ARM NEON has a platform specific version of vp8_recon16x16mb, though it's just a stub to extract the various parameters from the MACROBLOCKD struct and pass them to vp8_recon16x16mb_neon(). Using that function's prototype directly will be a better long term solution, but it's quite an invasive change. Change-Id: I04273149e2ade34749e2d09e7edb0c396e1dd620	2010-10-26 13:23:36 -04:00
John Koleszar	96cf6588de	make arm hex search the generic implementation The ARM version of vp8_hex_search() is a faster implementation of the same algorithm. Since it doesn't use any ARM specific code, it can be made the default implementation. This removes a linking error. Change-Id: I77d10f2c16b2515bff4522c350004e03b7659934	2010-10-26 10:46:31 -04:00
John Koleszar	1e7c05e0b4	Merge "add missing GET_GOT/RESTORE_GOT pairs"	2010-10-26 07:05:21 -07:00
John Koleszar	19638c2309	arm: move unrolled loops back to generic code Some of the ARM functions differed from their generic counterparts only by unrolling their loops. Since this change may be useful on other platforms, or might even supercede the looped version in the generic case, move it back to the generic file. This code is left under #if ARCH_ARM for now, but it may be worth considering a different (possibly new) conditional for these. If it turns out that this should be runtime selectable, these functions will have to move to the RTCD infrastructure. Don't want to take that step at this time without more profile data. Change-Id: I4612fdbc606fbebba4971a690fb743ad184ff15f	2010-10-26 09:51:35 -04:00
John Koleszar	d330a5876b	arm: remove duplicate functions These functions were true duplicates of functions present in the generic code. This fixes some of the link errors when building with --enable-shared --enable-pic. Change-Id: Idff26599d510d954e439207883607ad6b74df20c	2010-10-26 09:37:44 -04:00
Jim Bankoski	0a5a638c60	Merge commit 'refs/changes/09/809/1' of https://review.webmproject.org/p/libvpx	2010-10-26 07:34:57 -04:00
John Koleszar	b523dd51bd	add missing GET_GOT/RESTORE_GOT pairs These functions made global references but did not set up the GOT, causing compilation failures in PIC mode. Change-Id: Iac473bf46733f87eb2e001cd736af4acf73fa51d	2010-10-25 23:45:02 -04:00
John Koleszar	bdc9262a6d	Merge WebM input/output branch Change-Id: I83a6f18d2314e5d97759b4ae49afaa52fd8b3c44	2010-10-25 22:59:23 -04:00
John Koleszar	0a6bf29eb9	vpxenc: warn against webm output to pipes The WebM writer requires a seekable stream. Change-Id: I192e00706a0685362d41b8d2faf80add63d564b9	2010-10-25 22:35:22 -04:00
John Koleszar	5329189a33	vpxenc: specify output file with -o Requiring the output file to be specified with the -o option opens up the possibility of supporting multiple input files in the future. Change-Id: I14c9b75e9b21184b47081e1ccf30cf4c91315964	2010-10-25 22:32:23 -04:00
John Koleszar	933d44b818	vpxdec: rework default output parameters This patch reworks the default behavior of the tool to output Y4M instead of writing individual raw frames. The relevant controls are now: --yv12, --i420 - These options change the output format to be raw planar data. The output will be Y4M unless one of these options is specified. --flipuv - Swaps the chroma planes. Works with Y4M output. -o, --output - Sets the output filename. Defaults to stdout if not specified. Supports escape character expansion for frame width (%w) height (%h) and sequence number (%1..%9). The --prefix option has been removed in favor of this escape expansion. Since the output defaults to stdout if -o is not specified, an error will be thrown if stdout is not connected to a pipe. This can be overridden by specifying '-o -'. Change-Id: I94e42c57ca75721fdd57a6129e79bcdb2afe5d4d	2010-10-25 22:32:03 -04:00
John Koleszar	4b578ea6c4	vpxdec: replace --quiet with --verbose Be quiet by default, to play nicer with scripts. Change-Id: I68f6c88411fd5487566f268fb73b4e55ae64410c	2010-10-25 22:24:36 -04:00
John Koleszar	5d12e04d16	vpxdec: use the same output for --progress and --summary Update the timing information in-place for the --progress option. Change-Id: I8efea57050db72963c0bc5c994425e7e692d1502	2010-10-25 22:22:40 -04:00
John Koleszar	dea6193d09	usage: fix horizontal alignment of options When showing the command usage information for vpxenc and vpxdec, options with both a short and long version that do not take an argument were not properly aligned. Change-Id: I8d65b5ab85bcb5a5dc8bc0d4b293b5189d56dedb	2010-10-25 22:20:32 -04:00
John Koleszar	28f177cd3a	vpxenc: change --framerate to --fps Saves a little typing. FPS is a well known abbreviation. Change-Id: I53730ea36afb9309732eb1c72c52d824d5365fec	2010-10-25 22:18:13 -04:00
John Koleszar	456bfb196b	vpxenc: output webm by default WebM should be preferred to IVF output, since it has wider tool support. Change-Id: I5ac3d5cb68722e6c8af917cdba32ac01dd5e0ea2	2010-10-25 22:16:02 -04:00
John Koleszar	ea68ee0458	rename ivf{enc,dec} to vpx{enc,dec} The new WebM output support should be preferred to IVF, but we can't change the default behavior of the ivf* tools. There are a few other default behaviors for these tools that are counterintuitive for historical reasons, and changing the binary name provides the opportunity to clean those up as well. This patch takes the first step by renaming the binaries. Change-Id: I647008ae37cc352dd27ec1da7ed13489e0609b24	2010-10-25 22:12:44 -04:00
John Koleszar	dc66630cca	ivfenc: webm output support This patch adds the --webm option, to allow the creation of WebM streams without having to remux ivf into webm. Change-Id: Ief93c114a6913c55a04cf51bce38f594372d0ad0	2010-10-25 22:12:14 -04:00
John Koleszar	f9d9824047	Import webmquicktime webm writer Initial import of the libmkv directory from the webmquicktime[1] project, at commit fedbda1. [1]: git://review.webmproject.org/webmquicktime.git commit fedbda18de899ff94855cb334de7e471036fbf1d Change-Id: I1564a0ebfa72293fc296ee02178196530dfd90e4	2010-10-25 22:06:57 -04:00
Frank Galligan	1258cf62ae	Fixed the timebase parameter of ivfenc. Ivfenc will use timebase if it is set. If it is not set ivfenc will still double the timebase so altref frames will have a unique pts. Patch Set #3: Use integer math to generate source pts. Added a framerate parameter. Increased the default timebase to milliseconds to remove the *2 everywhere. Change-Id: I8d25b5b2cb26deef7eb72d74b5f76c98cafaf4db	2010-10-25 22:04:38 -04:00
John Koleszar	cfe3f9173f	ivfdec: support y4m output from raw input The width and height needed to write the Y4M header can be found by probing the stream with vpx_codec_peek_stream_info(). This also has the consequence of supporting multiple codecs from raw files with automatic detections, should we add additional codecs in the future. Change-Id: I7522a8f4c7577b6ed9876d744c59cd86d30c6049	2010-10-25 22:02:33 -04:00
John Koleszar	ad252daf65	ivfdec: webm reader support This patch enables ivfdec to decode WebM files. WebM demuxing is provided by the Matthew Gregan's Nestegg library. This patch also makes minor changes to the timebase->framerate handling when doing Y4M output. For WebM files, the framerate is guessed by looking at the first second of video. For IVF files, the timebase=1/(2*fps) hack is still in place, but is only used if the timebase denominator is less than 1000. This is in anticipation of change I8d25b5b, which introduces the distinction between framerate and timebase to ivfenc. In the case of high resolution timebases, like 100ns, we would have to guess the framerate like we do for WebM, but since WebM support in ivfenc will deprecate IVF output, we just assume 30fps rather than writing the lookahead code. Change-Id: I1dd8600f13bf6071533d2816f005da9ede4f60a2	2010-10-25 22:01:40 -04:00
Fritz Koenig	1d70aaf08b	Merge "Debug option for drawing motion vectors."	2010-10-25 15:40:22 -07:00
Fritz Koenig	d1a4cce809	Debug option for drawing motion vectors. Postproc level that uses Bresenham's line algorithm to draw motion vectors onto the postproc buffer. Change-Id: I34c7daa324f2bdfee71e84fcb1c50b90fa06f6fb	2010-10-25 15:39:04 -07:00
Johann	a3b002fc90	Merge "quiet compiler"	2010-10-25 13:26:55 -07:00
John Koleszar	56e87274cd	Merge "Remove legacy release.sh script"	2010-10-25 13:23:19 -07:00
Aaron Watry	53f61ce226	Add sparc-solaris-gcc as a build target. Solaris 10 requires -lposix4 to build successfully on gcc. I only have a Sparc machine to test with on Solaris 10, but this change leaves OpenSolaris x86 in a usable state w/ gnu-generic. I am of the belief that this change should fix Solaris 10 on Sparc, but will leave other Solaris architectures as is. If someone has an x86 Solaris 10 machine to test on, they may add x86-solaris-gcc to libvpx/configure and give it a go. Change-Id: I17a282028bb4d3e9fd8764159f95665160f7b62a	2010-10-25 16:20:14 -04:00
Martin Ettl	c3fd2c4ea7	Fix leaked file descriptor with ENTROPY_STATS cppcheck found a leaked file descriptor in the debugging code enabled by defining ENTROPY_STATS. Fixes issue #60. Change-Id: I0c1d0669cb94d44fed77860f97b82763be06b7cb	2010-10-25 13:16:39 -04:00
John Koleszar	2ad4810a21	NASM: trailing slash for ASFLAGS includes Fix out-of-tree builds using NASM. NASM expects its include paths to have a trailing slash. These aren't used used when doing in-tree builds (./configure) Change-Id: I38d469d15acb1b7e65733a2e5ca8c9d86fa4ad86	2010-10-25 10:28:45 -04:00
Johann	385865f820	quiet compiler clean up compiler warnings, man in the yellow hat warnings, and start to remove unused #includes Change-Id: I6267e98d9b3024b6fb1ef2732b29067a33cb96f6	2010-10-25 10:07:35 -04:00
Johann	1376f061da	reuse common loopfilter code there were four versions for the regular and macroblock loopfilters: horizontal [y\|uv] vertical [y\|uv] this moves all the common code into 2 functions: vp8_loop_filter_neon vp8_mbloop_filter_neon this provides no gain in performance. there's a bit of jitter, but it trends down ~0.25-0.5%. however, this is a huge gain maintenance. also, there is the potential to drop some stack usage in the macroblock loopfilter. Change-Id: I91506f07d2f449631ff67ad6f1b3f3be63b81a92	2010-10-25 09:48:50 -04:00
Timothy B. Terriberry	b71962fdc9	Add runtime CPU detection support for ARM. The primary goal is to allow a binary to be built which supports NEON, but can fall back to non-NEON routines, since some Android devices do not have NEON, even if they are otherwise ARMv7 (e.g., Tegra). The configure-generated flags HAVE_ARMV7, etc., are used to decide which versions of each function to build, and when CONFIG_RUNTIME_CPU_DETECT is enabled, the correct version is chosen at run time. In order for this to work, the CFLAGS must be set to something appropriate (e.g., without -mfpu=neon for ARMv7, and with appropriate -march and -mcpu for even earlier configurations), or the native C code will not be able to run. The ASFLAGS must remain set for the most advanced instruction set required at build time, since the ARM assembler will refuse to emit them otherwise. I have not attempted to make any changes to configure to do this automatically. Doing so will probably require the addition of new configure options. Many of the hooks for RTCD on ARM were already there, but a lot of the code had bit-rotted, and a good deal of the ARM-specific code is not integrated into the RTCD structs at all. I did not try to resolve the latter, merely to add the minimal amount of protection around them to allow RTCD to work. Those functions that were called based on an ifdef at the calling site were expanded to check the RTCD flags at that site, but they should be added to an RTCD struct somewhere in the future. The functions invoked with global function pointers still are, but these should be moved into an RTCD struct for thread safety (I believe every platform currently supported has atomic pointer stores, but this is not guaranteed). The encoder's boolhuff functions did not even have _c and armv7 suffixes, and the correct version was resolved at link time. The token packing functions did have appropriate suffixes, but the version was selected with a define, with no associated RTCD struct. However, for both of these, the only armv7 instruction they actually used was rbit, and this was completely superfluous, so I reworked them to avoid it. The only non-ARMv4 instruction remaining in them is clz, which is ARMv5 (not even ARMv5TE is required). Considering that there are no ARM-specific configs which are not at least ARMv5TE, I did not try to detect these at runtime, and simply enable them for ARMv5 and above. Finally, the NEON register saving code was completely non-reentrant, since it saved the registers to a global, static variable. I moved the storage for this onto the stack. A single binary built with this code was tested on an ARM11 (ARMv6) and a Cortex A8 (ARMv7 w/NEON), for both the encoder and decoder, and produced identical output, while using the correct accelerated functions on each. I did not test on any earlier processors. Change-Id: I45cbd63a614f4554c3b325c45d46c0806f009eaa	2010-10-25 09:23:29 -04:00
Johann	e81e30c25d	isolate new temporal filtering code onyx_if is getting pretty big. split out the temporal code to make it easier to look at. Change-Id: I207c3a94c90e91b32e3ea5e1836a53b7a990fabd	2010-10-25 09:11:03 -04:00
John Koleszar	3b9e72b210	Merge "Improve handling of invalid frames." Change-Id: Icef5226a70260607c190126c1c0cc28b796e759c	2010-10-22 11:54:49 -04:00
Timothy B. Terriberry	09bcc1f710	Improve handling of invalid frames. The code was not checking for frame sizes smaller than 3 bytes, and the partition size checks might have failed if the input buffer was within 16MB of the top of the heap. In addition, the reference count on the current frame buffer was not being decremented on error, so after a small number of errors, no new frame buffer could be found and it would run off the list of them. Change-Id: I0c60dba6adb1e2a29df39754f72a56ab6c776b46	2010-10-22 11:50:56 -04:00
Timothy B. Terriberry	8f75ea6b5c	Convert [4][4] matrices to [16] arrays. Most of the code that actually uses these matrices indexes them as if they were a single contiguous array, and coverity produces reports about the resulting accesses that overflow the static bounds of the first row. This is perfectly legal in C, but converting them to actual [16] arrays should eliminate the report, and removes a good deal of extraneous indexing and address operators from the code. Change-Id: Ibda479e2232b3e51f9edf3b355b8640520fdbf23	2010-10-21 17:04:30 -07:00
Frank Galligan	45e6494177	Change altref times to preceding pts+1. Change the pts of the altref frame to be as close as possible to the pts of the preceding frame and still be strictly increasing. Change-Id: Iae3033a4c89ae5a9d0e5c4198e9196e5f3ee57c7	2010-10-21 14:11:58 -04:00
John Koleszar	1ee3ebcd66	Merge "Move firstpass motion map to stats packet"	2010-10-21 11:09:02 -07:00
John Koleszar	bb7dd5b1ba	Move firstpass motion map to stats packet The first implementation of the firstpass motion map for motion compensated temporal filtering created a file, fpmotionmap.stt, in the current working directory. This was not safe for multiple encoder instances. This patch merges this data into the first pass stats packet interface, so that it is handled like the other (numerical) firstpass stats. The new stats packet is defined as follows: Numerical Stats (16 doubles) -- 128 bytes Motion Map -- 1 byte / Macroblock Padding -- to align packet to 8 bytes The fpmotionmap.stt file can still be generated for debugging purposes in the same way that the textual version of the stats are available (defining OUTPUT_FPF in firstpass.c) Change-Id: I083ffbfd95e7d6a42bb4039ba0e81f678c8183ca	2010-10-21 14:04:20 -04:00
Yunqing Wang	4cefb4434f	Add MMWORD PTR/XMMWORD PTR in subtract_sse2.asm Change-Id: Ia649b500ef020225d8bbf611799d0f47658dc2ac	2010-10-21 13:42:24 -04:00
Yunqing Wang	31752f2f41	Merge "Rewrite vp8_short_walsh4x4_sse2()"	2010-10-21 10:31:23 -07:00
Yunqing Wang	0918747520	Merge "Add SSE2 subtract functions"	2010-10-21 10:30:27 -07:00
Fritz Koenig	15acc84f10	Remove stack shadowing for x86-64 x86-64 passes most arguments in registers. There is no need to push them to the stack before using them. Change-Id: I13c683f1358782682ecafaf1df3fb0af23b978ea	2010-10-21 10:28:08 -07:00
Yunqing Wang	fc94ffcea4	Rewrite vp8_short_walsh4x4_sse2() This rewriting reflects changes made in commit "Improve the accuracy of forward walsh-hadamard transform". Since this function is not called much, only a small encoder performance gain (~0.5% ) is seen. Change-Id: Ie9df58a43028a11fd5b115c4bbe3141f7596578b	2010-10-21 13:02:55 -04:00
John Koleszar	28d191ada4	Import nestegg webm/mkv parser Initial import of nestegg[1] parser lib, at commit 0d51131. [1]: http://github.com/kinetiknz/nestegg commit 0d51131519a1014660b5e111e28a78785d76600f Change-Id: I191d388b7e5140ef96624511ccdd65d0e183076d	2010-10-21 10:55:14 -04:00

... 5 6 7 8 9 ...

703 Commits