generic-library/vpx

Author	SHA1	Message	Date
Scott LaVarnway	b07e5b6fa1	Finished vp8_sixtap_predict4x4_ssse3 function Added vp8_filter_block1d4_h6_ssse3 and vp8_filter_block1d4_v6_ssse3 assembly routines. Also removed unused assembly. Change-Id: I01c1021835f2edda9da706822345f217087ca0d0	2010-08-11 13:49:00 -04:00
Johann	c0ba42d3c0	rename DETOK_[AL] everything else uses lowercase detok Change-Id: I9671e2e90eb2961208dfa81c00b3accb5749ec04	2010-08-11 13:36:35 -04:00
Scott LaVarnway	99f46d62d9	Moved gf_active code to encoder only The gf_active code is only used by the encoder, so it was moved from common and decoder. Change-Id: Iada15acd5b2b33ff70c34668ca87d4cfd0d05025	2010-08-11 11:54:25 -04:00
Yaowu Xu	c404fa42ac	Removed duplicate functions Change-Id: Ie587972ccefd3c762b8cdf8ef39345cd22924b9b	2010-08-10 21:45:34 -07:00
Yaowu Xu	3b95a46c55	Normalize quantizer's zero bin and rounding factors This patch changes a few numbers in the two constant arrays for quantizer's zerobin and rounding factors, in general to make the sum of the two factors for any Q to be 128. While it might be beneficial to calibrate the two arrays for best quantizer performance, it is not the purpose of this patch. Normalizing the two arrays will enable quick optimization of the current faster quantizer, i.e .zerobin check can be removed. Change-Id: If9abfd7929bf4b8e9ecd64a79d817c6728c820bd	2010-08-10 21:12:04 -07:00
Timothy B. Terriberry	8fa38096a3	Add trellis quantization. Replace the exponential search for optimal rounding during quantization with a linear Viterbi trellis and enable it by default when using --best. Right now this operates on top of the output of the adaptive zero-bin quantizer in vp8_regular_quantize_b() and gives a small gain. It can be tested as a replacement for that quantizer by enabling the call to vp8_strict_quantize_b(), which uses normal rounding and no zero bin offset. Ultimately, the quantizer will have to become a function of lambda in order to take advantage of activity masking, since there is limited ability to change the quantization factor itself. However, currently vp8_strict_quantize_b() plus the trellis quantizer (which is lambda-dependent) loses to vp8_regular_quantize_b() alone (which is not) on my test clip. Patch Set 3: Fix an issue related to the cost evaluation of successor states when a coefficient is reduced to zero. With this issue fixed, now the trellis search almost exactly matches the exponential search. Patch Set 2: Overall, the goal of this patch set is to make "trellis" search to produce encodings that match the exponential search version. There are three main differences between Patch Set 2 and 1: a. Patch set 1 did not properly account for the scale of 2nd order error, so patch set 2 disable it all together for 2nd blocks. b. Patch set 1 was not consistent on when to enable the the quantization optimization. Patch set 2 restore the condition to be consistent. c. Patch set 1 checks quantized level L-1, and L for any input coefficient was quantized to L. Patch set 2 limits the candidate coefficient to those that were rounded up to L. It is worth noting here that a strategy to check L and L+1 for coefficients that were truncated down to L might work. (a and b get trellis quant to basically match the exponential search on all mid/low rate encodings on cif set, without a, b, trellis quant can hurt the psnr by 0.2 to .3db at 200kbps for some cif clips) (c gets trellis quant to match the exponential search to match at Q0 encoding, without c, trellis quant can be 1.5 to 2db lower for encodings with fixed Q at 0 on most derf cif clips) Change-Id: Ib1a043b665d75fbf00cb0257b7c18e90eebab95e	2010-08-10 20:58:24 -07:00
Scott LaVarnway	e4fe866949	Added ssse3 version of sixtap filters Improved decoder performance by 9% for the clip used. Change-Id: I8fc5609213b7bef10248372595dc85b29f9895b9	2010-08-10 17:33:49 -04:00
Yunqing Wang	ba2e107d28	First modification of multi-thread decoder This is the first modification of VP8 multi-thread decoder, which uses same threads to decode macroblocks and then do loopfiltering for each frame. Inspired by Rob Clark, synchronization was done on every 8 macroblocks instead of every macroblock to reduce lock contention. Comparing with the original code, this implementation gave about 15%- 20% performance gain while decoding my test clips on a Core2 Quad platform (Linux). The work is not done yet. Test on other platforms are needed. Change-Id: Ice9ddb0b511af1359b9f71e65066143c04fef3b5	2010-08-10 14:09:57 -04:00
John Koleszar	618c7d27a0	Mark loopfilter C functions as static Clang defaults to C99 mode, and inline works differently in C99. (gcc, on the other hand, defaults to a special gnu-style inlining, which uses different syntax.) Making the functions static makes sure clang doesn't decide to discard a function because it's too large to inline. Thanks to eli.friedman for the patch. Fixes http://code.google.com/p/webm/issues/detail?id=114 Change-Id: If3c1c3c176eb855a584a60007237283b0cc631a4	2010-08-09 09:36:44 -04:00
John Koleszar	cfb204eaf7	Merge "Issue 150: Fixing linker warning in extend.c."	2010-08-02 09:35:05 -07:00
John Koleszar	4e6827a013	configure: support directories containing .o Fixes http://code.google.com/p/webm/issues/detail?id=96 The regex which postprocesses the gcc make-deps (-M) output was too greedy and matching in the dependencies part of the rule rather than the target only. The patch provided with the issue was not correct, as it tried to match the .o at the end of the line, which isn't correct at least for my GCC version. This patch matches word characters instead of .* Thanks to raimue and the MacPorts community for isolating this issue. Change-Id: I28510da2252e03db910c017101d9db12e5945a27	2010-08-02 10:21:55 -04:00
Jan Kratochvil	0e8f108fb0	nasm: avoid space before the :data symbol type. global label:data ^^ Provide nasm compatibility. No binary change by this patch with yasm on {x86_64,i686}-fedora13-linux-gnu. Few longer opcodes with nasm on {x86_64,i686}-fedora13-linux-gnu have been checked as safe. Change-Id: I10f17eb1e4d4a718d4ebd1d0ccddc807c365e021	2010-08-02 09:20:42 -04:00
Jan Kratochvil	0327d3df90	nasm: end labels with colon (':') Labels should end by colon (':'), nasm requires it. Provide nasm compatibility. No binary change by this patch with yasm on {x86_64,i686}-fedora13-linux-gnu. Few longer opcodes with nasm on {x86_64,i686}-fedora13-linux-gnu have been checked as safe. Change-Id: I0b2ec6f01afb061d92841887affb5ca0084f936f	2010-08-02 09:20:03 -04:00
Jan Kratochvil	c8134bc54a	nasm: use OWORD vs DQWORD nasm knows only OWORD. yasm knows both OWORD and DQWORD. Provide nasm compatibility. No binary change by this patch with yasm on {x86_64,i686}-fedora13-linux-gnu. Few longer opcodes with nasm on {x86_64,i686}-fedora13-linux-gnu have been checked as safe. Change-Id: I62151390089e90df9a7667822fa594ac20b00e78	2010-08-02 09:17:14 -04:00
John Koleszar	675298216d	Merge "Replace pinsrw (SSE) with MMX instructions"	2010-08-02 06:16:26 -07:00
Philip Jägenstedt	7d243701d9	Replace pinsrw (SSE) with MMX instructions Fixes http://code.google.com/p/webm/issues/detail?id=136 Change-Id: I5a3e294061644a1a9718e8ba4a39548ede25cc42	2010-08-02 09:15:45 -04:00
John Koleszar	38a20e030f	apple: include proper mach primatives Fixes implicit declaration warning for 'mach_task_self'. Patch courtesy of timeless at gmail.com Change-Id: I9991dedd1ccfddc092eca86705ecbc3b764b799d	2010-07-29 17:04:44 -04:00
Yaowu Xu	c2a8d8b54c	Merge "Enable the switch between two versions of quantizer"	2010-07-29 07:17:40 -07:00
Frank Galligan	062e6c1886	Removed two unused global variables. Removed the global variables vp8_an and vp8_cd. vp8_an was causing problems because it was increasing the .bss by 1572864 bytes. Change-Id: I6c12e294133c7fb6e770c0e4536d8287a5720a87	2010-07-28 17:25:09 -04:00
Yaowu Xu	f95c80b60f	Enable the switch between two versions of quantizer To facilitate more testing related to quantizer and rate control, the old version quantizer is added back. old and new quantizer can be switched back and forth by define or un-define the macro "EXACT_QUANT". Change-Id: Ia77e687622421550f10e9d65a9884128a79a65ff	2010-07-28 10:51:34 -07:00
John Koleszar	23d68a5f30	configure: pass original arguments through to make dist When running configure automatically through the make dist target, reuse the arguments passed to the original configure command. Change-Id: I40e5b8384d6485a565b91e6d2356d5bc9c4c5928	2010-07-27 14:32:07 -04:00
John Koleszar	aa82363c46	Merge "msvs: fix install of codec sources"	2010-07-27 11:21:42 -07:00
Johann	a570bbd418	x86/sse2: disable asm quantizer follow up to Change I0e51492d: neon: disable asm quantizer Now x86 doesn't segfault with --disable-runtime-cpu-detect and -p=2 Change-Id: I8ca127bb299198efebbcbd5a661e81788361933f	2010-07-27 12:54:43 -04:00
Johann	b9a038a5ed	Fix build w/o RTCD So many places to update ... Change-Id: Ide957b40cc833f99c2d1849acade6850fbf7585d	2010-07-27 11:56:19 -04:00
John Koleszar	d8009c077a	neon: disable asm quantizer The assembly version of the quantizer has not been updated to match the new exact quantizer introduced in commit `e04e2935`. That commit tried to disable this code but missed the non-RTCD case. Thanks to David Baker <david.baker at openmarket.com> for isolating the issue and testing this fix. Change-Id: I0e51492dc6f8e44d2c10b587427448bf94135c65	2010-07-27 11:16:19 -04:00
Fritz Koenig	1743f9486b	Merge "update arm idct functions"	2010-07-26 06:05:39 -07:00
Fritz Koenig	3de8a95831	Merge changes I896fe6f9,I90d8b167 * changes: Change the x86 idct functions to do reconstruction at the same time Combine idct and reconstruction steps	2010-07-26 06:05:30 -07:00
Johann	56f5a9a060	update arm idct functions Jeff Muizelaar posted some changes to the idct/reconstruction c code. This is the equivalent update for the arm assembly. This shows a good boost on v6, and a minor boost on neon. Here are some numbers for highway in qcif, 2641 frames: HEAD neon: ~161 fps new neon: ~162 fps HEAD v6: ~102 fps new v6: ~106 fps The following functions have been updated for armv6 and neon: vp8_dc_only_idct_add vp8_dequant_idct_add vp8_dequant_dc_idct_add Conflicts: vp8/decoder/arm/armv6/dequantdcidct_v6.asm vp8/decoder/arm/armv6/dequantidct_v6.asm Resolved by removing these files. When I rewrote the functions, I also moved the files to dequant_dc_idct_v6.asm/dequant_idct_v6.asm Change-Id: Ie3300df824d52474eca1a5134cf22d8b7809a5d4	2010-07-26 08:55:19 -04:00
Justin Lebar	1d8277f8e8	Issue 150: Fixing linker warning in extend.c.	2010-07-23 16:42:25 -07:00
Fredrik Söderquist	2add72d9bc	Don't dereference ctx->priv if it hasn't been setup correctly.	2010-07-23 19:13:50 -04:00
Fredrik Söderquist	eafcf918a0	Only touch ctx->priv if vp8_mmap_alloc succeeded.	2010-07-23 19:13:34 -04:00
Jeff Muizelaar	98fcccfe97	Change the x86 idct functions to do reconstruction at the same time Change-Id: I896fe6f9664e6849c7cee2cc6bb4e045eb42540f	2010-07-23 15:21:36 -04:00
Jeff Muizelaar	b2fa74ac18	Combine idct and reconstruction steps This moves the prediction step before the idct and combines the idct and reconstruction steps into a single step. Combining them seems to give an overall decoder performance improvement of about 1%. Change-Id: I90d8b167ec70d79c7ba2ee484106a78b3d16e318	2010-07-23 15:21:36 -04:00
Fritz Koenig	0ce3901282	Swap alt/gold/new/last frame buffer ptrs instead of copying. At the end of the decode, frame buffers were being copied. The frames are not updated after the copy, they are just for reference on later frames. This change allows multiple references to the same frame buffer instead of copying it. Changes needed to be made to the encoder to handle this. The encoder is still doing frame buffer copies in similar places where pointer reference could be done. Change-Id: I7c38be4d23979cc49b5f17241ca3a78703803e66	2010-07-23 14:53:59 -04:00
Paul Wilkins	68cf24310b	Merge commit 'refs/changes/51/351/1' of ssh://review.webmproject.org:29418/libvpx into KfRateBugMerged	2010-07-23 17:45:26 +01:00
Yaowu Xu	f5cf8553a2	Merge "Make the quantizer exact."	2010-07-23 09:26:26 -07:00
Paul Wilkins	9404c7db6d	Rate control bug with long key frame interval. In two pass encodes, the calculation of the number of bits allocated to a KF group had the potential to overflow for high data rates if the interval is very long. We observed the problem in one test clip where there was one section where there was an 8000 frame gap between key frames. Change-Id: Ic48eb86271775d7573b4afd166b567b64f25b787	2010-07-23 17:01:12 +01:00
Timothy B. Terriberry	e04e293522	Make the quantizer exact. This replaces the approximate division-by-multiplication in the quantizer with an exact one that costs just one add and one shift extra. The asm versions have not been updated in this patch, and thus have been disabled, since the new method requires different multipliers which are not compatible with the old method. Change-Id: I53ac887af0f969d906e464c88b1f4be69c6b1206	2010-07-23 08:48:01 -07:00
Paul Wilkins	d576690ba1	80 character line length on Arnr LUT Tweaked table to fit to 80 characters. Change-Id: Ie6ba80e0b31b33e23d2bf78599abe223369fcefb	2010-07-23 16:47:54 +01:00
Fritz Koenig	08eed049d4	Remove CONFIG_NEW_TOKENS files. These files were out of date and no longer maintained. Token decoding has implemented the no-crash code which is incompatible with this arm assembly code. Change-Id: Ibf729886c56fca48181af60b44bda896c30023fc	2010-07-22 19:00:21 -04:00
John Koleszar	4d86ef3534	msvs: fix install of codec sources The libs.mk file must be installed for the vpx.vcproj file to be generated. It was being installed, but not in the src/ directory as expected. Also missed include files yasm.rules, quantize_x86.h Change-Id: Ic1a6f836e953bfc954d6e42a18c102a0114821eb	2010-07-22 18:33:25 -04:00
Tom Finegan	b791dca979	Change devenv.com command line. Change /build to -build to avoid problems when builds are run within msys bash shells. Change-Id: Ie68d72f702adad00d99be8a01c7a388c3af7657d	2010-07-22 17:51:17 -04:00
Tom Finegan	72d4ba92f0	Add vs9 targets. Add targets x86-win32-vs9 and x86_64-win64-vs9 for support of Visual Studio 2008-- this removes the need to convert the vs8 projects before using them within the IDE. Change-Id: Idb83e2ae701e07d98db1be71638280a493d770a2	2010-07-22 13:44:16 -04:00
Johann	160d671e34	Merge "limit range checking code for L[k] to CONFIG_DEBUG. patch by timeless@gmail.com"	2010-07-21 12:59:39 -07:00
Yaowu Xu	7a89d4c3d4	Merge "Improve the accuracy of forward walsh-hadamard transform"	2010-07-19 07:50:26 -07:00
Paul Wilkins	0ba32632cd	ARNR Lookup Table. Change submitted for Adrian Grange. Convert threshold calculation in ARNR filter to a lookup table. Change-Id: I12a4bbb96b9ce6231ce2a6ecc2d295610d49e7ec	2010-07-19 14:46:42 +01:00
Paul Wilkins	02277b8aa3	Parameter limit change. Change maximum ARNR filter width to 15. Change-Id: I3b72450ea08e96287445ec18810630ee2292954c	2010-07-19 14:39:43 +01:00
Paul Wilkins	bf18069ceb	Rate control fix for ARNR filtered frames. Previously we had assumed that it was necessary to give a full frame's bit allocation to the alt ref frame if it has been created through temporal filtering. This is not the case. The active max quantizer control insures that sufficient bits are allocated if needed and allocating a full frame's worth of bits creates an excessive overhead for the ARF. Change-Id: I83c95ed7bc7ce0e53ccae6ff32db5a97f145937a	2010-07-19 14:10:07 +01:00
Paul Wilkins	7c938f4d3c	Fix: Incorrect 'cols' calculation in temporal filter. Change-Id: I37f10fbe4fbb505c1d34980a59af3e817c287e22	2010-07-16 15:57:17 +01:00
Michael Kohler	80f0e7a7d0	limit range checking code for L[k] to CONFIG_DEBUG. patch by timeless@gmail.com	2010-07-12 18:41:45 +02:00

... 33 34 35 36 37 ...

1894 Commits