generic-library/vpx

Author	SHA1	Message	Date
Johann	fe74c4286a	Rename quantize_sse2_intrinsics.c The only reason for the _intrinsics part of the file name was for the interim period where only one of the functions was redone and the base file name was the same. Change-Id: I7851154f1633d48821bee885b1cadb2148e65a23	2013-04-24 09:08:56 -07:00
John Koleszar	771fc832f3	Merge branch 'master' into experimental Pick up VP8 encryption, quantization changes, and some fixes to vpxenc Conflicts: test/decode_test_driver.cc test/decode_test_driver.h test/encode_test_driver.cc vp8/vp8cx.mk vpxdec.c vpxenc.c Change-Id: I9fbcc64808ead47e22f1f22501965cc7f0c4791c	2013-03-27 10:46:19 -07:00
Shimon Doodkin	907016fdc7	Remove gcc-specific __label__ Use unique names and ditch the local label declaration. Visual Studio does not support it. https://code.google.com/p/webm/issues/detail?id=561 Change-Id: Ica643cf5abb56ee6156371f5bf73fdeb58014422	2013-03-22 10:08:19 -07:00
Ronald S. Bultje	f60f6db716	Rename quantize_sse2.c to quantize_sse2_intrinsics.c to avoid collision. Change-Id: I5637d491eb6a9b7633f72e03fd9df72131eeb121	2013-03-04 12:25:01 -08:00
Johann	403145032d	Merge "Use intrinsics for sse2 regular quantize"	2013-03-01 17:20:26 -08:00
Johann	eca59cad0b	Use intrinsics for sse2 regular quantize Remove dependency of this function on asm_offsets. ssse3/sse4 next. Change quant_shift calculation so it be done using SIMD. Pre-calculate as much as possible to simplify EOB selection. Take advantage of qcoeff being zero'd by tying the if statements together. Speed parity with previous implementation with gcc x86_64 linux Change-Id: Ife97556a1eca3a74b09def1a3d04084974dff1fb	2013-02-28 18:06:15 -08:00
Johann	67978d1380	Merge "vp8 fast quantizer with intrinsics"	2013-02-28 11:32:03 -08:00
Jan Kratochvil	82ed3f9a41	Fix --as=nasm compatibility for new asm code. s/movd/movq/ Change-Id: Id1a56de91551f8dc796f14f1056c565dfc1ba626	2013-02-27 09:55:38 -08:00
Johann	ef887974aa	vp8 fast quantizer with intrinsics Reduce dependency on offsets file by using intrinsics. Disassembly shows improvements over previous assembly specifically in register management, preloading, and {pro,epi}log. Speed change is within margin of error. Change-Id: I8131b4b4d62bc092407fe847bfaa8f2c0e1384ff	2013-02-26 10:48:24 -08:00
Frank Galligan	f67d740b34	Add support for x64 and win64 yasm flags. Some projects must define only win64 for Windows 64bit builds using yasm. Change-Id: I1d09590d66a7bfc8b4412e1cc8685978ac60b748	2013-01-31 16:25:37 -08:00
James Zern	9dab3ce624	add emmintrin_compat.h for builds with gcc < 4 Change-Id: If7822e6fcd0d3568b934032322b19ba3e401df26	2012-12-20 14:56:13 -08:00
John Koleszar	a9c7597adc	support building vp8 and vp9 into a single lib Change-Id: Ib8f8a66c9fd31e508cdc9caa662192f38433aa3d	2012-11-15 10:46:17 -08:00
John Koleszar	7b8dfcb5a2	Rough merge of master into experimental Creates a merge between the master and experimental branches. Fixes a number of conflicts in the build system to allow either VP8 or VP9 to be built. Specifically either: $ configure --disable-vp9 $ configure --disable-vp8 --disable-unit-tests VP9 still exports its symbols and files as VP8, so that will be resolved in the next commit. Unit tests are broken in VP9, but this isn't a new issue. They are fixed upstream on origin/experimental as of this writing, but rebasing this merge proved difficult, so will tackle that in a second merge commit. Change-Id: I2b7d852c18efd58d1ebc621b8041fe0260442c21	2012-11-07 11:30:16 -08:00
Ronald S. Bultje	4b2c2b9aa4	Rename vp8/ codec directory to vp9/. Change-Id: Ic084c475844b24092a433ab88138cf58af3abbe4	2012-11-01 16:31:22 -07:00
Ronald S. Bultje	6a4b1e5958	Remove vp8 in local symbols. For non-static functions, change the prefix to vp9_. For static functions, remove the prefix. Also fix some comments, remove unused code or unused function prototypes. Change-Id: I1f8be05362f66060fe421c3d4c9a906fdf835de5	2012-11-01 10:03:43 -07:00
Ronald S. Bultje	982deebb5e	Change name of common top-level structures from VP8 to VP9. This change encompasses VP8_PTR, VP8_COMP, VP8D_COMP, VP8_COMMON, VP8Decompressor and VP8Common. Change-Id: I514ef4ad4e682370f36d656af1c09ee20da216ad	2012-10-31 10:15:08 -07:00
Ronald S. Bultje	43da8f147c	Change non-function symbol vp8_ prefixes to vp9_. For local symbols, make them static instead. Change-Id: I13d60947a46f711bc8991e16100cea2a13e3a22e	2012-10-31 10:15:08 -07:00
Ronald S. Bultje	f88558fb1d	Change encoder vp8_ and vp8cx_ public symbol prefixes to vp9_. Change-Id: Ie2e3652591b010ded10c216501ce24fd95d0aec5	2012-10-30 22:07:07 -07:00
Jim Bankoski	818ee904a9	remove fdct invoke macros Remove the fdct invoke macro calls Change-Id: Ica2431c655819fa012133ee7abc75a16761e5fd6	2012-10-29 11:25:56 -07:00
Jim Bankoski	1838d87771	invoke macro removal encodemb Change-Id: I321280abcf48f3dc16e194d29bde2bd3baec6006	2012-10-29 12:36:50 +00:00
Jim Bankoski	118b2fe962	Remove variance vtable from rtcd Change-Id: Idd2722a538423b451e1e3495f89a7141480493d6	2012-10-21 20:47:57 -07:00
Jim Bankoski	7c15c18c5e	removed the recon rtcd invoke macro code (unrevert) This reinstates reverted commit `2113a83157` Change-Id: I9a9af13497d1e58d4f467e3e083fddf06b1b786c	2012-10-16 12:02:31 -07:00
Yunqing Wang	64075c9b01	Encoder denoiser performance improvement The denoiser function was modified to reduce the computational complexity. 1. The denoiser c function modification: The original implementation calculated pixel's filter_coefficient based on the pixel value difference between current raw frame and last denoised raw frame, and stored them in lookup tables. For each pixel c, find its coefficient using filter_coefficient[c] = LUT[abs_diff[c]]; and then apply filtering operation for the pixel. The denoising filter costed about 12% of encoding time when it was turned on, and half of the time was spent on finding coefficients in lookup tables. In order to simplify the process, a short cut was taken. The pixel adjustments vs. pixel diff value were calculated ahead of time. adjustment = filtered_value - current_raw = (filter_coefficient * diff + 128) >> 8 The adjustment vs. diff curve becomes flat very quick when diff increases. This allowed us to use only several levels to get a close approximation of the curve. Following the denoiser algorithm, the adjustments are further modified according to how big the motion magnitude is. 2. The sse2 function was rewritten. This change made denoiser filter function 3x faster, and improved the encoder performance by 7% ~ 10% with the denoiser on. Change-Id: I93a4308963b8e80c7307f96ffa8b8c667425bf50	2012-08-31 13:48:13 -07:00
Deb Mukherjee	7d0656537b	Merging in the sixteenth subpel uv experiment Merges this experiment in to make it easier to run tests on filter precision, vectorized implementation etc. Also removes an experimental filter. Change-Id: I1e8706bb6d4fc469815123939e9c6e0b5ae945cd	2012-08-08 16:57:43 -07:00
John Koleszar	c6b9039fd9	Restyle code Approximate the Google style guide[1] so that that there's a written document to follow and tools to check compliance[2]. [1]: http://google-styleguide.googlecode.com/svn/trunk/cppguide.xml [2]: http://google-styleguide.googlecode.com/svn/trunk/cpplint/cpplint.py Change-Id: Idf40e3d8dddcc72150f6af127b13e5dab838685f	2012-07-17 11:46:03 -07:00
John Koleszar	0164a1cc5b	Fix pedantic compiler warnings Allows building the library with the gcc -pedantic option, for improved portabilty. In particular, this commit removes usage of C99/C++ style single-line comments and dynamic struct initializers. This is a continuation of the work done in commit `97b766a46`, which removed most of these warnings for decode only builds. Change-Id: Id453d9c1d9f44cc0381b10c3869fabb0184d5966	2012-06-11 15:14:58 -07:00
Stefan Holmer	cd0bf0e407	Fixes a win build issue related to denoising. Change-Id: I912384f526865089aa03ca8875591324e5c1c449	2012-05-31 15:44:28 +02:00
Stefan Holmer	0927a41139	Fixes a clang linking error. Change-Id: I1d2db53129dc6ec068093ad1e5fc0d94110473b3	2012-05-31 10:52:20 +02:00
Stefan Holmer	d850034443	Added another denoising threshold for finding DC shifts. Compares the sum of differences between the input block and the averaged block. If they differ too much the block will not be filtered. Negligible perfomance hit. Change-Id: Ib1c31a265efd4d100b3abc4a1ea6675038c8ddde	2012-05-30 16:50:21 +02:00
Alpha Lam	0f7e4665ae	Make libvpx Chromium build friendly Add PRIVATE macro for adding private_extern directive for yasm to hide global symbols. This is only enabled if -DCHROMIUM is used with YASM. Also fixed a small problem with rtcd_defs.sh to guard TEMPORAL_DENOISING. Change-Id: I9027fce3ebddcf20078293e4b86b396f21da7857	2012-05-23 18:15:05 -07:00
Christian Duvivier	38ddb426d0	Inline Intrinsic optimized Denoiser Faster version of denoiser, cut cost by 1.7x for C path, by 3.3x for SSE2 path. Change-Id: I154786308550763bc0e3497e5fa5bfd1ce651beb	2012-05-21 07:54:20 -07:00
Attila Nagy	a91b42f022	Makes all global data in entropy.c const Removes all runtime initialization of global data in entropy.c. Precalculated values are used for initializing all entropy related tabels. First patch in a series to make sure code is reentrant. Change-Id: I9aac91a2a26f96d73c6470d772a343df63bfe633	2012-04-17 12:12:58 +03:00
Paul Wilkins	c88d335f7d	Only support improved quant Deprecate fast quant and strict_quant code. Small effect on quality as fast was used in first pass but the effect is basically neutral across the derf set. The rationale here is to reduce the number of code paths for now to make experimentation easier. Optimized and fast code options can be re-introduced later along with other encode speed options. Change-Id: Ia30c5daf3dbc52e72c83b277a1d281e3c934cdad	2012-03-21 18:22:33 +00:00
Johann	e50f96a4a3	Move SAD and variance functions to common The MFQE function of the postprocessor depends on these Change-Id: I256a37c6de079fe92ce744b1f11e16526d06b50a	2012-03-05 16:50:33 -08:00
Deb Mukherjee	88b36eb0d9	Bug fix in ssse3 variance computation. Fixes a bug that was introduced in the high precision mv patch. Change-Id: Ieadb433ebe4c3ef3e0e63944dab11528bf8bd73a	2012-02-24 20:24:54 -08:00
Deb Mukherjee	18e90d744e	Supporting high precision 1/8-pel motion vectors This is the initial patch for supporting 1/8th pel motion. Currently if we configure with enable-high-precision-mv, all motion vectors would default to 1/8 pel. Encode and decode syncs fine with the current code. In the next phase the code will be refactored so that we can choose the 1/8 pel mode adaptively at a frame/segment/mb level. Derf results: http://www.corp.google.com/~debargha/vp8_results/enhinterp_hpmv.html (about 0.83% better than 8-tap interpoaltion) Patch 3: Rebased. Also adding 1/16th pel interpolation for U and V Patch 4: HD results. http://www.corp.google.com/~debargha/vp8_results/enhinterp_hd_hpmv.html Seems impressive (unless I am doing something wrong). Patch 5: Added mmx/sse for bilateral filtering, as well as enforced use of c-versions of subpel filters with 8-taps and 1/16th pel; Also redesigned the 8-tap filters to reduce the cut-off in order to introduce a denoising effect. There is a new configure option sixteenth-subpel-uv which will use 1/16 th pel interpolation for uv, if the motion vectors have 1/8 pel accuracy. With the fixes the results are promising on the derf set. The enhanced interpolation option with 8-taps alone gives 3% improvement over thei derf set: http://www.corp.google.com/~debargha/vp8_results/enhinterpn.html Results on high precision mv and on the hd set are to follow. Patch 6: Adding a missing condition for CONFIG_SIXTEENTH_SUBPEL_UV in vp8/common/x86/x86_systemdependent.c Patch 7: Cleaning up various debug messages. Patch 8: Merge conflict Change-Id: I5b1d844457aefd7414a9e4e0e06c6ed38fd8cc04	2012-02-23 09:25:21 -08:00
Johann	6b151d436d	Clarify 'max_sad' usage Depending on implementation the optimized SAD functions may return early when the calculated SAD exceeds max_sad. Change-Id: I05ce5b2d34e6d45fb3ec2a450aa99c4f3343bf3a	2012-02-16 15:17:44 -08:00
Paul Wilkins	79d330d7d5	Code simplification Removal of the pickinter.c and .h files and calls to this code. Removal of some code relating to real time and one pass settings though there is more to be done in this regard. However, vp8_set_speed_features() now only supports modes 0 and 1 and speeds up to 3 so rd should always be set. Change-Id: I62c0c1b6154ab499785baef310536080e87bc4d8	2012-02-16 17:21:20 +00:00
Paul Wilkins	9a8204d6ee	Simplification of experimental code base. Removed ~CONFIG_REALTIME_ONLY code. Change-Id: I5fafff29a08acd8928699f9ddce8744787024d8c	2012-02-14 09:03:56 +00:00
Johann	169823428f	Missed some variance casts Change-Id: I9fb510f9421fb3c317a8e32e3058cee977ddf9fa	2012-02-10 11:07:33 -08:00
Johann	fea3556e20	Fix variance overflow In the variance calculations the difference is summed and later squared. When the sum exceeds sqrt(2^31) the value is treated as a negative when it is shifted which gives incorrect results. To fix this we cast the result of the multiplication as unsigned. The alternative fix is to shift sum down by 4 before multiplying. However that will reduce precision. For 16x16 blocks the maximum sum is 65280 and sqrt(2^31) is 46340 (and change). PPC change is untested. Change-Id: I1bad27ea0720067def6d71a6da5f789508cec265	2012-02-09 12:38:31 -08:00
Paul Wilkins	d90f0eb4c5	Removal of SEGFEATURES placeholder comments This commit only involves the removal of placeholder comments //#if CONFIG_SEGFEATURES. Change-Id: I94b350daaf998ee0cfdde5aa25b1d3b0522ab816	2012-02-09 17:25:05 +00:00
John Koleszar	8aae246089	RTCD: finalize removal of old RTCD system This is the final commit in the series converting to the new RTCD system. It removes the encoder csystemdependent files and the remaining global function pointers that didn't conform to the old RTCD system. Change-Id: I9649706f1bb89f0cbf431ab0e3e7552d37be4d8e	2012-01-30 12:10:48 -08:00
John Koleszar	109b69a706	RTCD: add arnr functions This commit continues the process of converting to the new RTCD system. It removes the last of the VP8_ENCODER_RTCD struct references. Change-Id: I2a44f52d7cccf5177e1ca98a028ead570d045395	2012-01-30 12:10:48 -08:00
John Koleszar	0b0bc8d098	RTCD: add motion search functions This commit continues the process of converting to the new RTCD system. Change-Id: Ia5828b7ecc80db55b21916704aa3d54cbb98f625	2012-01-30 12:10:47 -08:00
John Koleszar	be8af188d0	RTCD: add block subtraction functions This commit continues the process of converting to the new RTCD system. Change-Id: Id8a287fdd4bd050ea4452e1582ad85520f3081be	2012-01-30 12:10:47 -08:00
John Koleszar	61311e6103	RTCD: add quantizer functions This commit continues the process of converting to the new RTCD system. Change-Id: Iba9df4c03a508e51c37201c621be43523fae87d9	2012-01-30 12:10:46 -08:00
John Koleszar	510e0ab467	RTCD: add FDCT functions This commit continues the process of converting to the new RTCD system. Change-Id: I3f9c07db65eb206f6363d21bdb80e871570da767	2012-01-30 12:10:42 -08:00
John Koleszar	83a91e789c	RTCD: add variance functions This commit continues the process of converting to the new RTCD system. Change-Id: Ie5c1aa480637e98dc3918fb562ff45c37a66c538	2012-01-30 12:08:30 -08:00
Yunqing Wang	2b2c0c9bda	Improve SSSE3 fast quantizer function Simplified the EOB calculation in the function. Change-Id: I7422f18be40ae270358f5cb0811d66e64436b56f	2011-12-29 12:05:50 -05:00

1 2 3 4

154 Commits