generic-library/vpx

Author	SHA1	Message	Date
Scott LaVarnway	15ce6bd62e	Removed the loopfilter rtcd invoke macro code Change-Id: I446b2ffcbe732ffb112dbd97a4799272d4c01a84	2012-10-16 16:19:35 -07:00
Jim Bankoski	7c15c18c5e	removed the recon rtcd invoke macro code (unrevert) This reinstates reverted commit 2113a831575d81faeadd9966e256d58b6b2b1633 Change-Id: I9a9af13497d1e58d4f467e3e083fddf06b1b786c	2012-10-16 12:02:31 -07:00
Jim Bankoski	f9d5f86643	Revert "removed the recon. rtcd invoke macro code" This reverts commit 2113a831575d81faeadd9966e256d58b6b2b1633	2012-10-13 20:29:04 -07:00
Jim Bankoski	2113a83157	removed the recon. rtcd invoke macro code Code clean up - removed rtcd Change-Id: Id963ecf53c370b1d99484ef18d6befeed7e0c748	2012-10-13 18:49:44 -07:00
Jim Bankoski	89f060e88a	convert copy16x16 to rtcd Convert copy16x16 from invoke to rtcd. The first in a long string of converts. Change-Id: I296b0aa32f40e9fb649f7a3cb914a4e5300cad63	2012-10-09 17:09:08 -07:00
Christian Duvivier	63ef9c40a4	SSE2 version of vectorized 8-tap filtering. About 20% overall encoder speedup (vs. about 30% for sse4 version). Change-Id: Ibf608a6a1bc94b14ec47e8046d3206b275b5a8bd	2012-08-21 15:26:14 -07:00
Christian Duvivier	525b183910	A few more optimizations, about 1% overall speedup. Unroll horizontal pass, no more intermediate buffer, faster special transpose. Change-Id: I05df75be4e5f01420066cdf3c61a2edf35bedb64	2012-08-16 15:03:29 -07:00
Christian Duvivier	9471bc2e9e	Merge "First partial snapshot of vectorized 8-tap filtering." into experimental	2012-08-15 18:01:18 -07:00
Christian Duvivier	5a34e0eb89	First partial snapshot of vectorized 8-tap filtering. About 3.5x faster, 30% overall encoder speedup. Rest of optimizations will come soon (see TODO section in filter_sse4.c). Change-Id: If18108048bfd5345fc942e8574e4c7f58e0e86e0	2012-08-15 17:55:06 -07:00
Paul Wilkins	77dc5c65f2	Code clean up. Further cases of inconsistent naming convention. Change-Id: Id3411ecec6f01a4c889268a00f0c9fd5a92ea143	2012-08-15 11:00:53 +01:00
Deb Mukherjee	7d0656537b	Merging in the sixteenth subpel uv experiment Merges this experiment in to make it easier to run tests on filter precision, vectorized implementation etc. Also removes an experimental filter. Change-Id: I1e8706bb6d4fc469815123939e9c6e0b5ae945cd	2012-08-08 16:57:43 -07:00
Deb Mukherjee	0ebf548c75	Merging and bug-fix in enhanced_interp experiment Merged the enhanced_interp experiment. Found and fixed a bug in the include files framework, whereby certain encoder files were still using the old INTERP_EXTEND value of 3 instead of 4. The thresholds for mv range mcomp.c need a small adjustment to prevent crashes. The results are more or less unchanged. Change-Id: Iac5008390f1efc97ce1102fbb5f8989c847fb579	2012-07-31 11:45:31 -07:00
Deb Mukherjee	9984a155d6	Merges several experiments The following five experiments are merged: newentropy newupdate adaptive_entropy (also includes a couple of parameter changes that improves results a little in common/entropymode.c and encoder/modecosts.c that were not merged from the internal branch) newintramodes expanded_coef_context Change-Id: I8a142a831786ee9dc936f22be1d42a8bced7d270	2012-07-27 12:12:39 -07:00
John Koleszar	c6b9039fd9	Restyle code Approximate the Google style guide[1] so that that there's a written document to follow and tools to check compliance[2]. [1]: http://google-styleguide.googlecode.com/svn/trunk/cppguide.xml [2]: http://google-styleguide.googlecode.com/svn/trunk/cpplint/cpplint.py Change-Id: Idf40e3d8dddcc72150f6af127b13e5dab838685f	2012-07-17 11:46:03 -07:00
Ronald S. Bultje	9c9d6743d4	Sign-extend input argument so it can be used in pointer arithmetic. Change-Id: I6cbd4de96f9dcc783cef170bfd7652f6cbee36a2	2012-06-25 14:16:39 -07:00
Daniel Kang	31fd84d80b	x86inc: Move x86inc to the correct location. Change-Id: I6802731a4d15feef5ce62993dc505ded55c40f7e	2012-06-18 13:36:41 -07:00
Daniel Kang	7a00071576	Adds x86inc.asm and update idct/dequant mmx Updates idct/dequant mmx assembly to work with vpnext instead of vp8. Also adds x86inc.asm Change-Id: I6e147d5e89177ae449271e97e50d082eb11b078e	2012-06-12 15:04:03 -07:00
Deb Mukherjee	c5ddb7f016	Adds new Directional Intra prediction modes. Adds 6 directional intra predictiom modes for 16x16 and 8x8 blocks. Change-Id: I25eccc0836f28d8d74922e4e9231568a648b47d1	2012-05-15 08:54:50 -07:00
Deb Mukherjee	18e90d744e	Supporting high precision 1/8-pel motion vectors This is the initial patch for supporting 1/8th pel motion. Currently if we configure with enable-high-precision-mv, all motion vectors would default to 1/8 pel. Encode and decode syncs fine with the current code. In the next phase the code will be refactored so that we can choose the 1/8 pel mode adaptively at a frame/segment/mb level. Derf results: http://www.corp.google.com/~debargha/vp8_results/enhinterp_hpmv.html (about 0.83% better than 8-tap interpoaltion) Patch 3: Rebased. Also adding 1/16th pel interpolation for U and V Patch 4: HD results. http://www.corp.google.com/~debargha/vp8_results/enhinterp_hd_hpmv.html Seems impressive (unless I am doing something wrong). Patch 5: Added mmx/sse for bilateral filtering, as well as enforced use of c-versions of subpel filters with 8-taps and 1/16th pel; Also redesigned the 8-tap filters to reduce the cut-off in order to introduce a denoising effect. There is a new configure option sixteenth-subpel-uv which will use 1/16 th pel interpolation for uv, if the motion vectors have 1/8 pel accuracy. With the fixes the results are promising on the derf set. The enhanced interpolation option with 8-taps alone gives 3% improvement over thei derf set: http://www.corp.google.com/~debargha/vp8_results/enhinterpn.html Results on high precision mv and on the hd set are to follow. Patch 6: Adding a missing condition for CONFIG_SIXTEENTH_SUBPEL_UV in vp8/common/x86/x86_systemdependent.c Patch 7: Cleaning up various debug messages. Patch 8: Merge conflict Change-Id: I5b1d844457aefd7414a9e4e0e06c6ed38fd8cc04	2012-02-23 09:25:21 -08:00
John Koleszar	180b0306cc	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/common/defaultcoefcounts.h vp8/common/entropy.c vp8/encoder/bitstream.c Change-Id: Idd4990c80d5b5494ac036254694015fab449bc08	2011-08-25 08:36:19 -04:00
Fritz Koenig	112bd4e2b4	Fix naming of sse2 idct functions. Prepend idct function names with vp8_ so that under profiling they show up associated with libvpx. Change-Id: I4fe357b50236cb7730a4cc00164c0a3487a1d8b4	2011-08-24 10:25:32 -07:00
John Koleszar	67864c5f97	Merge remote branch 'internal/upstream' into HEAD	2011-08-24 00:05:05 -04:00
Johann	85358d04cd	Fix data accesses for simple loopfilters The data that the simple horizontal loopfilter reads is aligned, treat it accordingly. For the vertical, we only use the bottom 4 bytes, so don't read in 16 (and incur the penalty for unaligned access). This shows a small improvement on older processors which have a significant penalty for unaligned reads. postproc_mmx.c is unused Change-Id: I87b29bbc0c3b19ee1ca1de3c4f47332a53087b3d	2011-08-23 20:42:45 -04:00
Fritz Koenig	c5f890af2c	Use local labels for jumps/loops in x86 assembly. Prepend . to local labels in assembly code. This allows non unique labels within a file. Also makes profiling information more informative by keeping the function name with the loop name. Change-Id: I7a983cb3a5ba2413d5dafd0a37936b268fb9e37f	2011-08-23 09:05:29 -07:00
John Koleszar	6901105e99	Merge remote branch 'internal/upstream' into HEAD	2011-07-14 00:05:04 -04:00
Johann	01433c5043	update x86 asm for loopfilter Change-Id: I1ed739522db7c00c189851c7095c1b64ef6412ce	2011-07-08 09:23:38 -04:00
John Koleszar	5380a2215e	Merge remote branch 'internal/upstream' into HEAD	2011-07-02 00:05:10 -04:00
Ronald S. Bultje	c8a23ad3f4	Properly use GET_GOT/RESTORE_GOT when using GLOBAL(). This should fix binaries using PIC on x86-32. Also should fix issue 343. Change-Id: I591de3ad68c8a8bb16054bd8f987a75b4e2bad02	2011-06-30 14:04:27 -07:00
John Koleszar	27331e1377	Merge remote branch 'internal/upstream' into HEAD	2011-05-20 00:05:16 -04:00
Scott LaVarnway	914f7c36d7	Merge "Make hor UV predict ~2x faster (73 vs 132 cycles) using SSSE3."	2011-05-19 11:22:01 -07:00
John Koleszar	65b1648f35	Merge remote branch 'internal/upstream' into HEAD	2011-05-11 00:05:07 -04:00
John Koleszar	6edd07d656	Merge remote branch 'internal/upstream-experimental' into HEAD	2011-05-11 00:05:07 -04:00
Johann	df2023a6cb	set up Global Offset Table in recon global values were being referenced, but the GOT was not being set up. as the GOT is only required for PIC, this issue wasn't caught in the default configuration. Change-Id: I8006e53776139362a76f2c80cf9d0f8458602b2f http://code.google.com/p/webm/issues/detail?id=328	2011-05-10 15:58:56 -04:00
Johann	a7d4d3c550	clean up unused variable warnings Change-Id: I9467d7a50eac32d8e8f3a2f26db818e47c93c94b	2011-05-09 12:56:20 -04:00
John Koleszar	e2990fcc48	Merge remote branch 'internal/upstream' into HEAD	2011-05-03 00:05:05 -04:00
Thijs Vermeir	8942f70cdf	Fix documentation typos Change-Id: I97124670926433bf1593c91660d8b8f8482ea9ce	2011-04-30 09:34:59 +02:00
Ronald S. Bultje	5a23352c03	Make hor UV predict ~2x faster (73 vs 132 cycles) using SSSE3. Change-Id: I658a1df7d825f820573cb2d11ad402f9d2791035	2011-04-29 11:52:09 -07:00
John Koleszar	57afffbcbb	Merge remote branch 'internal/upstream' into HEAD	2011-04-29 00:05:07 -04:00
James Berry	f10732554b	bug fix removed inline from recon_wrapper_sse2.c removed inline from recon_wrapper_sse2.c to build for visual stuido Change-Id: I74a3482950448e2cdb30e9cd7087145b440d8a22	2011-04-28 15:12:00 -04:00
John Koleszar	e1b90ce862	Merge remote branch 'internal/upstream' into HEAD	2011-04-28 00:05:07 -04:00
Ronald S. Bultje	1e7ded69cf	Use psadbw to get the sum of bytes in a line. Thanks Jason for pointing that out on #vp8. ;-). Change-Id: I5330a753e752a8704b78a409597472628e0b26a5	2011-04-27 13:49:21 -07:00
Ronald S. Bultje	1083fe4999	SSE2/SSSE3 optimizations for build_predictors_mbuv{,_s}(). decoding before 10.425 10.432 10.423 =10.426 after: 10.405 10.416 10.398 =10.406, 0.2% faster encoding before 14.252 14.331 14.250 14.223 14.241 14.220 14.221 =14.248 after 14.095 14.090 14.085 14.095 14.064 14.081 14.089 =14.086, 1.1% faster Change-Id: I483d3d8f0deda8ad434cea76e16028380722aee2	2011-04-27 11:31:27 -07:00
John Koleszar	bbc24a65c4	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/common/alloccommon.c vp8/encoder/rdopt.c Change-Id: Ic34b33577423031e277235ffa6bcaff7b252e5cb	2011-04-26 08:27:39 -04:00
Johann	01527e743f	remove simpler_lpf the decision to run the regular or simple loopfilter is made outside the function and managed with pointers stop tracking the option in two places. use filter_type exclusively Change-Id: I39d7b5d1352885efc632c0a94aaf56b72cc2fe15	2011-04-25 17:37:41 -04:00
John Koleszar	308e31a3ef	Merge remote branch 'internal/upstream-experimental' into HEAD Conflicts: vp8/decoder/onyxd_int.h Change-Id: Icf445b589c2bc61d93d8c977379bbd84387d0488	2011-04-25 09:13:41 -04:00
Johann	4a2b684ef4	modify SAVE_XMM for potential 64bit use the win64 abi requires saving and restoring xmm6:xmm15. currently SAVE_XMM and RESTORE XMM only allow for saving xmm6:xmm7. allow specifying the highest register used and if the stack is unaligned. Change-Id: Ica5699622ffe3346d3a486f48eef0206c51cf867	2011-04-19 10:42:45 -04:00
Johann	c7cfde42a9	Add save/restore xmm registers in x86 assembly code Went through the code and fixed it. Verified on Windows. Where possible, remove dependencies on xmm[67] Current code relies on pushing rbp to the stack to get 16 byte alignment. This broke when rbp wasn't pushed (vp8/encoder/x86/sad_sse3.asm). Work around this by using unaligned memory accesses. Revisit this and the offsets in vp8/encoder/x86/sad_sse3.asm in another change to SAVE_XMM. Change-Id: I5f940994d3ebfd977c3d68446cef20fd78b07877	2011-04-18 16:30:38 -04:00
John Koleszar	9d75a502c4	Merge remote branch 'internal/upstream' into HEAD	2011-04-16 00:05:07 -04:00
Johann	487c0299c9	remove dead code, add missing RESTORE_XMM vp8_filter_block1d16_h4_ssse3 was never called because UNSHADOW_ARGS moves the stack by 'mov rsp, rbp', the issue was masked. however, if/when win64 used those registers for persistant data, issues could/will arise. Change-Id: I56d6effca0aeba1f86082689771cb10145d39651	2011-04-15 10:11:53 -04:00
John Koleszar	f809f4f93c	Merge remote branch 'internal/upstream' into HEAD	2011-04-12 00:05:08 -04:00

1 2

86 Commits