generic-library/vpx

Author	SHA1	Message	Date
James Zern	2908091342	Merge "msvs-build: use msbuild for vs >= 2005"	2013-07-12 10:59:35 -07:00
Deb Mukherjee	94c481f9f1	Some minor cleanups for efficiency Implements some of the helper functions more efficiently with lookups rathers than branches. Modeling function is consolidated to reduce some computations. Also merged the two enums BLOCK_SIZE_TYPES and BlockSize into one because there is no need to keep them separate (even though the semantics are a little different). No bitstream or output change. About 0.5% speedup Change-Id: I7d71a66e8031ddb340744dc493f22976052b8f9f	2013-07-12 10:22:56 -07:00
Dmitry Kovalev	727631873d	Merge "Removing redundant code mostly from vp9_pred_common.{h, c}."	2013-07-12 10:22:30 -07:00
Paul Wilkins	b8ddc9f0d3	Merge "Speed 2 feature adjustment."	2013-07-12 02:14:01 -07:00
Jingning Han	119decdee7	Merge "Cosmetic changes in 16x16 ADST/DCT unit test"	2013-07-11 21:52:39 -07:00
Jingning Han	84c3ac0476	Merge "Remove unnecessary tx_type branch in encode_block"	2013-07-11 21:52:27 -07:00
Dmitry Kovalev	dd150e8ea9	Removing redundant code mostly from vp9_pred_common.{h, c}. Removing redundant function arguments and curly braces. Change-Id: I46e02561f33fe02e84a3b19756f03b9504bd6a1b	2013-07-11 18:39:10 -07:00
Ronald S. Bultje	ee09dd9949	Remove unused function block_error(). Change-Id: I78a79fc51c2d7cc3c261f35b569155397f3dc0c4	2013-07-11 17:14:03 -07:00
James Zern	30bac896f9	Merge "vp9: fix peek_si for version==0"	2013-07-11 15:51:39 -07:00
James Zern	5b11e38aa7	Merge "small update to peek_si/get_si documentation"	2013-07-11 15:47:11 -07:00
Dmitry Kovalev	cae3fb7267	Merge "Calling is_inter_mode() instead of custom code."	2013-07-11 15:20:14 -07:00
Jingning Han	dac5891a1a	Merge "SSE2 4x4 invserse ADST/DCT transform"	2013-07-11 14:17:23 -07:00
Dmitry Kovalev	8c05e59065	Calling is_inter_mode() instead of custom code. Change-Id: Iccd4ab95ea51a6d57ed43947f2fd7ad92e8979cf	2013-07-11 14:14:47 -07:00
Dmitry Kovalev	b55ecafda8	Merge "Making vp9_default_nmv_context static."	2013-07-11 13:58:34 -07:00
James Zern	43dc0f8886	small update to peek_si/get_si documentation correct a doxygen and function reference Change-Id: I525371d64969aa60c464d0f6a133bc29895d7991	2013-07-11 12:23:28 -07:00
James Zern	7645c9ab34	vp9: fix peek_si for version==0 Change-Id: I6bfec4fa50dfc1a953edb1a2aa8e97e6e896bed6	2013-07-11 12:22:39 -07:00
Dmitry Kovalev	c4ad3273c7	Moving segmentation related vars into separate struct. Adding segmentation struct to vp9_seg_common.h. Struct members are from macroblockd and VP9Common structs. Moving segmentation related constants and enums to vp9_seg_common.h. Change-Id: I23fabc33f11a359249f5f80d161daf569d02ec03	2013-07-11 11:57:57 -07:00
Dmitry Kovalev	f70c021d36	Merge "Adding write_compressed_header function."	2013-07-11 11:57:17 -07:00
Dmitry Kovalev	802e57535a	Merge "Removing unused TOKENEXTRA arg from pick_sb_modes function."	2013-07-11 11:46:06 -07:00
Jingning Han	29c45f31ee	Cosmetic changes in 16x16 ADST/DCT unit test Change-Id: Ic649e9e47d14d6f8cae0c443a425ea533a97ad8d	2013-07-11 11:37:38 -07:00
Johann	158c80cbb0	convolve8 optimizations for neon Independent horizontal and vertical implementations. Requires that blocks be built from 4x4 and [xy]_step_q4 == 16 6-10% improvement. CIF improved the least. Change-Id: I137f5ceae4440adc0960bf88e4453e55a618bcda	2013-07-11 11:08:19 -07:00
hkuang	c9b25dcae4	Add neon optimize vp9_dc_only_idct_add. Change-Id: Iae84ab945cc9662a0ddd839aa2b9ca59f2ae5423	2013-07-11 10:30:47 -07:00
Jingning Han	b9381b6faf	Remove unnecessary tx_type branch in encode_block The function encode_block is called only by inter-prediction modes, hence removing the transform type branching there. Change-Id: I34a3172e28ce2388835efd0f8781922211bff857	2013-07-11 09:11:35 -07:00
Jim Bankoski	5000cdf0ff	Merge "Wide loopfilter 16 pix at a time"	2013-07-11 06:44:02 -07:00
Paul Wilkins	5290eeab88	Speed 2 feature adjustment. With sf->auto_mv_step_size on it is questionable whether sf->reduce_first_step_size is worthwhile. At speed 2 it was not having a big impact. Even at speed 2 sf->optimize_coefficients = 0 is not having a big speed imapct so for now I have moved it down into a higher speed setting. Change-Id: I8a54de76d486ad37aabce76474889da2768b14c1	2013-07-11 13:59:12 +01:00
Jingning Han	49b6302044	SSE2 4x4 invserse ADST/DCT transform Enable SSE2 4x4 inverse ADST/DCT transform. The runtime goes from 292 cycles down to 89 cycles. Running bus_cif at 2000 kbps, the overall runtime of speed 0 goes from 301s to 295s (2% speed-up). Change-Id: I24098136e7fee7ab2fbf1c11755bdf2ca37f3628	2013-07-10 20:16:02 -07:00
Jingning Han	aedc7c59b1	Merge "Fix tx_type bug in intra4x4 rd loop"	2013-07-10 20:13:25 -07:00
Ronald S. Bultje	decead7336	Replace copy_memNxM functions with a generic copy/avg function. Change-Id: I3ce849452ed4f08527de9565a9914d5ee36170aa	2013-07-10 18:27:24 -07:00
Ronald S. Bultje	c13e0bcb52	Remove unused fwalsh/fdct x86 SIMD implementations. Change-Id: Ia942e56cf322821d42ba06178672791eeee2847e	2013-07-10 18:22:51 -07:00
Dmitry Kovalev	ac72ad071d	Making vp9_default_nmv_context static. Change-Id: Ia3d5bd45adf288de11ab59c4728266c93c17e275	2013-07-10 17:44:45 -07:00
Ronald S. Bultje	46997bde88	Merge "Remove unused iwalsh4x4 MMX/SSE2 functions."	2013-07-10 17:08:46 -07:00
Ronald S. Bultje	a7ef456453	Merge "Remove unused 16x3/3x16 sad SSE2 functions."	2013-07-10 17:08:43 -07:00
John Koleszar	64f7a4d8cb	Wide loopfilter 16 pix at a time Where possible, do the 16 pixel wide filter while doing the horizontal filtering pass. The same approach can be taken for the mbloop_filter when that's implemented. Doing so on the vertical pass is a little more involved, but possible. Change-Id: I010cb505e623464247ae8f67fa25a0cdac091320	2013-07-10 16:32:44 -07:00
James Zern	73c4e28487	msvs-build: use msbuild for vs >= 2005 allows concurrent builds via the /m command line option Change-Id: I668792ba00276e8626dc175c0a44ddab35fc7114	2013-07-10 16:18:13 -07:00
Dmitry Kovalev	544d8c3316	Removing unused TOKENEXTRA arg from pick_sb_modes function. Change-Id: I0543e72fa092eef3976b65e16bb597197c364873	2013-07-10 15:57:28 -07:00
Jingning Han	18803f9cc4	Fix tx_type bug in intra4x4 rd loop This commit fixed the mis-use of the tx_type for inverse transform in intra4x4 rate-distortion optimization loop. It improves the overall coding performance. Change-Id: I7fe9953175b74890357dbcee33c138573766e980	2013-07-10 15:49:49 -07:00
Deb Mukherjee	7494bba66b	Merge "Prunes out full-rd computation based on modeled rd"	2013-07-10 15:37:11 -07:00
Dmitry Kovalev	140447db5a	Merge "Adding read_compressed_header function."	2013-07-10 15:11:08 -07:00
Dmitry Kovalev	0ac5e4dd58	Adding write_compressed_header function. Change-Id: Ic5257fa8278e9b6297de230e4fd26a1e23ad2bb7	2013-07-10 15:08:34 -07:00
Jim Bankoski	68ef7a6b8a	configure with internal stats not working Change-Id: I5dea4570cb05df27a522abf6e7b695998654284a	2013-07-10 15:07:53 -07:00
Ronald S. Bultje	3f210f10eb	Remove unused iwalsh4x4 MMX/SSE2 functions. Change-Id: I2d22577911a37ed7d8c7e08cac20764842267652	2013-07-10 14:52:47 -07:00
Ronald S. Bultje	48c53233fd	Remove unused 16x3/3x16 sad SSE2 functions. Change-Id: I30a597c0cc366e34c9a3e2afe32d70e044f95ca4	2013-07-10 14:52:47 -07:00
Ronald S. Bultje	e6f955251f	Merge "SSSE3 assembly for 4x4/8x8/16x16/32x32 H intra prediction."	2013-07-10 14:52:23 -07:00
Ronald S. Bultje	6a60249071	Merge "SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 TM intra prediction."	2013-07-10 14:52:19 -07:00
Jim Bankoski	865ca76604	Merge "remove warnings when NDEBUG is set"	2013-07-10 14:39:39 -07:00
Jim Bankoski	6591cf2f7e	remove warnings when NDEBUG is set Change-Id: Ie0cb732fdcb98616a422c4463bff80642248d136	2013-07-10 14:27:20 -07:00
Deb Mukherjee	53ff43adc3	Prunes out full-rd computation based on modeled rd Adds a speed feature to eliminate full-rd computation if the modeled rd or rd based on a different parameter in the same mode is already a lot larger than the best rd yet. Specifically, only search the sharp and smooth filters if the modeled rd cost based on the regular filter is within a certain factor of the best rd cost so far. Also, skip full-rd computation of non splitmv inter modes if the modeled rd cost based on pred error is within the same factor of the best rd cost so far. Also adds some enhancements in the rd search for splitmv mode to speed things up by early breakouts. Negligible impact on performance. Resuts on derfraw300: psnr: -0.013% with the splitmv enhancements, -0.24% with the rd breakout feature on. speedup: 6% with splitmv enhancements, 20% with also residual breakout (tested on football sequence at 600 Kbps) Change-Id: I37abc308ea9f110c1679ce649b6a7e73ab1ad5fc	2013-07-10 13:49:49 -07:00
James Zern	82f5935111	Merge "msvc: set a more useful debug format"	2013-07-10 13:02:22 -07:00
James Zern	9a8524d5ba	Merge "test_libvpx: disable pthreads in gtest for win targets"	2013-07-10 13:01:52 -07:00
Jingning Han	114423538f	SSE2 16x16 ADST/DCT hybrid transform This commit enables 16x16 ADST/DCT forward hybrid transform using SSE2 operations. It reduces the runtime from 5433 cycles to 1621 cycles, at no compression performance loss. Change-Id: I75fd7f1984e9e28846af459f810ff0d6ae125230	2013-07-10 12:14:53 -07:00

1 2 3 4 5 ...

5569 Commits