James Zern 
							
						 
					 
					
						
						
							
						
						d5e07a8451 
					 
					
						
						
							
							variance_test: add NEON functions  
						
						... 
						
						
						
						note not all functions have NEON implementations:
- variance4x4_neon
Change-Id: I03c1ba21f3b02aa2482d7ca8feedc3ef74b5947f 
						
						
					 
					
						2014-02-26 19:25:02 -08:00 
						 
				 
			
				
					
						
							
							
								James Zern 
							
						 
					 
					
						
						
							
						
						002ad40897 
					 
					
						
						
							
							test/: remove unnecessary extern "C"s  
						
						... 
						
						
						
						Change-Id: I826655a708010149de231ca31a2e3ba4f1842c0c 
						
						
					 
					
						2014-01-23 19:42:59 -08:00 
						 
				 
			
				
					
						
							
							
								James Zern 
							
						 
					 
					
						
						
							
						
						a0fcbcfa5f 
					 
					
						
						
							
							fix vp8-only build  
						
						... 
						
						
						
						Change-Id: Id9ce44f3364dd57b30ea491d956a2a0d6186be05 
						
						
					 
					
						2013-09-17 18:47:25 -07:00 
						 
				 
			
				
					
						
							
							
								Yaowu Xu 
							
						 
					 
					
						
						
							
						
						afffa3d9b0 
					 
					
						
						
							
							cleanup cpplint warnings  
						
						... 
						
						
						
						Suggested by James Zern to clear out cpplint warnings for all unit
test code.
Change-Id: I731a3fa4d2a257eb9ef733426ba84286fbd7ea34 
						
						
					 
					
						2013-09-06 10:13:49 -07:00 
						 
				 
			
				
					
						
							
							
								Jim Bankoski 
							
						 
					 
					
						
						
							
						
						5b307886fb 
					 
					
						
						
							
							variance x86inc guards  
						
						... 
						
						
						
						also fixed bug in sad calcs
Change-Id: I6571fcbe37556c16ae32be66dc0fd879852aac1d 
						
						
					 
					
						2013-08-06 14:17:13 -07:00 
						 
				 
			
				
					
						
							
							
								James Zern 
							
						 
					 
					
						
						
							
						
						e247ab09a6 
					 
					
						
						
							
							variance_test: add missing ClearSystemState...  
						
						... 
						
						
						
						...to recently added SubpelVarianceTest
Change-Id: I8775e39fd5dbfba81ad42b79b47bf6dd6ca8cc0e 
						
						
					 
					
						2013-06-26 18:32:21 -07:00 
						 
				 
			
				
					
						
							
							
								Ronald S. Bultje 
							
						 
					 
					
						
						
							
						
						ac6ea2ab91 
					 
					
						
						
							
							Allocate memory using appropriate expected alignment in unit tests.  
						
						... 
						
						
						
						Fixes crashes of test_libvpx on 32-bit Linux.
Change-Id: If94e7628a86b788ca26c004861dee2f162e47ed6 
						
						
					 
					
						2013-06-21 17:03:57 -07:00 
						 
				 
			
				
					
						
							
							
								James Zern 
							
						 
					 
					
						
						
							
						
						cc774c8bb0 
					 
					
						
						
							
							variance_test: use REGISTER_STATE_CHECK  
						
						... 
						
						
						
						Change-Id: Id54ad9a781634f075e990d5bade5be8490959975 
						
						
					 
					
						2013-06-21 14:30:08 -07:00 
						 
				 
			
				
					
						
							
							
								Ronald S. Bultje 
							
						 
					 
					
						
						
							
						
						1e6a32f1af 
					 
					
						
						
							
							SSE2/SSSE3 optimizations and unit test for sub_pixel_avg_variance().  
						
						... 
						
						
						
						Encoding of bus @ 1500kbps (first 50 frames) goes from 3min57 to
3min35, i.e. approximately a 10.5% speedup. Note that the SIMD versions
which use a bilinear filter (x_offset & 7 || y_offset & 7) aren't
perfectly interleaved, and can probably be improved further in the
future. I've marked this with a few TODOs/FIXMEs in the code.
Change-Id: I5c9e900c0f0d32e431a50fecae213b510b2549f9 
						
						
					 
					
						2013-06-20 15:59:48 -07:00 
						 
				 
			
				
					
						
							
							
								Ronald S. Bultje 
							
						 
					 
					
						
						
							
						
						8fb6c58191 
					 
					
						
						
							
							Implement sse2 and ssse3 versions for all sub_pixel_variance sizes.  
						
						... 
						
						
						
						Overall speedup around 5% (bus @ 1500kbps first 50 frames 4min10 ->
3min58). Specific changes to timings for each function compared to
original assembly-optimized versions (or just new version timings if
no previous assembly-optimized version was available):
sse2   4x4:    99 ->   82 cycles
sse2   4x8:           128 cycles
sse2   8x4:           121 cycles
sse2   8x8:   149 ->  129 cycles
sse2   8x16:  235 ->  245 cycles (?)
sse2  16x8:   269 ->  203 cycles
sse2  16x16:  441 ->  349 cycles
sse2  16x32:          641 cycles
sse2  32x16:          643 cycles
sse2  32x32: 1733 -> 1154 cycles
sse2  32x64:         2247 cycles
sse2  64x32:         2323 cycles
sse2  64x64: 6984 -> 4442 cycles
ssse3  4x4:           100 cycles (?)
ssse3  4x8:           103 cycles
ssse3  8x4:            71 cycles
ssse3  8x8:           147 cycles
ssse3  8x16:          158 cycles
ssse3 16x8:   188 ->  162 cycles
ssse3 16x16:  316 ->  273 cycles
ssse3 16x32:          535 cycles
ssse3 32x16:          564 cycles
ssse3 32x32:          973 cycles
ssse3 32x64:         1930 cycles
ssse3 64x32:         1922 cycles
ssse3 64x64:         3760 cycles
Change-Id: I81ff6fe51daf35a40d19785167004664d7e0c59d 
						
						
					 
					
						2013-06-20 09:34:25 -07:00 
						 
				 
			
				
					
						
							
							
								James Zern 
							
						 
					 
					
						
						
							
						
						5b756748fd 
					 
					
						
						
							
							tests: clear system state after non-API calls  
						
						... 
						
						
						
						add ClearSystemState() to reset MMX registers avoiding corrupting
subsequent tests.
Change-Id: I668deb09aa7aa467709776e5819f936910698bc0 
						
						
					 
					
						2013-06-18 11:32:27 -07:00 
						 
				 
			
				
					
						
							
							
								Yunqing Wang 
							
						 
					 
					
						
						
							
						
						f4fcfe3075 
					 
					
						
						
							
							Optimize variance functions  
						
						... 
						
						
						
						Added SSE2 version of variance functions for super blocks.
Change-Id: Ibeaae8771ca21c99d41dd74067574a51e97b412d 
						
						
					 
					
						2013-05-22 10:29:38 -07:00 
						 
				 
			
				
					
						
							
							
								James Zern 
							
						 
					 
					
						
						
							
						
						1711cf2dbb 
					 
					
						
						
							
							add vp8 variance test  
						
						... 
						
						
						
						Change-Id: I4e94ee2c4e2360d6a11a454c323f2899c1bb6f72 
						
						
					 
					
						2013-02-22 16:25:14 -08:00 
						 
				 
			
				
					
						
							
							
								John Koleszar 
							
						 
					 
					
						
						
							
						
						fcccbcbb39 
					 
					
						
						
							
							Add vp9_ prefix to all vp9 files  
						
						... 
						
						
						
						Support for gyp which doesn't support multiple objects in the same
static library having the same basename.
Change-Id: Ib947eefbaf68f8b177a796d23f875ccdfa6bc9dc 
						
						
					 
					
						2012-11-27 14:12:30 -08:00 
						 
				 
			
				
					
						
							
							
								John Koleszar 
							
						 
					 
					
						
						
							
						
						a9c7597adc 
					 
					
						
						
							
							support building vp8 and vp9 into a single lib  
						
						... 
						
						
						
						Change-Id: Ib8f8a66c9fd31e508cdc9caa662192f38433aa3d 
						
						
					 
					
						2012-11-15 10:46:17 -08:00 
						 
				 
			
				
					
						
							
							
								James Zern 
							
						 
					 
					
						
						
							
						
						984734436d 
					 
					
						
						
							
							Fix variance (signed integer) overflow  
						
						... 
						
						
						
						In the variance calculations the difference is summed and later squared.
When the sum exceeds sqrt(2^31) the value is treated as a negative when
it is shifted which gives incorrect results.
To fix this we force the multiplication to be unsigned.
The alternative fix is to shift sum down by 4 before multiplying.
However that will reduce precision.
For 16x16 blocks the maximum sum is 65280 and sqrt(2^31) is 46340 (and
change).
This change is based on:
1698234fea3556 
						
						
					 
					
						2012-11-06 23:06:44 -08:00