generic-library/vpx

Author	SHA1	Message	Date
James Zern	3197172405	Merge "Update vpx subpixel 1d filter ssse3 asm"	2016-07-02 03:08:17 +00:00
Johann	1b833d63d9	vpx_dsp: remove x86inc.asm distinction BUG=b:29583530 Change-Id: I397d77536b0d3cee0a92cdfe8b76bc4e434d0720	2016-06-29 18:55:58 -07:00
James Zern	3a6a81fc9a	Merge changes I9433d858,Iafd05637,If08ce6ca * changes: tests: remove redundant round() definition remove visual studio < 2010 workarounds configure: remove old visual studio support (<2010)	2016-06-29 23:07:16 +00:00
Linfeng Zhang	6b350766bd	Update vpx subpixel 1d filter ssse3 asm Speed test shows the new vertical filters have degradation on Celeron Chromebook. Added "X86_SUBPIX_VFILTER_PREFER_SLOW_CELERON" to control the vertical filters activated code. Now just simply active the code without degradation on Celeron. Later there should be 2 set of vertical filters ssse3 functions, and let jump table to choose based on CPU type. Change-Id: Iba2f1f2fe059a9d142c396d03a6b8d2d3b981e87	2016-06-29 13:48:41 -07:00
Yaowu Xu	63a37d16f3	Prevent negative variance Due to rounding, hbd variance may become negative. This commit put in check and clamp of negative values to 0. Change-Id: I610d9c8aa2d4eebe7bc5f2c5624a9e3cadad4c94	2016-06-29 11:08:17 -07:00
James Zern	c125f4a594	remove visual studio < 2010 workarounds BUG=b/29583530 Change-Id: Iafd05637eb65f4da54a9c857e79204a77646858a	2016-06-28 20:58:49 -07:00
James Zern	f51f67602e	*.asm: normalize label format add a trailing ':', though it's optional with the tools we support, it's more common to use it to mark a label. this also quiets the orphan-labels warning with nasm/yasm. BUG=b/29583530 Change-Id: I46e95255e12026dd542d9838e2dd3fbddf7b56e2	2016-06-27 19:46:57 -07:00
James Zern	cfd5e0221c	Revert "Update vpx subpixel 1d filter ssse3 asm" This reverts commit 1517fb74fd40eaab67246e8fb81d5c321bb33b06. Fixes a segfault in windows x64 builds. Change-Id: I6a6959cd7e64a28376849a9f2b11fc852a7c1fbe	2016-06-25 11:37:20 -07:00
Linfeng Zhang	bdeb5febe4	Merge "Update vpx subpixel 1d filter ssse3 asm"	2016-06-23 19:08:04 +00:00
Alex Converse	83db21b2fd	vpx_lpf_horizontal_4_sse2: Remove dead load. Change-Id: I51026c52baa1f0881fcd5b68e1fdf08a2dc0916e	2016-06-22 18:17:41 -07:00
Linfeng Zhang	1517fb74fd	Update vpx subpixel 1d filter ssse3 asm Speed test shows the new vertical filters have degradation on Celeron Chromebook. Added "X86_SUBPIX_VFILTER_PREFER_SLOW_CELERON" to control the vertical filters activated code. Now just simply active the code without degradation on Celeron. Later there should be 2 set of vertical filters ssse3 functions, and let jump table to choose based on CPU type. Change-Id: I37e3e9c5694737d9134a6bce6698d3e43f8fc962	2016-06-22 13:15:00 -07:00
Yaowu Xu	543ea3eb3e	Make type conversion explicit This fixes MSVC warnings. Change-Id: I675d8486230b2b74d7973d95720a4995c4750282	2016-06-20 12:05:29 -07:00
James Zern	e34e684059	Merge changes If31d36c8,I10b947e7 * changes: vpx_dsp,add_noise: remove mmx implementation vpx_dsp: remove mmx variance implementations	2016-06-04 00:56:06 +00:00
Linfeng Zhang	b90166665f	Merge "Slow pshufb removal in 3 intra prediction functions."	2016-06-03 16:35:14 +00:00
James Zern	462e0ff88b	vpx_dsp,add_noise: remove mmx implementation a sse2 version exists, this is a reasonable modern baseline. Change-Id: If31d36c8412d25b53f41b4a93cf02f46802c0c33	2016-06-02 23:51:22 -07:00
James Zern	eea8ea88ab	vpx_dsp: remove mmx variance implementations there are sse2 equivalents for all remaining variance implementations Change-Id: I10b947e73fc0067688181f819b59e47966bec3d2	2016-06-02 23:46:16 -07:00
Linfeng Zhang	ad0646cb84	Slow pshufb removal in 3 intra prediction functions. Replaced vpx_d45_predictor_4x4_ssse3(), vpx_d45_predictor_8x8_ssse3() and vpx_d207_predictor_4x4_ssse3() with created vpx_d45_predictor_4x4_sse2(), vpx_d45_predictor_8x8_sse2() and vpx_d207_predictor_4x4_sse2() respectively. It's mostly neutral or slightly worse than ssse3 in good cases and better than ssse3 in the bad cases (but still worse than using the mmx regs). Change-Id: Ib0237ceb71d2c57b8a93fd3170330cfed9d56bdd	2016-06-02 10:55:58 -07:00
Yaowu Xu	46ff1072b3	variance_avx2.c: UBSAN/IOC fix BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1222 Change-Id: Ifb3bedf9b4e1b007b21aebaa4beb9ba50424efef	2016-05-31 16:44:35 -07:00
Linfeng Zhang	0ba9b299e9	Merge "Upgrade vpx_lpf_{vertical,horizontal}_4 mmx to sse2"	2016-05-27 15:47:28 +00:00
Linfeng Zhang	4b5e462d08	Upgrade vpx_lpf_{vertical,horizontal}_4 mmx to sse2 Followed the code style of other lpf fuctions. These 2 functions put 2 rows of data in a single xmm register, so they have similar but not identical filter operations, and cannot share the same macros. Change-Id: I3bab55a5d1a1232926ac8fd1f03251acc38302bc	2016-05-26 14:55:18 -07:00
Scott LaVarnway	9d24fe60f1	Merge "Code clean of sub_pixel_variance4xh -- 2"	2016-05-26 13:20:24 +00:00
Scott LaVarnway	a4f3751be5	Code clean of sub_pixel_variance4xh -- 2 Replace MMX with SSE2. Change-Id: Id8482d2589131f9427e7f36bc64413f058caf31f	2016-05-24 04:44:05 -07:00
James Zern	3fb55d24e8	Revert "Code clean of sub_pixel_variance4xh" This reverts commit 2468163e0770108f5216b65445ce05a8241bca21. causes valgrind errors for overread of buffer in SubpelVarianceTest Change-Id: I448e52c76f815ac199305b71f7d169f2bc167679	2016-05-19 23:37:27 -07:00
Yaowu Xu	d1f0f4cc63	Merge "Clarify integer value ranges"	2016-05-18 23:55:05 +00:00
Yaowu Xu	a564b18d7f	Clarify integer value ranges This commit clarifies integer value range for vairables used in several variance functions, also change to use proper type conversion to reflect the value ranges. Change-Id: Ic3234b83a912ce1ad12d1b254f3378763e15cc5c	2016-05-18 10:25:12 -07:00
Scott LaVarnway	2468163e07	Code clean of sub_pixel_variance4xh Replace MMX with SSE2. Change-Id: Ia8fcba755952804e347d7d7736f57d1f90c988a0	2016-05-18 04:24:41 -07:00
Yaowu Xu	c1e4f5a80d	Merge "Change to use correct check for halfpel"	2016-05-13 01:27:47 +00:00
Linfeng Zhang	2f55beb355	Merge "remove mmx variance functions"	2016-05-11 22:21:23 +00:00
Yaowu Xu	17fae3ad0a	Change to use correct check for halfpel In motion estimation stage for subpel motion, subpel variance is computed use bilinear interpolation. The motion vector precision used is at 1/8 pel and three bits are used to represent the x and y subpel offsets. Based on this, the half pel check should be against 4, not 8. Change-Id: I1f56fa1fa3f2f5e19a20d27983efe628557f170e	2016-05-11 13:52:59 -07:00
Linfeng Zhang	d0ffae825d	remove mmx variance functions there are sse2 equivalents which is a reasonable modern baseline Removed mmx variance functions: vpx_get_mb_ss_mmx() vpx_get8x8var_mmx() vpx_get4x4var_mmx() vpx_variance4x4_mmx() vpx_variance8x8_mmx() vpx_mse16x16_mmx() vpx_variance16x16_mmx() vpx_variance16x8_mmx() vpx_variance8x16_mmx() Change-Id: Iffaf85344c6676a3dd337c0645a2dd5deb2f86a1	2016-05-11 12:39:42 -07:00
Linfeng Zhang	d0e687bf8c	remove mmx sad functions there are sse2 equivalents which is a reasonable modern baseline Change-Id: Ibbe536a5ad1c2cccef6bdcc75c13b3dde35a56ba	2016-05-11 10:50:04 -07:00
Jim Bankoski	da33728f48	vpx_dsp: Rename postproc.c add_noise. Change-Id: I4906d1b79a2951e659995202b9fa97e2ea5cfba0	2016-05-10 06:52:58 -07:00
Scott LaVarnway	c2c5297595	Merge "VPX: refactor vpx_idct16x16_1_add_sse2()"	2016-05-09 22:15:17 +00:00
Scott LaVarnway	1490342be5	VPX: refactor vpx_idct16x16_1_add_sse2() Change-Id: I431ea0d9abe764d110a1ba32a8cb15e2fdac8805	2016-05-09 09:50:00 -07:00
Johann	b23bd2360f	The subfunctions are only defined for sse2 See highbd_subpel_variance_impl_sse2.asm Change-Id: Id13b97f4f6d189ed71cdc6d52b3c4ea63dc1da05	2016-05-06 18:58:49 -07:00
Johann	a761197fbd	Unlike non-hbd variance, opt2 is never used Change-Id: I1d342725df332c4efc6006d9e3dcb7372c41f448	2016-05-06 18:38:04 -07:00
James Zern	2184692c07	vpx_dsp/*.[hc]: add missing vpx_dsp_rtcd.h include Change-Id: I103be7eee36492f8619144ce8325bc916d4975c7	2016-05-04 15:06:44 -07:00
James Bankoski	89f905e5e5	Merge "libvpx: add a unit test for plane_add_noise."	2016-05-04 13:09:05 +00:00
Jim Bankoski	34d5aff747	libvpx: add a unit test for plane_add_noise. In so doing this fixes a couple of bugs: vpx_plane_add_noise.c needed to subtract a clamp instead of add. And the assembly (mmx sse) had assumptions that parameters were continuous in memory which was not true. Change-Id: I76f2c43cf54bfc838eb2edf8a443eaaa7565d7b5	2016-05-03 16:23:06 -07:00
James Bankoski	e755a283dd	Merge "Move vpx_add_plane from codec to vpx_dsp and dedup."	2016-05-03 14:11:57 +00:00
Jim Bankoski	fce3cee8dd	Move vpx_add_plane from codec to vpx_dsp and dedup. Change-Id: I12218d8331c0558c0587a66321e3ca46da7e5cc7	2016-05-02 12:17:39 -07:00
Alex Converse	a68b24fdee	Tweak casts on vpx_sub_pixel_variance to avoid implicit overflow. Change-Id: I481eb271b082fa3497b0283f37d9b4d1f6de270c	2016-04-27 16:37:18 -07:00
Alex Converse	6c4007be1c	Be explicit about overflow in vpx_variance16x16_sse2. The product always fits in uint32_t, but the operands don't. An optimizing compiler should generate the wraparound code. (Verified with clang). Change-Id: I25eb64df99152992bc898b8ccbb01d55c8d16e3c	2016-04-27 15:22:17 -07:00
Alex Converse	ccb894ce73	Remove casts on < 16x16 variance. These blocks will never overflow since max sum is +/-255wh. Change-Id: Ia2c630339fd9cfb411b56b6040ff402095f12a2e	2016-04-27 15:21:58 -07:00
James Zern	38bc1d0f4b	vpx_fdct16x16_1_sse2: improve load pattern load the full row rather than doing 2 8-wide columns Change-Id: I7a1c0cba06b0dc1ae86046410922b1efccb95c95	2016-04-04 16:03:42 -07:00
James Zern	3735def667	vpx_fdctNxN_1_sse2: reduce store size only output[0] needs to be set, store_output is more involved than a movdqa in the high bitdepth case Change-Id: I2cbd85d7cf74688bdf47eb767934fe42e02bff67	2016-04-04 16:02:06 -07:00
Scott LaVarnway	67c4c8244a	VPX: loopfilter_mmx.asm using x86inc 2 This reverts commit 9aa083d164e0d39086aa0c83f0d1a0d0f0d1ba61. Fixes a decoder mismatch with 32bit PIC builds. Change-Id: I94717df662834810302fe3594b38c53084a4e284	2016-03-08 04:24:47 -08:00
James Zern	9aa083d164	Revert "VPX: loopfilter_mmx.asm using x86inc" This reverts commit 15ecdc3970462c15fdf7185d373cb52664f40c0f. breaks 32-bit pic builds Change-Id: I8bb1b9471a293f05ac7423aaba0339d408931b7a	2016-03-04 18:23:45 -08:00
Scott LaVarnway	dd6729f826	VPX: Remove pmin/pmax from subpixel functions. These instructions are unnecessary if the adds are done in the correct order. Change-Id: I4e533b8267c32e610a4b94203ad052dc9fdabd71	2016-02-27 05:47:56 -08:00
Scott LaVarnway	51beb29f52	Merge "VPX: vpx_filter_block1d16_(v8, v8_avg)"	2016-02-27 13:31:18 +00:00

1 2 3 4

196 Commits