generic-library/vpx

Author	SHA1	Message	Date
Johann	f6fcd3410d	sub pel avg variance neon: 4x block sizes BUG=webm:1423 Change-Id: Iaab2b9a183fdb54aae5f717aba95d90dc36a9e3b	2017-05-22 14:40:05 -07:00
Johann	188d58eaa9	sub pel variance neon: 4x block sizes Add optimizations for blocks of width 4 BUG=webm:1423 Change-Id: Idfb458d36db3014d48fbfbe7f5462aa6eb249938	2017-05-22 14:40:01 -07:00
Johann	9b0d306a2f	sub pel avg variance neon: add neon optimizations These are missing an optimized version of vpx_comp_avg_pred BUG=webm:1423 Change-Id: I31fa6ef842e98f7ff3ea079ffed51ae33178e2ed	2017-05-22 13:58:43 -07:00
Johann	7b742da63e	neon variance: process 4x blocks Continue processing sets of 16 values. Plenty of improvement for 4x8 (doubles the speed) but only about 30% for 4x4. BUG=webm:1422 Change-Id: Ib8dd96f75d474f0348800271d11e58356b620905	2017-05-17 17:35:01 -07:00
Luca Barbato	143b21e362	ppc: Add get_mb_ss_vsx Change-Id: I1b54a7a5bb642e4b836d786ea1ae506eed025e3f	2017-05-12 17:23:00 +02:00
Luca Barbato	6d225eb5f9	ppc: Add get4x4sse_cs_vsx Change-Id: I3028bdadf653665d18e781d28e9625f62804b3d8	2017-05-12 17:23:00 +02:00
Johann Koenig	d713ec3c46	Merge changes I92eb4312,Ibb2afe4e * changes: subpel variance neon: add mixed sizes sub pixel variance neon: use generic variance	2017-05-10 18:19:52 +00:00
Johann	2346a6da4a	subpel variance neon: add mixed sizes Add support for everything except block sizes of 4. Performance is better but numbers will improve again when the variance optimizations land. BUG=webm:1423 Change-Id: I92eb4312b20be423fa2fe6fdb18167a604ff4d80	2017-05-04 15:30:01 -07:00
Johann	cb9133c72f	neon variance: add small missing sizes Some of the mixed sizes were missing. They can be implemented trivially using the existing helper function. When comparing the previous 16x8 and 8x16 implementations, the helper function is about 10% faster than the 16x8 version. The 8x16 is very close, but the existing version appears to be faster. BUG=webm:1422 Change-Id: Ib0e856083c1893e1bd399373c5fbcd6271a7f004	2017-05-04 08:59:42 -07:00
Johann	d55724fae9	Remove armv6 target Change-Id: I1fa81cc9cabf362a185fc3a53f1e58de533a41e5	2016-08-04 12:55:06 -07:00
clang-format	33e40cb5db	test: apply clang-format Change-Id: I0d9ab85855eb723f653a7bb09b3d0d31dd6cfd2f	2016-07-27 01:58:52 +00:00
skal	7d72ebaa5c	take II: variance_test partial clean-up remove some (but not all yet!) tuple mis-use, and revamp the code a lot. Factorize some common chores into MainTestClass. Change-Id: Id37b7330eebe80d19b9d12a454f24ff9be6b1116	2016-07-18 16:18:26 -07:00
Johann	0266e70c52	test: remove x86inc.asm distinction BUG=b:29583530 Change-Id: I296a0b81755e3086bc0a40cb126d0200ff03c095	2016-06-30 11:14:10 -07:00
James Zern	eea8ea88ab	vpx_dsp: remove mmx variance implementations there are sse2 equivalents for all remaining variance implementations Change-Id: I10b947e73fc0067688181f819b59e47966bec3d2	2016-06-02 23:46:16 -07:00
Scott LaVarnway	a4f3751be5	Code clean of sub_pixel_variance4xh -- 2 Replace MMX with SSE2. Change-Id: Id8482d2589131f9427e7f36bc64413f058caf31f	2016-05-24 04:44:05 -07:00
Linfeng Zhang	d0ffae825d	remove mmx variance functions there are sse2 equivalents which is a reasonable modern baseline Removed mmx variance functions: vpx_get_mb_ss_mmx() vpx_get8x8var_mmx() vpx_get4x4var_mmx() vpx_variance4x4_mmx() vpx_variance8x8_mmx() vpx_mse16x16_mmx() vpx_variance16x16_mmx() vpx_variance16x8_mmx() vpx_variance8x16_mmx() Change-Id: Iffaf85344c6676a3dd337c0645a2dd5deb2f86a1	2016-05-11 12:39:42 -07:00
Yi Luo	4bdc7d1c88	Fix high bit depth mask and variance reference function - Use arithmetic AND (&) instead of logical AND (&&) to generate correct testing input. - Fix variance reference function to be consistent with our codebase implementation. - Refer to the following issue: https://bugs.chromium.org/p/webm/issues/detail?id=1166 Change-Id: I8c1ebb03e22dc9e1dcd96bdf935fc126cee71307	2016-04-05 09:27:03 -07:00
Alex Converse	ed3df445d9	Revert "Merge "Change highbd variance rounding to prevent negative variance."" This reverts commit `ea48370a50`, reversing changes made to `15939cb2d7`. The commit was insufficiently tested and causes failures. Change-Id: I623d6fc2cd3ae6fd42d0abab1f8eada465ae57a7	2016-01-13 11:19:06 -08:00
Alex Converse	022c848b4d	Change highbd variance rounding to prevent negative variance. Always round sum error and sum square error toward zero in variance calculations. This prevents variance from becoming negative. Avoiding rounding variance at all might be better but would be far more invasive. Change-Id: Icf24e0e75ff94952fc026ba6a4d26adf8d373f1c	2015-11-24 16:32:01 -08:00
James Zern	837cea40fc	variance_test: create fn pointers w/'&' ref this helps some toolchains (vs9) resolve the type of the parameter Change-Id: I8c83b86da53b1783cd18c0f765b67ba33da91d72	2015-11-06 11:04:11 -08:00
Jingning Han	097d59c28c	Cosmetics - Fix header file order in unit tests Change-Id: I9582a8d74990125b71e8fe620f7f3f2585a30798	2015-07-29 20:48:25 -07:00
Parag Salasakar	1d7f1ca7da	mips msa vp9 avg subpel variance optimization rebased Change-Id: Ia21987010dbb688e2a8fa204ca9129d2f34c9581	2015-07-08 12:07:28 +05:30
Johann	6a82f0d7fb	Move sub pixel variance to vpx_dsp Change-Id: I66bf6720c396c89aa2d1fd26d5d52bf5d5e3dff1	2015-07-07 15:51:04 -07:00
Parag Salasakar	fbe67d307a	mips msa vp9 subpel variance optimization Change-Id: If88401bf8c5d8ee58200278734d7a5058d1585d0	2015-07-06 14:59:01 -07:00
James Zern	97946622c0	Revert "mips msa vp9 subpel variance optimization" This reverts commit `a42df86c03`. this change causes MSA/VP9SubpelVarianceTest.Ref and MSA/VP9SubpelVarianceTest.ExtremeRef failures under mips32r5el-msa-linux-gnu and mips64r6el-msa-linux-gnu Change-Id: I40b71a0b774eaeb31f66f795733f95cf360909f7	2015-07-02 12:06:51 -07:00
James Zern	ced982640b	Revert "mips msa vp9 avg subpel variance optimization" This reverts commit `61774ad1c4`. this change causes MSA/VP9SubpelAvgVarianceTest.Ref failures under mips32r5el-msa-linux-gnu and mips64r6el-msa-linux-gnu Change-Id: I7fb520c12b2a3b212d5e84b7619a380a48e49bb0	2015-07-02 12:06:29 -07:00
Parag Salasakar	61774ad1c4	mips msa vp9 avg subpel variance optimization average improvement ~3x-5x Change-Id: Iefbcafc05daab77b38a4e63b551e427867a501a4	2015-07-01 13:46:41 +05:30
Parag Salasakar	a42df86c03	mips msa vp9 subpel variance optimization average improvement ~3x-5x Change-Id: I4cbba2711467b0e205904769ebbb4a1fcbb1a311	2015-07-01 07:51:34 +05:30
Parag Salasakar	2d730a289a	mips msa vpx_dsp variance optimization average improvement ~2x-4x Change-Id: Ia3eef3f390148c2eb5cdc580a94cb26369737f82	2015-06-30 12:22:18 +05:30
James Zern	e0e4045db8	variance_test: fix build w/--disable-vp8-encoder s/CONFIG_VP8\b/CONFIG_VP8_ENCODER/ Change-Id: I616aace9cf8f18d7e83f00f7aef3b8a26fc4c17b	2015-06-11 23:15:30 -07:00
James Zern	47fe535422	disable vp8_sub_pixel_variance8x8_neon fails unit tests: [ FAILED ] NEON/VP8SubpelVarianceTest.ExtremeRef/0, where GetParam() = (3, 3, 0x14e36d, 0) [ FAILED ] NEON/VP8SubpelVarianceTest.Ref/0, where GetParam() = (3, 3, 0x14e36d, 0) the tests were recently enabled in: `eb88b17` Make vp9 subpixel match vp8 the functions likely haven't changed since being converted from assembly Change-Id: I6141717b111b8f735f436c160d74270af53ef722	2015-06-05 20:18:51 -07:00
Johann	eb88b172fe	Make vp9 subpixel match vp8 The only difference between the two was that the vp9 function allowed for every step in the bilinear filter (16 steps) while vp8 only allowed for half of those. Since all the call sites in vp9 (<< 1) the input, it only ever used the same steps as vp8. This will allow moving the subpel variance to vpx_dsp with the rest of the variance functions. Change-Id: I6fa2509350a2dc610c46b3e15bde98a15a084b75	2015-06-03 22:10:51 -07:00
Johann	d90536c1a2	Unify reference variance functions Use uint32_t for all output and make all functions static Change-Id: I2c9c6f6310732dc53444607d1c1a268ac1ab83ba	2015-06-02 15:14:55 -07:00
Johann	fdc549994a	Cast variance reference output The larger internal variables are required for the intermediates but RoundHighBitDepth brings them down to uint32_t/unsigned int. Fixes type warnings in visual studio. Change-Id: I48d35284d6cbde330ccdc1f46b6215a645d5eb00	2015-06-01 10:56:52 -07:00
Johann	a927aec5f8	Merge "Use correct parameters for NEON variance tests"	2015-05-28 19:53:50 +00:00
Johann	efc2e9844e	Use correct parameters for NEON variance tests Change-Id: Ib2949d0a3e9273e7952bbf91956357c1138093f1	2015-05-28 11:28:06 -07:00
Johann	c855ed72a6	Remove conversion warnings from hbd shifts ROUND_POWER_OF_TWO has some poor side effects when used with [u]int64_t such as doing the shifting in 32bits. Change-Id: Ic85a19765cd316fb43657cb21c86f35ceb772773	2015-05-27 17:54:22 -07:00
Johann	c5a7c89e89	Correct case in Get4x4SSEFunc Change-Id: Ie8a7508798fa8e65c579a77cedb8305cee4ddc81	2015-05-27 11:38:43 -07:00
Johann	c3bdffb0a5	Move variance functions to vpx_dsp subpel functions will be moved in another patch. Change-Id: Idb2e049bad0b9b32ac42cc7731cd6903de2826ce	2015-05-26 12:01:52 -07:00
Johann	1d7ccd5325	Relocate memory operations for common code With the sad functions, and hopefully the variance functions soon, moving to the vpx_dsp location, place the defines used in the reference C code in a common location. Change-Id: I4c8ce7778eb38a0a3ee674d2f1c488eda01cfeca	2015-05-13 11:41:15 -07:00
Frank Galligan	ec1d8387e1	Add 64x64 sub_pel_variance Neon function On Nexus 7 speed -5, -6, -7, and -8 saw about a 15% increase in perf for 480p. Speeds -5, -6, -7, and -8 saw about a 10% increase in perf for 720p. Tested on Nexus 7, built with ndk r10d, gcc 4.9. Change-Id: I2fa5315845e3021c9a6e2ea47e52e68b398d8334	2015-01-14 08:36:24 -08:00
Frank Galligan	74d40cd507	Add 64x variance Neon functions Add optimized Neon functions of: vp9_variance32x64 vp9_variance64x32 vp9_variance64x64 On Nexus 7 speed -5 and -6 saw about a 4% increase in perf. Speeds -7 and -8 saw about a 6% increase in perf. Tested on Nexus 7, built with ndk r10d, gcc 4.9. Change-Id: I5a81f13c9897eb927fa39662530f5524a0f768fa	2015-01-13 15:08:13 -08:00
Peter de Rivaz	48032bfcdb	Added sse2 acceleration for highbitdepth variance Change-Id: I446bdf3a405e4e9d2aa633d6281d66ea0cdfd79f (cherry picked from commit `d7422b2b1e`) (cherry picked from commit `6d741e4d76`)	2014-11-14 15:18:53 -08:00
Scott LaVarnway	fe2cc873dc	VP8 encoder for ARMv8 by using NEON intrinsics 1 Add vp8_mse16x16_neon.c - vp8_mse16x16_neon - vp8_get4x4sse_cs_neon Change-Id: I108952f60a9ae50613f0ce3903c2c81df19d99d0 Signed-off-by: James Yu <james.yu@linaro.org>	2014-09-15 12:04:09 -07:00
Dmitry Kovalev	1f19ebbab6	Replacing vp9_get_mb_ss_sse2 asm implementation with intrinsics. Change-Id: Ib4f5dd733eb2939b108070a01e83da5d9990bac0	2014-09-06 00:10:25 -07:00
Dmitry Kovalev	202edb3d23	Actually resetting random generator for all variance test cases. Calling Reset(int) method instead of overloaded operator()(int). Adding underscore at the end of class member name. Change-Id: I01934e7bc056d4b594e5d05d693328febd34ac3c	2014-09-04 12:24:52 -07:00
Dmitry Kovalev	12cd6f421d	Removing variance MMX code. Removed functions: * vp9_mse16x16_mmx * vp9_get_mb_ss_mmx * vp9_get4x4var_mmx * vp9_get8x8var_mmx * vp9_variance4x4_mmx * vp9_variance8x8_mmx * vp9_variance16x16_mmx * vp9_variance16x8_mmx * vp9_variance8x16_mmx They all have SSE2 equivalent. Change-Id: I3796f2477c4f59b35b4828f46a300c16e62a2615	2014-08-29 10:26:42 -07:00
levytamar82	69a5f5ecf7	Fix bug 807 in the sub_pixel_variance function the dst is aligned to 16 bytes and not to 32 bytes - now load unaligned data Change-Id: I2e0b9745543697efc56fefa32857ea10117af135	2014-08-07 18:51:02 -07:00
Scott LaVarnway	98165ec074	Neon version of vp9_sub_pixel_variance8x8(), vp9_variance8x8(), and vp9_get8x8var(). On a Nexus 7, vpxenc (in realtime mode, speed -12) reported a performance improvement of ~1.2%. Change-Id: I8a66ac2a0f550b407caa27816833bdc563395102	2014-08-01 11:35:55 -07:00
Scott LaVarnway	d39448e2d4	Neon version of vp9_sub_pixel_variance32x32(), vp9_variance32x32(), and vp9_get32x32var(). Change-Id: I8137e2540e50984744da59ae3a41e94f8af4a548	2014-07-31 08:00:36 -07:00

1 2

72 Commits