generic-library/vpx

Author	SHA1	Message	Date
Johann	e4b3f03c64	add copyright to rtcd files Allows them to pass the license check in chromium. BUG=chromium:98319 Change-Id: Iefc1706152a549d8c4ae774c917596bf1c9492d8	2017-12-14 22:50:08 +00:00
Shiyou Yin	577d4fa792	vp8: [loongson] optimize idct with mmi 1. vp8_dequant_idct_add_y_block_mmi 2. vp8_dequant_idct_add_uv_block_mmi Change-Id: I9987147be2685ac79d4b045d1d56f6709ee1223c	2017-10-17 03:27:31 +00:00
Shiyou Yin	f70de09f2a	vp8: [loongson] optimize dct with mmi 1. vp8_short_fdct4x4_mmi 2. vp8_short_fdct8x4_mmi 3. vp8_short_walsh4x4_mmi Change-Id: I89a7df25cfd09fae309fac257ad8b6a3dc1c8acb	2017-10-12 08:50:04 +08:00
Shiyou Yin	e8ed2bb762	vp8: [loongson] optimize quantize with mmi 1. vp8_fast_quantize_b_mmi 2. vp8_regular_quantize_b_mmi Change-Id: Ic6e21593075f92c1004acd67184602d2aa5d5646	2017-10-11 16:45:58 +08:00
Shiyou Yin	73102d1ed2	vp8: [loongson] optimize copymen with mmi 1. vp8_copy_mem16x16_mmi 2. vp8_copy_mem8x8_mmi 3. vp8_copy_mem8x4_mmi Change-Id: I3de29a11fa7402df0e48bbb944440b1e66498a65	2017-09-26 08:40:11 +08:00
Shiyou Yin	b81de66171	vp8: [loongson] optimize dequantize with mmi 1. vp8_dequantize_b_mmi 2. vp8_dequant_idct_add_mmi Change-Id: I505f8afb7a444173392b325906e6a4f420f00709	2017-09-14 20:56:06 +08:00
Shiyou Yin	5b558592f5	vp8: [loongson] optimize idctllm with mmi 1. vp8_short_idct4x4llm_mmi 2. vp8_short_inv_walsh4x4_mmi 3. vp8_dc_only_idct_add_mmi Change-Id: I616923681e79d78607a4988608fc39df77b093f4	2017-09-14 16:51:11 +08:00
Shiyou Yin	761f2f5cb4	vp8: [loongson] optimize loopfilter with mmi 1. vp8_loop_filter_horizontal_edge_mmi 2. vp8_loop_filter_vertical_edge_mmi 3. vp8_mbloop_filter_horizontal_edge_mmi 4. vp8_mbloop_filter_vertical_edge_mmi 5. vp8_loop_filter_simple_horizontal_edge_mmi 6. vp8_loop_filter_simple_vertical_edge_mmi Change-Id: Ie34bbff3a16cff64e39a50798afd2b7dac9bcdc3	2017-09-11 11:08:09 +08:00
Shiyou Yin	0095213790	vp8: [loongson] optimize sixtap predict with mmi 1. vp8_sixtap_predict16x16_mmi 2. vp8_sixtap_predict8x8_mmi 3. vp8_sixtap_predict8x4_mmi 4. vp8_sixtap_predict4x4_mmi Change-Id: I186669d1a1d998a0f3ba3a548e25eee8b52c251b	2017-09-02 19:08:20 +00:00
Johann	f5141ea45f	Refine vp8_refining_search_sadx4 targeting This uses the same sdx4df pointers as vp8_diamond_search_sadx4 and should therefore target the same optimizations. See `e4ddf9db6a` Change-Id: Ic298e9b25c34bbe6b7a0799509355b0addb56675	2016-11-08 15:22:44 -08:00
Johann	721354fe7f	vp8: remove mmx functions When they have sse2 equivalents. Change-Id: I158f631a3bcecba57b36093ac10114b1904767a7	2016-09-29 15:25:27 -07:00
Johann	2663b092ae	Rename _xmm functions to _sse2 Avoid the extra level of indirection/confusion. Change-Id: I0555f639d67835df9fb7dac0c75085e9954805f1	2016-09-29 15:23:11 -07:00
Johann	1364cb58b4	Remove vp8_clear_system_state Use vpx_clear_system_state instead. Change-Id: Ia3e9122f69a2c690ddd7c7bc54f92ccb9ec18b3e	2016-09-29 13:22:49 -07:00
Johann	c7f9d0719d	vp8: clean up rtcd Remove lines which specify the same name for a function. Change-Id: I956bd8ce2b81a2a8feab5621d28bd2499c2b4c2d	2016-09-29 12:10:01 -07:00
Johann	e4ddf9db6a	Hook up vp8_diamond_search_sad_sse3 The original commit never set any 'specialize' line: `61311e6103` It appears the sadx4 version of function uses sdx4df calls to speed up the search. There are no sse3 versions of the sdx4df functions, but there are sse2 and msa versions. There is a neon version of vpx_sad16x16x4d but not any of the smaller versions. Perhaps if they existed this function could be expanded to use them. Change-Id: I936d7d6b1a3ff6dcd5a4d2322272708c47cdec13	2016-09-27 15:31:49 -07:00
Johann	1d14e42df7	Un-Revert "Restore vp8_sixtap_predict4x4_neon" This restores `d9dce2f48e` Switched to using signed shift-and-narrow. Instead of saturating negative results to 0, it was saturating them to 255. BUG=webm:817 BUG=webm:1273 Change-Id: I571095336aa4182e3288b17924fcaaece42b0a49	2016-09-23 14:58:57 -07:00
James Zern	6ae58fd55e	Merge "Revert "Restore vp8_sixtap_predict4x4_neon""	2016-09-16 06:13:42 +00:00
Johann Koenig	7795e99296	Revert "Restore vp8_sixtap_predict4x4_neon" This reverts commit `d9dce2f48e`. Appears to be failing the SixtapPredict tests in some configurations and possibly test vectors as well. Change-Id: Ica6aa83ebac47d0a76e451846e7da67b1c17a7d7	2016-09-16 06:12:49 +00:00
Johann	43743b1d3e	Restore vp8_bilinear_predict4x4_neon This function was removed when clang started introducing alignment hints which caused the 32 bit vld1_lane_u32/vst1_lane_u32 to fail: https://llvm.org/bugs/show_bug.cgi?id=24421 The load has been rendered safe with an implementation ~indiscernible performance-wise that uses _u8 and over-reads just a touch. It is still ~5x faster than C in the unaligned case and doing both filters. BUG=webm:892 BUG=webm:1273 Change-Id: Icf7167189391b46202f47233bb585c24c42bcc36	2016-09-15 21:16:11 -07:00
Johann	d9dce2f48e	Restore vp8_sixtap_predict4x4_neon This function was removed when clang started introducing alignment hints which caused the 32 bit vld1_lane_u32/vst1_lane_u32 to fail: https://llvm.org/bugs/show_bug.cgi?id=24421 The load has been rendered safe with an implementation ~indiscernible performance-wise that uses _u8 and over-reads just a touch. The store, when unaligned, has a version that is ~25% slower but safe when xoffset = 0 (second pass filter only). When the first pass filter (or both) are in play, the new version is almost identical in speed. Worst case performance (both filters, unaligned stores) is roughly 3-4x faster than C. BUG=webm:817 BUG=webm:1273 Change-Id: I1e490e94453e0872151fe0dafb05557463f6247d	2016-09-15 14:56:47 -07:00
Johann	d55724fae9	Remove armv6 target Change-Id: I1fa81cc9cabf362a185fc3a53f1e58de533a41e5	2016-08-04 12:55:06 -07:00
Jim Bankoski	88e6951465	deblock filter : moved from vp8 code branch The deblocking filters used in vp8 have been moved to vpx_dsp for use by both vp8 and vp9. Change-Id: I5209d76edafc894b550f751fc76d3aa6799b392d	2016-07-12 05:53:00 -07:00
Johann	ce11055d57	Remove sixtap/bilinear 4x4 neon implementations These implementations rely on casting the pointers to load the data. Clang implemented optimizations which automatically add alignment hints to such loads. The 4x4 filters do not guarantee the necessary alignment so the resulting assembly is broken. https://llvm.org/bugs/show_bug.cgi?id=24421 BUG=webm:817 BUG=webm:892 Change-Id: I608885299f1f86ff83653b65e0e40d0ae87fb3fe	2016-05-06 17:20:15 -07:00
Jim Bankoski	fce3cee8dd	Move vpx_add_plane from codec to vpx_dsp and dedup. Change-Id: I12218d8331c0558c0587a66321e3ca46da7e5cc7	2016-05-02 12:17:39 -07:00
Ronald S. Bultje	c26a9ecaa2	vp8: change build_intra4x4_predictors() to use vpx_dsp. I've added a few new functions (d45e, d63e, he, ve) to cover the filtered h/v 4x4 predictors that are vp8-specific, the "correct" d45 with the correctly filtered bottom-right pixel (as opposed to the unfiltered version in vp9), and the "broken" d63 with weirdly filtered bottom-right pixels (which is correctly filtered in vp9). There may be a minor performance impact on all systems because we have to do an extra copy of the Above pixel array to incorporate the topleft pixel in the same array (thus fitting the vpx_dsp API). In addition, armv6 will have a more serious performance impact b/c I removed the armv6/vp8-specific assembly. I'm not sure anyone cares... Change-Id: I7f9e5ebee11d8e21aca2cd517a69eefc181b2e86	2015-09-30 18:45:49 -04:00
Ronald S. Bultje	7cdcfee82c	vp8: change build_intra_predictors_mbuv_s to use vpx_dsp. Change-Id: I936c2430c3c5b1e0ab5dec0a20110525e925b5e4	2015-09-30 18:45:46 -04:00
Ronald S. Bultje	54d48955f6	vp8: change build_intra_predictors_mby_s to use vpx_dsp. Change-Id: I2000820e0c04de2c975d370a0cf7145330289bb2	2015-09-30 18:45:40 -04:00
Alex Converse	d816fa7bfd	Replace VP8 SSIM with VP9 derived vpx_dsp SSIM. Change-Id: Ic61f30af12d1b01c1d5adc4e08bc20e20ad38027	2015-08-07 11:20:05 -07:00
Parag Salasakar	d35f992599	mips msa vp8 denoising filter optimization average improvement ~2x-3x Change-Id: I6c17012c731fa4d56e0343f8de0df47b2dde289b	2015-08-01 08:05:25 +05:30
Parag Salasakar	8fbc641540	mips msa vp8 temporal filter optimization average improvement ~2x-3x Change-Id: I05593bed583234dc7809aaec6cab82773a29505d	2015-07-31 12:03:19 +05:30
Parag Salasakar	0e3f494b21	mips msa vp8 block subtract optimization average improvement ~2x-3x Change-Id: I30abf4c92cddcc9e87b7a40d4106076e1ec701c2	2015-07-31 09:29:10 +05:30
Parag Salasakar	56aa0da405	mips msa vp8 quantize optimization average improvement ~2x-3x Change-Id: I6fc37191bf9cb5a67e1af9787d0d27659c17bdba	2015-07-30 12:56:57 -07:00
Parag Salasakar	0c2a14f9e2	mips msa vp8 fdct optimization average improvement ~2x-4x Change-Id: Id0bc600440f7ef53348f585ebadb1ac6869e9a00	2015-07-30 08:14:42 +05:30
Parag Salasakar	a5d9416fd7	mips msa vp8 post proc optimization average improvement ~2x-4x Change-Id: I93abc15389649c169bb8b69127c0b95407d34692	2015-07-29 09:40:26 +05:30
Parag Salasakar	5deb983744	mips msa vp8 filter by weight optimization average improvement ~3x-5x Change-Id: Ia808ae56b118e0e1b293901447aa5a0f597b405b	2015-07-28 08:16:34 +05:30
Parag Salasakar	af6733aec6	mips msa vp8 recon intra optimization average improvement ~3x-5x Change-Id: I73306863e9bf172d5adc06b8dd54e43985d1e063	2015-07-25 12:32:26 +05:30
Parag Salasakar	fb73ceae85	mips msa vp8 bilinear filter optimization average improvement ~3x-4x Change-Id: I8c0b3d5c86c9eb4f802b87c971864d2cfceeb7cc	2015-07-24 09:21:35 +05:30
Parag Salasakar	509fb0bc9d	mips msa vp8 copy mem optimization average improvement ~2x-4x Change-Id: I3af3ecced96c5b8e0cb811256e5089e28fe013a2	2015-07-23 10:29:40 +05:30
Parag Salasakar	55c0df5ef1	mips msa vp8 sixtap filter optimization average improvement ~3x-5x Change-Id: I5fd88cb088814be443d04be384b9fca99b22adef	2015-07-13 09:23:52 +05:30
Parag Salasakar	0ea2684c2c	mips msa vp8 loop filter optimization average improvement ~2x-4x Change-Id: I20c4f900ef95d99b18f9cf4db592cd352c2212eb	2015-07-08 12:41:00 +05:30
Johann	6a82f0d7fb	Move sub pixel variance to vpx_dsp Change-Id: I66bf6720c396c89aa2d1fd26d5d52bf5d5e3dff1	2015-07-07 15:51:04 -07:00
Jingning Han	9d251f9510	Merge "Unify subtract function used in VP8/9"	2015-07-07 20:42:19 +00:00
Jingning Han	0ede9f52b7	Unify subtract function used in VP8/9 This commit replaces the vp8_ prefixed subtract function with the common vpx_subtract_block function. It removes redundant SIMD optimization codes and unit tests. Change-Id: I42e086c32c93c6125e452dcaa6ed04337fe028d9	2015-07-07 09:57:44 -07:00
Parag Salasakar	3d938d71b0	mips msa vp8 idct optimization average improvement ~2x-5x Change-Id: I19e82f78772993bcd67fcf975fe180232172f86d	2015-07-07 12:41:54 +05:30
James Zern	47fe535422	disable vp8_sub_pixel_variance8x8_neon fails unit tests: [ FAILED ] NEON/VP8SubpelVarianceTest.ExtremeRef/0, where GetParam() = (3, 3, 0x14e36d, 0) [ FAILED ] NEON/VP8SubpelVarianceTest.Ref/0, where GetParam() = (3, 3, 0x14e36d, 0) the tests were recently enabled in: `eb88b17` Make vp9 subpixel match vp8 the functions likely haven't changed since being converted from assembly Change-Id: I6141717b111b8f735f436c160d74270af53ef722	2015-06-05 20:18:51 -07:00
Johann	516c087c51	Remove unused sub pixel mse Change-Id: I7a5e4e2632c3fa69d2a85a68fa9b418631caf09c	2015-06-03 08:00:51 -07:00
Johann	86d0cb8325	Disable neon bilinear 4x4 Clang adds alignment hints when casting up the loads/stores. Although this should be safe for most paths, it's causing some crashes. Either the source of the misalignment needs to be determined and adjusted or the intrinsics need to be rewritten to avoid using the cast to load the data. BUG=817,892 Change-Id: Ia3aa824d6a4cd97e14325ff49dc730b6f85ec7e8	2015-06-02 00:02:55 +00:00
Johann	c3bdffb0a5	Move variance functions to vpx_dsp subpel functions will be moved in another patch. Change-Id: Idb2e049bad0b9b32ac42cc7731cd6903de2826ce	2015-05-26 12:01:52 -07:00
James Zern	6eb1016301	vp8_copy32xn: sync function signature + include vp8_rtcd.h in copy_c.c silences missing prototype warnings Change-Id: Iecc279c695b08a26b231dedb41e3b84c551703f3	2015-05-14 22:41:13 -07:00
Johann	d5d9289800	Move shared SAD code to vpx_dsp Create a new component, vpx_dsp, for code that can be shared between codecs. Move the SAD code into the component. This reduces the size of vpxenc/dec by 36k on x86_64 builds. Change-Id: I73f837ddaecac6b350bf757af0cfe19c4ab9327a	2015-05-06 16:58:20 -07:00

1 2

83 Commits