generic-library/vpx

Author	SHA1	Message	Date
Johann	2057d3ef75	use memcpy for unaligned neon stores Advise the compiler that the store is eventually going to a uint8_t buffer. This helps avoid getting alignment hints which would cause the memory access to fail. Originally added as a workaround for clang: https://bugs.llvm.org//show_bug.cgi?id=24421 Change-Id: Ie9854b777cfb2f4baaee66764f0e51dcb094d51e	2017-05-17 12:11:31 -07:00
James Zern	1396d12103	idct_neon.c: add missing rtcd include + correct declarations as necessary BUG=webm:1294 Change-Id: I719602df9a56e79188a78e7f8b31257c6d3cc11d	2016-09-30 11:41:26 -07:00
Johann Koenig	b165451ad5	Merge "Un-Revert "Restore vp8_sixtap_predict4x4_neon""	2016-09-26 19:11:00 +00:00
Johann	ab0e7a237a	Use shifted value for sinpi8sqrt2 The value 35468 changes sign when stored in int16_t: implicit conversion from 'int' to 'int16_t' (aka 'short') changes value from 35468 to -30068 This negation requires adding back the original value to compensate. Shifting the value keeps the value positive and saves a post-vqdmulh shift. This technique is used in webp and idct_dequant_full_2x_neon BUG=b/28027557 Change-Id: I0c5ce09bea170fe08061856c2af6f841a557e0c3	2016-09-23 17:04:18 -07:00
Johann	1d14e42df7	Un-Revert "Restore vp8_sixtap_predict4x4_neon" This restores `d9dce2f48e` Switched to using signed shift-and-narrow. Instead of saturating negative results to 0, it was saturating them to 255. BUG=webm:817 BUG=webm:1273 Change-Id: I571095336aa4182e3288b17924fcaaece42b0a49	2016-09-23 14:58:57 -07:00
James Zern	6ae58fd55e	Merge "Revert "Restore vp8_sixtap_predict4x4_neon""	2016-09-16 06:13:42 +00:00
Johann Koenig	7795e99296	Revert "Restore vp8_sixtap_predict4x4_neon" This reverts commit `d9dce2f48e`. Appears to be failing the SixtapPredict tests in some configurations and possibly test vectors as well. Change-Id: Ica6aa83ebac47d0a76e451846e7da67b1c17a7d7	2016-09-16 06:12:49 +00:00
Johann	43743b1d3e	Restore vp8_bilinear_predict4x4_neon This function was removed when clang started introducing alignment hints which caused the 32 bit vld1_lane_u32/vst1_lane_u32 to fail: https://llvm.org/bugs/show_bug.cgi?id=24421 The load has been rendered safe with an implementation ~indiscernible performance-wise that uses _u8 and over-reads just a touch. It is still ~5x faster than C in the unaligned case and doing both filters. BUG=webm:892 BUG=webm:1273 Change-Id: Icf7167189391b46202f47233bb585c24c42bcc36	2016-09-15 21:16:11 -07:00
Johann	d9dce2f48e	Restore vp8_sixtap_predict4x4_neon This function was removed when clang started introducing alignment hints which caused the 32 bit vld1_lane_u32/vst1_lane_u32 to fail: https://llvm.org/bugs/show_bug.cgi?id=24421 The load has been rendered safe with an implementation ~indiscernible performance-wise that uses _u8 and over-reads just a touch. The store, when unaligned, has a version that is ~25% slower but safe when xoffset = 0 (second pass filter only). When the first pass filter (or both) are in play, the new version is almost identical in speed. Worst case performance (both filters, unaligned stores) is roughly 3-4x faster than C. BUG=webm:817 BUG=webm:1273 Change-Id: I1e490e94453e0872151fe0dafb05557463f6247d	2016-09-15 14:56:47 -07:00
Johann	d55724fae9	Remove armv6 target Change-Id: I1fa81cc9cabf362a185fc3a53f1e58de533a41e5	2016-08-04 12:55:06 -07:00
Jim Bankoski	3e04114f3d	prepend ++ instead of post in for loops. Applied the following regex : search for: (for.\(.;.;) ([a-zA-Z_])\+\+\) replace with: \1 ++\2) This misses some for loops: ie : for (mb_col = 0; mb_col < oci->mb_cols; mb_col++, mi++) Change-Id: Icf5f6fb93cced0992e0bb71d2241780f7fb1f0a8	2016-07-18 06:54:50 -07:00
clang-format	81a6739533	vp8: apply clang-format Change-Id: I7605b6678014a5426ceb45c27b54885e0c4e06ed	2016-07-15 19:28:44 -07:00
Johann	ce11055d57	Remove sixtap/bilinear 4x4 neon implementations These implementations rely on casting the pointers to load the data. Clang implemented optimizations which automatically add alignment hints to such loads. The 4x4 filters do not guarantee the necessary alignment so the resulting assembly is broken. https://llvm.org/bugs/show_bug.cgi?id=24421 BUG=webm:817 BUG=webm:892 Change-Id: I608885299f1f86ff83653b65e0e40d0ae87fb3fe	2016-05-06 17:20:15 -07:00
Ronald S. Bultje	c26a9ecaa2	vp8: change build_intra4x4_predictors() to use vpx_dsp. I've added a few new functions (d45e, d63e, he, ve) to cover the filtered h/v 4x4 predictors that are vp8-specific, the "correct" d45 with the correctly filtered bottom-right pixel (as opposed to the unfiltered version in vp9), and the "broken" d63 with weirdly filtered bottom-right pixels (which is correctly filtered in vp9). There may be a minor performance impact on all systems because we have to do an extra copy of the Above pixel array to incorporate the topleft pixel in the same array (thus fitting the vpx_dsp API). In addition, armv6 will have a more serious performance impact b/c I removed the armv6/vp8-specific assembly. I'm not sure anyone cares... Change-Id: I7f9e5ebee11d8e21aca2cd517a69eefc181b2e86	2015-09-30 18:45:49 -04:00
Ronald S. Bultje	7cdcfee82c	vp8: change build_intra_predictors_mbuv_s to use vpx_dsp. Change-Id: I936c2430c3c5b1e0ab5dec0a20110525e925b5e4	2015-09-30 18:45:46 -04:00
Ronald S. Bultje	54d48955f6	vp8: change build_intra_predictors_mby_s to use vpx_dsp. Change-Id: I2000820e0c04de2c975d370a0cf7145330289bb2	2015-09-30 18:45:40 -04:00
Johann	4e5e5fc52b	Rename vp8 loopfilter[_neon.c] Avoid conflict with vpx_dsp version Change-Id: I041b1532a9276400a5547de8dfed1de43ad4e83d	2015-08-18 11:47:00 -07:00
Johann	6a82f0d7fb	Move sub pixel variance to vpx_dsp Change-Id: I66bf6720c396c89aa2d1fd26d5d52bf5d5e3dff1	2015-07-07 15:51:04 -07:00
James Zern	dcf5b7cfdd	loopfiltersimpleverticaledge_neon: quiet uninit var warnings take 2. localize the function parameter to actually remove the warning Change-Id: I23c02061b5e21b0b75bd33c26062d1e531df7b92	2015-06-30 23:23:59 -07:00
James Zern	69c153c4e6	loopfiltersimpleverticaledge_neon: quiet uninit var warnings the vector used in vld_lane_ should be initialized before use Change-Id: Idce95354737915f6fb4e6b5e8980a050e953036d	2015-06-25 20:39:21 -07:00
James Zern	f4d746a3c1	idct_dequant_0_2x_neon: quiet uninit var warnings the vector used in vld_lane_ should be initialized before use Change-Id: I6b791088479fec3bc021ca75cc2af5adcc39d954	2015-06-25 20:29:35 -07:00
James Zern	4bd87a9b9e	vp8_subpixelvariance_neon: right size coeff table only uint8 is required; each use only loads one value as a uint8 quiets a few type conversion warnings Change-Id: I03dc0dc0eb01ac23a6e8673daa2b77c6c57bf1b0	2015-06-23 23:48:12 -07:00
Johann	c3bdffb0a5	Move variance functions to vpx_dsp subpel functions will be moved in another patch. Change-Id: Idb2e049bad0b9b32ac42cc7731cd6903de2826ce	2015-05-26 12:01:52 -07:00
James Zern	fd3658b0e4	replace DECLARE_ALIGNED_ARRAY w/DECLARE_ALIGNED this macro was used inconsistently and only differs in behavior from DECLARE_ALIGNED when an alignment attribute is unavailable. this macro is used with calls to assembly, while generic c-code doesn't rely on it, so in a c-only build without an alignment attribute the code will function as expected. Change-Id: Ie9d06d4028c0de17c63b3a27e6c1b0491cc4ea79	2015-05-07 11:55:08 -07:00
Johann	d5d9289800	Move shared SAD code to vpx_dsp Create a new component, vpx_dsp, for code that can be shared between codecs. Move the SAD code into the component. This reduces the size of vpxenc/dec by 36k on x86_64 builds. Change-Id: I73f837ddaecac6b350bf757af0cfe19c4ab9327a	2015-05-06 16:58:20 -07:00
James Zern	f58011ada5	vpx_mem: remove vpx_memset vestigial. replace instances with memset() which they already were being defined to. Change-Id: Ie030cfaaa3e890dd92cf1a995fcb1927ba175201	2015-04-28 20:00:59 -07:00
Johann	eabb793f3b	Use correct buffer size in vp8 subpixel variance In vp8_sub_pixel_variance8x8_neon the temp2 buffer is only initialized to kHeight8 * kWidth8. However, in the case that xoffset != 0 and yoffset == 0, var_filter_block2d_bil_w8 is called with output_width kHeight8PlusOne. Thanks to cmugurel for diagnosing and yulius for the patch. Change-Id: Ib71ffd96ffad963c92b8b7ca23f303942785b8e0 https://code.google.com/p/webrtc/issues/detail?id=4190	2015-02-03 09:11:05 -08:00
Johann	f6be2f3c87	Clarify GCC version check The version check was incorrectly matching some versions of clang which reported as gcc 4.2 Change-Id: I686d3576e71883fe1463206b56ab5e2aa9bb68a8	2014-09-25 11:53:45 -07:00
Jia Jia	0ae866bd19	vp8/vp9: neon: msvc: move the 'ifdef _MSC_VER' bit to vpx_ports/mem.h. fix compiling warning. Change-Id: If8706a9046436f704c597e4275a6810c76ba7daa	2014-09-14 01:43:54 +08:00
Jia Jia	c97f5e8b86	vp8 common: change 'HAVE_NEON_ASM' to 'HAVE_NEON' for compiling functions of NEON intrinsics. Change-Id: I975e5eac16f8b623ff589f0ec072cdaff2183b04	2014-09-05 12:24:05 +00:00
James Zern	35fadf1d25	bilinearpredict_neon: fix type conversion warnings make bifilter4_coeff[][] uint8_t, no values exceed this range and they're loaded with vdup_n_u8(). Change-Id: I921983e9edd828d29820e40ac30a7801dbe0fb4f	2014-09-04 20:50:42 -07:00
James Zern	f61e00c79d	Merge "arm: Fix building vp8_subpixelvariance_neon.c with MSVC"	2014-09-04 11:00:53 -07:00
Scott LaVarnway	ec94967ffe	Revert "Revert "VP8 for ARMv8 by using NEON intrinsics 10"" This reverts commit `677fb5123e` Compiles with 4.6. Change-Id: I7f87048911b6bc28a61741d95501fa45ee97b819	2014-09-04 08:51:20 -07:00
Martin Storsjo	0002da32e6	arm: Fix building vp8_subpixelvariance_neon.c with MSVC Use the right return values - vget_low_s64 returns int64x1_t, not a normal int64_t. Also make __builtin_prefetch a no-op on MSVC for this file. Change-Id: I4d2fce01d0ba106b98d3d53b137803119c2c2c08	2014-09-04 09:49:30 +03:00
Scott LaVarnway	dcbfacbb98	Neon version of vp8_build_intra_predictors_mby_s() and vp8_build_intra_predictors_mbuv_s(). This patch replaces the assembly version with an intrinsic version. On a Nexus 7, vpxenc (in realtime mode, speed -12) reported a performance improvement of ~2.6%. Change-Id: I9ef65bad929450c0215253fdae1c16c8b4a8f26f	2014-09-03 13:41:27 -07:00
Scott LaVarnway	9293d267d2	VP8 for ARMv8 by using NEON intrinsics 17 Add vp8_subpixelvariance_neon.c - vp8_sub_pixel_variance16x16_neon_func - vp8_variance_halfpixvar16x16_h_neon - vp8_variance_halfpixvar16x16_v_neon - vp8_variance_halfpixvar16x16_hv_neon - vp8_sub_pixel_variance8x8_neon Change-Id: I3e5d85b2eafc26be0eef6a777789b80e4579257b Signed-off-by: James Yu <james.yu@linaro.org>	2014-09-03 13:33:44 -07:00
Johann	5b788c0cbe	Merge "Revert "Revert "VP8 for ARMv8 by using NEON intrinsics 06" This reverts commit `81ad047ee5`. Revert "VP8 for ARMv8 by using NEON intrinsics 15" This reverts commit 727af7cebe3698b8493ba6c1360b0a6606c310fb.""	2014-09-03 13:27:11 -07:00
Scott LaVarnway	652ef29d09	Revert "Revert "VP8 for ARMv8 by using NEON intrinsics 08"" This reverts commit `928ff03889` Compiles with 4.6 now. Change-Id: Ib455da1098bb0e0623248be07579882a425fcbd1	2014-08-29 13:29:36 -07:00
Johann	911e96a4eb	Revert "Revert "VP8 for ARMv8 by using NEON intrinsics 06" This reverts commit `81ad047ee5`. Revert "VP8 for ARMv8 by using NEON intrinsics 15" This reverts commit 727af7cebe3698b8493ba6c1360b0a6606c310fb." This reverts commit `920f803f2e` Change-Id: I410d9036214a1b18427cca70b4bc6d8239740737	2014-08-20 09:41:50 -07:00
James Zern	74ed33cf6e	vp8_bilinear_predict4x4_neon: init src vectors quiets uninitialized warnings on the first load. Change-Id: I58a5af337087d96b4eaea8991a0f85c4ba58aebe	2014-07-11 00:05:25 -07:00
James Zern	5e30127c7a	vp8_sixtap_predict4x4_neon: init src vectors quiets uninitialized warnings on the first load. Change-Id: Ied9b03928537a9ed2cd414b9e8a0be00191b0f32	2014-07-10 23:48:47 -07:00
Johann	f625b2ac93	Correct HAVE_NEON_ASM define These optimizations are currently disabled. Change-Id: I19c58c9cb82d017638b86196641b9e001dfa798b	2014-05-16 08:20:13 -07:00
Johann	2f6f955a17	Remove intermediate step in vp8_dequantize_b With the intrinsics it is no longer necessary to have a stub/helper function. Change-Id: I3695961c3c94f1bb750d3b7b29716e509ebba482	2014-05-14 12:24:18 -07:00
Johann	4dcc6d9707	Build armv7a-only code Allow disabling the more generic NEON code. Use filtered option to disable rtcd code. Change-Id: Icb4500c1a2bac16eed3c5e3ec0c35e92e6bbbb9f	2014-05-14 12:23:33 -07:00
Johann	920f803f2e	Revert "VP8 for ARMv8 by using NEON intrinsics 06" This reverts commit `81ad047ee5`. Revert "VP8 for ARMv8 by using NEON intrinsics 15" This reverts commit `727af7cebe`. This exposes a bug in gcc 4.9 regarding register allocation. Will reland when 4.9 is fixed. Change-Id: I2d8a04e4edde93719280e41550f4c0765608ec4d	2014-05-13 13:21:17 -07:00
Johann	ce23931a3f	Only build neon assembly for armv7 targets Allow selectively building just the intrinsics for armv8 Change-Id: I2f29b2e4508b8b8e5649c2906b3159ad1d4ec477	2014-05-12 08:52:02 -07:00
Johann	4bffb75ba3	Merge "Revert "VP8 for ARMv8 by using NEON intrinsics 10""	2014-05-07 06:47:48 -07:00
Johann	3a695015ad	Merge "arm: Use a correct neon vector type for 64 bit integers"	2014-05-07 06:34:25 -07:00
Martin Storsjo	d5d82a5e1a	arm: Add a no-op define of __builtin_prefetch for MSVC Both GCC and RVCT/ARMCC support __builtin_prefetch, but MSVC doesn't. Change-Id: I44e1eecead61bc88d8fdfd3fef03d76d4f5afe08	2014-05-07 10:43:24 +03:00
Martin Storsjo	82a83c4fe0	arm: Use a correct neon vector type for 64 bit integers This fixes building with MSVC. Change-Id: I763ba8855c8083d82c8b477d3a297e310e93a335	2014-05-07 10:22:40 +03:00

1 2 3 4

151 Commits