generic-library/vpx

Author	SHA1	Message	Date
Scott LaVarnway	60624aa53a	Merge "VP9: token decoder expansion"	2016-09-26 12:06:50 +00:00
James Zern	f8c056a895	Merge "vp9_idct: delete dead TODOs"	2016-09-24 01:47:00 +00:00
Johann	ab0e7a237a	Use shifted value for sinpi8sqrt2 The value 35468 changes sign when stored in int16_t: implicit conversion from 'int' to 'int16_t' (aka 'short') changes value from 35468 to -30068 This negation requires adding back the original value to compensate. Shifting the value keeps the value positive and saves a post-vqdmulh shift. This technique is used in webp and idct_dequant_full_2x_neon BUG=b/28027557 Change-Id: I0c5ce09bea170fe08061856c2af6f841a557e0c3	2016-09-23 17:04:18 -07:00
Johann	1d14e42df7	Un-Revert "Restore vp8_sixtap_predict4x4_neon" This restores `d9dce2f48e` Switched to using signed shift-and-narrow. Instead of saturating negative results to 0, it was saturating them to 255. BUG=webm:817 BUG=webm:1273 Change-Id: I571095336aa4182e3288b17924fcaaece42b0a49	2016-09-23 14:58:57 -07:00
Scott LaVarnway	87b689f97a	VP9: token decoder expansion This version is based on Change 267683, but does not use the macros. Change-Id: I0619fa618decf8bdeef250584d75d70318b5d9a7	2016-09-23 06:24:20 -07:00
Scott LaVarnway	ada850786c	Merge "VP9: pass TileWorkerData instead of MACROBLOCKD and vpx_reader."	2016-09-23 11:59:16 +00:00
James Zern	deadda3dea	Merge "vpx_idct32x32_34_add_sse2: rm unneeded transposes"	2016-09-23 02:49:26 +00:00
James Zern	a914ffad97	Merge "variance_neon: sync variance*() w/c,sse2"	2016-09-23 02:18:49 +00:00
Scott LaVarnway	7a34f85955	VP9: pass TileWorkerData instead of MACROBLOCKD and vpx_reader. Change-Id: I869ef0f113c022143b531c44aefa0f1bb267052d	2016-09-22 13:18:36 -07:00
James Zern	fdd1186f97	vpx_idct32x32_34_add_sse2: rm unneeded transposes this change is neutral to mildly positive across various x86-64 platforms Change-Id: I28fb5ae598fc1317b7a42c9a846ac5d57d104784	2016-09-21 19:49:25 -07:00
Angie Chiang	99ef84c65a	Merge "Detect invalid highbd iht input"	2016-09-22 01:06:38 +00:00
James Zern	e372bfd5ac	variance_neon: sync variance*() w/c,sse2 removes some unnecessary casts and adds a few explicit uint32 ones for larger sizes to quiet -Wshorten-64-to-32 warnings Change-Id: I63c5fce8e62c426d5cf5c10a66a113c119a43518	2016-09-21 18:04:45 -07:00
James Zern	fcf281b6a1	Merge "vp8: remove VP8_SET_DBG* control support"	2016-09-22 00:43:35 +00:00
Angie Chiang	80338b91d3	Detect invalid highbd iht input Do nothing in vp9_highbd_iht#x#_##_add_c when input magnitude is beyond 20 bits. Note that, sign bit is not included here. In the 20 bits, we use 12 bits for input signal, 7 bits for forward transform amplification, and 1 bit for contingency in rounding and quantizing BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1286 Change-Id: I332c6f68df4614fc2e7d2dc4c5bb0d0cff8a245c	2016-09-21 17:15:19 -07:00
Johann	2bed8b6acd	Keep vp8 sixtap read within bounds When filtering it needs 6 pixels: 2 prior to the source, the source, and 3 after the source. When filtering 16 wide, that means 21. To accomplish this the SSE2 reads [-2] to [5], [6] to [13], and [14] to [21], a total of 24 bytes (reading in groups of 8 is easy) The filter then shifts this last set to the top half of the register and uses 'or' to combine it with the previous set. Valgrind detected an issue reading pixels [19], [20] and [21]: Address 0x7f581c2 is 434 bytes inside a block of size 441 alloc'd Note: we only need pixels [16], [17], and [18] as context for [15]. To fix this, it now reads 8 bytes starting at [11], which re-loads [11] through [13], but stops at [18] and does not over-read any values. This is shifted by 5 and 'or'd with xmm1. Although the lower bits are not cleared, they overlap directly with [11] through [13], so 'or' produces the correct results. Change-Id: I0c89c03afa660fc9b0108ac055d7bd403e493320	2016-09-21 16:17:07 -07:00
Johann	35ebc1cddf	predict_test: align dst buffer to 16 On 32 bit machines 'new' does not always appear to allocate sufficiently aligned buffers, causing intermittent test failures. Change-Id: I0db4fc73782012e4eef71dc0fb540e74fdbfcebe	2016-09-21 13:35:47 -07:00
James Zern	3f72509587	vp8: remove VP8_SET_DBG* control support the --enable-postproc-visualizer configure option remains as a no-op as do the control names and values for compatibility + remove the corresponding debug flags from vpxdec: --pp-* Change-Id: I4a001cd9962b59560d7d6bda6272d4ff32b8d37c	2016-09-20 20:19:36 -07:00
James Zern	cec6433e41	vp9_idct: delete dead TODOs Change-Id: Icdd5494f557d83026dc078bce37997a76aa288fb	2016-09-20 19:46:27 -07:00
James Zern	b6e686b1ea	Merge changes from topic 'Wshorten' * changes: vp8: convert some uses of unsigned long to size_t vp8/encoder: quiet some -Wshorten-64-to-32 warnings	2016-09-20 23:17:20 +00:00
James Zern	c31d02615d	Merge "variance_avx2: sync variance functions with c-code"	2016-09-20 22:33:39 +00:00
James Zern	2351a73531	Merge "examples: quiet -Wshorten-64-to-32 warnings"	2016-09-20 22:32:58 +00:00
James Zern	feb4313c5f	Merge "vp9_rtcd: remove non-existent highbd convolve fns"	2016-09-20 22:07:09 +00:00
Johann Koenig	8478f97105	Merge "Enable ssse3 bilinear tests"	2016-09-20 21:46:50 +00:00
Johann Koenig	18fd69ee91	Merge "Add vp8_bilinear_filter test"	2016-09-20 20:30:48 +00:00
Alex Converse	0d2687ef87	Merge "Code class0 using vpx_read() / vpx_write()."	2016-09-20 19:19:29 +00:00
James Zern	5841929fde	vp9_rtcd: remove non-existent highbd convolve fns these were moved to vpx_dsp Change-Id: I307b07ae05e2333277d4b7011cba36dcf8409959	2016-09-19 20:01:23 -07:00
James Zern	08b8b6bb8f	examples: quiet -Wshorten-64-to-32 warnings all around usage of strtol/strtoul Change-Id: If907c89f107a068987aa71ddd93cee9a7389e4cd	2016-09-19 19:02:49 -07:00
James Zern	8281da74b9	vp8: convert some uses of unsigned long to size_t similar to changes that were done in vp9 for encoded frame size reporting. has the side-effect of quieting a -Wshorten-64-to-32 warning. Change-Id: I89f74cb617fc29334ee351dc8dfaa3b8cfd4e5af	2016-09-19 18:35:59 -07:00
James Zern	0ce98b423b	vp8/encoder: quiet some -Wshorten-64-to-32 warnings this code is similar to other existing uses and/or vp9 Change-Id: I56e646931379759d9f7332ea6d746060007c75ee	2016-09-19 18:35:59 -07:00
Linfeng Zhang	761e5ec2f6	Refactor lpf (size 4 and 8) NEON intrinsics optimization Also check in 8x8 8-bit transpose NEON intrinsics optimization transpose_u8_8x8() Change-Id: I32d321cf97ea21eab158ac4896990fc9a51681c4	2016-09-19 16:41:37 -07:00
James Zern	6acd061aad	variance_avx2: sync variance functions with c-code add missing int64 -> uint32 cast; quiets -Wshorten-64-to-32 warnings Change-Id: I4850b36e18dc8b399108342be4bfe0b684aefb78	2016-09-19 16:19:29 -07:00
Johann Koenig	0695843a21	Merge "Remove -fno-strict-aliasing flag"	2016-09-19 22:49:23 +00:00
Johann	fad70a358b	Remove -fno-strict-aliasing flag The referenced bug was fixed by saving neon registers. That this had any effect was coincidental. Both chromium and Android build with clang and neither uses this flag. Change-Id: I470247d6fd9226fc207b42a187105581a94badc3	2016-09-19 12:16:03 -07:00
Nathan E. Egge	de7f5ce9e5	Code class0 using vpx_read() / vpx_write(). The vp9_mv_class0_tree is a balanced tree with two leafs and can simply be coded as a boolean with probability class0[0]. Change-Id: If294dac825a5f945371092c74aa8e3f84cd962b6 (cherry picked from commit be8a8ab62ebdd111c6f2e9a33b15630570671eba)	2016-09-19 10:50:39 -07:00
Alex Converse	01e2902521	Zero the whole rd_counts struct rather than the each member Change-Id: I495aa9cec2b2b8f1ae69bdab8b3feeca76358472	2016-09-19 10:04:47 -07:00
James Zern	aa0eb67bf7	loopfilter_mb_neon: remove unused load_8x8() quiets a -Wunused-function warning for arm targets Change-Id: I293a7e3d3d7d61d6af2fbedad5e8c25126c418b6	2016-09-17 11:00:31 -07:00
Linfeng Zhang	5d73639d8f	Merge "Refactor lpf (size 16) NEON intrinsics optimization"	2016-09-17 00:33:30 +00:00
James Zern	112eb54c1b	Merge "vpx_codec_control: return incapable for unmatched control"	2016-09-16 17:30:44 +00:00
Linfeng Zhang	8107368000	Refactor lpf (size 16) NEON intrinsics optimization Extract shared code so later lpf size 4 and 8 functions can reuse. Change-Id: Ibb43ef1fd8651bd2e32fcc4c56cf6fa7ca237401	2016-09-16 09:12:13 -07:00
James Zern	33aef48f29	vpx_subpixel_8t_intrin_avx2: tolerate unversioned clang assume __clang_major__==0 has the latest version of _mm256_broadcastsi128_si256. fixes builds with custom clang toolchains. BUG=b/30970831 Change-Id: I90becd56278e4716bd46e2ba9d910af977e8dfa6	2016-09-16 07:14:17 +00:00
James Zern	7a9e476072	Merge changes from topic 'clang-format' * changes: apply clang-format .clang-format: update to 3.8.1	2016-09-16 07:11:33 +00:00
Johann	e813c2b416	Enable ssse3 bilinear tests The code only has issues when xoffset == 0 and yoffset == 0 which represents a simple copy. Presumably this case does not need to be handled because the issue has existed since 2010. BUG=webm:1287 Change-Id: Ic47e2653f3b729e99b40e53d8d2d8d1501edaaa9	2016-09-15 23:16:26 -07:00
Johann	caf9a7841e	Add vp8_bilinear_filter test Build out the sixtap_predict test because the filters are interchangeable. Add verbose failures and border checking. Change-Id: I962f50041750dca6f8d0cd35a943424cf82ddcb1	2016-09-15 23:16:19 -07:00
James Zern	6ae58fd55e	Merge "Revert "Restore vp8_sixtap_predict4x4_neon""	2016-09-16 06:13:42 +00:00
Johann Koenig	7795e99296	Revert "Restore vp8_sixtap_predict4x4_neon" This reverts commit `d9dce2f48e`. Appears to be failing the SixtapPredict tests in some configurations and possibly test vectors as well. Change-Id: Ica6aa83ebac47d0a76e451846e7da67b1c17a7d7	2016-09-16 06:12:49 +00:00
Johann Koenig	fdbe249991	Merge "Restore vp8_bilinear_predict4x4_neon"	2016-09-16 05:33:50 +00:00
Johann Koenig	102eae06e9	Merge "zero structures completely"	2016-09-16 04:41:22 +00:00
Johann	43743b1d3e	Restore vp8_bilinear_predict4x4_neon This function was removed when clang started introducing alignment hints which caused the 32 bit vld1_lane_u32/vst1_lane_u32 to fail: https://llvm.org/bugs/show_bug.cgi?id=24421 The load has been rendered safe with an implementation ~indiscernible performance-wise that uses _u8 and over-reads just a touch. It is still ~5x faster than C in the unaligned case and doing both filters. BUG=webm:892 BUG=webm:1273 Change-Id: Icf7167189391b46202f47233bb585c24c42bcc36	2016-09-15 21:16:11 -07:00
Johann Koenig	7bc0733c27	Merge "Restore vp8_sixtap_predict4x4_neon"	2016-09-16 04:12:08 +00:00
Johann	d5054504a7	zero structures completely Use vp[89]_zero when possible. Expand the {} set when neither is available or nearby. Change-Id: Ifc1f46f60100916cd798bf7be3a10f09321c99bd	2016-09-16 03:54:11 +00:00

... 3 4 5 6 7 ...

16377 Commits