generic-library/vpx

Author	SHA1	Message	Date
Johann	1f41c0d37f	vp8 mfqe: zero map[] The loop appears to set map[i] with the intention of running the 'j' loop up to that point. However, without zero'ing map[] first the behavior is unpredictable. Fixes a static analysis warning: warning: Branch condition evaluates to a garbage value for (j = 0; j < 4 && map[j]; ++j) { Change-Id: Ifa39353d8aa5cc47b467a7d3d8cdd3b5319fd997	2018-03-14 13:56:39 -07:00
Shiyou Yin	d344ab03cc	vp8: [loongson] fix bug of type conflict. In commit `577d4fa79`, int8_t was used to replace char. This will result in a compilation error, for int8_t was typedefined to signed char, but not char. Change-Id: I5c9837e01b0b58688a7741f5c9a99a76ca887e4a	2018-01-23 14:28:55 +08:00
Johann Koenig	3761254119	Merge changes from topic "clang-format" * changes: clang-format v5.0.0 vp9/ remove spurious comments clang-format v5.0.0 vp8/ clang-format v5.0.0 vpx_dsp/ clang-format v5.0.0 mem_ops.h clang-format v5.0.0 vpx_util/vpx_atomic.h clang-format v5.0.0 y4minput.c clang-format v5.0.0 vpxenc.c clang-format v5.0.0 examples/ clang-format v5.0.0 test/	2018-01-22 19:38:42 +00:00
Johann	f95bf1db50	clang-format v5.0.0 vp8/ Allow*OnASingleLine appears to no longer apply to typedef structs. Adjust closing parenthesis/opening brace on functions. Remove trailing commas to keep multiple elements on one line. Change-Id: I6e535a8ddb15c9b3de8216ce8ddb2a18241af46c	2018-01-18 12:37:58 -08:00
Marco	9debbc2ec7	vp8: Fix to multi-res-encoder for skipping streams. For the vp8 simulcast/multi-res-encoder: Add flags to keep track of the disabling/skipping of streams for the multi-res-encoder. And if the lower spatial stream is skipped for a given stream, disable the motion vector reuse for that stream. Also remove the condition of forcing same frame type across all streams. This fix allows for the skipping/disabling of the base or middle layer streams. Change-Id: Idfa94b32b6d2256932f6602cde19579b8e50a8bd	2018-01-18 11:56:42 -08:00
Johann	f5b2dd2a66	adopt some clang 5.0.0 formatting At least the changes that don't conflict with 4.0.1 Change-Id: I9b6a7c14dadc0738cd0f628a10ece90fc7ee89fd	2018-01-11 12:35:24 -08:00
Shiyou Yin	08a668af32	vp8: [loongson] optimize loopfilter v2. Optimize function vp8_mbloop_filter_vertical_edge_mmi and function vp8_mbloop_filter_horizontal_edge_mmi. Make full use of memory loading delay slot and reduce unnecessary instructions. Change-Id: I61da2c3a44c06044225461f46bf487d83cba6c16	2017-12-15 17:06:47 +08:00
Shiyou Yin	09519a55c7	Merge "vp8: [loongson] optimize sixtab predict v2."	2017-12-15 00:53:21 +00:00
Johann	e4b3f03c64	add copyright to rtcd files Allows them to pass the license check in chromium. BUG=chromium:98319 Change-Id: Iefc1706152a549d8c4ae774c917596bf1c9492d8	2017-12-14 22:50:08 +00:00
Shiyou Yin	f2ad523461	vp8: [loongson] optimize sixtab predict v2. 1. Delete unnecessary zero setting process. 2. Optimize the method of calculating SSE in vpx_varianceWxH. Change-Id: I8bab801416e7f4958c28c6d080e3cf785a50f82b	2017-12-14 16:29:58 +08:00
Johann	bdbecea1ba	explicitly label .text sections nasm should infer .text but does not for windows: https://bugzilla.nasm.us/show_bug.cgi?id=3392451 Change-Id: Ib195465e5f33405f5ff61c4cf88aa2a72640cacb	2017-12-01 14:33:04 -08:00
James Zern	acb9460929	vp8: correct if/else '{' placement swap '{' and c-style comments removing a few redundant ones along the way; covers most leftovers from the clang-tidy run against an x86_64-linux config. Change-Id: I67a45596f80a12389faca49c5be440875092a7df	2017-10-27 12:27:10 -07:00
Shiyou Yin	577d4fa792	vp8: [loongson] optimize idct with mmi 1. vp8_dequant_idct_add_y_block_mmi 2. vp8_dequant_idct_add_uv_block_mmi Change-Id: I9987147be2685ac79d4b045d1d56f6709ee1223c	2017-10-17 03:27:31 +00:00
Shiyou Yin	f70de09f2a	vp8: [loongson] optimize dct with mmi 1. vp8_short_fdct4x4_mmi 2. vp8_short_fdct8x4_mmi 3. vp8_short_walsh4x4_mmi Change-Id: I89a7df25cfd09fae309fac257ad8b6a3dc1c8acb	2017-10-12 08:50:04 +08:00
Shiyou Yin	e8ed2bb762	vp8: [loongson] optimize quantize with mmi 1. vp8_fast_quantize_b_mmi 2. vp8_regular_quantize_b_mmi Change-Id: Ic6e21593075f92c1004acd67184602d2aa5d5646	2017-10-11 16:45:58 +08:00
Shiyou Yin	73102d1ed2	vp8: [loongson] optimize copymen with mmi 1. vp8_copy_mem16x16_mmi 2. vp8_copy_mem8x8_mmi 3. vp8_copy_mem8x4_mmi Change-Id: I3de29a11fa7402df0e48bbb944440b1e66498a65	2017-09-26 08:40:11 +08:00
Shiyou Yin	b81de66171	vp8: [loongson] optimize dequantize with mmi 1. vp8_dequantize_b_mmi 2. vp8_dequant_idct_add_mmi Change-Id: I505f8afb7a444173392b325906e6a4f420f00709	2017-09-14 20:56:06 +08:00
Shiyou Yin	5b558592f5	vp8: [loongson] optimize idctllm with mmi 1. vp8_short_idct4x4llm_mmi 2. vp8_short_inv_walsh4x4_mmi 3. vp8_dc_only_idct_add_mmi Change-Id: I616923681e79d78607a4988608fc39df77b093f4	2017-09-14 16:51:11 +08:00
Shiyou Yin	761f2f5cb4	vp8: [loongson] optimize loopfilter with mmi 1. vp8_loop_filter_horizontal_edge_mmi 2. vp8_loop_filter_vertical_edge_mmi 3. vp8_mbloop_filter_horizontal_edge_mmi 4. vp8_mbloop_filter_vertical_edge_mmi 5. vp8_loop_filter_simple_horizontal_edge_mmi 6. vp8_loop_filter_simple_vertical_edge_mmi Change-Id: Ie34bbff3a16cff64e39a50798afd2b7dac9bcdc3	2017-09-11 11:08:09 +08:00
Shiyou Yin	0095213790	vp8: [loongson] optimize sixtap predict with mmi 1. vp8_sixtap_predict16x16_mmi 2. vp8_sixtap_predict8x8_mmi 3. vp8_sixtap_predict8x4_mmi 4. vp8_sixtap_predict4x4_mmi Change-Id: I186669d1a1d998a0f3ba3a548e25eee8b52c251b	2017-09-02 19:08:20 +00:00
Peter Boström	d42e876164	Add atomics to vp8 synchronization primitives. Fixes issue on iPad Pro 10.5 (and probably other places) where threads are not properly synchronized. On x86 this data race was benign as load and store instructions are atomic, they were being atomic in practice as the program hasn't been observed to be miscompiled. Such guarantees are not made outside x86, and real problems manifested where libvpx reliably reproduced a broken bitstream for even just the initial keyframe. This was detected in WebRTC where this device started using multithreading (as its CPU count is higher than earlier devices, where the problem did not manifest as single-threading was used in practice). This issue was not detected under thread-sanitizer bots as mutexes were conditionally used under this platform to simulate the protected read and write semantics that were in practice provided on x86 platforms. This change also removes several mutexes, so encoder/decoder state is lighter-weight after this change and we do not need to initialize so many mutexes (this was done even on non-thread-sanitizer platforms where they were unused). Change-Id: If41fcb0d99944f7bbc8ec40877cdc34d672ae72a	2017-08-31 17:55:57 -07:00
Kaustubh Raste	39e8b8dac6	Fix mips dspr2 6 tap filter clobber list Change-Id: Ib7c07e6ce00a5c7e59113b16e6661a8369f9e646	2017-08-04 10:56:56 +05:30
Marco	b9577e07fc	vp8: Drop due to overshoot for non-screen content. For 1 pass CBR mode: Apply the logic for dropping (and re-adjusting rate control) due to large overshoot to the case of non-screen content when drop_frames_allowed is enabled. For the non-screen content case: add additional condition that rate correction factor is close to minimum state, and flag to constrain the frequency of the dropping. Also handle the case of temporal layers and multi-res encoding. Add some flags/counters to the layer context for temporal layers. For multi-res: drop due to overshoot is checked on lowest stream, and if overshoot is detected we force drops on all upper streams for that frame. This feature is to avoid large frame sizes on big content changes following low content period. No change in behavior for screen_content_mode = 2. Change-Id: I797ab236cbbf3b15cad439e9a227fbebced632e6	2017-08-02 13:12:48 -07:00
Han Shen	b72d3e8a25	Earmark extra space for VSX. Backend specific optimization for PPC VSX reads 16 bytes, whereas arm neon / sse2 only reads <= 8 bytes. Although the extra bytes read are actually never used, this is not a warrant for groping around. Fixed by allocating more when building for VSX. This is reported by asan. Also note - PPC does have assembly that loads 64-bit content from memory - lxsdx loads one 64-bit doubleword (whereas lxvd2x loads two 64-bit doubleword) from memory. However, we only have "vec_vsx_ld" builtins that mapped to lxvd2x, no builtins to lxsdx. The only way to access lxsdx is through inline assembly, which does not fit well in the origin paradigm. Refer: vsx: vpx_tm_predictor_4x4_vsx @ third_party/libvpx/git_root/vpx_dsp/ppc/intrapred_vsx.c neon: vpx_tm_predictor_4x4_neon @ third_party/libvpx/git_root/vpx_dsp/arm/intrapred_neon_asm.asm sse2: tm_predictor_4x4 @ third_party/libvpx/git_root/vpx_dsp/x86/intrapred_sse2.asm BUG=b/63112600 Tested: asan tests passed. Change-Id: I5f74b56e35c05b67851de8b5530aece213f2ce9d	2017-07-19 13:59:32 -07:00
Jerome Jiang	b0c4d87ac7	vp8: Clean up skinmap debugging codes. Use the computed skinmap. Change-Id: I8aabb5922ef5190ec85b9e01807cb79b4803b925	2017-06-23 14:05:33 -07:00
James Zern	88a302e743	Merge changes from topic 'missing-proto' * changes: onyxd_int.h: add missing prototypes onyxd.h: add vp8dx_references_buffer prototype vp[89],vpx_dsp: add missing includes vp8,encodeframe.h: correct prototypes vp8: add temporal_filter.h add picklpf.h add ethreading.h vp8,bitstream.h: add missing prototypes vp8: remove vp8_fast_quantize_b_mmx vp8,loopfilter_filters: make some functions static vp9_ratectrl: make adjust_gf_boost_lag_one_pass_vbr static vp9_encodeframe: make scale_part_thresh_sumdiff static vp9_alt_ref_aq: correct vp9_alt_ref_aq_create proto tiny_ssim: make some functions static	2017-06-23 05:44:24 +00:00
James Zern	7c0788b07f	onyxd.h: add vp8dx_references_buffer prototype quiets -Wmissing-prototypes Change-Id: I6bee535f3fb67e54a390266d787a5a92127aeadc	2017-06-21 19:00:15 -07:00
James Zern	18335f193d	vp8,loopfilter_filters: make some functions static quiets -Wmissing-prototypes Change-Id: Ie5b00537f64a05e68a38dc558463691523988994	2017-06-21 19:00:14 -07:00
Jerome Jiang	a36017e007	Enable 8x8 skin detection for vp8. If 2 or more 8x8 blocks are identified as skin, the macroblock will be labeled as skin. Change-Id: I596542c81a2df9e96270cab39d920bbfeb02bc6e	2017-06-15 20:53:03 -07:00
James Zern	4f9d852759	vp8_skin_detection: add 'vp8_' prefix to public fns BUG=webm:1438 Change-Id: I5feb31c254d02e116e624cfe702e73ba5a1f7aca	2017-06-12 20:13:28 -07:00
James Zern	98666368ee	rename vp8/common/skin_detection.[hc] -> vp8_* some build systems have trouble with duplicate basenames. vpx_dsp/skin_detection.[hc] were added in: `658e85425` Merge skin detection code in vp8/9. BUG=webm:1438 Change-Id: Ieaa70b40bda409ec23e6d179b47a930ac6243b05	2017-06-12 20:13:23 -07:00
Jerome Jiang	ff2d220d21	Remove duplication on vp8/9_write_yuv_frame. Change-Id: Ib3546032a27c715bf509c0e24d26a189bc829da8	2017-06-09 17:08:26 -07:00
Jerome Jiang	658e854252	Merge skin detection code in vp8/9. BUG=webm:1438 Change-Id: Ie3dc034c7dbb498a0b088a767b1936ddeed4df14	2017-06-07 21:20:34 -07:00
Jerome Jiang	68f035026f	vp8 skin detection: Fix visual studio build failure. Change-Id: I510b755550ebbfa2aaf9b974920d7f1c6454a845	2017-06-01 13:46:46 -07:00
Jerome Jiang	f1a300acc4	vp8: Clean up skin detection. Use only the average of center 2x2 pixels in vp8. Change-Id: I2b23ff19a90827226273e0fca49e90c734eda59b	2017-05-31 14:57:10 -07:00
Jerome Jiang	c39526da8a	Write skin map of vp8 skin detection for debug. Change-Id: Ica1b4e918aa759cd0ce65920f9d88452bbf9e3b4	2017-05-30 10:30:05 -07:00
Jerome Jiang	327c9bb1da	Refactor: Move vp8 skin detection to new files. Change-Id: If760f28cbbf22beac1cc9bd1546f13831e9dd3f0	2017-05-25 16:12:27 -07:00
Johann Koenig	e7cac13016	Merge changes Ib8dd96f7,Ie9854b77 * changes: neon variance: process 4x blocks use memcpy for unaligned neon stores	2017-05-22 17:48:33 +00:00
Johann	2057d3ef75	use memcpy for unaligned neon stores Advise the compiler that the store is eventually going to a uint8_t buffer. This helps avoid getting alignment hints which would cause the memory access to fail. Originally added as a workaround for clang: https://bugs.llvm.org//show_bug.cgi?id=24421 Change-Id: Ie9854b777cfb2f4baaee66764f0e51dcb094d51e	2017-05-17 12:11:31 -07:00
Johann Koenig	2300e16675	Revert "Add visibility="protected" attribute for global variables referenced in asm files." This reverts commit `0d88e15454`. Reason for revert: chromium builds are failing to locate vpx_rv during dlopen() dlopen failed: cannot locate symbol "vpx_rv" referenced by "libstandalonelibwebviewchromium.so" Original change's description: > Add visibility="protected" attribute for global variables referenced in asm files. > > During aosp builds with binutils-2.27, we're seeing linker error > messages of this form: > libvpx.a(subpixel_mmx.o): relocation R_386_GOTOFF against preemptible > symbol vp8_bilinear_filters_x86_8 cannot be used when making a shared > object > > subpixel_mmx.o is assembled from "vp8/common/x86/subpixel_mmx.asm". > Other messages refer to symbol references from deblock_sse2.o and > subpixel_sse2.o, also assembled from asm files. > > This change marks such symbols as having "protected" visibility. This > satisfies the linker as the symbols are not preemptible from outside > the shared library now, which I think is the original intent anyway. > > Change-Id: I2817f7a5f43041533d65ebf41aefd63f8581a452 > TBR=jzern@google.com,johannkoenig@google.com,rahulchaudhry@chromium.org,builds@webmproject.org Change-Id: I0c2ea375aa7ef5fda15b9d9e23e654bb315c941b	2017-05-16 15:54:33 -07:00
Rahul Chaudhry	0d88e15454	Add visibility="protected" attribute for global variables referenced in asm files. During aosp builds with binutils-2.27, we're seeing linker error messages of this form: libvpx.a(subpixel_mmx.o): relocation R_386_GOTOFF against preemptible symbol vp8_bilinear_filters_x86_8 cannot be used when making a shared object subpixel_mmx.o is assembled from "vp8/common/x86/subpixel_mmx.asm". Other messages refer to symbol references from deblock_sse2.o and subpixel_sse2.o, also assembled from asm files. This change marks such symbols as having "protected" visibility. This satisfies the linker as the symbols are not preemptible from outside the shared library now, which I think is the original intent anyway. Change-Id: I2817f7a5f43041533d65ebf41aefd63f8581a452	2017-05-12 11:11:16 -07:00
James Zern	e05f4cf8f4	intrapred: rename d63f to d63e this is consistent with he/ve/d45e Change-Id: I75641ae5667430b0ecd370db86fff6e666cb577d	2017-03-24 20:41:39 -07:00
Johann	ccd23215ed	ppc: include ppc.h for ppc_simd_caps() Change-Id: Idc829eb066cf4e905d062cb9c08424e0f1b7e1a7	2017-03-09 09:26:45 -08:00
Rafael de Lucena Valle	51289302ab	Add support for POWER8/VSX Add ppc, ppc64 and ppc64le on all_platforms and ARCH_LIST Add VSX flags and check for -mvsx Define empty setup_rtcd_internal Add Altivec detection based on: http://freevec.org/function/altivec_runtime_detection_linux Detect VSX at runtime when enabled Change-Id: I304f4d8c5fee0ff19b6483cd2e9cc50d6ddec472 Signed-off-by: Rafael de Lucena Valle <rafaeldelucena@gmail.com>	2017-03-08 20:28:08 +00:00
clang-format	4b402746ca	apply clang-format Change-Id: I75e4a9e0b37bd4586f26c8d6c1fa27f3f6ff1bce	2017-02-14 12:45:52 -08:00
Peter Boström	297dfd8696	Add decoder getters for the last quantizer. To be used for frame stats output of vpxdec. Change-Id: I0739a01bd3635c4b3fedd58f3e27363ce8fb1b1e	2017-01-12 16:16:22 -05:00
Jim Bankoski	318a1ff5ec	vp8 : use threading mutex's for tsan only. To avoid decode performance hit of 2% when running on hyperthreaded cores. This patch only uses the mutex's when we are running tsan. This is safe because 32 bit operations like read and store are atomic on all the platforms we care about. Tsan warns about race situations, but in this case either situation ( read occurs before write or write before read) the worst case is that we go around one extra time in the loop. So the ordering doesn't really matter. That said a few other things have been tried : for instance as per here: webrtc/base/atomicops.h#52 In this patch they use: __atomic_load_n(i, __ATOMIC_ACQUIRE); __atomic_store_n(i, value, __ATOMIC_RELEASE); This code works on gcc, clang ( replacing protected write and read), and avoids tsan errors. Incurring no penalty in performance. In C11 its replaced by straight atomic operands. However there is no equivalent in the visual studio's we support as int32 on all windows platforms is already atomic. To avoid tsan like warnings on windows we'd need to use interlocked exchange and the end result doesn't gain us any thing. Change-Id: I2066e3c7f42641ebb23d53feb1f16f23f85bcf59	2016-12-16 08:50:55 -08:00
Jim Bankoski	85a541a421	Reapply 'Amend and improve VP8 multithreading implementation' Reapply this patch: `ff0107f` Amend and improve VP8 multithreading implementation Amended the patch to add a unit test, and fix an asan error. BUG=webm:851 Change-Id: I6572c03256169c64e80248bf5a5e99f59a2fc93c	2016-12-13 02:11:34 +00:00
Kaustubh Raste	ecc5998bcf	Fix mips dspr2 build warning Change-Id: Ia8fb3ed124f01384e7896e309c9ff22c05b40719	2016-11-22 17:49:17 +05:30
James Zern	ba016b710a	Merge "ppflags.h: remove unused _DEBUG_* enum values"	2016-11-10 20:53:22 +00:00

1 2 3 4 5 ...

764 Commits