generic-library/vpx

Author	SHA1	Message	Date
Johann	188d58eaa9	sub pel variance neon: 4x block sizes Add optimizations for blocks of width 4 BUG=webm:1423 Change-Id: Idfb458d36db3014d48fbfbe7f5462aa6eb249938	2017-05-22 14:40:01 -07:00
Johann	9b0d306a2f	sub pel avg variance neon: add neon optimizations These are missing an optimized version of vpx_comp_avg_pred BUG=webm:1423 Change-Id: I31fa6ef842e98f7ff3ea079ffed51ae33178e2ed	2017-05-22 13:58:43 -07:00
Johann	e0d294c3af	sub pel variance neon: normalize variable names match vpx_dsp/variance.c variable names Change-Id: I228c6f296c183af147b079b7c8bcdf97bd09cf3a	2017-05-22 13:58:43 -07:00
Linfeng Zhang	27beada6d0	Merge "Add vpx_highbd_idct{4x4,8x8,16x16}_1_add_sse2"	2017-05-22 20:58:18 +00:00
Johann	67ac68e399	variance neon: assert overflow conditions Change-Id: I12faca82d062eb33dc48dfeb39739b25112316cd	2017-05-22 11:25:06 -07:00
Linfeng Zhang	c167345ffb	Add vpx_highbd_idct{4x4,8x8,16x16}_1_add_sse2 BUG=webm:1412 Change-Id: Ia338a6057d36f9ed7eaa9cbd4dfbf0c3cbdc6468	2017-05-22 11:24:21 -07:00
Johann	d217c87139	neon variance: special case 4x The sub pixel variance uses a temp buffer which guarantees width == stride. Take advantage of this with the 4x and avoid the very costly lane loads. Change-Id: Ia0c97eb8c29dc8dfa6e51a29dff9b75b3c6726f1	2017-05-22 10:51:31 -07:00
Johann Koenig	e7cac13016	Merge changes Ib8dd96f7,Ie9854b77 * changes: neon variance: process 4x blocks use memcpy for unaligned neon stores	2017-05-22 17:48:33 +00:00
Marco Paniconi	b3bf91bdc6	Merge "vp9: Adjustments to cyclic refresh for high motion."	2017-05-22 06:27:30 +00:00
Marco	2adc0443dd	vp9: Adjustments to cyclic refresh for high motion. For aq-mode=3: refactor the condition for turning off the refresh. Add some adjustments for high motion content. No/little change in RTC metrics, only affects high motion case. Change-Id: I7da8eabfb0e61db014be4562806f72ee5ef4a43b	2017-05-21 22:21:44 -07:00
Marco	ff9395eb3b	vp9: Speed >= 8: Modify condition for low-resoln. No change on RTC metrics. Change-Id: I5abc573cb56572188d900645d13ba479f55a1ea0	2017-05-21 22:14:38 -07:00
Johann Koenig	b5055002d7	Merge "neon 4 byte helper functions"	2017-05-19 17:11:30 +00:00
Johann Koenig	3c603eadb4	Merge "neon fdct: 4x4 implementation"	2017-05-19 17:08:58 +00:00
Paul Wilkins	a7977ece93	Merge "Changes to modified error."	2017-05-19 12:24:32 +00:00
Marco	1205e3207e	vp9: SVC: Modify condition to allow for copy partition. When temporal layers are used, only allow for copy partition on the top temporal enhancement layer frames. Change-Id: I5472abdc0f9f6c8dafa75a7a84c615e08ae22af8	2017-05-18 14:19:31 -07:00
Jerome Jiang	6b6ff9c969	Merge "vp9: Make copy partition work for SVC and dynamic resize."	2017-05-18 19:37:30 +00:00
Marco	2ba4729ef8	vp9: Make copy partition work for SVC and dynamic resize. Only affects speed 8. Make changes to copy partition to fix a bug in setting microblock offset. Avg PSNR shows 0.02% gain on rtc_derf and 0.08% loss on rtc. Change-Id: I61c3e5914dde645331344388e7437e5638acd4f3	2017-05-18 11:33:56 -07:00
paulwilkins	5680b4517f	Changes to modified error. The modified error was a derivative of the "coded_error" that was used to allocate bits between different frames on the assumption that the allocation should be linear in terms of this modified error. I.e. a frame with double the modified error score should all things being equal get double the number of bits. The code also included upper and lower caps derived from input VBR parameters. This patch improves the initial calculation of the clip mean error (now called "mean_mod_score" as it is no longer a prediction error) used as the midpoint for the rate distribution function and normalizes the output "modified scores" scores such that 1.0 indicates a frame in the middle of the distribution. The VBR upper and lower caps are then applied directly to a frame's normalized score. This refactoring is intended to make it easier to drop in alternative distribution functions or to base the rate allocation on a corpus wide midpoint (rather than the clip mean). Change-Id: I4fb09de637e93566bfc4e022b2e7d04660817195	2017-05-18 12:56:02 +01:00
Johann	7b742da63e	neon variance: process 4x blocks Continue processing sets of 16 values. Plenty of improvement for 4x8 (doubles the speed) but only about 30% for 4x4. BUG=webm:1422 Change-Id: Ib8dd96f75d474f0348800271d11e58356b620905	2017-05-17 17:35:01 -07:00
Johann	2057d3ef75	use memcpy for unaligned neon stores Advise the compiler that the store is eventually going to a uint8_t buffer. This helps avoid getting alignment hints which would cause the memory access to fail. Originally added as a workaround for clang: https://bugs.llvm.org//show_bug.cgi?id=24421 Change-Id: Ie9854b777cfb2f4baaee66764f0e51dcb094d51e	2017-05-17 12:11:31 -07:00
Marco Paniconi	a2dfbbd7d6	Merge "vp9: Modify ChangingDropFrameThresh unittest."	2017-05-17 18:42:51 +00:00
Linfeng Zhang	13918a9ccc	Merge "Update partial idct testing code"	2017-05-17 17:53:03 +00:00
Yaowu Xu	bde2c04fb7	Merge "Experiment. Store first pass errors as per MB values."	2017-05-17 17:38:15 +00:00
Marco	4733df333f	vp9: Modify ChangingDropFrameThresh unittest. Add another (lower) bitrate to the test, to cover frame drop behavior at low bitrate range. Change-Id: Iaad003974159daf3d2d65ef3a6575a3e72e498d6	2017-05-17 09:38:21 -07:00
Linfeng Zhang	3210ca6d60	Update partial idct testing code Add PartialIDctTest::PrintDiff() to help debugging. In RunQuantCheck, try all combinations of +/-mask_ input for 4x4 idct. Update PartialIDctTest::InitInput(). Change-Id: I13fd163954a4c1a3a6cfeb5e4a4d3d0e7ff901f4	2017-05-17 09:28:32 -07:00
Johann	105503b839	neon fdct: 4x4 implementation Approximately twice as fast as C implementation. BUG=webm:1424 Change-Id: I3c0307fb08ddc23df42545cd089a78e2ed5c9d3f	2017-05-17 07:38:18 -07:00
paulwilkins	42e5073f94	Experiment. Store first pass errors as per MB values. Most existing first pass stats are stored in a form normalized to a macro-block scale. However the error scores for intra / inter etc were stored as frame level values but mainly used as MB level values. This change fixes that. Normalized per MB values make comparisons between different formats easier and in any case this is usually what is wanted. An change in results should be limited to slight differences in rounding. *** Change after patch 8 +2 requiring new approval. Final pre-submit testing showed one 4K clip with above expected change. Investigation showed this was due to a value used to test for ultra low intra complexity in key frame detection. This was a per frame not per MB value but also did not scale with frame size. Replacement with a small per MB value (based on original per frame value and cif frame size) resolved the KF detection problem. Also converted kf_group_error_left to a double in line with other error values to reduce rounding problems in KF group bit allocation All clips and sets now show nominal (or 0) change as expected. Change-Id: Ic2d57980398c99ade2b7380e3e6ca6b32186901f	2017-05-17 12:00:18 +01:00
Linfeng Zhang	18e8baa5c0	Add transpose_32bit_4x4() and rename transpose_4x4() for vpx_dsp/x86 Change-Id: Ib57377f6cf6573c04720d3cc5dea4285362b4220	2017-05-16 17:46:37 -07:00
Johann Koenig	31cb852a90	Merge "Revert "Add visibility="protected" attribute for global variables referenced in asm files.""	2017-05-16 23:39:37 +00:00
Johann Koenig	2300e16675	Revert "Add visibility="protected" attribute for global variables referenced in asm files." This reverts commit `0d88e15454`. Reason for revert: chromium builds are failing to locate vpx_rv during dlopen() dlopen failed: cannot locate symbol "vpx_rv" referenced by "libstandalonelibwebviewchromium.so" Original change's description: > Add visibility="protected" attribute for global variables referenced in asm files. > > During aosp builds with binutils-2.27, we're seeing linker error > messages of this form: > libvpx.a(subpixel_mmx.o): relocation R_386_GOTOFF against preemptible > symbol vp8_bilinear_filters_x86_8 cannot be used when making a shared > object > > subpixel_mmx.o is assembled from "vp8/common/x86/subpixel_mmx.asm". > Other messages refer to symbol references from deblock_sse2.o and > subpixel_sse2.o, also assembled from asm files. > > This change marks such symbols as having "protected" visibility. This > satisfies the linker as the symbols are not preemptible from outside > the shared library now, which I think is the original intent anyway. > > Change-Id: I2817f7a5f43041533d65ebf41aefd63f8581a452 > TBR=jzern@google.com,johannkoenig@google.com,rahulchaudhry@chromium.org,builds@webmproject.org Change-Id: I0c2ea375aa7ef5fda15b9d9e23e654bb315c941b	2017-05-16 15:54:33 -07:00
Marco Paniconi	baef5486bf	Merge "Revert "Revert "vp8: Real-time mode: reduce mode_check_freq thresh for speed 10."""	2017-05-16 22:50:29 +00:00
Marco Paniconi	13d4a0d011	Revert "Revert "vp8: Real-time mode: reduce mode_check_freq thresh for speed 10."" This reverts commit `3704807805`. Reason for revert: <INSERT REASONING HERE> Does not look to be the cause of the test failures. Original change's description: > Revert "vp8: Real-time mode: reduce mode_check_freq thresh for speed 10." > > This reverts commit `4a7424adba`. > > Reason for revert: <INSERT REASONING HERE> > Possibly causing test failures in roll into chromium. > > Original change's description: > > vp8: Real-time mode: reduce mode_check_freq thresh for speed 10. > > > > Reduces quality regression at speed 10 for real-time mode. > > > > Change-Id: I9f624bea9ca262dab32ce9de7d6d91175d6becc8 > > > > TBR=marpan@google.com,builds@webmproject.org,jianj@google.com > # Not skipping CQ checks because original CL landed > 1 day ago. > > Change-Id: I1defcb74e78a5a3bd29b7d1b21a96a79fa26a457 > TBR=marpan@google.com,builds@webmproject.org,jianj@google.com NOPRESUBMIT=true NOTREECHECKS=true NOTRY=true Change-Id: I13d86a2a68b8aa8c0c7465e6e58cff0e00bc7862	2017-05-16 22:50:19 +00:00
Marco Paniconi	b9987a7c25	Merge "Revert "vp8: Real-time mode: reduce mode_check_freq thresh for speed 10.""	2017-05-16 22:48:39 +00:00
Marco Paniconi	3704807805	Revert "vp8: Real-time mode: reduce mode_check_freq thresh for speed 10." This reverts commit `4a7424adba`. Reason for revert: <INSERT REASONING HERE> Possibly causing test failures in roll into chromium. Original change's description: > vp8: Real-time mode: reduce mode_check_freq thresh for speed 10. > > Reduces quality regression at speed 10 for real-time mode. > > Change-Id: I9f624bea9ca262dab32ce9de7d6d91175d6becc8 > TBR=marpan@google.com,builds@webmproject.org,jianj@google.com # Not skipping CQ checks because original CL landed > 1 day ago. Change-Id: I1defcb74e78a5a3bd29b7d1b21a96a79fa26a457	2017-05-16 22:48:13 +00:00
Johann Koenig	dac3b59721	Merge "'protected' visibility unsupported on macho"	2017-05-15 21:21:45 +00:00
Johann	7498fe2e54	neon 4 byte helper functions When data is guaranteed to be aligned, use helper functions which assert that requirement. Change-Id: Ic4b188593aea0799d5bd8eda64f9858a1592a2a3	2017-05-15 13:42:31 -07:00
Johann	3fbc371e99	'protected' visibility unsupported on macho Mac builds must not specify 'protected' visibility. Then only support 'default' and 'hidden'. https://developer.apple.com/library/content/documentation/DeveloperTools/Conceptual/CppRuntimeEnv/Articles/SymbolVisibility.html Change-Id: I94eccfaa29af0ddcc4a5c1c0e14cf63ef7146462	2017-05-15 11:29:22 -07:00
Johann Koenig	8739a182c8	Merge "move neon load/stores to a new file"	2017-05-15 18:15:27 +00:00
Johann	1088b4f87c	move neon load/stores to a new file Move the tran_low_t helper functions to a new file. Additional load/store functions will be added here. Change-Id: I52bf652c344c585ea2f3e1230886be93f5caefc3	2017-05-15 08:29:43 -07:00
Marco	4a7424adba	vp8: Real-time mode: reduce mode_check_freq thresh for speed 10. Reduces quality regression at speed 10 for real-time mode. Change-Id: I9f624bea9ca262dab32ce9de7d6d91175d6becc8	2017-05-14 18:19:06 -07:00
Alexandra Hájková	bcbc3929ae	ppc: Add vpx_sad64/32/16x64/32/16_avg_vsx Change-Id: Ic9639b1331d8c5cbc207c2a036891ff0137fc56f	2017-05-13 13:13:15 +00:00
Jerome Jiang	6b9d130214	Merge "vp9: speed 8: Fix seg fault in partition copy when drop frames."	2017-05-13 03:20:49 +00:00
Cheng Chen	4c0655f26b	Merge "Speed up encoding by skipping altref recode"	2017-05-13 01:29:59 +00:00
Jerome Jiang	1fcd5cca3c	vp9: speed 8: Fix seg fault in partition copy when drop frames. BUG=webm:1433 Change-Id: I4f3984ef28660d3218d48007d7c977bdbdaf8af6	2017-05-12 15:57:23 -07:00
Rahul Chaudhry	0d88e15454	Add visibility="protected" attribute for global variables referenced in asm files. During aosp builds with binutils-2.27, we're seeing linker error messages of this form: libvpx.a(subpixel_mmx.o): relocation R_386_GOTOFF against preemptible symbol vp8_bilinear_filters_x86_8 cannot be used when making a shared object subpixel_mmx.o is assembled from "vp8/common/x86/subpixel_mmx.asm". Other messages refer to symbol references from deblock_sse2.o and subpixel_sse2.o, also assembled from asm files. This change marks such symbols as having "protected" visibility. This satisfies the linker as the symbols are not preemptible from outside the shared library now, which I think is the original intent anyway. Change-Id: I2817f7a5f43041533d65ebf41aefd63f8581a452	2017-05-12 11:11:16 -07:00
Marco Paniconi	9a66582604	Merge "vp9: Use INTERP_FILTER for filter_type in vp9_rtcd_defs.pl"	2017-05-12 17:02:50 +00:00
James Zern	ac8f58f6ab	Merge changes I1b54a7a5,I3028bdad,I59788cd9 * changes: ppc: Add get_mb_ss_vsx ppc: Add get4x4sse_cs_vsx ppc: Add comp_avg_pred_vsx	2017-05-12 15:24:59 +00:00
Luca Barbato	143b21e362	ppc: Add get_mb_ss_vsx Change-Id: I1b54a7a5bb642e4b836d786ea1ae506eed025e3f	2017-05-12 17:23:00 +02:00
Luca Barbato	6d225eb5f9	ppc: Add get4x4sse_cs_vsx Change-Id: I3028bdadf653665d18e781d28e9625f62804b3d8	2017-05-12 17:23:00 +02:00
Luca Barbato	a7f8bd451b	ppc: Add comp_avg_pred_vsx Change-Id: I59788cd98231e707239c2ad95ae54f67cfe24e10	2017-05-12 17:22:55 +02:00

... 3 4 5 6 7 ...

17482 Commits