generic-library/vpx

Author	SHA1	Message	Date
Marco	a0de2692fc	vp9: Speed 6 adapt_partition for live/vbr usage. Enable adapt_partition for vbr mode for speed 6. This allows the usage of the pickmode-based partition (used in speed 5), but only selectively for superblocks with high source sad, otherwise the faster variance based partition scheme is used. For speed 6 on ytlive set: avgPSNR/SSIM metrics up by ~0.6%, several clips up by ~1.5%. Small/negligible decrease in speed. Change-Id: I12f3efef6b3e059391de330fdbe5a44c2587f1f8	2017-08-25 11:36:34 -07:00
Marco Paniconi	34e48d6115	Merge "vp9: Adjust 16x16 splot threshold for variance partition"	2017-08-24 22:26:43 +00:00
Tom Finegan	ccae8da7c6	Make sure diff is present at configure time. This avoids an endless build loop at vpx_version.h creation time when diff is not present. Change-Id: I16ae386dbdaf14f9a2b85e4c5d1aaa6c08f52a45	2017-08-24 12:11:48 -07:00
Johann Koenig	6c21650c0e	Merge "quantize avx: copy 32x32 implementation"	2017-08-24 18:55:03 +00:00
Marco	d14777157e	vp9: Adjust 16x16 splot threshold for variance partition For speeds < 7, increase threshold that controls the split of 16x16->8x8 blocks, for resolutions 720p and higher. Minor change for speed 5 (since it uses reference partition scheme which only uses variance partition as first step). For speed 6: ~0.5% increase in avgPSNR/SSIM metrics on ytlvie set. No change in speed. Change-Id: I5126580973201538d8ca26a9256b93c4d11d685b	2017-08-24 10:44:05 -07:00
Johann Koenig	258122fdc6	Merge "quantize test: skip block was removed"	2017-08-24 17:43:10 +00:00
Johann	f60d1dcd3d	quantize avx: copy 32x32 implementation Ensure avx and ssse3 stay in sync by testing them against each other. Change-Id: I699f3b48785c83260825402d7826231f475f697c	2017-08-24 10:42:34 -07:00
Johann	1787e7dbe0	quantize ssse3: copy implementation to intrinsics Still does not pass tests. Does match the previous assembly, although saving the sign before multiplying is dubious. Change-Id: Ia163f18c755aba542d6e93f7bf7343184660df5a	2017-08-24 07:47:51 -07:00
Johann	92aafefa1e	quantize test: skip block was removed Change-Id: I1d93698bc27529b0544d79dd7b9fe37afa51ef87	2017-08-24 07:21:42 -07:00
Johann Koenig	2dc0a5132d	Merge "quantize test: set threshold for 32x32"	2017-08-24 14:04:29 +00:00
Shiyou Yin	d080c92524	Merge "vpx_dsp:loongson optimize vpx_mseWxH_c(case 16x16,16X8,8X16,8X8) with mmi."	2017-08-24 00:55:11 +00:00
Marco Paniconi	30c261b1eb	Merge "vp9: SVC: Skip NEWMV for small blocks for (0, 0) base_mv."	2017-08-23 23:09:33 +00:00
Johann	e89344d61a	quantize test: set threshold for 32x32 Change-Id: I77be617c7d7c64929dd51c6077322f4f8ad23897	2017-08-23 15:59:11 -07:00
Johann Koenig	f53b656207	Merge "quantize avx: copy implementation to intrinsics"	2017-08-23 21:14:13 +00:00
Marco	c9ff7b6637	vp9: SVC: Skip NEWMV for small blocks for (0, 0) base_mv. For SVC encoding: average speedup ~1.5%, with small ~0.57 loss in avgPSNR metrics. Change-Id: Icebce6f6ef4e819d7dfcf8db898c583167351de4	2017-08-23 13:08:27 -07:00
Scott LaVarnway	1aad50c092	Merge "vpx_dsp: get32x32var_avx2() cleanup"	2017-08-23 19:59:25 +00:00
Johann Koenig	dfafd10ef5	Merge "quantize neon: round dqcoeff towards zero"	2017-08-23 19:20:53 +00:00
Johann	7c27872164	quantize avx: copy implementation to intrinsics Adds an early exit based on ptest. Slightly slower than ssse3 in the full case because of the extra check, but potentially faster if lots of rows can be skipped. Very close in speed to the assembly. Can run in 32 bit, unlike the assembly. Allows reworking the function prototype to use structs. Change-Id: If80e2b9ba059370a4cad3c973196e82a97b4330e	2017-08-23 09:19:16 -07:00
Johann	2a5aa98a35	quantize neon: round dqcoeff towards zero Add 1 if negative to get dqcoeff to round towards zero. 10-15% faster than converting to positive before shifting. Change-Id: I01a62fd0c9bca786b6885b318bd447bb9229903d	2017-08-23 08:05:50 -07:00
Johann	e83d99d7b8	quantize fp: neon implementation About 4x faster when values are below the dequant threshold and 10x faster if everything needs to be calculated. Both numbers would improve if the division for dqcoeff could be simplified. BUG=webm:1426 Change-Id: I8da67c1f3fcb4abed8751990c1afe00bc841f4b2	2017-08-23 08:01:30 -07:00
Shiyou Yin	59e065b6ed	vpx_dsp:loongson optimize vpx_mseWxH_c(case 16x16,16X8,8X16,8X8) with mmi. Change-Id: I2c782d18d9004414ba61b77238e0caf3e022d8f2	2017-08-23 15:14:15 +08:00
Marco Paniconi	0207f17144	Merge "vp9: Condition lighting change detection on CBR mode."	2017-08-22 22:52:05 +00:00
Johann Koenig	103e4e50a8	Merge changes I53f8a160,I48f282bf * changes: quantize ssse3: copy style from sse2 quantize sse2: copy opts from ssse3	2017-08-22 22:27:56 +00:00
Marco	a31461c853	vp9: Condition lighting change detection on CBR mode. This feature is used for the CBR RTC encoding mode at speed >= 6. This change will exclude it for VBR mode. For speed 6 live encoding (VBR): avgPSNR/SSIM metrics on ytlive set up by ~1% (few clips up by 2/3%). No change in speed. Change-Id: I1a0dd94c334f7df309ab5a48d477d7e25355b798	2017-08-22 14:59:37 -07:00
Johann	b9c1dcc5fa	quantize ssse3: copy style from sse2 Change-Id: I53f8a160e640c674ea035fc112e207b6dca42598	2017-08-22 14:25:27 -07:00
Johann Koenig	7f2993f5e4	Merge "quantize: capture skip block early"	2017-08-22 20:03:02 +00:00
Johann	75752ab7c0	quantize sse2: copy opts from ssse3 Simplify eob calculations based on ssse3 implementation. General clean up and re-scoping. Change-Id: I48f282bf9bd28ee9bc2c7a6779be9d45b5a3a3ee	2017-08-22 13:01:44 -07:00
Johann Koenig	ab27b68693	Merge changes Icfb70687,I9a963e99,Ie8ac00ef,I1272917c * changes: quantize: ignore skip_block in arm quantize: ignore skip_block in x86 quantize fp: ignore skip_block in arm quantize fp: ignore skip_block in x86	2017-08-22 19:19:14 +00:00
Johann	7a178a5631	quantize: capture skip block early This should probably be handled before vp9_regular_quantize_b_4x4 even gets called. Fixes an assert resulting from removing skip_block from the quantize functions. BUG=webm:1459 Change-Id: I7f52b53f959b4654b3d4517ebda31a678f4d0fde	2017-08-22 12:10:55 -07:00
James Zern	419ce36294	Merge "ppc: Add vpx_idct16x16_256_add_vsx"	2017-08-22 00:48:39 +00:00
Shiyou Yin	bff5aa9827	Merge "vpx_dsp:loongson optimize vpx_subtract_block_c (case 4x4,8x8,16x16) with mmi."	2017-08-22 00:37:23 +00:00
Johann	2c56bb97f2	quantize: ignore skip_block in arm Change-Id: Icfb70687476b2edb25d255793ba325b261d40584	2017-08-21 14:37:50 -07:00
Johann	c02fdd0258	quantize: ignore skip_block in x86 Change-Id: I9a963e99f08761f0c8d6a305619270b2f1c4edf8	2017-08-21 14:37:03 -07:00
Johann	b527b47312	quantize fp: ignore skip_block in arm Change-Id: Ie8ac00efa826eead2a227726a1add816e04ff147	2017-08-21 14:34:48 -07:00
Johann	7b13d99b98	quantize fp: ignore skip_block in x86 Change-Id: I1272917c49cf6e6710e52c36535b2fc8c8dced78	2017-08-21 14:33:41 -07:00
Johann	661efeca97	quantize test: test _fp_ version of quantize None of the x86 optimizations pass the tests. Change-Id: Ic67f2ba1977b657e68f2a13b0711fc5fcbafd909	2017-08-21 12:29:41 -07:00
Johann	13eed991f9	Remove skip_block from quantize This condition is handled before this code is reached. The ssse3 version of the function has always crashed when attempting to handle the skip_block condition. Add assert() and comments regarding the usage of skip_block. Removing the parameter is a fairly involved process so leave it be for the moment. Change-Id: Ib299f6fc6589d7ee102262cc74a7aeb60110bc5a	2017-08-21 09:49:04 -07:00
Scott LaVarnway	eab3f5e0cc	vpx_dsp: get32x32var_avx2() cleanup renamed to get32x16var_avx2() BUG=webm:1404 Change-Id: Icb8f3986c9c9c646e13a69430db7235fc7e1a036	2017-08-18 13:44:09 -07:00
Scott LaVarnway	2c5478e383	Merge "vpx_dsp: vpx_get16x16var_avx2() cleanup"	2017-08-18 20:30:59 +00:00
Scott LaVarnway	2f7497f341	vpx_dsp: vpx_get16x16var_avx2() cleanup BUG=webm:1404 Change-Id: I88aceb07f4db4870a06eee21d87296974ce3221a	2017-08-18 12:23:49 -07:00
Johann Koenig	1426f04e91	Merge "quantize: normalize intermediate types"	2017-08-18 16:00:28 +00:00
Shiyou Yin	7d82e57f5b	vpx_dsp:loongson optimize vpx_subtract_block_c (case 4x4,8x8,16x16) with mmi. Change-Id: Ia120ad1064d0b6106d9685cf075bdab373eef19e	2017-08-18 09:06:49 +08:00
James Zern	bb15fd51be	highbd_idct32x32*,idct32_34_4x32_quarter_1_2: fix typo 135 -> 34 fixes unused function warnings for highbd_idct32_34_4x32_quarter_[12] Change-Id: I4f50ff6ea514200af93dd59ff94c7f9717409682	2017-08-17 15:37:38 -07:00
Johann	7f602d6114	quantize: normalize intermediate types Despite abs_coeff being a positive value, all the other implementations treat it as signed which simplifies restoring the sign. HBD builds cast qcoeff to avoid a visual studio warning. Match vp9_quantize.c style of casting the entire expression. Change-Id: I62b539b8df05364df3d7644311e325288da7c5b5	2017-08-17 12:34:28 -07:00
James Zern	e038d1610e	inv_txfm_sse2.h: correct idct/iadst prototypes fixes mismatch between prototypes and definitions Change-Id: Ib5e7dfcce244dbb8401815be2cdd183d96792652	2017-08-16 23:06:09 -07:00
Paul Wilkins	f64e14047d	Merge "Prevent parameters that can cause invalid ARF groups."	2017-08-16 18:25:57 +00:00
Paul Wilkins	372336d1e5	Merge "Fix corrupt arf groups due to low "lag_in_frames""	2017-08-16 18:25:29 +00:00
Linfeng Zhang	f95686895b	Merge changes I08b562b6,Ia275940a,I51106e90 * changes: Add vpx_highbd_idct32x32_{34, 135, 1024}_add_{sse2, sse4_1} Update highbd idct x86 optimizations. Update 32x32 idct sse2 and ssse3 optimizations.	2017-08-16 16:36:37 +00:00
paulwilkins	b814e2d898	Prevent parameters that can cause invalid ARF groups. Having a very low "lag_in_frames" value could cause the encoder to create incorrect / corrupt ARF groups including displayed frames that update the ARF buffer and false overlay frames that are coded at low rate but are not actually overlays of a real ARF frame. This is linked to a reported unit test "slow down" where the chosen parameters (lag of 3 frames) gave rise to such "broken" ARF group(s). See also BUG=webm:1454 Change-Id: If52d0236243ed5552537d1ea9ed3fed8c867232c	2017-08-16 14:33:59 +01:00
paulwilkins	48110d0f79	Fix corrupt arf groups due to low "lag_in_frames" Having a very small value for "lag_in_frames" can result in corrupt arf groups including displayed frames that update the arf buffer and fake overlay frames that are not in fact overlays of real arfs but are nevertheless starved of bits. Leaving lag_in_frames at the default of 25 for these 5 frame two pass VBR tests should now give rise to a valid ARF coding pattern as follows:- K(ey), A(rf), N(ormal), N, N, O(verlay). This change is part of a response to BUG=webm:1454 where broken arf groups interacted badly with a change that corrects for large rate misses. However, it may still in some cases increase encode time by virtue of the fact that the unit test now codes a correct coding pattern with "hidden" ARF frames. Change-Id: Ifd0246a4c1d0be247247c754024d7a4ed5f66a6b	2017-08-16 14:07:24 +01:00

1 2 3 4 5 ...

17744 Commits