generic-library/vpx

Author	SHA1	Message	Date
Marco	5d6c1c2d8f	vp9: Adjust noise estimation for 360p. Change-Id: Ib76875232491b14f7114061e8e913e87004427a0	2017-07-31 17:12:58 -07:00
Marco Paniconi	ebb023deb6	Merge "Revert "Revert "vp9: Speed feature to adapt partition based on source_sad."""	2017-07-31 14:58:15 +00:00
Marco	999bd6ea84	vp9: Fix denoising condition when pickmode partition is used. When the superblock partition is based on the nonrd-pickmode, we need to avoid the denoising. Current condition was based on the speed level. This change is to make the condition at the superblock level, as the switch in partitioning may be done at sb level based on source_sad (e.g., in speed 6). Change-Id: I12ece4f60b93ed34ee65ff2d6cdce1213c36de04	2017-07-30 23:16:38 -07:00
Jerome Jiang	f027908ad0	Revert "Revert "vp9: Speed feature to adapt partition based on source_sad."" This reverts commit c9266b85476aadf078238b7bde3c36bf7953e11c. Disable source_sad when resolution > 1080P. The test should pass now. BUG=webm:1452 Change-Id: I72dde88e66590ff9e41da5e5dd83f5550a83f082	2017-07-30 19:49:31 -07:00
James Zern	facb124941	Merge "Revert "vp9: Speed feature to adapt partition based on source_sad.""	2017-07-30 03:26:10 +00:00
James Zern	c9266b8547	Revert "vp9: Speed feature to adapt partition based on source_sad." This reverts commit 064fc570ff8399536563e3846500fd99b273b034. This causes an assertion failure in vp9_mcomp.c when running gtest_filter=VP9/MotionVectorTestLarge.OverallTest/41: `mv->col >= -((1 << (11 + 1 + 2)) - 1) && mv->col < ((1 << (11 + 1 + 2)) - 1)' Change-Id: I449e777bf18b661cb3f1d82253610c55c51687f6	2017-07-29 11:36:58 -07:00
James Zern	d35b627340	Revert "Rewrite vpx_highbd_idct8x8_{12,64}_add_sse2" This reverts commit aa1c4cd140007ea5b4be99732fbb23d1fd8cf2b5. This fails the following tests with extreme input coefficients: SSE2/InvTrans8x8DCT.CompareReference/0 SSE2/InvTrans8x8DCT.CompareReference/2 previously the optimized path was skipped in this range Change-Id: I9af015a46eba96208834a219fafd651d37556a80	2017-07-29 11:12:27 -07:00
Marco Paniconi	5d0bef4763	Merge "vp9: Adjust logic in source sad for screen content."	2017-07-29 01:46:58 +00:00
Marco Paniconi	e48dfcead1	Merge "vp9: Speed feature to adapt partition based on source_sad."	2017-07-29 01:45:19 +00:00
Jerome Jiang	ac211fe23e	vp9: Adjust logic in source sad for screen content. Change-Id: I917d106f4c95ea44e413e23881f6303982e1a6a3	2017-07-28 17:25:41 -07:00
Marco	064fc570ff	vp9: Speed feature to adapt partition based on source_sad. Move the source_sad feature to speed 6 (from speed 7), and add speed feature to switch from the variance-based partition to reference_partition (which uses nonrd-pickmode for bsize selection) if source_sad is high. Currently used only for speed 6 for resoln <= 360p. About 4-5% improvement on 360p in RTC set. Some speed slowdown, but still ~30% faster than speed 5. Change-Id: Ib0330ee5fe9fdd2608aed91359a2a339d967491c	2017-07-29 00:20:26 +00:00
Urvang Joshi	7105e66d19	Remove the DP version of vp9_optimize_b(). The greedy version was already enabled by default here: https://chromium-review.googlesource.com/c/546848/ And the speed+compression gains from greedy version were already mentioned here: https://chromium-review.googlesource.com/c/531675/ Change-Id: Iad9f7d03490c845ad1e230af028c9d39edddca97	2017-07-28 23:12:57 +00:00
Linfeng Zhang	75653b7032	Merge changes Ia0e20f5f,I28150789,I35df041b,I221dff34 * changes: Update vpx_idct16x16_10_add_sse2() Add vpx_idct16x16_38_add_sse2() Rewrite vpx_highbd_idct8x8_{12,64}_add_sse2 Refactor highbd idct 4x4 and 8x8 x86 functions	2017-07-28 22:43:00 +00:00
James Zern	3c73e587d1	Revert "quantize ssse3: declare all variables" This reverts commit 03f5e300d69d368290305e19cc66bac8b0ea1ff8. This causes test failures under OSX: SSSE3/VP9QuantizeTest.EOBCheck/0 SSSE3/VP9QuantizeTest.OperationCheck/0 Change-Id: I122732717ead1f7af5b04c529a6948e382e5e59b	2017-07-28 01:22:16 -07:00
Linfeng Zhang	5232e35bc2	Update vpx_idct16x16_10_add_sse2() Change-Id: Ia0e20f5fa47382af5785221eebb05212b40bd35c	2017-07-27 18:03:25 -07:00
Linfeng Zhang	7f4acf8700	Add vpx_idct16x16_38_add_sse2() Change-Id: I28150789feadc0b63d2fadc707e48971b41f9898	2017-07-27 18:02:43 -07:00
Linfeng Zhang	aa1c4cd140	Rewrite vpx_highbd_idct8x8_{12,64}_add_sse2 BUG=webm:1412 Change-Id: I35df041b757d42278ac7a5cdbd909e8ffcee1455	2017-07-27 18:02:36 -07:00
Linfeng Zhang	9c43d81bc2	Refactor highbd idct 4x4 and 8x8 x86 functions BUG=webm:1412 Change-Id: I221dff34dd5f71b390b5e043d0a137ccb0a01dec	2017-07-27 18:01:03 -07:00
Johann Koenig	a83e1f1d53	Merge "quantize ssse3: declare all variables"	2017-07-27 21:18:35 +00:00
Jerome Jiang	905b8ec27f	Merge "vp8: Remove isolated skin & non skin blocks."	2017-07-27 20:24:08 +00:00
Jerome Jiang	56d95b77f5	vp8: Remove isolated skin & non skin blocks. Neutral on RTC metrics and speed on Pixel. Change-Id: I26b907483fe133e6e4c1009d147631f0d0e0f2fb	2017-07-26 14:44:36 -07:00
James Zern	1c666465af	inv_txfm_{sse2,ssse3}: clear conversion warnings visual studio reports tran_high_t (int64) -> short in calls to _mm_set1_epi16 Change-Id: Icb8d1baee77ad3d45edb1477a443d3e648f0b745	2017-07-25 20:13:49 -07:00
James Zern	62682ac8ad	highbd_idct_sse.c: clear conversion warnings visual studio reports tran_high_t (int64) -> int in calls to _mm_setr_epi32 Change-Id: Ic2247c8e3800991202151790d78bd94c4f4aed05	2017-07-25 20:11:09 -07:00
James Zern	85736e616e	vpx_variance16x16_sse2: correct cast order allow the right shift to operate on 64-bits, this matches the rest of the implementations previously: b0f1ae147 vpx_get16x16var_avx2: correct cast order Change-Id: I632ee5e418f3f9b30e79ecd05588eb172b0783aa	2017-07-25 16:45:40 -07:00
James Zern	b0f1ae1475	vpx_get16x16var_avx2: correct cast order allow the right shift to operate on 64-bits, this matches the rest of the implementations missed in: 6acd061aa variance_avx2: sync variance functions with c-code Change-Id: Icae436b881251ccb9f9ed64fcbf8d358c58a4617	2017-07-24 16:29:44 -07:00
James Zern	8836e46ffd	set_var_thresh_from_histogram: prevent negative variance For 8-bit the subtrahend is small enough to fit into uint32_t. For 10/12-bit apply: 63a37d16f Prevent negative variance previously: 47b9a0912 Resolve -Wshorten-64-to-32 in highbd variance. c0241664a Resolve -Wshorten-64-to-32 in variance. Change-Id: I181c85f0b9a03da37c2e8b89482d48aa3dbc0aee	2017-07-22 13:27:32 -07:00
Marco	8c7a60e04d	vp8: Fix compile warning in vp8_multi_resolution_encoder.c Change-Id: I49c960179dfc1902aa5e5c99915789878c06bc3d	2017-07-20 14:19:43 -07:00
Johann Koenig	e8bd534c42	Merge "quantize test: promote RandRange() result to signed"	2017-07-20 19:46:05 +00:00
Johann Koenig	0c30b75f40	Merge "quantize test: lowbd functions do not pass in highbd"	2017-07-20 19:45:59 +00:00
Jerome Jiang	494188505b	Merge "vp9: Removed unused skin detection function."	2017-07-20 16:58:01 +00:00
Johann	af08fbb444	quantize test: promote RandRange() result to signed Avoid unsigned overflow warning: unsigned integer overflow: 19974 - 32703 cannot be represented in type 'unsigned int' Change-Id: Ifebee014342e4c6f3b53306c0cad6ae0b465ac12	2017-07-20 08:17:48 -07:00
Johann	c782f27ead	quantize test: lowbd functions do not pass in highbd qcoeff output looks OK but dqcoeff is no good. BUG=webm:1448 Change-Id: I07211db8a8b74f1f45fdd059852e2de0e5ee18fd	2017-07-20 08:17:48 -07:00
Johann Koenig	4702bb26be	Merge "quantize test: eob is output"	2017-07-20 15:17:26 +00:00
Johann Koenig	e1809501d0	Merge "Earmark extra space for VSX."	2017-07-19 21:35:57 +00:00
Jerome Jiang	9dd992b6f0	Merge "Roll libwebm: Fix android build failure with NDK r15b."	2017-07-19 21:30:21 +00:00
Johann	bde2e4aa36	quantize test: eob is output eob values are generated by the function. Change-Id: I8ce92100e83022bff99888a5a7e6ef378c49fda3	2017-07-19 14:17:19 -07:00
Han Shen	b72d3e8a25	Earmark extra space for VSX. Backend specific optimization for PPC VSX reads 16 bytes, whereas arm neon / sse2 only reads <= 8 bytes. Although the extra bytes read are actually never used, this is not a warrant for groping around. Fixed by allocating more when building for VSX. This is reported by asan. Also note - PPC does have assembly that loads 64-bit content from memory - lxsdx loads one 64-bit doubleword (whereas lxvd2x loads two 64-bit doubleword) from memory. However, we only have "vec_vsx_ld" builtins that mapped to lxvd2x, no builtins to lxsdx. The only way to access lxsdx is through inline assembly, which does not fit well in the origin paradigm. Refer: vsx: vpx_tm_predictor_4x4_vsx @ third_party/libvpx/git_root/vpx_dsp/ppc/intrapred_vsx.c neon: vpx_tm_predictor_4x4_neon @ third_party/libvpx/git_root/vpx_dsp/arm/intrapred_neon_asm.asm sse2: tm_predictor_4x4 @ third_party/libvpx/git_root/vpx_dsp/x86/intrapred_sse2.asm BUG=b/63112600 Tested: asan tests passed. Change-Id: I5f74b56e35c05b67851de8b5530aece213f2ce9d	2017-07-19 13:59:32 -07:00
Johann Koenig	89a116f4cb	Merge "variance: call C comp_avg_pred"	2017-07-19 20:34:13 +00:00
Jerome Jiang	8ad9338e2e	Roll libwebm: Fix android build failure with NDK r15b. BUG=webm:1447 Change-Id: I8defe45cb94eb9c209ba72ce446786f24c14c0b8	2017-07-18 16:52:46 -07:00
Jerome Jiang	4526644615	vp9: Removed unused skin detection function. Change-Id: I6702b7b11aa4ac9aac5fd54deef4377cdcb29c64	2017-07-18 14:52:04 -07:00
Jerome Jiang	59e461db1f	Merge "vp9: Allocate alt-ref in denoiser for SVC."	2017-07-18 21:30:04 +00:00
Jerome Jiang	babef23a5f	Merge "vp9: Remove isolated skin & non-skin blocks."	2017-07-18 20:48:32 +00:00
Johann Koenig	56d3f1573a	Merge changes I62c2e313,Ibd7a0337,I94e1d886 * changes: quantize test: test sse2 and avx optimizations quantize test: extend arrays quantize test: restrict and correct input	2017-07-18 20:42:39 +00:00
Johann	4b9a848bb3	variance: call C comp_avg_pred Keep optimized code out of the reference implementation. This matches the style of the other sub calls. Change-Id: I3da6acd4f2c647b029c420e22ac9410a18259689	2017-07-18 20:22:53 +00:00
Jerome Jiang	fd216268ad	vp9: Allocate alt-ref in denoiser for SVC. When SVC is used, allocate alt-ref in denoiser. Change-Id: I1b17221b55b9444cd23b97d481b54ff8d296d857	2017-07-18 13:22:47 -07:00
Johann	03f5e300d6	quantize ssse3: declare all variables Copy missing line from avx implementation. Change-Id: I9755c5b4d4034867de6fa9f741c24bf49dce3a27	2017-07-18 12:32:57 -07:00
Johann	101981b736	quantize test: test sse2 and avx optimizations ssse3 does not pass either of the tests. avx 32x32 does not pass. Change-Id: I62c2e31336fd2327327afaa0da896ad79a3def44	2017-07-18 12:08:16 -07:00
Jerome Jiang	adbfc4308a	vp9: Remove isolated skin & non-skin blocks. 0.007% regression on rtc and 0.004% gain on rtc_derf. 1 thread on QVGA,VGA and HD has ~0.2% speed regression while 2 threads has ~0.2% speed gain on Google Pixel. Change-Id: Ia4a6ec904df670d7001e35e070b01e34149d23dc	2017-07-18 11:29:14 -07:00
Johann	c7ebe82253	quantize test: extend arrays Officially the quant structures are 8 elements, with one dc element and 7 repeated ac elements. The low bit depth optimizations take advantage of this to fill the xmm registers. The high bit depth version manually duplicates the values. If all the optimizations were unified, the structure sizes could be greatly reduced. Change-Id: Ibd7a0337a7832ce2a1a05ee433c310077e1059ae	2017-07-18 09:55:47 -07:00
Johann	cb61ba02f4	quantize test: restrict and correct input Use only valid values for quantize inputs. These were determined by looping over vp9_init_quantizer and looking for max and min values. This allows extending the test to the low bit depth functions which were not designed to handle all possible inputs but only valid inputs. Change-Id: I94e1d8863a49ac227845b65c6b50130e10e6319e	2017-07-18 09:40:45 -07:00

1 2 3 4 5 ...

17618 Commits