generic-library/vpx

Author	SHA1	Message	Date
James Zern	5da2e500d7	inv_txfm_sse2: clear conversion warning in hbd build tran_high -> tran_low in return from dct_const_round_shift() Change-Id: I2fe06c4b604823b1d1fe40a487017c3c2819a440	2017-03-17 01:16:38 -07:00
James Zern	2882778310	Merge "Add vpx_highbd_idct32x32_135_add_neon()"	2017-03-17 07:26:52 +00:00
Linfeng Zhang	65e9fb65e8	Add vpx_highbd_idct32x32_135_add_neon() BUG=webm:1301 Change-Id: I58c2d65d385080711c3666d6d8f9d241dac7b21a	2017-03-16 22:37:55 -07:00
James Zern	68efc64b72	Merge "Clean vpx_idct32x32_1024_add_neon()"	2017-03-17 05:24:58 +00:00
Marco	02975a604c	vp9: Fix speed 8 condition for enabling copy_partition. Change-Id: I2c090e6ba853a30fef1957b620853315f9471753	2017-03-16 17:08:37 -07:00
Gabriel Marin	976ddb61d3	Add a vector form of routine vp9_model_rd_from_var_lapndz Add routine vp9_model_rd_from_var_lapndz_vec and call it from model_rd_for_sb to model the rate and distortion for MAX_MB_PLANE Laplacian sources in parallel. The caller ensures that all sources have non-zero variance. Measured a 18% to 25% reduction in retired instructions, and 17% to 24% reduction in instruction execution cost with different compilers for the Laplacian modeling. No change in behavior. TEST=Verified that encoded files match bit for bit, with and without this change. BUG=b/33678225 Change-Id: I6b76947f21c659a349adb896e13e99f6e3f951e6	2017-03-16 22:19:44 +00:00
Marco Paniconi	83ba1880bf	Merge "vp9: Fixes in non-rd pickmode for denoising with SVC."	2017-03-16 21:53:38 +00:00
Johann Koenig	eeeb71ed97	Merge "Remove ppc-linux-gcc target"	2017-03-16 21:53:17 +00:00
Johann Koenig	cd3d7cf4ac	Merge "Add Hadamard for Power8"	2017-03-16 21:52:15 +00:00
Marco	bc7d4935bb	vp9: Fixes in non-rd pickmode for denoising with SVC. Don't denoise spatial layer frames whose base layer is a key frame. Disallow golden reference for SVC with denoising on frames that will be denoised (highest layer), as this removes bad artifact. Will re-enable when issue is resolved. Change-Id: I87a6597812330500966458172acfce54af65f70f	2017-03-16 12:59:41 -07:00
Marco	ba8bfaafa7	vpx_codec.h: include vpx/.h -> ./.h This matches the other includes and also fixes a compile issue in chromium. Change-Id: I45e00a1454f7ed948aa3b96b04cc5946b1d02985	2017-03-16 16:55:56 +00:00
Jerome Jiang	bf40776aa4	Merge "Refactor: Change cpi->resize_state to enum values."	2017-03-16 16:43:42 +00:00
Marco Paniconi	ec73bf53a5	Merge "vp8: Fix compiler warning in vp8 pickinter.c"	2017-03-16 05:13:38 +00:00
Rafael de Lucena Valle	405b94c661	Add Hadamard for Power8 Change-Id: I3b4b043c1402b4100653ace4869847e030861b18 Signed-off-by: Rafael de Lucena Valle <rafaeldelucena@gmail.com>	2017-03-15 23:46:18 -03:00
Marco Paniconi	cd47c1942e	Merge "vp9: Fix some issues with denoiser and SVC."	2017-03-16 02:42:55 +00:00
Marco	a340c64a79	vp9: Fix some issues with denoiser and SVC. Fix the update of the denoiser buffer when the base spatial layer is a key frame. And allow for better/lower QP on high spatial layers when their base layer is key frame. Change-Id: I96b2426f1eaa43b8b8d4c31a68b0c6d68c3024a2	2017-03-15 17:19:17 -07:00
Jerome Jiang	b5f7f7737a	Refactor: Change cpi->resize_state to enum values. Change-Id: Iab1409b0fc1175bc5a14afc4749a08c536c98c41	2017-03-15 17:16:17 -07:00
Marco	2c8430e223	vp9: Turn off ml_partition_search_early_termination. Fails on nightly ubsan, valgrind tests. Enabled on commit:6701014 Change-Id: Ied3f5cb38e39cba54ac134f4514107cdfdfce159	2017-03-15 15:00:38 -07:00
Marco	deea4ede59	vp8: Fix compiler warning in vp8 pickinter.c Change-Id: I0e5714538fe53d885a2201d808846901ae8fc288	2017-03-15 11:50:14 -07:00
Linfeng Zhang	e54231d613	Clean vpx_idct32x32_1024_add_neon() Change-Id: I05921e16d6a3e4e7e5b00a90624735050a186636	2017-03-15 11:24:31 -07:00
Yi Luo	8440cc4817	Merge "Improve idct32x32_1024_add SSSE3 intrinsics performance"	2017-03-15 02:32:52 +00:00
Linfeng Zhang	d9a9a4ffea	Merge "Fix overflow issue in 32x32 idct NEON intrinsics"	2017-03-15 00:38:17 +00:00
Jerome Jiang	27d5a57072	Merge "vp9: Using source sad for speedup for dynamic resizing."	2017-03-15 00:03:52 +00:00
Linfeng Zhang	c756eb01c8	Fix overflow issue in 32x32 idct NEON intrinsics Similar issue as Change bc1c18e. The PartialIDctTest.ResultsMatch test on vpx_idct32x32_135_add_neon() in high bit-depth mode exposes 16-bit overflow in final stage of pass 2, when changing the test number from 1,000 to 1,000,000. Change to use saturating add/sub for vpx_idct32x32_34_add_neon(), vpx_idct32x32_135_add_neon and vpx_idct32x32_1024_add_neon() in high bit-depth mode. Change-Id: Iaec0e9aeab41a3fdb4e170d7e9b3ad1fda922f6f	2017-03-14 16:59:14 -07:00
Jerome Jiang	2fa7092808	Merge "vp9: Enable row multithreading for SVC in real-time mode."	2017-03-14 23:29:46 +00:00
Jerome Jiang	02463273c9	vp9: Using source sad for speedup for dynamic resizing. Only for speed >= 7. Change-Id: I3ac85fbb4023cf7e6f8333806b345b0174382a09	2017-03-14 15:47:19 -07:00
Yi Luo	fedcf83f33	Improve idct32x32_1024_add SSSE3 intrinsics performance - Function level speed improves ~12%. Change-Id: I9b7dbddabf08c7d0f6b25264e6074d5ccbe39290	2017-03-14 14:04:08 -07:00
James Zern	1b91f41935	Merge "vp9/encoder: fix segfault on win32 using vs < 2015"	2017-03-14 19:21:42 +00:00
Yunqing Wang	c3e290963d	Merge "Apply machine learning-based early termination in VP9 partition search"	2017-03-14 18:07:05 +00:00
Marco Paniconi	78a6946904	Merge "vp9: Speed >= 8: Enable simple_block_yrd speed feature."	2017-03-14 17:50:17 +00:00
Marco	c0c789ab50	vp9: Adjust copy partition threshold, for speed 8. Reduce it from 5 to 4, small/no change in metrics or speed. Small reduction in dragging artifact near moving head. Change-Id: Ic3bc5ca67c70bf0c89fc2ed14454840a28ae5b6a	2017-03-14 09:18:53 -07:00
Marco	c216c8d6f2	vp9: Speed >= 8: Enable simple_block_yrd speed feature. Enable speed feature for resolutions > VGA. avgPSNR on RTC down by ~1.7%. Speedup on ARM: ~5%. Change-Id: I7a3fe5f7425aa8df3f4a2eced1afa355bc0d4c95	2017-03-14 09:10:28 -07:00
Marco Paniconi	507204316a	Merge "vp9: Fix to source_sad feature for SVC."	2017-03-13 19:18:31 +00:00
Linfeng Zhang	b0bfcc368c	Merge "Add vpx_highbd_idct32x32_135_add_c()"	2017-03-13 18:49:01 +00:00
Marco	f0a22b23fe	vp9: Fix to source_sad feature for SVC. Allow speed feature sf->use_source_sad to be used on highest spatial layer for SVC. Change-Id: I260eb0478902764f49f83e43b17024fe86ff3b22	2017-03-13 11:00:40 -07:00
Yunqing Wang	670101439f	Apply machine learning-based early termination in VP9 partition search This patch was based on Yang Xian's intern project code. Further modifications were done. 1. Moved machine-learning related parameters into the context structure. 2. Corrected the calculation of sum_eobs. 3. Removed unused parameters and calculations. 4. Made it work with multiple tiles. 5. Added a speed feature for the machine-learning based partition search early termination. 6. Re-organized the code. The patch was rebased to the top-of-tree. Borg test BDRATE result: 4k set: PSNR: +0.144%; SSIM: +0.043%; hdres set: PSNR: +0.149%; SSIM: +0.269%; midres set: PSNR: +0.127%; SSIM: +0.257%; Average speed gain result: 4k clips: 22%; hd clips: 23%; midres clips: 15%. Change-Id: I0220e93a8277e6a7ea4b2c34b605966e3b1584ac	2017-03-13 09:54:18 -07:00
Marco Paniconi	b39f7c3364	Merge "vp9: Fix condition for intra search in non-rd pickmode."	2017-03-13 06:11:13 +00:00
Marco	8c18df7fcd	vp9: Fix condition for intra search in non-rd pickmode. Fixes an issue when the LAST and golden is not used as a reference, in which case its possible no encoding mode is set (since intra may be skipped under certain codtions). Fix is to make sure intra is searched if no inter mode is checked. Issue can happen for temporal layer pattern#7 in vpx_temporal_svc_encoder.c Change-Id: I5ab4999b2f9dbd739044888e0916b5ec491d966b	2017-03-12 22:30:39 -07:00
James Zern	48fca113d1	inv_txfm_ssse3,butterfly: fix win32 abi compatibility only the first 3 parameters can be aligned to 16 as required by __m128i, make them all pointers for consistency. since: 07c48ccfe Improve idct32x32_34_add SSSE3 intrinsics performance BUG=webm:1384 Change-Id: I0324f701e723a27cb470036a180693ba8829d01d	2017-03-10 19:57:17 -08:00
James Zern	c09b290cea	vp9/encoder: fix segfault on win32 using vs < 2015 shift the bsse[] member of the macroblock struct to the front to avoid an incorrect offset (0) to the upper half of bsse[0] which leads to a negative resulting in a crash. restrict this to visual studio versions before 2015 (the bug was observed with 2013, fixed in 2015) to avoid any potential cache impact on other platforms. https://connect.microsoft.com/VisualStudio/feedback/details/2396360/bad-structure-offset-in-32-bit-code BUG=webm:1054 Change-Id: I40f68a1d421ccc503cc712192263bab4f7dde076	2017-03-10 17:37:17 -08:00
Marco Paniconi	0af189c00d	Merge "vp9: Sample encoder vpx_temporal_svc_encoder: enable row-mt"	2017-03-10 18:26:06 +00:00
Marco	169c846575	vp9: Sample encoder vpx_temporal_svc_encoder: enable row-mt Enable row-mt in the sample encoder vpx_temporal_svc_encoder.c, under certain condiitons. Change-Id: Ic103ee81a9d80be5bf6e5778cc21fc3199db909d	2017-03-10 10:11:39 -08:00
Yi Luo	018290a344	Merge "Improve idct32x32_135_add SSSE3 intrinsics performance"	2017-03-10 17:14:30 +00:00
Marco	ffb3c50da1	vp9: Enable row multithreading for SVC in real-time mode. Enable row-mt for SVC for real-time mode, speed >=5. Add the controls to the sample encoders, but keep it off for now. Add the control and enable it for the 1 pass CBR unittests. For speed 7, 3 layer SVC, 2 threads, row-mt enabled gives about ~5% speedup. Change-Id: Ie8e77323c17263e3e7a7b9858aec12a3a93ec0c1	2017-03-10 01:01:07 +00:00
Yi Luo	327add990f	Improve idct32x32_135_add SSSE3 intrinsics performance - Split the inv txfm into three parts to avoid stack spillover. - Function level speed improves ~12%. - Use function and macro to remove some repeated code. Change-Id: I14f5f072334fd766808cb52bf648df792e7379ee	2017-03-09 16:17:54 -08:00
Johann Koenig	f951881e8c	Merge "ppc: include ppc.h for ppc_simd_caps()"	2017-03-09 23:12:37 +00:00
James Zern	cb60e66085	Merge "move vp9_scale_and_extend_frame_c to vp9_frame_scale.c"	2017-03-09 22:51:08 +00:00
Johann	94655569fe	Remove ppc-linux-gcc target Change-Id: Iec2430966f54e2e5ba79f6bb703f47adde46479f	2017-03-09 11:33:33 -08:00
Johann	ccd23215ed	ppc: include ppc.h for ppc_simd_caps() Change-Id: Idc829eb066cf4e905d062cb9c08424e0f1b7e1a7	2017-03-09 09:26:45 -08:00
James Zern	2f31a16445	move vp9_scale_and_extend_frame_c to vp9_frame_scale.c this is similar to the x86 configuration and helps mitigate an issue with a circular dependency between this function and the ssse3 variant causing an outsized increase in binary size (~300K for chrome) chrome.dll: .text 255B000 -> 252B000 .data 7B000 -> 75000 -221184 bytes BUG=chromium:697956 Change-Id: Ic95b142ecd62dd4f1795788aa27dd8fab59b708c	2017-03-08 21:13:50 -08:00

1 2 3 4 5 ...

16965 Commits