generic-library/vpx

Author	SHA1	Message	Date
Yue Chen	4ff6d13771	Merge "Cosmetics for vp10/common/vp10_rtcd_defs.pl" into nextgenv2	2016-07-12 01:21:33 +00:00
James Zern	08bd57ef0d	vp10_convolve_ssse3.c: make some functions static quiets -Wmissing-prototypes warnings BUG=b/29584271 Change-Id: I4d2eb7f4b45d7b829421976641b3212bcf29e7dd	2016-07-11 16:52:10 -07:00
James Zern	9bf5a1ab46	vp10/common/idct.h: add some missing prototypes quiets the warning of the same name BUG=b/29584271 Change-Id: I220cd58e1060f77e3910472fed1b167add3a08f8	2016-07-11 16:52:08 -07:00
James Zern	bc4341fd94	vp10: add some missing includes quiets some -Wmissing-prototypes warnings BUG=b/29584271 Change-Id: I9174728459fcabb6d9ac0028ae58029e52c0da92	2016-07-11 16:52:07 -07:00
Yue Chen	68e19472c1	Cosmetics for vp10/common/vp10_rtcd_defs.pl Change-Id: Iaf8c6f0b1e340f0406df2871a3dc2ded19b7009a	2016-07-11 23:41:30 +00:00
Debargha Mukherjee	6770c7361e	Merge "Optimize and cleanup supertx predictor." into nextgenv2	2016-07-11 22:30:16 +00:00
Debargha Mukherjee	6bbadfb303	Merge "Improve vpx_blend_* functions." into nextgenv2	2016-07-11 19:30:04 +00:00
Geza Lore	cd489264e1	Optimize and cleanup supertx predictor. Use vpx_blend_a64_hmask and vpx_blend_a64_vmask to speed up computing the supertx predictor. Decoder speedup of up to 4% has been observed. Change-Id: I255a5ba4cc24f78dc905d25b6e2f7fbafac13253	2016-07-11 18:14:21 +00:00
Geza Lore	bfa59b4a5f	Improve vpx_blend_* functions. - Made source buffers pointers to const. - Renamed vpx_blend_mask6b to vpx_blend_a64_mask. This is more indicative that the function does alpha blending. The 6, or 6b suffix was misleading, as the max mask value (64) does not fit into 6 bits. - Added VPX_BLEND_* macros to use when needing to blend scalars. - Use VPX_BLEND_A256 in combine_interintra to be more explicit about the operation being done. - Added versions of vpx_blend_a64_* which take 1D horizontal/vertical masks directly and apply them to all rows/columns (vpx_blend_a64_hmask and vpx_blend_a64_vmask). The SSE4.1 optimzied horizontal version now falls back on the 2D version. This can be improved upon if it show up high enough in a profile. - All vpx_blend_a64_* functions now support block sizes down to 1x1 (ie: a single pixel). This is for usage convenience. The SSE4.1 optimized versions fall back on the C implementation if w <= 2 or h <= 2. This can again be improved if it becomes hot code. Change-Id: I13ab3835146ffafe3e1d74d8e9cf64a5abe4144d	2016-07-11 19:05:17 +01:00
Pascal Massimino	e5fb2d4e93	remove ROUNDZ_* macros in favor of just ROUND_* ones Change-Id: I263088be8d71018deb9cc6a9d2c66307770b824d	2016-07-11 06:27:41 -07:00
Debargha Mukherjee	5d28183fcf	Merge "Refactor and clean up on blend_mask6" into nextgenv2	2016-07-09 06:50:32 +00:00
Yue Chen	5b25323c25	Merge "Fix assertion failures in mips+msa setting" into nextgenv2	2016-07-09 01:07:27 +00:00
Yue Chen	4ab19eac62	Fix assertion failures in mips+msa setting Directly call c functions, otherwise when EXT_TX is enabled, hybrid transform other than combination of DCT/ADST has not been implemented, thus will cause assertion failures in the switch loops in vp10_fhtnxn_msa() and vp10_ihtnxn_nxn_add_msa(). BUG=webm:1239 Change-Id: I2379a07e5406f9489edcd2f3205682f679c9b091	2016-07-08 17:13:52 -07:00
Debargha Mukherjee	72ef6d7704	Refactor and clean up on blend_mask6 Change-Id: Ie9188471e7dc07ab9c95b22f258b1662e895c533	2016-07-08 15:02:57 -07:00
Sarah Parker	6c56def33e	Merge "Make new_quant bin widths to be uniform" into nextgenv2	2016-07-08 17:40:55 +00:00
Sarah Parker	88faa2b348	Make new_quant bin widths to be uniform Change-Id: Iceeca8ecbc43919b43189352a307479d666d1dad	2016-07-07 16:22:32 -07:00
Geza Lore	fc28be3b23	Clean up build_wedge_inter_predictor_from_buf Change-Id: I715f8ffa3e81056a74ca8ac94793009afb781221	2016-07-07 13:12:57 +01:00
Debargha Mukherjee	e5e37e310b	Merge "Reject ext-inter compound modes based on modelled RD." into nextgenv2	2016-06-30 18:18:53 +00:00
Jingning Han	3d8cde6618	Merge "Remove unused BITDEPTH_10 definition" into nextgenv2	2016-06-30 16:26:25 +00:00
Geza Lore	532304e468	Reject ext-inter compound modes based on modelled RD. Reject ext-inter compound modes before doing full rate distortion evaluation, if the corresponding single reference modes had a lower modelled RD. ext-inter speedup up to TBD. Coding performance: TBD Change-Id: I358bfb879c5ebe5e7afbf6f540cc784f8de14857	2016-06-30 09:56:17 +01:00
Jingning Han	e6c8e35dec	Remove unused BITDEPTH_10 definition Change-Id: Ic11f32db352e1ff7b3ed140654ee1a6016ba516f	2016-06-29 16:43:54 -07:00
Debargha Mukherjee	a35597fc7f	Various cosmetics on the new_quant experiment Also extends quant profiles to include quality range. Change-Id: Ia96e45b6425e1d42ca61fc401f63d4fd7214e448	2016-06-29 13:18:52 -07:00
Yi Luo	dd2064a0ac	Merge "Fix bugs in convolution filter optimization" into nextgenv2	2016-06-27 21:33:45 +00:00
Yi Luo	8404253f81	Fix bugs in convolution filter optimization - Fix the over-writing bug in horizontal filtering as width = 2. - Fix 10-tap vertical filtering which no longer reads one row of pixel above the block. - Fix 10-tap filter zero padding. - Encoder speed slow down ~4.0%, compared to, 81ad953 Convolution vertical filter SSSE3 optimization Change-Id: I9bb294a4529300081c29bf284e6bc6eb081cc536	2016-06-27 10:23:38 -07:00
Sarah Parker	fbe6fb2773	Add multiple quantization profiles to new_quant experiment Add the ability to pick between 3 quantization profiles. The profile is chosen based on the entropy context at the block level. Change-Id: Iaea0485798441b7d635962c2563f3a477f582dac	2016-06-24 16:16:13 -07:00
Yi Luo	81ad95363a	Convolution vertical filter SSSE3 optimization - Apply 8-pixel vertical filtering direction parallelism. - Add unit tests to verify bit exact. - Encoder speed improves ~29% (enable EXT_INTERP) on Xeon E5-2680. - Combinational cycle count of vp10_convolve() drops from 26.06% to 6.73%. Change-Id: Ic1ae48f8fb1909991577947a8c00d07832737e57	2016-06-23 12:56:47 -07:00
Jingning Han	b605de074d	Refactor reference frame type defs Move the reference frame type definitions to common/enums.h file. Replace hard coded numbers. Combine repeated definitions. Change-Id: I288e079a03e448014cc181bcdb3f88ee8ec8d139	2016-06-22 12:34:44 -07:00
Debargha Mukherjee	78842b2870	Merge "Reinstate "Optimize wedge partition selection." without tests." into nextgenv2	2016-06-22 16:59:40 +00:00
Jingning Han	c2195c5b7e	Make drl support bi-directional reference frames This commit refactors the reference frame structure used in the dynamic motion vector referencing system, and makes it support the bi-directional reference frames. This resolves unit test failure (enc/dec mismatch) when both are turned on. The compression performance (ref-mv + ext-refs) is improved by 0.2% for lowres. Change-Id: I233624d8fccc1f69e82295f94de984ff056365dc	2016-06-21 17:39:30 -07:00
Geza Lore	135d663159	Reinstate "Optimize wedge partition selection." without tests. This reinstates commit efda2831e5f758b4f350679b5c55c0b9282449b0 without the tests and with fixes for 32 bit x86 builds. Change-Id: I34be4fe1e8a67686d26ba256fd7efe0eb6a569e8	2016-06-21 20:31:50 +01:00
Yi Luo	f1a50db2d1	Merge "Convolution horizontal filter SSSE3 optimization" into nextgenv2	2016-06-20 20:06:02 +00:00
Yi Luo	229690a95c	Convolution horizontal filter SSSE3 optimization - Apply signal direction/4-pixel vertical/8-pixel vertical parallelism. - Add unit test to verify the bit exact result. - Overall encoding time improves ~24% on Xeon E5-2680 CPU. Change-Id: I104dcbfd43451476fee1f94cd16ca5f965878e59	2016-06-20 11:10:30 -07:00
Zoe Liu	5805a14ca6	Merge bi-predictive frames to EXT_REFS This patch removed the experiment of BIDIR_PRED and merged the feature into the experiment of EXT_REFS: (1) Each frame now has up to 6 reference frames, namely LAST_FRAME, LAST2_FRAME, LAST3_FRAME, GOLDEN_FRAME, (forward) and BWDREF_FRAME, ALTREF_FRAME (backward); LAST4_FRAME has been removed; (2) First pass still keeps the 8 updates: KF_UPDATE, LF_UPDATE, GF_UPDATE, ARF_UPDATE, OVERLAY_UPDATE, and BRF_UPDATE, LAST_BIPRED_UPDATE, BI_PRED_UPDATE; (3) show_existing_frame==1 is supported in the experiment of EXT_REFS; (4) New encoding modes are added for both single-ref and compound cases, through the use of the 2 extra forward references (LAST2 & LAST3) and the 1 extra backward reference (BWDREF). RD performance wise, using Overall PSNR: Avg/BDRate Bipred only Prev EXT_REFS Current EXT_REFS with bipred lowres: -3.474/-3.324 -1.748/-1.586 -4.613/-4.387 derflr: -2.097/-1.353 -1.439/-1.215 -3.120/-2.252 midres: -2.129/-1.901 -1.345/-1.185 -2.898/-2.636 If in vp10/encoder/firstpass.h, change BFG_INTERVAL from 2 to 3, i.e. to use 2 bi-predictive frames than 1, a further improvement may be obtained: Current EXT_REFS with bipred 1 bi-predictive frame 2 bi-predictive frames lowres: -4.613/-4.387 -4.675/-4.465 derflr: -3.120/-2.252 -3.333/-2.516 midres: -2.898/-2.636 -3.406/-3.095 Change-Id: Ib06fe9ea0a5cfd7418a1d79b978ee9d80bf191cb	2016-06-17 12:43:39 -07:00
Debargha Mukherjee	e20a29d3b0	Merge "Select segment based loopfilter strength for supertx blocks." into nextgenv2	2016-06-15 16:49:34 +00:00
Debargha Mukherjee	52c71f749d	Merge "Rework supertx segment handling and adaptive quantization." into nextgenv2	2016-06-15 16:47:58 +00:00
Jingning Han	48f5125749	Merge "Fix enc/dec mismatch in non-420 settings" into nextgenv2	2016-06-14 21:54:08 +00:00
Geza Lore	44b91a0e76	Select segment based loopfilter strength for supertx blocks. Segment based loopfilter strength for supertx coded blocks is now selected based on the minimum of all segment IDs within a supertx coded block (same as the quantiser settings). Change-Id: Ib056bd0d05f6a1d3b512a76deb4e2ad4db0f7dc4	2016-06-14 16:07:51 +01:00
Geza Lore	7dd90c9d22	Rework supertx segment handling and adaptive quantization. Segment level quantizer settings for supertx coded blocks are now selected based on the minimum of all segment IDs within a supertx coded block. This also fixes the 3 adaptive quantization modes with supertx. Change-Id: Ib5db099539d4f82f240e1d745d6e5264f8b34cde	2016-06-14 16:07:50 +01:00
Jingning Han	a4ea8fd8b8	Fix enc/dec mismatch in non-420 settings This commit makes the dual filter experiment work with non-420 settings. It fixes unit test failure in EndToEndTestLarge. Change-Id: I04f7afdee78f91389d9ff72947efa152098af930	2016-06-14 00:21:48 +00:00
Debargha Mukherjee	902ee5060c	A crash fix for supertx / ext-inter combination. Change-Id: I9860376c98aa3b25f5bf86ed13d4a7631fa6b153	2016-06-13 13:57:30 -07:00
Debargha Mukherjee	81f8b3f31c	Merge "Some refactoring to support warped motion mode" into nextgenv2	2016-06-10 23:18:39 +00:00
Zoe Liu	1d1286bfb4	Fix one typo in the comment Change-Id: Ie98fd60426b18980ec85572f3cfc9ce0b97a5361	2016-06-10 15:58:30 -07:00
Debargha Mukherjee	03be30ba3e	Some refactoring to support warped motion mode Change-Id: I15d54a3ae48b2b33082668116792c6595bdb3ddb	2016-06-10 12:04:18 -07:00
Debargha Mukherjee	8b118faa61	Merge "Adds higher precision for homography model 3rd row" into nextgenv2	2016-06-10 18:47:17 +00:00
Sarah Parker	e14f61b924	Merge "Move new quant experiment from nextgen" into nextgenv2	2016-06-10 17:26:33 +00:00
Jingning Han	b77dfccf00	Merge "Add MIN_TX_SIZE definition" into nextgenv2	2016-06-10 16:04:38 +00:00
Sarah Parker	a21afd421b	Move new quant experiment from nextgen This experiment implements non-uniform quantization where the width of the bins increases gradually to more closely match a laplacian distribution of the coeficcients. Performance Gain: derflr: 0.15% hevcmr: 0.675% Change-Id: I25234244e3bcd94b87c1f77cf682190b61c8ef94	2016-06-10 08:06:22 -07:00
Angie Chiang	95340fccb3	Revert "Optimize wedge partition selection." This reverts commit efda2831e5f758b4f350679b5c55c0b9282449b0. This commit causes segmentation fault at SSE2/SumSquares2DTest.RandomValues/0 Change-Id: I171937e4daf6f15323e8206418773deb03bd8c53	2016-06-09 19:17:37 -07:00
Aamir Anis	de2a20b411	Merge "Updated loop restoration" into nextgenv2	2016-06-09 20:57:09 +00:00
Debargha Mukherjee	697bcef677	Add a couple of missing WRAPLOW checks To make coefficient checking consistent with the VP9 spec sections 8.7.1.6 and 8.7.1.1. Change-Id: I92e38e89a41d1e482317bb478c48ffa608d2d6ee	2016-06-09 12:58:27 -07:00

1 2 3 4 5 ...

678 Commits