generic-library/vpx

Author	SHA1	Message	Date
Linfeng Zhang	af7fb17c09	Upgrade fwht4x4_mmx() to fwht4x4_sse2() for vp9 and vp10. Function level timing test shows about 27% time saving on a Xeon E5-2680 v2 desktop. Rename vp9_dct_sse2.c to vp9_dct_intrin_sse2.c for vp9 and rename dct_sse2.c to dct_intrin_sse2.c for vp10 to avoid duplicate basenames. Actually vp9_fwht4x4_mmx/sse2() and vp10_fwht4x4_mmx/sse2() are identical. TODO: They should be unified later if there is no intention to keep a duplicate. Change-Id: I3e537b7bbd9ba417c606cd7c68c4dbbfa583f77d	2016-05-27 09:51:16 -07:00
Hui Su	e717ece4ab	Merge "Add a quick path in build_intra_predictors" into nextgenv2	2016-05-26 22:12:53 +00:00
hui su	e5f47d4334	ext-intra: refactor mode info. writing and reading No performance changes. Change-Id: I001068330ea217a993aee9b79d7ffead0d23100e	2016-05-26 14:56:40 -07:00
Hui Su	88eaf5d6ce	Merge "Skip unnecessary calculations in ext-intra" into nextgenv2	2016-05-26 18:03:02 +00:00
hui su	bad6e169bf	Add a quick path in build_intra_predictors For the cases where no reference data is available. Change-Id: Ibf1ac9b7073acc2c7fc44da893f3d608dc74bc1e	2016-05-25 15:21:57 -07:00
Yi Luo	bfe4c0ae07	Integrate HBD inverse HT flip types sse4.1 optimization - tx_size: 4x4, 8x8, 16x16. - tx_type: FLIPADST_DCT, DCT_FLIPADST, FLIPADST_FLIPADST, ADST_FLIPADST, FLIPADST_ADST. - Encoder speed improvement: park_joy_1080p_12: ~11%, crowd_run_1080p_12: ~7%. - Add unit test cases for bit-exact against C. Change-Id: Ia69d069031fa76c4625e845bfbfe7e6f6ed6e841	2016-05-25 12:32:10 -07:00
Yi Luo	cb507ff29a	Merge "HBD inverse HT 8x8 and 16x16 sse4.1 optimization" into nextgenv2	2016-05-24 22:06:07 +00:00
Zoe Liu	cf5083d4cd	Added an experiment "bidir_pred" for backward prediction Major parts have been implemented as follows: (1) Added BRF_UPDATE, LASTNRF_UPDATE, and NRF_UPDATE in firstpass.c; (2) Added the handling for the scenario of "cpi->common.show_existing_frame == 1" at the encoder; (3) Added a new reference frame of BWDREF_FRAME; (4) Have bwd-ref work with upsampled references. Note that when the experiment of "ext_refs" turned on, this experiment will be turned off automatically currently. RD performance in Overall PSNR has been improved, compared against the VP10 baseline: lowres: Avg -3.312; BDRate -3.154 derflr: Avg -1.927; BDRate -1.176 midres: Avg -2.149; BDRate -2.001 hdres : Avg -0.567; BDRate -0.588 Change-Id: I4c06ff51cc20194bffbd4d2346e57ba3dcf6b62c	2016-05-24 13:55:57 -07:00
Yi Luo	28cdee448d	HBD inverse HT 8x8 and 16x16 sse4.1 optimization - Covers tx_type: DCT_DCT, DCT_ADST, ADST_DCT, ADST_ADST. - Encoding speed improves ~27% on crowd_run_1080p_12. - Merge 4x4, 8x8, 16x16 unit tests in one test file. Change-Id: I058ef5254d068a9523a826480c78ebbdd231824c	2016-05-24 12:55:30 -07:00
Debargha Mukherjee	89f5b6a0b6	Merge "Remove redundant memcpy from wedge predictor." into nextgenv2	2016-05-24 17:33:54 +00:00
Debargha Mukherjee	416da08102	Merge "Pick up bit-depth from the right place" into nextgenv2	2016-05-24 17:33:22 +00:00
Geza Lore	2935b4db0e	Remove redundant memcpy from wedge predictor. Removing redundant calls to memcpy from build_wedge_inter_predictor_from_buf yields a net 4% encoder speedup with ext-inter only. The output is identical. Change-Id: If97d4e323a5c8aca90c84a25a72085e006b05446	2016-05-24 11:31:18 +01:00
Geza Lore	62b6331753	Pick up bit-depth from the right place Change-Id: Icbdb036d7927b77b84bd78e8348ec8b5be88df08	2016-05-24 11:08:23 +01:00
hui su	4a741a5d5c	Skip unnecessary calculations in ext-intra Around 5% speedup. Change-Id: I1c552e4e58fbf5637c0b5a97dd2cc4f83a1ca201	2016-05-23 17:24:19 -07:00
Zoe Liu	a63147ae77	Fix --test-decode=warn to test mismatch This patch always compares the most recent show frames between the encoder and the decoder to test the mismatch. Change-Id: I68a91ad0996a598231450debfd616e24992419b5	2016-05-23 17:01:53 -07:00
Debargha Mukherjee	fb65f9b54b	Merge "Add optimized vpx_blend_mask6" into nextgenv2	2016-05-23 23:43:52 +00:00
Geza Lore	a661bc87c4	Add optimized vpx_blend_mask6 This is to replace vp10/common/reconinter.c:build_masked_compound. Functionality is equivalent, but the interface is slightly more generic. Total encoder speedup with ext-inter: ~7.5% Change-Id: Iee18b83ae324ffc9c7f7dc16d4b2b06adb4d4305	2016-05-23 16:28:58 +01:00
Debargha Mukherjee	fa5022978d	Merge "Wedge refactoring to handle signs better" into nextgenv2	2016-05-20 23:19:39 +00:00
Debargha Mukherjee	e5de2ad632	Wedge refactoring to handle signs better Mostly refactoring. Handles signs better though results are more or less neutral. Change-Id: If499537c8f8da4f34d104ebfda072eb4c85fb12f	2016-05-20 14:12:52 -07:00
Yaowu Xu	93921097a6	Merge "Properly handle the filter extension in highbd setting" into nextgenv2	2016-05-20 20:00:51 +00:00
Yaowu Xu	7fd0e1b991	Merge "Port change to highbitdepth code path" into nextgenv2	2016-05-20 19:59:41 +00:00
Yaowu Xu	ba794ea356	Port change to highbitdepth code path This fixes the crash in encoder when configure with both highbitdepth and dual-filter. Change-Id: Ie06cc528094f4b31b7fc0ba75e7b15cae031d707	2016-05-20 11:30:37 -07:00
Hui Su	83713e7059	Merge "Use standard rounding in intra filters." into nextgenv2	2016-05-20 16:36:04 +00:00
Jingning Han	f1c283f4de	Merge "Rework sub8x8 chroma component inter predictor" into nextgenv2	2016-05-20 15:50:32 +00:00
Yaowu Xu	a9fc1cc257	Fix a build issue When both obmc and dual_filter is enabled. Change-Id: I56b127573a6cca31469bb357cf7a6a9c3df64071	2016-05-19 14:24:41 -07:00
Yue Chen	a33e3d12cb	Merge "Fix obmc + ext-interp interference" into nextgenv2	2016-05-19 18:08:52 +00:00
Jingning Han	d84a2e7dc0	Properly handle the filter extension in highbd setting This commit makes the filter extension in highbd aware of the dual filter and ext-interp experiments to prevent enc/dec mismatch when both experiments are turned on. Change-Id: I11ac1f041bd5f73d61e839d6386d9c5d008da3f7	2016-05-19 09:59:48 -07:00
Yi Luo	5fec33012e	Merge "Fix to conform Google's coding convention" into nextgenv2	2016-05-19 16:07:01 +00:00
Jingning Han	0f513752a0	Rework sub8x8 chroma component inter predictor This commit makes the sub8x8 chroma component inter predictor operate at 2x2 block level. This allows one to use the actual motion vector associated with each individal pixel block. It improves the compression performance lowres 0.40% midres 0.25% hdres 0.15% Change-Id: Ia40e07cc7fde463dbf660018850e024932136c4f	2016-05-19 09:03:57 -07:00
Jingning Han	936ed0804d	Merge "Account sub8x8 block reference filter type for prob context" into nextgenv2	2016-05-19 15:04:31 +00:00
Jingning Han	300083da27	Merge "Re-structure probability model context for prediction filter type" into nextgenv2	2016-05-19 15:04:18 +00:00
Geza Lore	fa63b5514a	Use standard rounding in intra filters. Use the same rounding method that is used throughout the codebase, where the halfway value is rounded up rather than down. Change-Id: Ie969ed7eb9fcc88a93a90c7e274fd82f336c7e4d	2016-05-19 13:16:42 +01:00
Geza Lore	009bd1153e	Fix obmc + ext-interp interference With ext-interp, a switchable interpolation filter is coded iff the motion vector uses fractional pixel movement (ie, true subpixel movement). With ext-interp and obmc enabled at the same time, the RD search proceeds as: 1. Do motion search 2. Do interpolation filter search iff subpixel motion, otherwise use EIGHTTAP_REGULAR 3. Evaluate obmc=0 4. Evaluete obmc=1 - This involves another motion search If the motion search in step 4 yields an integer motion vector, while the search in step 1 did not, then an interp_filter value other than EIGHTTAP_REGULAR is invalid, and will cause an assertion failure at output time, or a mismatch if not using --enable-debug. The fix sets the interp_filter to EIGHTTAP_REGULAR if obmc=1 is picked with an integer motion vector. Change-Id: I4685d1ad537f41d833dc9eb64845956b67886cca	2016-05-19 11:30:07 +01:00
Yi Luo	346d2449f0	Fix to conform Google's coding convention - Confirm input coeff buffer is 16-byte aligned. - sizeof() prefer variable name instead of type. - Fix function name (Capital first letter then Pascal case). - Long base class name uses a newline (with colon and 4 space indent). - Remove a unnecessary reference function variable. - Method declaration precedes variable declaration in class definition. Change-Id: I317f7e679926b5219f58c5f7d14512e94985e7fe	2016-05-18 18:15:53 -07:00
Zoe Liu	011f020447	Refactor on getting upsampled reference frame Reused a function that has been used in getting the normal reference frames. Change-Id: Ic4f7dac5c396d689a72699ab79fd580747f8bd65	2016-05-18 16:00:23 -07:00
Jingning Han	9161464f6c	Account sub8x8 block reference filter type for prob context If a reference block is coded with sub8x8 block size, and if it has sub-pixel level motion vectors, its prediction filter type should be used as context information. The coding performance gains of dual filter type coding scheme are lowres 0.57% hdres 0.88% Change-Id: I68b98f2518d02f11c29d0256aeb45b2580fe5cac	2016-05-18 12:35:31 -07:00
Angie Chiang	6f28581b26	Turn on flip in inverse txfm2d Fix build failed Reduce txfm test time Change-Id: Ieaf6b27f3a272d06286f817f01230413fa8adcf6	2016-05-18 11:26:57 -07:00
Jingning Han	27d44a1843	Re-structure probability model context for prediction filter type This commit reworks the probability model contexts used in the prediction filter type entropy coding. Change-Id: I7abc68cb469248d0d7ca1046da3c086ecb7b066a	2016-05-18 11:11:43 -07:00
Yi Luo	18ecb16c30	Merge "Integrate HBD row/column flip fwd txfm SSE4.1 optimization" into nextgenv2	2016-05-18 17:45:45 +00:00
Debargha Mukherjee	f1ddf6eb04	Merge "Reducing computation of interintra modes" into nextgenv2	2016-05-18 17:21:15 +00:00
Yi Luo	1d307368a9	Integrate HBD row/column flip fwd txfm SSE4.1 optimization - Integrate 5 flip transform types for each 4x4, 8x8, and 16x16 block, for experiment, EXT_TX. - Encoder speed improves about 12%-15%. - Update the unit tests for bit-exact result against C. Change-Id: Idf27c87f1e516ca5b66c7b70142477a115404ccb	2016-05-18 03:48:01 +00:00
Jingning Han	9f55543c06	Merge "Silience compiler warnings in unsigned int" into nextgenv2	2016-05-18 01:18:46 +00:00
Jingning Han	436f78fab7	Silience compiler warnings in unsigned int Add suffix u to clarify the unsigned int constant when the value is above 2^31. Change-Id: Ic712096285b7bf37deaeb5ad1b6b117fc0d67093	2016-05-17 16:46:42 -07:00
Debargha Mukherjee	049dbe7786	Reducing computation of interintra modes Use model for interintra mode search. Speed-up about 5-10% with about 0.04 drop in efficiency. lowres: -2.60% Change-Id: I825bf0ba8a46eb7f19fc528c25b8df066fb8ea95	2016-05-17 07:28:06 -07:00
James Zern	a81a75184c	Merge "vp10/rdopt,rd_pick_intra4x4block: port tsan fix from vp9" into nextgenv2	2016-05-17 03:04:00 +00:00
James Zern	8eba4ac46e	vp10/rdopt,rd_pick_intra4x4block: port tsan fix from vp9 minus the non-existent nonrd portion. original change: commit d642294b1c57a5adacb1038ff45766c38bae8a6d Author: Jingning Han <jingning@google.com> Date: Thu Feb 11 12:36:49 2016 -0800 Fix tsan error in VP9 sub8x8 intra mode search This commit fixes issue 1141. The issue was triggered in multi-tile encoding. The change properly saves and restores the block context information in the real-time mode selection process. It removes several redundant memcpy operations in sub8x8 intra block mode search. Change-Id: I35c9ad197f4bd500ec39b5fc833f052f19eee010 Change-Id: Idfa38c54c9e645479f6870d46f71fb1e91c071da	2016-05-16 17:20:29 -07:00
Jingning Han	4677e1a718	Unify the per directional filter type system for compound modes For the current stage, we assume a single prediction filter type per direction in the settings of compound inter prediction modes. Change-Id: I12a1afdd364b93fcee870bd11ad01fc40ab48cff	2016-05-16 14:41:08 -07:00
Jingning Han	d567e14e81	Enable per motion component filter type selection Change-Id: I73823fc94f296d225dece7156de71b30bae3fcb7	2016-05-16 14:38:43 -07:00
Jingning Han	c4e7fde68a	Merge "Properly handle 2D filter boundary extension" into nextgenv2	2016-05-16 21:34:28 +00:00
Yi Luo	ceabb00704	Merge "HBD inverse HT 4x4 SSE4.1 optimization" into nextgenv2	2016-05-16 21:15:08 +00:00

... 5 6 7 8 9 ...

1473 Commits