generic-library/vpx

Author	SHA1	Message	Date
Jingning Han	025fa11c75	Take out skip_recode speed feature The assumption doesn't hold true in the current codebase. Remove this speed feature to simplify the codebase. Change-Id: I9b69f484c9b7cd612b825047cc5b2fce63ee0af7	2016-06-08 18:27:36 +00:00
Jingning Han	0d6980d7a1	Remove swap buffer speed feature The inter prediction residual can undergo different transform types during the rate-distortion optimization search. The assumption used in this speed feature no longer holds true. This commit removes the related code to clean up the codebase and clear out unit test failure in higher speed setting. Change-Id: I7f7cd4df2345ed3e607c9fae75b38cd2dbde0cac	2016-06-08 11:27:00 -07:00
Jingning Han	33dafdb58b	Add tx type speed feature to recursive transform block partitioning Change-Id: I45440a72b4287d98cbe21b72defc67138a8eb953	2016-06-07 11:34:30 -07:00
Jingning Han	9a858e868c	Rework the tx type speed feature This commit re-works the transform type speed feature. It moves the transform type selection outside of the coding mode loop. This avoids repeated motion search if the best prediction mode is chosen as NEWMV. It improves the speed performance for clips that contain more motion activities. For mobile_cif at 1000 kbps, this makes the baseline encoding 7% faster and makes the encoding with dynamic motion vector referencing scheme enabled 10% faster. Change-Id: I93e2714b3e461303372c4b66a4134ee212faffd1	2016-06-07 11:32:27 -07:00
Jingning Han	3713949b6d	Merge "Make ref-mv experiment support ActiveMap" into nextgenv2	2016-06-06 16:06:41 +00:00
Debargha Mukherjee	b85d0adadf	Merge "Always include the cost of tx size in rate for Y." into nextgenv2	2016-06-03 22:57:17 +00:00
Debargha Mukherjee	33c57e6223	Merge "Check if sub8x8 rd stats are valid before reusing them." into nextgenv2	2016-06-03 22:38:56 +00:00
Debargha Mukherjee	fc61d92bf8	Merge "Compute rate of partition type accurately for edge blocks." into nextgenv2	2016-06-03 22:37:33 +00:00
Jingning Han	27d8a948c1	Make ref-mv experiment support ActiveMap Reset the ref_mv_idx and predicted motion vector when the coding block belongs to skip segment. Change-Id: I5746ab315a436b829b64a1a25121989d3c11c995	2016-06-03 15:04:18 -07:00
Geza Lore	b87078d51e	Always include the cost of tx size in rate for Y. The transform can only be skipped if both Y and U/V can be skipped, so we always include the cost of tx size in the rate for Y. This will get later subtracted if the transform is actually skipped. Change-Id: I136a223e5596f18b69bb9f743e7e08438183a215	2016-06-03 11:51:35 -07:00
Geza Lore	d9870c32a9	Check if sub8x8 rd stats are valid before reusing them. Change-Id: I5d49f15a07de58c226d4003b4691e001abf1f3f8	2016-06-03 11:47:34 -07:00
Geza Lore	8ee640f979	Compute cost of UV mode accurately for intra blocks. We used to cache the cost of the UV mode from the search with a different previously tried Y mode, but the UV mode is contexted on the Y mode, so caching the cost is inaccurate. Change-Id: Ib003510afb6fc9befb7808b67b0be64f1c0a0804	2016-06-03 11:13:51 -07:00
Geza Lore	1354c6942c	Compute rate of partition type accurately for edge blocks. This patch factors in the different partition coding syntax used for right and bottom edge blocks when doing RD search. Change-Id: I2f31650512b6a4a7a2c03352414693aff6fbf87b	2016-06-03 06:43:34 -07:00
Debargha Mukherjee	353930d212	Merge "Add 1D version of vpx_sum_squares_i16" into nextgenv2	2016-06-03 13:27:50 +00:00
Debargha Mukherjee	5590c48937	Merge "Move template specializations into .cc from .h" into nextgenv2	2016-06-03 13:27:43 +00:00
Debargha Mukherjee	cfa03374f8	Merge "Factor out x86 SIMD intrinsic synonyms" into nextgenv2	2016-06-03 13:27:29 +00:00
Debargha Mukherjee	1e160ce559	Merge "Factor out model_rd_from_sse" into nextgenv2	2016-06-03 13:27:22 +00:00
Debargha Mukherjee	cbf51c5ba0	Merge "Pre-compute and use contiguous wedge masks." into nextgenv2	2016-06-03 13:27:02 +00:00
Geza Lore	f19700fe52	Add 1D version of vpx_sum_squares_i16 Change-Id: I0d7bda2fe6f995a9e88a9f66540b4979b3f7fab1	2016-06-03 09:34:55 +01:00
Geza Lore	5a69ee0e11	Move template specializations into .cc from .h Change-Id: I6d8775c1fa228fde25016a401e3c22a8e3da42f9	2016-06-03 09:34:55 +01:00
Geza Lore	9ebca46933	Factor out x86 SIMD intrinsic synonyms Change-Id: Idc4ac3ccd2ba19087cdb74a3e4a6774ac50386aa	2016-06-03 09:34:55 +01:00
Geza Lore	73bc3119be	Factor out model_rd_from_sse Change-Id: Ia60ff0ecc8d083870fadbfe07d494d1e2c080489	2016-06-03 09:34:55 +01:00
Geza Lore	ab29978e9f	Pre-compute and use contiguous wedge masks. This is purely a refactoring patch and has no functional effect. Uses of these masks can be arranged such that all input blocks are contiguous in memory (stride == block width). In this case 1D versions of operations can be used. 1D vector operations have superior performance over 2D block equivalents as they are more processor cache friendly and they can do away with a second loop overhead. Change-Id: I2b76c9888aea2c857cc497e8a4b2841fd3dad54e	2016-06-03 00:16:22 -07:00
Debargha Mukherjee	17c4f1c7f5	Merge "Use standard rounding in combine_interintra." into nextgenv2	2016-06-02 19:29:16 +00:00
Debargha Mukherjee	7534a15c3a	Merge "Warped motion functions added" into nextgenv2	2016-06-02 19:28:03 +00:00
Geza Lore	888e90e823	Use standard rounding in combine_interintra. Use the same rounding method that is used throughout the codebase, where the halfway value is rounded up rather than down. Change-Id: I04e92850bc69a7d7a07b06e3d2ce97f6f2ada321	2016-06-02 16:26:05 +01:00
Alex Converse	380c4ee32d	Merge "segmentation: Don't use uninitialized probability data." into nextgenv2	2016-06-01 17:50:37 +00:00
Alex Converse	6bae20ca43	Merge "Replace some vpxbool calls with entropy coder agnostic calls." into nextgenv2	2016-05-31 23:58:00 +00:00
Alex Converse	7a6cb59dbb	segmentation: Don't use uninitialized probability data. BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1224 Change-Id: I17b76fcf0d8c191850350d5aa50dcc007b8b0cdc	2016-05-31 16:42:29 -07:00
Hui Su	afaefc89eb	Merge "ext-intra: speed up keyframe encoding" into nextgenv2	2016-05-31 23:21:03 +00:00
Hui Su	118167a47d	Merge "Add a speed feature for inter tx type search" into nextgenv2	2016-05-31 23:20:57 +00:00
Hui Su	60b52a1334	Merge "Add a speed feature for intra tx type search" into nextgenv2	2016-05-31 23:20:52 +00:00
James Zern	1d9cf262f7	Merge "vp10_inv_txfm2d_test: fix memory leak" into nextgenv2	2016-05-31 23:19:47 +00:00
Alex Converse	aee0091161	Replace some vpxbool calls with entropy coder agnostic calls. Change-Id: Ifbcd0714fcf994c43b69255185456c7a255df66c	2016-05-31 15:42:19 -07:00
Debargha Mukherjee	faf3c2cd38	Warped motion functions added Change-Id: I5064ef1421e17c3ecafe70e7ff1fc7db0c16cc8f	2016-05-31 14:03:23 -07:00
hui su	fa933553da	ext-intra: speed up keyframe encoding 130% speed increase for keyframe encoding, with 0.4% compression loss. When kf-max-dist=150, 1.5% speed increase with 0.03% compression loss. Change-Id: I4cf7314ab95b9eb6dd17f314aca8955522c82676	2016-05-31 10:34:44 -07:00
hui su	f523d7b540	Add a speed feature for inter tx type search Seperate prediction mode and tx type search for inter modes. Enabled for speed >=1. baseline: speed increase 40% compression drop 0.30%/0.29% on lowres/midres ext-tx: speed increase 160% compression drop 1.08%/0.95% on lowres/midres Change-Id: Ieb34b1ee80df6980d16e26a5783e08cc0deae55b	2016-05-31 10:34:35 -07:00
hui su	38e6dd71bb	Add a speed feature for intra tx type search Add a speed feature to seperate prediction mode and tx type search for intra modes: search for best intra prediction mode with fixed default tx type first, then choose the best tx type for the selected mode. Coding performance drop: baseline lowres 0.10% midres 0.08% hdres 0.14% with ext-tx lowres 0.14% midres 0.25% hdres 0.20% Speed improvement is 20% for baseline and 17% for ext-tx. It is turned on for speed >= 1. Change-Id: Ia5e8d39e8a4e2e42c521bfde938f8b6a98ab24f9	2016-05-31 10:33:56 -07:00
Zoe Liu	e89ca180c2	Make the bi-predictive frame group interval adjustable This is for the bidir-pred experiment. Previously the length of the bi-predictive frame group interval is fixed at 2, i.e. one bi-predictive frame may be inserted every other frame. This patch makes the length adjustable, i.e. any positive number may be specified, but the use of the backward ref will be turned off if the bi-predictive frame group interval is larger than the golden frame group. Further, an additional rate factor level has been added: INTER_LOW , which applies to LAST_BIPRED_UPDATE frames that are not used as references. Change-Id: I5514d34a64dd486bbb5756c2d0612946f598a789	2016-05-28 16:46:45 -07:00
Hui Su	6fd7f7dd3e	Merge "ext-intra: refactor mode info. writing and reading" into nextgenv2	2016-05-28 04:34:59 +00:00
James Zern	5d237f0986	vp10_inv_txfm2d_test: fix memory leak input_, ref_input_ and output_ were being allocated with new[] followed by vpx_memalign, remove the former Change-Id: Ia16d0f9b9317042a24445095ad3c284f4e7bb481	2016-05-26 20:04:59 -07:00
Hui Su	e717ece4ab	Merge "Add a quick path in build_intra_predictors" into nextgenv2	2016-05-26 22:12:53 +00:00
hui su	e5f47d4334	ext-intra: refactor mode info. writing and reading No performance changes. Change-Id: I001068330ea217a993aee9b79d7ffead0d23100e	2016-05-26 14:56:40 -07:00
Hui Su	88eaf5d6ce	Merge "Skip unnecessary calculations in ext-intra" into nextgenv2	2016-05-26 18:03:02 +00:00
hui su	bad6e169bf	Add a quick path in build_intra_predictors For the cases where no reference data is available. Change-Id: Ibf1ac9b7073acc2c7fc44da893f3d608dc74bc1e	2016-05-25 15:21:57 -07:00
Yi Luo	469d002f4e	Merge "Integrate HBD inverse HT flip types sse4.1 optimization" into nextgenv2	2016-05-25 21:35:14 +00:00
Yi Luo	bfe4c0ae07	Integrate HBD inverse HT flip types sse4.1 optimization - tx_size: 4x4, 8x8, 16x16. - tx_type: FLIPADST_DCT, DCT_FLIPADST, FLIPADST_FLIPADST, ADST_FLIPADST, FLIPADST_ADST. - Encoder speed improvement: park_joy_1080p_12: ~11%, crowd_run_1080p_12: ~7%. - Add unit test cases for bit-exact against C. Change-Id: Ia69d069031fa76c4625e845bfbfe7e6f6ed6e841	2016-05-25 12:32:10 -07:00
James Zern	008f27e70a	Merge "add vp10 ActiveMap/ActiveMapRefreshTest" into nextgenv2	2016-05-25 19:05:02 +00:00
Yi Luo	cb507ff29a	Merge "HBD inverse HT 8x8 and 16x16 sse4.1 optimization" into nextgenv2	2016-05-24 22:06:07 +00:00
Zoe Liu	cf5083d4cd	Added an experiment "bidir_pred" for backward prediction Major parts have been implemented as follows: (1) Added BRF_UPDATE, LASTNRF_UPDATE, and NRF_UPDATE in firstpass.c; (2) Added the handling for the scenario of "cpi->common.show_existing_frame == 1" at the encoder; (3) Added a new reference frame of BWDREF_FRAME; (4) Have bwd-ref work with upsampled references. Note that when the experiment of "ext_refs" turned on, this experiment will be turned off automatically currently. RD performance in Overall PSNR has been improved, compared against the VP10 baseline: lowres: Avg -3.312; BDRate -3.154 derflr: Avg -1.927; BDRate -1.176 midres: Avg -2.149; BDRate -2.001 hdres : Avg -0.567; BDRate -0.588 Change-Id: I4c06ff51cc20194bffbd4d2346e57ba3dcf6b62c	2016-05-24 13:55:57 -07:00

1 2 3 4 5 ...

16443 Commits