generic-library/vpx

Author	SHA1	Message	Date
Zoe Liu	1af28f0230	Code refactoring on Macros related to ref frame numbers We have renamed following Macros to avoid name confusion: REFS_PER_FRAME --> INTER_REFS_PER_FRAME (= ALTREF_FRAME - LAST_FRAME + 1) MAX_REF_FRAMES --> TOTAL_REFS_PER_FRAME (= ALTREF_FRAME - INTRA_FRAME + 1) INTER_REFS_PER_FRAME specifies the maximum number of reference frames that each Inter frame may use. TOTAL_REFS_PER_FRAME is equal to INTER_REFS_PER_FRAME + 1, which counts the INTRA_FRAME. Further, at the encoder side, since REF_FRAMES specifies the maximum number of the reference frames that the encoder may store, REF_FRAMES is usually larger than INTER_REFS_PER_FRAME. For example, in the ext-refs experiment, REF_FRAMES == 8, which allows the encoder to store maximum 8 reference frames in the buffer, but INTER_REFS_PER_FRAME equals to 6, which allows each Inter frame may use up to 6 frames out of the 8 buffered frames as its references. Hence, in order to explore the possibility to store more reference frames in future patches, we modified a couple of array sizes to accomodate the case that the number of buffered reference frames is not always equal to the number of the references that are being used by each Inter frame. Change-Id: I19e42ef608946cc76ebfd3e965a05f4b9b93a0b3	2016-08-04 11:21:28 -07:00
hui su	9a4702417a	Extra round of subpel MV search around second best full-pixel MV Keep track of the best and second best full pixel motion vector candidates, and do subpel search around both of them. Compression improvement: lowres 0.22% midres 0.23% hdres 0.18% No noticeable encoding speed changes observed on lowres test clips. Change-Id: I5f4df2a03d1db061cfdfdba6138b27e9ea91f089	2016-07-18 12:25:24 -07:00
Debargha Mukherjee	5f8ea94c1f	Remove unused zcoeff_blk from PICK_MODE_CONTEXT and MACROBLOCK Change-Id: I42f98ce51871948244bdcaaaeb3d0191622116ae	2016-07-14 12:36:03 -07:00
hui su	581636d767	Refactor codes about motion search 1. Add "best_mv" in MACROBLOCK to store the best motion vector during motion search, so that we don't need to pass its pointer to various motion search functions. 2. Declare some functions as static when possible. 3. Fix some indents. Change-Id: I0778146c0866cbc55e245988c59222577ea8260e	2016-07-13 10:12:37 -07:00
Debargha Mukherjee	0eefe6edb4	Remove use_quant_fp speed feature Change-Id: I22f1299545d4c75d80e72d479be66f66ea142ef1	2016-06-29 13:58:53 -07:00
Geza Lore	92922be83c	Remove skip_txfm optimization. Commit 0d6980d7a1caa592058f8d5d618b012c160772f7 removed some use of the skip_txfm optimization, and the rest are not productive. The current use of this optimization is only used with --good and --cpu-used >= 3, however the overhead of this is higher than the speedup it yields. Removing this, and subsequently simplifying model_rd_for_sb yields a net encoder speedup: --cpu-used=0 ~1.5% faster --cpu-used=3 ~2.0% faster The code simplification is also significant. Change-Id: I1dd668c32de15a2e912c59c42379d0f9e1032ff8	2016-06-28 10:03:03 +01:00
Sarah Parker	fbe6fb2773	Add multiple quantization profiles to new_quant experiment Add the ability to pick between 3 quantization profiles. The profile is chosen based on the entropy context at the block level. Change-Id: Iaea0485798441b7d635962c2563f3a477f582dac	2016-06-24 16:16:13 -07:00
Yue Chen	02596589e7	(Cosmetics) Remove unnecessary new parameters in obmc experiment pred_variance in obmc experiment is equivalant to recon_variance in baseline Change-Id: Iba8fb9bd973898be5a0d87a507ceaf65c75bdc51	2016-06-22 06:24:32 +00:00
Geza Lore	2a588555bb	Pass segment id explicitly to quantizer init. This is purely refactoring in preparation of fixing supertx segment handling Change-Id: I74bcae34241fdf2b592e1cd45b67af77b9e16c9a	2016-06-14 16:07:37 +01:00
Sarah Parker	a21afd421b	Move new quant experiment from nextgen This experiment implements non-uniform quantization where the width of the bins increases gradually to more closely match a laplacian distribution of the coeficcients. Performance Gain: derflr: 0.15% hevcmr: 0.675% Change-Id: I25234244e3bcd94b87c1f77cf682190b61c8ef94	2016-06-10 08:06:22 -07:00
Jingning Han	025fa11c75	Take out skip_recode speed feature The assumption doesn't hold true in the current codebase. Remove this speed feature to simplify the codebase. Change-Id: I9b69f484c9b7cd612b825047cc5b2fce63ee0af7	2016-06-08 18:27:36 +00:00
hui su	f523d7b540	Add a speed feature for inter tx type search Seperate prediction mode and tx type search for inter modes. Enabled for speed >=1. baseline: speed increase 40% compression drop 0.30%/0.29% on lowres/midres ext-tx: speed increase 160% compression drop 1.08%/0.95% on lowres/midres Change-Id: Ieb34b1ee80df6980d16e26a5783e08cc0deae55b	2016-05-31 10:34:35 -07:00
hui su	38e6dd71bb	Add a speed feature for intra tx type search Add a speed feature to seperate prediction mode and tx type search for intra modes: search for best intra prediction mode with fixed default tx type first, then choose the best tx type for the selected mode. Coding performance drop: baseline lowres 0.10% midres 0.08% hdres 0.14% with ext-tx lowres 0.14% midres 0.25% hdres 0.20% Speed improvement is 20% for baseline and 17% for ext-tx. It is turned on for speed >= 1. Change-Id: Ia5e8d39e8a4e2e42c521bfde938f8b6a98ab24f9	2016-05-31 10:33:56 -07:00
Jingning Han	ec2ffda599	Handle zero motion vector residual This commit handles the zero motion vector residuals for single and compound reference modes, respectively. It improves the coding performance by 0.13% with no additional encoding complexity. Change-Id: I16075a836025bd2746da2ff4698fb9261e4b08c1	2016-04-18 18:14:01 -07:00
Jingning Han	c8312daad1	Refactor rd_variance_adjustment function Compute the reconstruction variance in the prediction mode search. Change-Id: Id9c7635a9c9f5383e61c0e427e95234211834301	2016-04-18 09:37:34 -07:00
Alex Converse	bb0e692151	Convert palette from double to float. About 20% less time spent coding in vp10_k_means(). Change-Id: I5cf7605cde869a269776197bace70de353b07d83	2016-04-07 15:17:30 -07:00
Geza Lore	511da8cbe5	Rename MI_BLOCK_SIZE and MI_MASK macros. Rename MI_BLOCK_SIZE.* -> MAX_MIB_SIZE.* (MIB is for MI Block). Rename MI_MASK.* -> MAX_MIB_MASK.* There are no functional changes. This is in preparation for coding the superblock size at the frame level, which will require some of these constants to become variables. The new names better reflect future semantics, and hence make the code clearer. Change-Id: Iee08d97554cf4cc16a5dc166a3ffd1ab91529992	2016-03-31 09:57:41 +01:00
Geza Lore	552d5cd715	Extend superblock size fo 128x128 pixels. If --enable-ext-partition is used at build time, the superblock size (sometimes also referred to as coding unit (CU) size) is extended to 128x128 pixels. Change-Id: Ie09cec6b7e8d765b7555ff5d80974aab60803f3a	2016-03-30 18:23:06 +01:00
hui su	8a128c2a72	Fixes for Palette mode This patch fixes 2 issues in Palette mode: 1. More memory is needed in PALETTE_BUFFER for 444 video format. 2. A merge issue caused by https://chromium-review.googlesource.com/#/c/333940/7 Change-Id: I2aedc7dfdfb6b66fbd600189ec6e1e2cc6120d40	2016-03-25 18:16:44 -07:00
Geza Lore	f8cfb72a32	Refactor bsse and skip_txfm in MACROBLOCK. Simple refactoring to 2 dimensional arrays, in preparation for 128 wide superblocks. Change-Id: I40d447bd9fbd4f755534ea3cc82fc8f4676cea07	2016-03-18 15:30:10 +00:00
Jingning Han	7174d637e8	Properly restore transform block skip flag in RD search This commit fixes an encoding issue related to var-tx and ref-mv experiments that causes the codec to use random values for transform block skip flag. Change-Id: I8daa6d6b88ea45b5bbeb81b43dd0eeff545c8e5a	2016-03-03 13:52:49 -08:00
Yue Chen	02e734168c	Merge "Optimizing obmc rd decision by checking the real rd cost" into nextgenv2	2016-02-23 23:05:06 +00:00
Yue Chen	a614262edb	Optimizing obmc rd decision by checking the real rd cost Instead of using model_rd_for_sb() to estimate the cost and make the decision on bmc/obmc, we use super_block_yrd/uvrd() to calculate and compare the real rd costs of bmc and obmc. Average bit-rate reduction(%) of obmc experiment: derflr/derfhd/hevcmr/hevchd 2.353/TBD/TBD/TBD Before the optimization, the coding gain was: 1.582/1.109/1.600/1.164 Note: there is still some mysterious bug because that compared to the previous version, the performance at low bit rate drops a lot. Change-Id: I8dbee04a272190f10516a3953c1ae690f8136766	2016-02-23 14:16:12 -08:00
Jingning Han	fec5988657	Unify motion vector cost system This commit unifies the motion vector cost buffers for full pixel and sub-pixel motion search. The new motion vector coding system provides 0.5% coding gains for 720p and above sequences and 0.2% for lower resolution sets. Change-Id: I927ec81eadc39d11a3c12b375221a1ddd2e8bf24	2016-02-21 22:21:28 -08:00
Jingning Han	03c01bc3c0	Account context based prob model for motion vector cost estimate This commit accounts for the context based probability model for motion vector cost estimate in rate-distortion optimization. Change-Id: Ia068a9395dcb4ecc348f128b17b8d24734660b83	2016-02-19 16:32:51 -08:00
Alex Converse	b3ad81288f	Port switch to 9-bit rate cost to vp10. Brings the following commits to vp10: 269428e Tie the bit cost scale to a define. d13385c Switch to 9-bit rate cost constants built on a 256 probability denominator. ad43a73 Fix a signed overflow in vp9 motion cost. 1c9b091 Fix some interger overflow errors fac947d Restore previous motion search bit-error scale. Change-Id: I598ba7ee7efcde18439c31dfa96b86cbf297a580	2016-02-11 09:54:24 -08:00
Yue Chen	968bbc7bb2	Adding new compound modes to EXT_INTER experiment Combinations of different mv modes for two reference frames are allowed in compound inter modes. 9 options are enabled, including NEAREST_NEARESTMV, NEAREST_NEARMV, NEAR_NEARESTMV, NEAREST_NEWMV, NEW_NEARESTMV, NEAR_NEWMV, NEW_NEARMV, ZERO_ZEROMV, and NEW_NEWMV. This experiment is mostly deported from the nextgen branch. It is made compatible with other experiments Coding gain of EXT_INTER(derflr/hevcmr/hevchd): 0.533%/0.728%/0.639% Change-Id: Id47e97284e6481b186870afbad33204b7a33dbb0	2016-01-22 13:52:16 -08:00
Jingning Han	33cc1bd21d	Generate compound reference motion vector This commit allows the codec to add motion vector pairs into the candidate list. It further improves the compression performance by 0.1% across derf, hevcmr, stdhd, and hevchr sets without adding encode/decode time. Change-Id: I88d36da25a2a89bb506d411844af667081eba98b	2016-01-12 15:28:47 -08:00
Jingning Han	6f1f0d896a	Merge "Enable adaptive prediction mode coding" into nextgenv2	2015-12-15 04:38:15 +00:00
Angie Chiang	30ee689da3	Merge "Refactor vp10_xform_quant" into nextgenv2	2015-12-11 20:29:04 +00:00
Yaowu Xu	f07d73b9bf	Merge branch 'master' into nextgenv2 Change-Id: Id0b784b115602e2502b42fa972a5ae210435a3be	2015-12-11 08:58:40 -08:00
Jingning Han	aa5d53eb17	Enable adaptive prediction mode coding This commit allows the codec to analyze the reference motion vector candidate list and adaptively reduce the size of inter prediction mode set. Change-Id: Ied6a403843b860d66f26ed485c1825c05c71bdfc	2015-12-10 09:02:32 -08:00
paulwilkins	4e692bbee2	Changes to exhaustive motion search. This change has been imported from VP9 and alters the nature and use of exhaustive motion search. Firstly any exhaustive search is preceded by a normal step search. The exhaustive search is only carried out if the distortion resulting from the step search is above a threshold value. Secondly the simple +/- 64 exhaustive search is replaced by a multi stage mesh based search where each stage has a range and step/interval size. Subsequent stages use the best position from the previous stage as the center of the search but use a reduced range and interval size. For example: stage 1: Range +/- 64 interval 4 stage 2: Range +/- 32 interval 2 stage 3: Range +/- 15 interval 1 This process, especially when it follows on from a normal step search, has shown itself to be almost as effective as a full range exhaustive search with step 1 but greatly lowers the computational complexity such that it can be used in some cases for speeds 0-2. This patch also removes a double exhaustive search for sub 8x8 blocks which also contained a bug (the two searches used different distortion metrics). For best quality in my test animation sequence this patch has almost no impact on quality but improves encode speed by more than 5X. Restricted use in good quality speeds 0-2 yields significant quality gains on the animation test of 0.2 - 0.5 db with only a small impact on encode speed. On most natural video clips, however, where the step search is performing well, the quality gain and speed impact are small. Change-Id: Iac24152ae239f42a246f39ee5f00fe62d193cb98	2015-12-08 16:54:42 +00:00
hui su	c93e5cc3e9	Bring palette back to nextgenv2 It was removed by the master branch merge. Change-Id: I4b2a524c9e052e41063359afcb4ba22bf78344cf	2015-12-07 18:24:15 -08:00
Yaowu Xu	69f4930041	Merge branch 'master' into nextgenv2 Conflicts: vp10/common/blockd.h vp10/common/entropymode.h vp10/common/reconintra.c vp10/decoder/decodemv.c vp10/encoder/bitstream.c vp10/encoder/encoder.h vp10/encoder/rd.c vp10/encoder/rdopt.c vp10/encoder/tokenize.h Change-Id: Ic4891839b6f0474026d6d69821e38edec9632df1	2015-12-07 11:37:14 -08:00
Angie Chiang	88cae8b422	Refactor vp10_xform_quant 1) Add facade to quantize b/fp/dc version so that their interface are the same. 2) Merge vp10_xform_quant b/fp/dc version to one function so that the code flow in encodemb.c is clear Change-Id: Ib62d6215438fc2d07f4e7e72393f964832d6746f	2015-12-03 15:28:11 -08:00
hui su	5d3327e891	Remove palette from VP10 Store it in nextgenv2 for now. Change-Id: Iab0af0e15246758e3b6e8bde4a74b13c410576fc	2015-12-03 12:30:47 -08:00
Jingning Han	e5c57c580a	Integrate motion vector stack into codec This commit ports the motion vector stack from motion field analyzer to the encoding and decoding pipeline. Change-Id: Ie283c1e1a15b4c17a1c7c175ce322bf053bb7840	2015-11-25 01:14:44 +00:00
Jingning Han	bfeac5e19c	Support per transform block skip coding Allow the encoder to drop individual transform block coding. Change-Id: I2c2b2985254cb92baf891f03daa33f067279373b	2015-10-30 08:55:17 -07:00
hui su	5d011cb278	VP10: Add palette mode part 1 Add palette mode for keyframe luma channel. Palette mode is enabled when using "--tune-content=screen" in encoding config parameters. on screen_content testset: +6.89% on derlr : +0.00% Design doc (WIP): https://goo.gl/lD4yJw Change-Id: Ib368b216bfd3ea21c6c27436934ad87afdaa6f88	2015-10-12 10:02:17 -07:00
Ronald S. Bultje	bab8d38f7f	vp10: remove MACROBLOCK.{highbd_,}itxfm_add function pointer. This is preparatory work for allowing per-segment lossless coding. See issue 1035. Change-Id: I9487d02717ee3e766aee61a487780056bb35d2d3	2015-09-25 19:30:46 -04:00
Ronald S. Bultje	c74b33a413	vp10: remove MACROBLOCK.fwd_txm4x4 function pointer. This is preparatory work for allowing per-segment lossless coding. See issue 1035. Change-Id: Idd72e2a42d90fa7319c10122032d1a7c7a54dc05	2015-09-25 19:30:46 -04:00
Jingning Han	c3bf837572	Refactor mbmi_ext structure This commit removes mbmi_ext_base pointer from MACROBLOCK struct. Its use case can be fully covered by cpi->mbmi_ext_base pointer. Change-Id: I155351609336cf5b6145ed13c21b105052727f30	2015-09-17 09:51:45 -07:00
Jingning Han	f137697c32	Take out skip_encode speed feature in vp10 Change-Id: Ic39d4523e78863c816b0fc85f56ea5ae5e0b3310	2015-09-10 12:45:39 -07:00
Yaowu Xu	2dcefd9c7f	Correct guard macros in header files Change-Id: Ifce12a95c1cdc36dc6ac5a72759249a17407da9e	2015-08-13 09:25:39 -07:00
Jingning Han	54d66ef165	Remove vp9_ prefix from vp10 files Remove the vp9_ prefix from vp10 file names. Change-Id: I513a211b286a57d6126fc1b0fbfd6405120014f1	2015-08-11 21:24:08 -07:00

46 Commits