generic-library/vpx

Author	SHA1	Message	Date
Zoe Liu	1af28f0230	Code refactoring on Macros related to ref frame numbers We have renamed following Macros to avoid name confusion: REFS_PER_FRAME --> INTER_REFS_PER_FRAME (= ALTREF_FRAME - LAST_FRAME + 1) MAX_REF_FRAMES --> TOTAL_REFS_PER_FRAME (= ALTREF_FRAME - INTRA_FRAME + 1) INTER_REFS_PER_FRAME specifies the maximum number of reference frames that each Inter frame may use. TOTAL_REFS_PER_FRAME is equal to INTER_REFS_PER_FRAME + 1, which counts the INTRA_FRAME. Further, at the encoder side, since REF_FRAMES specifies the maximum number of the reference frames that the encoder may store, REF_FRAMES is usually larger than INTER_REFS_PER_FRAME. For example, in the ext-refs experiment, REF_FRAMES == 8, which allows the encoder to store maximum 8 reference frames in the buffer, but INTER_REFS_PER_FRAME equals to 6, which allows each Inter frame may use up to 6 frames out of the 8 buffered frames as its references. Hence, in order to explore the possibility to store more reference frames in future patches, we modified a couple of array sizes to accomodate the case that the number of buffered reference frames is not always equal to the number of the references that are being used by each Inter frame. Change-Id: I19e42ef608946cc76ebfd3e965a05f4b9b93a0b3	2016-08-04 11:21:28 -07:00
Yaowu Xu	b06147de6b	Cherry pick from AOMedia 5b5fbad VP9LfSync->VP10LfSync b752848 vp8_yv12 -> vpx_yv12 e5068cd VP9->VPX for reference frame flags Change-Id: Ia36860499c81a5aca8cd6190e7370ec404c0df0f	2016-08-02 16:24:41 -07:00
Debargha Mukherjee	e5848dea5a	Rectangular transforms 4x8 & 8x4 Added a new expt rect-tx to be used in conjunction with ext-tx. [rect-tx is a temporary config flag and will eventually be merged into ext-tx once it works correctly with all other experiments]. Added 4x8 and 8x4 tranforms for use initially with rectangular sub8x8 y blocks as part of this experiment. There is about a -0.2% BDRATE improvement on lowres, others pending. When var-tx is on rectangular transforms are currently not used. That will be enabled in a subsequent patch. Change-Id: Iaf3f88ede2740ffe6a0ffb1ef5fc01a16cd0283a	2016-07-21 10:46:41 -07:00
Jingning Han	b605de074d	Refactor reference frame type defs Move the reference frame type definitions to common/enums.h file. Replace hard coded numbers. Combine repeated definitions. Change-Id: I288e079a03e448014cc181bcdb3f88ee8ec8d139	2016-06-22 12:34:44 -07:00
Zoe Liu	5805a14ca6	Merge bi-predictive frames to EXT_REFS This patch removed the experiment of BIDIR_PRED and merged the feature into the experiment of EXT_REFS: (1) Each frame now has up to 6 reference frames, namely LAST_FRAME, LAST2_FRAME, LAST3_FRAME, GOLDEN_FRAME, (forward) and BWDREF_FRAME, ALTREF_FRAME (backward); LAST4_FRAME has been removed; (2) First pass still keeps the 8 updates: KF_UPDATE, LF_UPDATE, GF_UPDATE, ARF_UPDATE, OVERLAY_UPDATE, and BRF_UPDATE, LAST_BIPRED_UPDATE, BI_PRED_UPDATE; (3) show_existing_frame==1 is supported in the experiment of EXT_REFS; (4) New encoding modes are added for both single-ref and compound cases, through the use of the 2 extra forward references (LAST2 & LAST3) and the 1 extra backward reference (BWDREF). RD performance wise, using Overall PSNR: Avg/BDRate Bipred only Prev EXT_REFS Current EXT_REFS with bipred lowres: -3.474/-3.324 -1.748/-1.586 -4.613/-4.387 derflr: -2.097/-1.353 -1.439/-1.215 -3.120/-2.252 midres: -2.129/-1.901 -1.345/-1.185 -2.898/-2.636 If in vp10/encoder/firstpass.h, change BFG_INTERVAL from 2 to 3, i.e. to use 2 bi-predictive frames than 1, a further improvement may be obtained: Current EXT_REFS with bipred 1 bi-predictive frame 2 bi-predictive frames lowres: -4.613/-4.387 -4.675/-4.465 derflr: -3.120/-2.252 -3.333/-2.516 midres: -2.898/-2.636 -3.406/-3.095 Change-Id: Ib06fe9ea0a5cfd7418a1d79b978ee9d80bf191cb	2016-06-17 12:43:39 -07:00
Debargha Mukherjee	81f8b3f31c	Merge "Some refactoring to support warped motion mode" into nextgenv2	2016-06-10 23:18:39 +00:00
Debargha Mukherjee	03be30ba3e	Some refactoring to support warped motion mode Change-Id: I15d54a3ae48b2b33082668116792c6595bdb3ddb	2016-06-10 12:04:18 -07:00
Jingning Han	68cd946994	Add MIN_TX_SIZE definition Change-Id: I399d601d40827ac383a6687cbeaec59e9a9c63e4	2016-06-08 11:29:02 -07:00
Zoe Liu	cf5083d4cd	Added an experiment "bidir_pred" for backward prediction Major parts have been implemented as follows: (1) Added BRF_UPDATE, LASTNRF_UPDATE, and NRF_UPDATE in firstpass.c; (2) Added the handling for the scenario of "cpi->common.show_existing_frame == 1" at the encoder; (3) Added a new reference frame of BWDREF_FRAME; (4) Have bwd-ref work with upsampled references. Note that when the experiment of "ext_refs" turned on, this experiment will be turned off automatically currently. RD performance in Overall PSNR has been improved, compared against the VP10 baseline: lowres: Avg -3.312; BDRate -3.154 derflr: Avg -1.927; BDRate -1.176 midres: Avg -2.149; BDRate -2.001 hdres : Avg -0.567; BDRate -0.588 Change-Id: I4c06ff51cc20194bffbd4d2346e57ba3dcf6b62c	2016-05-24 13:55:57 -07:00
Sarah Parker	3da61efe3b	Add 1D tx set that corresponds to reduced ext tx inter sets This is the set of 1D transforms that are used in each ext_tx_used_inter set. The 1D sets will help speed up the ext tx pruning functions. Change-Id: Ib46ad26be2df60b3bfcd2f22d96e7f38ae286df5	2016-05-04 11:42:32 -07:00
Debargha Mukherjee	7ff7943455	Brings back near-near compound mode into ext-inter lowres: improves by 0.1% Change-Id: I245019916bf47c6e24bc8c3953b86715ab0193c9	2016-04-28 11:34:13 -07:00
Geza Lore	c50aaf3049	Make ext-refs respect encoding flags. The VP8_EFLAG_NO_UPD_LAST and VP8_EFLAG_NO_REF_LAST flags can be passed to the encoder to signal that it should not update/reference the LAST ref frame when encoding the current frame. With --enable-ext-refs turned on, the new LAST2 LAST3 and LAST4 ref frames could still be used or updated, which causes the VP10/ErrorResilienceTestLarge.DropFramesWithoutRecovery/{0,1,2} tests to fail. With this patch, if --enable-ext-refs is used, then VP8_EFLAG_NO_UPD_LAST and VP8_EFLAG_NO_REF_LAST also applies to the new LAST2 LAST3 and LAST4 ref frames, as well as the LAST ref frame. Change-Id: If482b1c09bbaf914eca8e0348a2367bff261661d	2016-04-13 12:03:58 +01:00
Geza Lore	f2be4f6058	Refactor PC_TREE root handling. Change-Id: Id8b16c1b18bd6f909e72aae3fd582dd3503c88c6	2016-04-08 17:01:00 +01:00
Geza Lore	454989ff32	Make superblock size variable at the frame level. The uncompressed frame header contains a bit to signal whether the frame is encoded using 64x64 or 128x128 superblocks. This can vary between any 2 frames. vpxenc gained the --sb-size={64,128,dynamic} option, which allows the configuration of the superblock size used (default is dynamic). 64/128 will force the encoder to always use the specified superblock size. Dynamic would enable the encoder to choose the sb size for each frame, but this is not implemented yet (dynamic does the same as 128 for now). Constraints on tile sizes depend on the superblock size, the following is a summary of the current bitstream syntax and semantics: If both --enable-ext-tile is OFF and --enable-ext-partition is OFF: The tile coding in this case is the same as VP9. In particular, tiles have a minimum width of 256 pixels and a maximum width of 4096 pixels. The tile width must be multiples of 64 pixels (except for the rightmost tile column). There can be a maximum of 64 tile columns and 4 tile rows. If --enable-ext-tile is OFF and --enable-ext-partition is ON: Same constraints as above, except that tile width must be multiples of 128 pixels (except for the rightmost tile column). There is no change in the bitstream syntax used for coding the tile configuration if --enable-ext-tile is OFF. If --enable-ext-tile is ON and --enable-ext-partition is ON: This is the new large scale tile coding configuration. The minimum/maximum tile width and height are 64/4096 pixels. Tile width and height must be multiples of 64 pixels. The uncompressed header contains two 6 bit fields that hold the tile width/heigh in units of 64 pixels. The maximum number of tile rows/columns is only limited by the maximum frame size of 65536x65536 pixels that can be coded in the bitstream. This yields a maximum of 1024x1024 tile rows and columns (of 64x64 tiles in a 65536x65536 frame). If both --enable-ext-tile is ON and --enable-ext-partition is ON: Same applies as above, except that in the bitstream the 2 fields containing the tile width/height are in units of the superblock size, and the superblock size itself is also coded in the bitstream. If the uncompressed header signals the use of 64x64 superblocks, then the tile width/height fields are 6 bits wide and are in units of 64 pixels. If the uncompressed header signals the use of 128x128 superblocks, then the tile width/height fields are 5 bits wide and are in units of 128 pixels. The above is a summary of the bitstream. The user interface to vpxenc (and the equivalent encoder API) behaves a follows: If --enable-ext-tile is OFF: No change in the user interface. --tile-columns and --tile-rows specify the base 2 logarithm of the desired number of tile columns and tile rows. The actual number of tile rows and tile columns, and the particular tile width and tile height are computed by the codec ensuring all of the above constraints are respected. If --enable-ext-tile is ON, but --enable-ext-partition is OFF: No change in the user interface. --tile-columns and --tile-rows specify the WIDTH and HEIGHT of the tiles in unit of 64 pixels. The valid values are in the range [1, 64] (which corresponds to [64, 4096] pixels in increments of 64. If both --enable-ext-tile is ON and --enable-ext-partition is ON: If --sb-size=64 (default): The user interface is the same as in the previous point. --tile-columns and --tile-rows specify tile WIDTH and HEIGHT, in units of 64 pixels, in the range [1, 64] (which corresponds to [64, 4096] pixels in increments of 64). If --sb-size=128 or --sb-size=dynamic: --tile-columns and --tile-rows specify tile WIDTH and HEIGHT, in units of 128 pixels in the range [1, 32] (which corresponds to [128, 4096] pixels in increments of 128). Change-Id: Idc9beee1ad12ff1634e83671985d14c680f9179a	2016-04-07 10:34:25 +01:00
Debargha Mukherjee	2a6389bb8b	Merge "Fix interpolation values and decouple interintra" into nextgenv2	2016-03-31 21:47:10 +00:00
Debargha Mukherjee	2be211e971	Fix interpolation values and decouple interintra Decouples interintra modes and probability models from regular intra modes, to enable creating/optimizing new interintra modes. Also, fixes interpolation values for 128x128 interintra and obmc. Change-Id: I5c2016db49b8f029164e5fe84c6274d4e02ff90e	2016-03-31 12:12:51 -07:00
Geza Lore	511da8cbe5	Rename MI_BLOCK_SIZE and MI_MASK macros. Rename MI_BLOCK_SIZE.* -> MAX_MIB_SIZE.* (MIB is for MI Block). Rename MI_MASK.* -> MAX_MIB_MASK.* There are no functional changes. This is in preparation for coding the superblock size at the frame level, which will require some of these constants to become variables. The new names better reflect future semantics, and hence make the code clearer. Change-Id: Iee08d97554cf4cc16a5dc166a3ffd1ab91529992	2016-03-31 09:57:41 +01:00
Geza Lore	552d5cd715	Extend superblock size fo 128x128 pixels. If --enable-ext-partition is used at build time, the superblock size (sometimes also referred to as coding unit (CU) size) is extended to 128x128 pixels. Change-Id: Ie09cec6b7e8d765b7555ff5d80974aab60803f3a	2016-03-30 18:23:06 +01:00
Geza Lore	490ba1ad25	Port large scale tile coding features from nextgen. If configured with --enable-ext-tile, the codec uses an alternative tile coding syntax in the bitstream. Changes include:: - The maximum number of tile rows and columns is extended to 1024 each. - The minimum tile width/height is 64 pixels (1 superblock). - A tile copy mode is added where a tile directly reuse the coded data of a previous tile - The meaning of the tile-columns and tile-rows codec parameters are overloaded to mean tile-width and tile-height in units of 64 pixels. - All tiles should now be independent, including rows within the same columns, so large scale parallel, or independent decoding is possible. - vpxdec also gained the options to decode only a particular tile, tile row, or tile column. Changes without --enable-ext-tile: - All tiles should now be independent, including rows within the same columns, so large scale parallel, or independent decoding is possible. - vpxenc default tile configuration changed to use 1 tile column. Change-Id: I0cd08ad550967ac18622dae5e98ad23d581cb33e	2016-03-24 09:26:05 +00:00
Debargha Mukherjee	7a3bae768e	Merge "Porting ext_partition experiment from nextgen" into nextgenv2	2016-03-23 04:58:38 +00:00
Julia Robson	5cce322a09	Porting ext_partition experiment from nextgen This has been ported under ext_partition_types because it is due to be combined with the coding_unit_size experiment which is already being ported under ext_partition Change-Id: I47af869ae123ddf0aa99160dac644059d14266ee	2016-03-22 12:29:01 -07:00
Jingning Han	bfdcccd8a1	Merge "Rework the DRL syntax entropy coding system" into nextgenv2	2016-03-22 00:07:36 +00:00
Debargha Mukherjee	1b17559327	Adds 1D transforms for ADST/FlipADST to make 16 Makes a set of 16 transforms total, adding all 1D combinations of ADST and FlipADST, and removng all DST transforms. lowres, midres both improve by about 0.1% and hdres by -0.378% in BDRATE but with fewer transforms that are also simpler. Further experiments to continue later. Change-Id: I7348a4c0e12078fdea5ae3a2d36a89a319ffcc6e	2016-03-21 11:19:36 -07:00
Jingning Han	5c9d315572	Rework the DRL syntax entropy coding system This commit re-designs the probability model for the syntax elements of the dynamic motion vector referencing system. Change-Id: Icfb8203c7e8f64e10e99f5890e25e6f6b15fe5d1	2016-03-21 09:52:33 -07:00
Debargha Mukherjee	f34deab243	Adds compound wedge prediction modes Incorporates wedge compound prediction modes. Change-Id: Ie73b54b629105b9dcc5f3763be87f35b09ad2ec7	2016-03-10 07:19:54 -08:00
Jingning Han	a8dc9694a4	Hybrid 1-D/2-D transform coding This commit enables a hybrid 1-D/2-D transform coding scheme and the accompany entropy coding system. It currently uses hybrid 1-D/2-D DCT transform coding. It provides coding performance gains: lowres_all 0.55% hdres_all 0.43% Change-Id: I2b30dcafd21eb2bb3371f6e854cbab440a4dfa78	2016-03-07 09:27:46 -08:00
Debargha Mukherjee	3287f5519e	Merge "Hooks to use 32x32 masked transforms for ext-tx" into nextgenv2	2016-02-26 20:54:37 +00:00
Debargha Mukherjee	da2d4a7afc	Hooks to use 32x32 masked transforms for ext-tx Adds hooks to use 32x32 ext-tx. Also adds scan orders for the masked transforms for 32x32. Make macro USE_MSKTX_FOR_32X32 1 in blockd.h to support 32x32 masked transforms for ext-tx. Change-Id: Ie6564830266651fcafae2d536c274dafd664ce17	2016-02-24 13:08:37 -08:00
Jingning Han	47bc2a5741	Enable context based motion vector entropy coding This commit enables a context based motion vector entropy coding conditioned on dynamic reference motion vector list. This (along with the previous CL) imporves the coding gains due to dynamic motion vector referencing based entropy coding: derf 0.1% hevcmr 0.2% stdhd 0.7% hevchr 0.4% No encoding time change was observed. Change-Id: I179c723844079195f6952a12582996a3ca9e9914	2016-02-24 09:02:32 -08:00
Jingning Han	df59bb8986	Vectorize motion vector probability models This commit converts the scalar motion vector probability model into vector format for later precise estimate. Change-Id: I7008d047ecc1b9577aa8442b4db2df312be869dc	2016-02-19 16:20:41 -08:00
Debargha Mukherjee	1badceada8	Code cleanup: remove redundant DST1 code Removes the USE_DST2 flag that was on by default. DST2 performs slightly better that DST1 and is faster to compute. Change-Id: Ifb788f3f0a0e1995d7625230cec144b876f01206	2016-02-16 10:36:02 -08:00
Jingning Han	4958987b2a	Entropy coding for dynamic ref mv modes This commit enables entropy coding for dynamic reference motion vector modes. The probability model is contexted on the ranking categories of the reference motion vector candidates. Change-Id: I09b58d98a409d63ec1a407331e29f8945b7ef17d	2016-02-08 17:05:24 -08:00
Yue Chen	968bbc7bb2	Adding new compound modes to EXT_INTER experiment Combinations of different mv modes for two reference frames are allowed in compound inter modes. 9 options are enabled, including NEAREST_NEARESTMV, NEAREST_NEARMV, NEAR_NEARESTMV, NEAREST_NEWMV, NEW_NEARESTMV, NEAR_NEWMV, NEW_NEARMV, ZERO_ZEROMV, and NEW_NEWMV. This experiment is mostly deported from the nextgen branch. It is made compatible with other experiments Coding gain of EXT_INTER(derflr/hevcmr/hevchd): 0.533%/0.728%/0.639% Change-Id: Id47e97284e6481b186870afbad33204b7a33dbb0	2016-01-22 13:52:16 -08:00
Yue Chen	1ac858794a	EXT_INTER experiment NEW2MV is enabled, representing a new motion vector predicted from NEARMV. It is mostly ported from nextgen, where it was named NEW_INTER. A few fixes are done for sub8x8 RDO to correct some misused mv references in the original patch. A 'bug-fix' for encoding complexity is done, reducing the additional encoding time from 50% to 20%. In sub8x8 case, the old patch did motion search for every interpolation filter (vp9 only searches once). This fix also slightly improves the coding gain. This experiment has been made compatible with REF_MV and EXT_REFS. Coding gain (derflr/hevcmr/hevchd): 0.267%/0.542%/0.257% Change-Id: I9a94c5f292e7454492a877f65072e8aedba087d4	2016-01-15 14:47:02 -08:00
Yaowu Xu	0367f32ea8	Merge branch 'master' into nextgenv2 Manually resovled the following conflicts: vp10/common/blockd.h vp10/common/entropy.h vp10/common/entropymode.c vp10/common/entropymode.h vp10/common/enums.h vp10/common/thread_common.c vp10/decoder/decodeframe.c vp10/decoder/decodemv.c vp10/encoder/bitstream.c vp10/encoder/encodeframe.c vp10/encoder/rd.c vp10/encoder/rdopt.c Change-Id: I15d20ce5292b70f0c2b4ba55c1f1318181481596	2016-01-13 13:18:06 -08:00
Debargha Mukherjee	a0900fd0db	Remove experimental flag for ext_tx Also includes a bug fix. Change-Id: Ia49ed00f8ffd1531c10bcf89b1f497310ee7cb82	2016-01-08 13:48:24 -08:00
Debargha Mukherjee	f7dfa4ece7	Modifies inter/intra coding to allow all tx types The nominal tx_type for a given mode is used as a context to encode the actual tx_type for intra. Results: derflr: -0.241% BDRATE hevcmr: -0.366% BDRATE Change-Id: Icfe7b0a58d79bc6497a06e3441779afec6e01e21	2016-01-08 11:13:46 -08:00
Jingning Han	387a10e3dc	Enable context analyzer for inter mode entropy coding It allows the codec to account for certain corner cases when processing inter prediction mode entropy coding. Change-Id: Ied451f4fff26ba579f6556554b8381ff2ccd0003	2016-01-08 10:27:27 -08:00
Debargha Mukherjee	3787b17439	Super transform - ported from nextgen branch Various additional changes were made to make the experiment compatible with misc_fixes. derflr: +0.979% hevcmr: +0.865% Speed-wise with --enable-supertx the encoder is only about 10% slower than without. Decoding impact is about 30% slowdown. Note this does not work with ext-tx or var-tx yet. That is a TODO. Change-Id: If25af4241a7a9efbd28f58eda3c4f044c7a7ef4b	2016-01-04 22:12:57 -08:00
Debargha Mukherjee	8b9efaa161	Merge "Replace DST1 in ext_tx experiment with DST2" into nextgenv2	2015-12-16 23:47:28 +00:00
Debargha Mukherjee	49d9730f60	Replace DST1 in ext_tx experiment with DST2 The DST2 is implemented by input alternate sign-flip, followed by DCT, followed by output reversal. Results are roughly the same, but it should be easier to optimize the DST2. [Interestingly a mtrix multuiply implementation is about 0.1% better]. Change-Id: If9ae5fdba87767fb0e6c163a62b77ee66a8d3afc	2015-12-15 11:30:48 -08:00
Jingning Han	aa5d53eb17	Enable adaptive prediction mode coding This commit allows the codec to analyze the reference motion vector candidate list and adaptively reduce the size of inter prediction mode set. Change-Id: Ied6a403843b860d66f26ed485c1825c05c71bdfc	2015-12-10 09:02:32 -08:00
Jingning Han	0d65cae638	Allow precise classification for refmv mode context Combine the nearest ref mv count and the total ref mv count for mode context. Change-Id: I342a2b126bf7d2d30c344911260d9769a923026b	2015-12-10 02:03:32 +00:00
Jingning Han	1dc18077b8	Re-design motion compensated prediction mode entropy coding system This commit re-works the entropy coding scheme of the motion compensated prediction modes. It allows more flexible hyperplane partition for precise classification. Change-Id: Iba5035c76691946cf1386b6c495e399c3d9c8fc5	2015-12-09 18:02:20 -08:00
hui su	c93e5cc3e9	Bring palette back to nextgenv2 It was removed by the master branch merge. Change-Id: I4b2a524c9e052e41063359afcb4ba22bf78344cf	2015-12-07 18:24:15 -08:00
Yaowu Xu	69f4930041	Merge branch 'master' into nextgenv2 Conflicts: vp10/common/blockd.h vp10/common/entropymode.h vp10/common/reconintra.c vp10/decoder/decodemv.c vp10/encoder/bitstream.c vp10/encoder/encoder.h vp10/encoder/rd.c vp10/encoder/rdopt.c vp10/encoder/tokenize.h Change-Id: Ic4891839b6f0474026d6d69821e38edec9632df1	2015-12-07 11:37:14 -08:00
hui su	5d3327e891	Remove palette from VP10 Store it in nextgenv2 for now. Change-Id: Iab0af0e15246758e3b6e8bde4a74b13c410576fc	2015-12-03 12:30:47 -08:00
hui su	d7c8bc77c6	Speed up angle search in intra mode selection Estimate angle histogram using gradient analysis, then skip those angles that are unlikely to be chosen. On ext-intra experiment, turning off filter-intra modes: for all-key-frame setting, computation overhead is reduced by about 40%, coding gain dropped from +2.08% to +1.96% (derflr); with kf-max-dist=150, computation overhead is reduced by about 60%, coding gain dropped from +0.58% to +0.49% (derflr). Change-Id: I36687410fb10561b8e1a8eebb1528cf17755bd5b	2015-12-01 11:15:47 -08:00
Jingning Han	11bac096f2	Merge "Analyze motion field to produce reference motion vectors" into nextgenv2	2015-11-25 01:14:12 +00:00
Jingning Han	254d3e172a	Analyze motion field to produce reference motion vectors This commit allows the codec to analyze the motion field in the avaiable above and left neighboring area to produce a set of reference motion vectors for each reference frame. These reference motion vectors are ranked according to the likelihood that it will be picked. Change-Id: I82e6cd990a7716848bb7b6f5f2b1829966ff2483	2015-11-24 15:52:55 -08:00

1 2

67 Commits