generic-library/vpx

Author	SHA1	Message	Date
Debargha Mukherjee	81f8b3f31c	Merge "Some refactoring to support warped motion mode" into nextgenv2	2016-06-10 23:18:39 +00:00
Debargha Mukherjee	03be30ba3e	Some refactoring to support warped motion mode Change-Id: I15d54a3ae48b2b33082668116792c6595bdb3ddb	2016-06-10 12:04:18 -07:00
Sarah Parker	a21afd421b	Move new quant experiment from nextgen This experiment implements non-uniform quantization where the width of the bins increases gradually to more closely match a laplacian distribution of the coeficcients. Performance Gain: derflr: 0.15% hevcmr: 0.675% Change-Id: I25234244e3bcd94b87c1f77cf682190b61c8ef94	2016-06-10 08:06:22 -07:00
Zoe Liu	e89ca180c2	Make the bi-predictive frame group interval adjustable This is for the bidir-pred experiment. Previously the length of the bi-predictive frame group interval is fixed at 2, i.e. one bi-predictive frame may be inserted every other frame. This patch makes the length adjustable, i.e. any positive number may be specified, but the use of the backward ref will be turned off if the bi-predictive frame group interval is larger than the golden frame group. Further, an additional rate factor level has been added: INTER_LOW , which applies to LAST_BIPRED_UPDATE frames that are not used as references. Change-Id: I5514d34a64dd486bbb5756c2d0612946f598a789	2016-05-28 16:46:45 -07:00
Zoe Liu	cf5083d4cd	Added an experiment "bidir_pred" for backward prediction Major parts have been implemented as follows: (1) Added BRF_UPDATE, LASTNRF_UPDATE, and NRF_UPDATE in firstpass.c; (2) Added the handling for the scenario of "cpi->common.show_existing_frame == 1" at the encoder; (3) Added a new reference frame of BWDREF_FRAME; (4) Have bwd-ref work with upsampled references. Note that when the experiment of "ext_refs" turned on, this experiment will be turned off automatically currently. RD performance in Overall PSNR has been improved, compared against the VP10 baseline: lowres: Avg -3.312; BDRate -3.154 derflr: Avg -1.927; BDRate -1.176 midres: Avg -2.149; BDRate -2.001 hdres : Avg -0.567; BDRate -0.588 Change-Id: I4c06ff51cc20194bffbd4d2346e57ba3dcf6b62c	2016-05-24 13:55:57 -07:00
Zoe Liu	a63147ae77	Fix --test-decode=warn to test mismatch This patch always compares the most recent show frames between the encoder and the decoder to test the mismatch. Change-Id: I68a91ad0996a598231450debfd616e24992419b5	2016-05-23 17:01:53 -07:00
Zoe Liu	011f020447	Refactor on getting upsampled reference frame Reused a function that has been used in getting the normal reference frames. Change-Id: Ic4f7dac5c396d689a72699ab79fd580747f8bd65	2016-05-18 16:00:23 -07:00
Yaowu Xu	102cdf94ed	Move count buffers from stack to heap This fixes the stack overflow issue on MSVC build. Change-Id: Icb0a78e5992a097d2192979ec2432546eaa452dd	2016-05-10 14:49:26 -07:00
Zoe Liu	a912c6ec31	Make LAST_FRAME always point to the newly coded frame in ext-refs This patch changes the encoder only for the ext-refs experiment. For each newly coded frame to refresh the LAST_FRAME, the decoder is notified that the LAST4_FRAME is to be refreshed, and read out the updated reference frame buffer vitural indexes for the next coded frame in a way that: LAST4_FRAME => LAST_FRAME, LAST_FRAME => LAST2_FRAME, LAST2_FRAME => LAST3_FRAME, and LAST3_FRAME => LAST4_FRAME. Compared against the original ext-refs experiment in TOT, a small gain is achieved in overall PSNR: lowres Avg: -0.154 lowres BDRate: -0.044 Change-Id: I648810c146a3cd915b408274a9373b7d38324864	2016-05-07 00:27:51 -07:00
Alex Converse	5d2b0f93b9	Use an exponential growth approach for the ANS reversal buffer. Memory constrained hardware can window the data via our standard windowing mechanism, tiles. Change-Id: Ib1cfd157604a8c9d9f9a9f2b0ba3bc2fd0643082	2016-04-13 15:16:29 -07:00
Hui Su	9e8cad3be7	Merge "Add vp10_ prefix to full_to_model_counts and fill_token_costs" into nextgenv2	2016-04-12 23:38:47 +00:00
hui su	0792748646	Add vp10_ prefix to full_to_model_counts and fill_token_costs Change-Id: I5e6c644fb09f7a80c88142dfdfa05cf5be260241	2016-04-12 11:06:47 -07:00
Geza Lore	61af8981b0	Extend variance based partitioning to 128x128 superblocks Change-Id: I41edf266d5540a9b070a5e65bc397dd3da210507	2016-04-12 09:40:11 +01:00
Geza Lore	f2be4f6058	Refactor PC_TREE root handling. Change-Id: Id8b16c1b18bd6f909e72aae3fd582dd3503c88c6	2016-04-08 17:01:00 +01:00
Geza Lore	454989ff32	Make superblock size variable at the frame level. The uncompressed frame header contains a bit to signal whether the frame is encoded using 64x64 or 128x128 superblocks. This can vary between any 2 frames. vpxenc gained the --sb-size={64,128,dynamic} option, which allows the configuration of the superblock size used (default is dynamic). 64/128 will force the encoder to always use the specified superblock size. Dynamic would enable the encoder to choose the sb size for each frame, but this is not implemented yet (dynamic does the same as 128 for now). Constraints on tile sizes depend on the superblock size, the following is a summary of the current bitstream syntax and semantics: If both --enable-ext-tile is OFF and --enable-ext-partition is OFF: The tile coding in this case is the same as VP9. In particular, tiles have a minimum width of 256 pixels and a maximum width of 4096 pixels. The tile width must be multiples of 64 pixels (except for the rightmost tile column). There can be a maximum of 64 tile columns and 4 tile rows. If --enable-ext-tile is OFF and --enable-ext-partition is ON: Same constraints as above, except that tile width must be multiples of 128 pixels (except for the rightmost tile column). There is no change in the bitstream syntax used for coding the tile configuration if --enable-ext-tile is OFF. If --enable-ext-tile is ON and --enable-ext-partition is ON: This is the new large scale tile coding configuration. The minimum/maximum tile width and height are 64/4096 pixels. Tile width and height must be multiples of 64 pixels. The uncompressed header contains two 6 bit fields that hold the tile width/heigh in units of 64 pixels. The maximum number of tile rows/columns is only limited by the maximum frame size of 65536x65536 pixels that can be coded in the bitstream. This yields a maximum of 1024x1024 tile rows and columns (of 64x64 tiles in a 65536x65536 frame). If both --enable-ext-tile is ON and --enable-ext-partition is ON: Same applies as above, except that in the bitstream the 2 fields containing the tile width/height are in units of the superblock size, and the superblock size itself is also coded in the bitstream. If the uncompressed header signals the use of 64x64 superblocks, then the tile width/height fields are 6 bits wide and are in units of 64 pixels. If the uncompressed header signals the use of 128x128 superblocks, then the tile width/height fields are 5 bits wide and are in units of 128 pixels. The above is a summary of the bitstream. The user interface to vpxenc (and the equivalent encoder API) behaves a follows: If --enable-ext-tile is OFF: No change in the user interface. --tile-columns and --tile-rows specify the base 2 logarithm of the desired number of tile columns and tile rows. The actual number of tile rows and tile columns, and the particular tile width and tile height are computed by the codec ensuring all of the above constraints are respected. If --enable-ext-tile is ON, but --enable-ext-partition is OFF: No change in the user interface. --tile-columns and --tile-rows specify the WIDTH and HEIGHT of the tiles in unit of 64 pixels. The valid values are in the range [1, 64] (which corresponds to [64, 4096] pixels in increments of 64. If both --enable-ext-tile is ON and --enable-ext-partition is ON: If --sb-size=64 (default): The user interface is the same as in the previous point. --tile-columns and --tile-rows specify tile WIDTH and HEIGHT, in units of 64 pixels, in the range [1, 64] (which corresponds to [64, 4096] pixels in increments of 64). If --sb-size=128 or --sb-size=dynamic: --tile-columns and --tile-rows specify tile WIDTH and HEIGHT, in units of 128 pixels in the range [1, 32] (which corresponds to [128, 4096] pixels in increments of 128). Change-Id: Idc9beee1ad12ff1634e83671985d14c680f9179a	2016-04-07 10:34:25 +01:00
Debargha Mukherjee	2be211e971	Fix interpolation values and decouple interintra Decouples interintra modes and probability models from regular intra modes, to enable creating/optimizing new interintra modes. Also, fixes interpolation values for 128x128 interintra and obmc. Change-Id: I5c2016db49b8f029164e5fe84c6274d4e02ff90e	2016-03-31 12:12:51 -07:00
Geza Lore	552d5cd715	Extend superblock size fo 128x128 pixels. If --enable-ext-partition is used at build time, the superblock size (sometimes also referred to as coding unit (CU) size) is extended to 128x128 pixels. Change-Id: Ie09cec6b7e8d765b7555ff5d80974aab60803f3a	2016-03-30 18:23:06 +01:00
Geza Lore	490ba1ad25	Port large scale tile coding features from nextgen. If configured with --enable-ext-tile, the codec uses an alternative tile coding syntax in the bitstream. Changes include:: - The maximum number of tile rows and columns is extended to 1024 each. - The minimum tile width/height is 64 pixels (1 superblock). - A tile copy mode is added where a tile directly reuse the coded data of a previous tile - The meaning of the tile-columns and tile-rows codec parameters are overloaded to mean tile-width and tile-height in units of 64 pixels. - All tiles should now be independent, including rows within the same columns, so large scale parallel, or independent decoding is possible. - vpxdec also gained the options to decode only a particular tile, tile row, or tile column. Changes without --enable-ext-tile: - All tiles should now be independent, including rows within the same columns, so large scale parallel, or independent decoding is possible. - vpxenc default tile configuration changed to use 1 tile column. Change-Id: I0cd08ad550967ac18622dae5e98ad23d581cb33e	2016-03-24 09:26:05 +00:00
Hui Su	daf2fb42e6	Merge "Add "entropy" experiment" into nextgenv2	2016-03-23 17:50:57 +00:00
Debargha Mukherjee	7a3bae768e	Merge "Porting ext_partition experiment from nextgen" into nextgenv2	2016-03-23 04:58:38 +00:00
Julia Robson	5cce322a09	Porting ext_partition experiment from nextgen This has been ported under ext_partition_types because it is due to be combined with the coding_unit_size experiment which is already being ported under ext_partition Change-Id: I47af869ae123ddf0aa99160dac644059d14266ee	2016-03-22 12:29:01 -07:00
Jingning Han	5c9d315572	Rework the DRL syntax entropy coding system This commit re-designs the probability model for the syntax elements of the dynamic motion vector referencing system. Change-Id: Icfb8203c7e8f64e10e99f5890e25e6f6b15fe5d1	2016-03-21 09:52:33 -07:00
hui su	83b47af18d	Add "entropy" experiment This patch added two features to improve entropy coding efficiency for coefficient tokens. 1. Choose 1 of 4 default probability tables based on q-index for key-frames. It is ported from nextgen branch: https://chromium-review.googlesource.com/#/c/280586/ 2. Do backward update after each superblock (64X64) row using subframe token counts. Coding gain: 0.1% on lowres; 0.42% on midres; 0.36% on hdres. Much larger gain for key-frames: 2.6%, 2.3%, 1.7%. Design doc: go/huisu-entropy Change-Id: Ia3b6a615636be09247d70e4c520405637561532b	2016-03-16 11:55:50 -07:00
Angie Chiang	b6fef12481	remove filter_cache PSNR test lowres hdres lowbd -0.013 0.067 highbd -0.044 0.039 Change-Id: Iefdb1e966bd004b2027456778185b675e8fb9b81	2016-03-15 14:40:59 -07:00
Yunqing Wang	e6e2d886d3	Add high-precision sub-pixel search as a speed feature Using the up-sampled reference frames in sub-pixel motion search is enabled as a speed feature for good-quality mode speed 0 and speed 1. Change-Id: Ieb454bf8c646ddb99e87bd64c8e74dbd78d84a50	2016-03-11 16:32:11 -08:00
Debargha Mukherjee	ce4b35d510	Merge "Adds compound wedge prediction modes" into nextgenv2	2016-03-10 17:44:45 +00:00
Debargha Mukherjee	f34deab243	Adds compound wedge prediction modes Incorporates wedge compound prediction modes. Change-Id: Ie73b54b629105b9dcc5f3763be87f35b09ad2ec7	2016-03-10 07:19:54 -08:00
hui su	954e560f9e	Refactor entropy coding of transform size No performance change. Change-Id: If35125fed909d89235b303514f77a33183bb36b3	2016-03-08 16:46:00 -08:00
Yunqing Wang	84f982080a	Minor fix in header files Move functions to be included in extern "C". Change-Id: If57fa5eb7955763cf99e6839dde4d7221fad75ea	2016-03-01 13:16:03 -08:00
Yunqing Wang	342a368fd4	Do sub-pixel motion search in up-sampled reference frames Up-sampled the reference frames to 8 times in each dimension using the 8-tap interpolation filter. In sub-pixel motion search, use the up-sampled reference frames to find the best matching blocks. This largely improved the motion search precision, and thus, improved the compression quality. There was no change in decoder side. Borg test and speed test results: 1. On derflr set, Overall PSNR gain: 1.306%, and SSIM gain: 1.512%. Average speed loss on derf set was 6.0%. 2. On stdhd set, Overall PSNR gain: 0.754%, and SSIM gain: 0.814%. On hevchd set, Overall PSNR gain: 0.465%, and SSIM gain: 0.527%. Speed loss on HD clips was 3.5%. Change-Id: I300ebaafff57e88914f3dedc8784cb21d316b04f	2016-02-29 12:14:47 -08:00
Jingning Han	a10814e11e	Merge "Account context based prob model for motion vector cost estimate" into nextgenv2	2016-02-22 17:37:42 +00:00
Jingning Han	03c01bc3c0	Account context based prob model for motion vector cost estimate This commit accounts for the context based probability model for motion vector cost estimate in rate-distortion optimization. Change-Id: Ia068a9395dcb4ecc348f128b17b8d24734660b83	2016-02-19 16:32:51 -08:00
Yaowu Xu	7823fbb45c	Merge "Move PSNR related functions into vpx_dsp/psnr.c" into nextgenv2	2016-02-18 01:00:54 +00:00
Yaowu Xu	7538501ad1	Move PSNR related functions into vpx_dsp/psnr.c This makes all metric computation to locate at some place, also gets rid of duplicate code between vp9 and vp10. Change-Id: I24a2707d183a2419cd18a8343010adae185ffcd4	2016-02-17 13:05:34 -08:00
Yue Chen	d1cad9c3f5	Overlapped block motion compensation experiment In this experiment, an obmc inter prediction mode is enabled for >= 8X8 inter blocks. When the obmc flag is on, the regular block- based motion compensation will be refined by using predictors of the above and left blocks. Fixed some compatibility issues with vp9_highbitdepth, supertx, ref_mv, and ext_interp. Coding gain (%) on derflr/hevcmr/hevchd OBMC: 1.047/1.022/0.708 OBMC + SUPERTX: 1.652/1.616/1.137 SUPERTX: 0.862/0.779/0.630 Change-Id: I5d8d3c4729c6d3ccb03ec7034563107893103b7f	2016-02-12 13:36:25 -08:00
Yaowu Xu	1a69cb286f	Refactor internal stats code Also removed the use of postprocessing in computing internal stats. Change-Id: Ib8fdbdfe7b7ca05cd1a034a373aa7762fa44323c	2016-02-12 07:31:29 -08:00
Jingning Han	4958987b2a	Entropy coding for dynamic ref mv modes This commit enables entropy coding for dynamic reference motion vector modes. The probability model is contexted on the ranking categories of the reference motion vector candidates. Change-Id: I09b58d98a409d63ec1a407331e29f8945b7ef17d	2016-02-08 17:05:24 -08:00
Yaowu Xu	204e77e059	Remove a flavor of SSIM that is never really used. Change-Id: I61ea7f63acbcfeecd3f7dba5a5a38b980efc802b	2016-02-08 11:22:08 -08:00
hui su	3b1c766802	Add 8-tap interpolation filter options for intra prediction BD-rate performance improvement (on top of ext-intra): derflr 0.22% hevclr 0.36% hevcmr 0.48% hevchr 0.37% stdhd 0.19% Average speed impact on some derf clips is about 40% slower (on top of ext-intra). Speed improvment is a to-do. Change-Id: I8fe3fe8c5e4f60d0462778adbcc15c84dfbe7a25	2016-02-03 14:19:20 -08:00
Yue Chen	968bbc7bb2	Adding new compound modes to EXT_INTER experiment Combinations of different mv modes for two reference frames are allowed in compound inter modes. 9 options are enabled, including NEAREST_NEARESTMV, NEAREST_NEARMV, NEAR_NEARESTMV, NEAREST_NEWMV, NEW_NEARESTMV, NEAR_NEWMV, NEW_NEARMV, ZERO_ZEROMV, and NEW_NEWMV. This experiment is mostly deported from the nextgen branch. It is made compatible with other experiments Coding gain of EXT_INTER(derflr/hevcmr/hevchd): 0.533%/0.728%/0.639% Change-Id: Id47e97284e6481b186870afbad33204b7a33dbb0	2016-01-22 13:52:16 -08:00
Debargha Mukherjee	84ca7a9f0f	Loop restoration filter Current implementation is a bilateral filter whose parameters are transmitted in the bitstream. derflr: -0.647% BDRATE hevcmr: -0.794% BDRATE This is a prelimary patch. Various other variations are to be investigated next, that will hopefully be less expensive on the decoder side. Change-Id: I50634ae8f5014ad0bf7432306348908a349d81e1	2016-01-20 17:59:46 -08:00
Yue Chen	1ac858794a	EXT_INTER experiment NEW2MV is enabled, representing a new motion vector predicted from NEARMV. It is mostly ported from nextgen, where it was named NEW_INTER. A few fixes are done for sub8x8 RDO to correct some misused mv references in the original patch. A 'bug-fix' for encoding complexity is done, reducing the additional encoding time from 50% to 20%. In sub8x8 case, the old patch did motion search for every interpolation filter (vp9 only searches once). This fix also slightly improves the coding gain. This experiment has been made compatible with REF_MV and EXT_REFS. Coding gain (derflr/hevcmr/hevchd): 0.267%/0.542%/0.257% Change-Id: I9a94c5f292e7454492a877f65072e8aedba087d4	2016-01-15 14:47:02 -08:00
Yaowu Xu	0367f32ea8	Merge branch 'master' into nextgenv2 Manually resovled the following conflicts: vp10/common/blockd.h vp10/common/entropy.h vp10/common/entropymode.c vp10/common/entropymode.h vp10/common/enums.h vp10/common/thread_common.c vp10/decoder/decodeframe.c vp10/decoder/decodemv.c vp10/encoder/bitstream.c vp10/encoder/encodeframe.c vp10/encoder/rd.c vp10/encoder/rdopt.c Change-Id: I15d20ce5292b70f0c2b4ba55c1f1318181481596	2016-01-13 13:18:06 -08:00
Debargha Mukherjee	a0900fd0db	Remove experimental flag for ext_tx Also includes a bug fix. Change-Id: Ia49ed00f8ffd1531c10bcf89b1f497310ee7cb82	2016-01-08 13:48:24 -08:00
Debargha Mukherjee	f7dfa4ece7	Modifies inter/intra coding to allow all tx types The nominal tx_type for a given mode is used as a context to encode the actual tx_type for intra. Results: derflr: -0.241% BDRATE hevcmr: -0.366% BDRATE Change-Id: Icfe7b0a58d79bc6497a06e3441779afec6e01e21	2016-01-08 11:13:46 -08:00
hui su	1559afda85	Use precise rate cost for intra modes in inter frames derflr +0.021% hevclr +0.207% hevcmr +0.035% stdhd +0.042% Change-Id: Ic750df93bcc0a261a66a9b19d939a5cd61a6b516	2016-01-04 08:35:29 -08:00
Zoe Liu	ec36a2b061	Restore the flexibility for the new 3 references For the experiment of EXT_REFS, removed the previous special handling on the new last 3 references, i.e. LAST2_FRAME, LAST3_FRAME, and LAST4_FRAME, at the decoder, so that these new last references are treated the same way as the other 3 references (LAST_FRAME, GOLDEN_FRAME, and ALTREF_FRAME). Encoder changes have been made accordingly to realize this flexibility. Change-Id: Ic6546f9443b4377bb7e7b101bfa3e70a8b8d1c65	2015-12-17 16:34:02 -08:00
Yaowu Xu	f07d73b9bf	Merge branch 'master' into nextgenv2 Change-Id: Id0b784b115602e2502b42fa972a5ae210435a3be	2015-12-11 08:58:40 -08:00
Jingning Han	1dc18077b8	Re-design motion compensated prediction mode entropy coding system This commit re-works the entropy coding scheme of the motion compensated prediction modes. It allows more flexible hyperplane partition for precise classification. Change-Id: Iba5035c76691946cf1386b6c495e399c3d9c8fc5	2015-12-09 18:02:20 -08:00
paulwilkins	4e692bbee2	Changes to exhaustive motion search. This change has been imported from VP9 and alters the nature and use of exhaustive motion search. Firstly any exhaustive search is preceded by a normal step search. The exhaustive search is only carried out if the distortion resulting from the step search is above a threshold value. Secondly the simple +/- 64 exhaustive search is replaced by a multi stage mesh based search where each stage has a range and step/interval size. Subsequent stages use the best position from the previous stage as the center of the search but use a reduced range and interval size. For example: stage 1: Range +/- 64 interval 4 stage 2: Range +/- 32 interval 2 stage 3: Range +/- 15 interval 1 This process, especially when it follows on from a normal step search, has shown itself to be almost as effective as a full range exhaustive search with step 1 but greatly lowers the computational complexity such that it can be used in some cases for speeds 0-2. This patch also removes a double exhaustive search for sub 8x8 blocks which also contained a bug (the two searches used different distortion metrics). For best quality in my test animation sequence this patch has almost no impact on quality but improves encode speed by more than 5X. Restricted use in good quality speeds 0-2 yields significant quality gains on the animation test of 0.2 - 0.5 db with only a small impact on encode speed. On most natural video clips, however, where the step search is performing well, the quality gain and speed impact are small. Change-Id: Iac24152ae239f42a246f39ee5f00fe62d193cb98	2015-12-08 16:54:42 +00:00

1 2

82 Commits