generic-library/vpx

Author	SHA1	Message	Date
Debargha Mukherjee	8b0a5b8718	Adding loop wiener restoration Adds a wiener filter based restoration scheme in loop which can be optionally selected instead of the bilateral filter. The LMMSE filter generated per frame is a separable symmetric 7 tap filter. Three parameters for each of horizontal and vertical filters are transmitted in the bitstream. The fourth parameter is obtained assuming the sum is normalized to 1. Also integerizes the bilateral filters, along with other refactoring necessary in order to support the new switchable restoration type framework. derflr: -0.75% BDRATE [A lot of videos still prefer bilateral, however since many frames now use the simpler separable filter, the decoding speed is much better]. Further experiments to follow, related to replacing the bilateral. Change-Id: I6b1879983d50aab7ec5647340b6aef6b22299636	2016-02-12 09:56:24 -08:00
Yaowu Xu	1a69cb286f	Refactor internal stats code Also removed the use of postprocessing in computing internal stats. Change-Id: Ib8fdbdfe7b7ca05cd1a034a373aa7762fa44323c	2016-02-12 07:31:29 -08:00
Yaowu Xu	bb8ca08816	Enable computing PSNRHVS for hbd build This commit adds computation of PSNRHVS for highbitdepth build, it also adds tests to make sure the calculation of psnrhvs metric for 10 and 12 bit correct. Change-Id: Iac8a8073d2b3e3ba5d368829d770793212fa63b6	2016-02-11 13:17:59 -08:00
Yaowu Xu	c0874f2441	Enable computing of FastSSIM for HBD build This commit adds the computation of fastSSIM for highbitdepth build, it also modifies the hbdmetric test to be more generic and applicable for fastSSIM. The 255 used for calculating ssim constants c1 and c2 is not exactly scaled by 4x and 16x to 1023 and 4095, therefore requries the metric test to have a thresold more tolerant than 0, currently at 0.03dB. Change-Id: I631829da7773de400e77fc36004156e5e126c7e0	2016-02-10 17:11:58 -08:00
Yaowu Xu	bb5f9e431f	Fix a bug in HBD buffer size computation The value of use_highbitdepth flag is used for compute the size for high bit depth buffer allocation, which should take value 0 or 1 depending on if the buffer is used for high bit depth or not. Previously, the values is set to 8 or 0, this commit fixes the issue and properly set the value for this flag to 1 or 0. This cuts the size of highbitdepth buffer memory allocation to 2/9 of the size prior to the fix. Change-Id: I401518b5a6147e5d8a973e54f7ca6bc1892065e0	2016-02-08 18:52:08 -08:00
Yaowu Xu	090eaadf20	Change to use local variables consistently This commit does not change the computation, nor results. Change-Id: I1a7bb47050220d970f075458b507c5e55d93b22e	2016-02-08 11:38:04 -08:00
Yaowu Xu	204e77e059	Remove a flavor of SSIM that is never really used. Change-Id: I61ea7f63acbcfeecd3f7dba5a5a38b980efc802b	2016-02-08 11:22:08 -08:00
Debargha Mukherjee	f0a4485e54	Refactor to separate restoration from loop filter Change-Id: Iab517862d957f3aa2a664e9349d57bbf424febb3	2016-01-29 15:39:23 -08:00
Debargha Mukherjee	3eb10fcf21	Cosmetic changes to loop restoration Also adds a normalized filtering function to be used later. Change-Id: I30e2140e664db635602f26a73b81ce8e008dff5e	2016-01-27 17:33:36 -08:00
Debargha Mukherjee	eef57c1e99	Fixes ext-interp experiment Fixes integer pel MV usage for the sub8x8 case, which fixes a rare mismatch issue. Also adds some other minor missing code related to filter threshes. Change-Id: I6b07e6cf9b287ba4b5bd6599af4a7412e50b3bdc	2016-01-27 09:24:48 -08:00
Debargha Mukherjee	84ca7a9f0f	Loop restoration filter Current implementation is a bilateral filter whose parameters are transmitted in the bitstream. derflr: -0.647% BDRATE hevcmr: -0.794% BDRATE This is a prelimary patch. Various other variations are to be investigated next, that will hopefully be less expensive on the decoder side. Change-Id: I50634ae8f5014ad0bf7432306348908a349d81e1	2016-01-20 17:59:46 -08:00
Yaowu Xu	727ca802bf	Merge "Merge branch 'master' into nextgenv2" into nextgenv2	2016-01-14 00:26:45 +00:00
Yaowu Xu	0367f32ea8	Merge branch 'master' into nextgenv2 Manually resovled the following conflicts: vp10/common/blockd.h vp10/common/entropy.h vp10/common/entropymode.c vp10/common/entropymode.h vp10/common/enums.h vp10/common/thread_common.c vp10/decoder/decodeframe.c vp10/decoder/decodemv.c vp10/encoder/bitstream.c vp10/encoder/encodeframe.c vp10/encoder/rd.c vp10/encoder/rdopt.c Change-Id: I15d20ce5292b70f0c2b4ba55c1f1318181481596	2016-01-13 13:18:06 -08:00
Jingning Han	33cc1bd21d	Generate compound reference motion vector This commit allows the codec to add motion vector pairs into the candidate list. It further improves the compression performance by 0.1% across derf, hevcmr, stdhd, and hevchr sets without adding encode/decode time. Change-Id: I88d36da25a2a89bb506d411844af667081eba98b	2016-01-12 15:28:47 -08:00
Debargha Mukherjee	f7dfa4ece7	Modifies inter/intra coding to allow all tx types The nominal tx_type for a given mode is used as a context to encode the actual tx_type for intra. Results: derflr: -0.241% BDRATE hevcmr: -0.366% BDRATE Change-Id: Icfe7b0a58d79bc6497a06e3441779afec6e01e21	2016-01-08 11:13:46 -08:00
Jingning Han	387a10e3dc	Enable context analyzer for inter mode entropy coding It allows the codec to account for certain corner cases when processing inter prediction mode entropy coding. Change-Id: Ied451f4fff26ba579f6556554b8381ff2ccd0003	2016-01-08 10:27:27 -08:00
Zoe Liu	9581f3d49a	Replaced a hard-coded value with the macro Change-Id: I2aec63d8a600e319d037b764b0609092bce1e483	2015-12-30 17:16:51 -08:00
Zoe Liu	ec36a2b061	Restore the flexibility for the new 3 references For the experiment of EXT_REFS, removed the previous special handling on the new last 3 references, i.e. LAST2_FRAME, LAST3_FRAME, and LAST4_FRAME, at the decoder, so that these new last references are treated the same way as the other 3 references (LAST_FRAME, GOLDEN_FRAME, and ALTREF_FRAME). Encoder changes have been made accordingly to realize this flexibility. Change-Id: Ic6546f9443b4377bb7e7b101bfa3e70a8b8d1c65	2015-12-17 16:34:02 -08:00
Yaowu Xu	dab7515aa4	Merge branch 'master' into nextgenv2 With a few manual fixes of merge conflicts. Change-Id: I0dd65ff90f9fa8606e5563f528659e2607b12376	2015-12-16 09:00:57 -08:00
paulwilkins	99309004bf	Fixed interval, fixed Q 1 pass test patch. For testing implemented a fixed pattern and delta, 1 pass, fixed Q, low delay mode. This has not in any way been tuned or optimized. Change-Id: Icf9b57c3bb16cc5c0726d5229009212af36eb6d9	2015-12-15 15:33:25 +00:00
Yaowu Xu	f07d73b9bf	Merge branch 'master' into nextgenv2 Change-Id: Id0b784b115602e2502b42fa972a5ae210435a3be	2015-12-11 08:58:40 -08:00
paulwilkins	4e692bbee2	Changes to exhaustive motion search. This change has been imported from VP9 and alters the nature and use of exhaustive motion search. Firstly any exhaustive search is preceded by a normal step search. The exhaustive search is only carried out if the distortion resulting from the step search is above a threshold value. Secondly the simple +/- 64 exhaustive search is replaced by a multi stage mesh based search where each stage has a range and step/interval size. Subsequent stages use the best position from the previous stage as the center of the search but use a reduced range and interval size. For example: stage 1: Range +/- 64 interval 4 stage 2: Range +/- 32 interval 2 stage 3: Range +/- 15 interval 1 This process, especially when it follows on from a normal step search, has shown itself to be almost as effective as a full range exhaustive search with step 1 but greatly lowers the computational complexity such that it can be used in some cases for speeds 0-2. This patch also removes a double exhaustive search for sub 8x8 blocks which also contained a bug (the two searches used different distortion metrics). For best quality in my test animation sequence this patch has almost no impact on quality but improves encode speed by more than 5X. Restricted use in good quality speeds 0-2 yields significant quality gains on the animation test of 0.2 - 0.5 db with only a small impact on encode speed. On most natural video clips, however, where the step search is performing well, the quality gain and speed impact are small. Change-Id: Iac24152ae239f42a246f39ee5f00fe62d193cb98	2015-12-08 16:54:42 +00:00
hui su	c93e5cc3e9	Bring palette back to nextgenv2 It was removed by the master branch merge. Change-Id: I4b2a524c9e052e41063359afcb4ba22bf78344cf	2015-12-07 18:24:15 -08:00
Yaowu Xu	69f4930041	Merge branch 'master' into nextgenv2 Conflicts: vp10/common/blockd.h vp10/common/entropymode.h vp10/common/reconintra.c vp10/decoder/decodemv.c vp10/encoder/bitstream.c vp10/encoder/encoder.h vp10/encoder/rd.c vp10/encoder/rdopt.c vp10/encoder/tokenize.h Change-Id: Ic4891839b6f0474026d6d69821e38edec9632df1	2015-12-07 11:37:14 -08:00
hui su	5d3327e891	Remove palette from VP10 Store it in nextgenv2 for now. Change-Id: Iab0af0e15246758e3b6e8bde4a74b13c410576fc	2015-12-03 12:30:47 -08:00
Zoe Liu	3ec1601e37	Added 3 more reference frames for inter prediction. Under the experiment of EXT_REFS: LAST2_FRAME, LAST3_FRAME, and LAST4_FRAME. Coding efficiency: derflr +1.601%; hevchr +1.895% Speed: Encoder slowed down by ~75% Change-Id: Ifeee5f049c2c1f7cb29bc897622ef88897082ecf	2015-11-20 17:00:24 -08:00
hui su	66f2f65ef7	Merge MISC_FIXES Remove MISC_FIXES flags except for the changes on MV precision, which has a 0.1% performance drop. On derflr, the impact is -0.012%. Change-Id: I0a74e5a212dd0cb827192a318c92a714c9681e45	2015-11-17 15:06:08 -08:00
Debargha Mukherjee	85514c40ae	New interpolation experiment Adds a new interpolation experiment. Improves entropy coding to send the filter type only if the motion vectors have subpel components. Adds one new 8-tap smooth filter, and tweaks the others. derflr: +0.695% hevcmr: +0.305% About 5% encode slowdown. No visible impact for decoding. Also makes the interpolation framework flexible to support both strictly interpolating filters as well as non-interpolating filters that filter integer offsets. This is mainly for further experimentation and if not found useful the code will be removed. Change-Id: I8db9cde56ca916be771fe54a130d608bf10786e6	2015-11-06 09:51:34 -08:00
Jingning Han	6727943ceb	Refactor loop filter mask This commit refactors the loop filter selection process to support variable transform block sizes based filter mask. It disables the multi-thread loop filter implementation to simplify the experiments. The speed impact on speed 0 encoding is negligible. Change-Id: Ia470b6da9ad833fe6eb72d2cbeda9296b21910ec	2015-10-30 15:25:16 -07:00
Yaowu Xu	4ac2ae3a4d	Merge branch 'masterbase' into nextgenv2 Conflicts: configure test/vp9_encoder_parms_get_to_decoder.cc vp10/common/blockd.h vp10/common/entropymode.c vp10/common/entropymode.h vp10/common/idct.c vp10/decoder/decodeframe.c vp10/decoder/decodemv.c vp10/encoder/bitstream.c vp10/encoder/encodeframe.c vp10/encoder/encodemb.c vp10/encoder/encoder.c vp10/encoder/encoder.h vp10/encoder/rd.c vp10/encoder/rdopt.c vp10/encoder/tokenize.c vp10/encoder/tokenize.h vp9/decoder/vp9_decodeframe.c vp9/decoder/vp9_decoder.h vp9/encoder/vp9_aq_cyclicrefresh.c vp9/encoder/vp9_encoder.h vp9/vp9_cx_iface.c vpx/vp8cx.h vpx_dsp/x86/vpx_subpixel_8t_intrin_ssse3.c vpx_scale/yv12config.h Change-Id: I604a329d38badec7a11e8ede16ca1404476e9b93	2015-10-22 11:40:44 -07:00
Ronald S. Bultje	60c58b5284	vp10: per-segment lossless coding. Some more testing of this patch would probably be useful, but I think the basics of it should work fine now. See issue 1035. Change-Id: I4a36d58f671c5391cb09d564581784a00ed26245	2015-10-16 19:30:39 -04:00
Ronald S. Bultje	6e5a1165be	vp10: make segmentation probs use generic probability model. Locate them (code-wise) in frame_context, and have them be updated as any other probability using the subexp forward and adaptive bw updates. See issue 1040 point 1. TODOs: - real-world default probabilities - why is counts sometimes NULL in the decoder? Does that mean bw adaptivity updates only work on some frames? (I haven't looked very closely yet, maybe this is a red herring.) Change-Id: I23b57b4e5e7574b75f16eb64823b29c22fbab42e	2015-10-16 19:30:38 -04:00
hui su	aaf6f6215f	Fix palette mode in multi-thread encoding setting Fix a couple of memory related errors. Also fix thread test failures. Change-Id: I0103995f832cecf1dd2380000321ac7204f0cfc0	2015-10-15 15:00:57 -07:00
Hui Su	b9e31b5163	Merge "VP10: Add palette mode part 1"	2015-10-13 17:34:27 +00:00
Ronald S. Bultje	5f589826f3	vp10: allow bw adaptivity for skip/tx probabilities in keyframes. See issue 1040 point 3. Change-Id: Ieef6d326b7fb50ceca5936525b7c688225a11fd1	2015-10-12 17:51:01 -04:00
hui su	5d011cb278	VP10: Add palette mode part 1 Add palette mode for keyframe luma channel. Palette mode is enabled when using "--tune-content=screen" in encoding config parameters. on screen_content testset: +6.89% on derlr : +0.00% Design doc (WIP): https://goo.gl/lD4yJw Change-Id: Ib368b216bfd3ea21c6c27436934ad87afdaa6f88	2015-10-12 10:02:17 -07:00
Yaowu Xu	7c514e2dfd	Merged branch 'master' into nextgenv2 Resolved Conflicts in the following files: configure vp10/common/idct.c vp10/encoder/dct.c vp10/encoder/encodemb.c vp10/encoder/rdopt.c Change-Id: I4cb3986b0b80de65c722ca29d53a0a57f5a94316	2015-09-29 16:17:32 -07:00
Ronald S. Bultje	cc5dd3ec10	Merge "vp9/10: improve support for render_width/height."	2015-09-28 16:25:28 +00:00
Ronald S. Bultje	3db5721e21	Merge "Rename display_{size,width,height} to render_*."	2015-09-28 16:25:20 +00:00
Ronald S. Bultje	812945a8f1	vp9/10: improve support for render_width/height. In the decoder, map this to the output variable vpx_image_t.r_w/h. This is intended as an improved version of VP9D_GET_DISPLAY_SIZE, which doesn't work with parallel frame decoding. In the encoder, map this to a codec control func (VP9E_SET_RENDER_SIZE) that takes a w/h pair argument in a int[2] (identical to VP9D_GET_DISPLAY_SIZE). Also add render_size to the encoder_param_get_to_decoder unit test. See issue 1030. Change-Id: I12124c13602d832bf4c44090db08c1009c94c7e8	2015-09-25 22:18:22 -04:00
James Zern	db2056f341	Merge "vp9/10 encoder: prevent NULL access on failure"	2015-09-26 01:52:52 +00:00
Ronald S. Bultje	36ffe64498	Rename display_{size,width,height} to render_. The name "display_" (or "d_") is used for non-compatible information (that is, the cropped frame dimensions in pixels, as opposed to the intended screen rendering surface size). Therefore, continuing to use display_ would be confusing to end users. Instead, rename the field to render_*, so that struct vpx_image can include it. Change-Id: Iab8d2eae96492b71c4ea60c4bce8121cb2a1fe2d	2015-09-25 21:34:29 -04:00
Ronald S. Bultje	bab8d38f7f	vp10: remove MACROBLOCK.{highbd_,}itxfm_add function pointer. This is preparatory work for allowing per-segment lossless coding. See issue 1035. Change-Id: I9487d02717ee3e766aee61a487780056bb35d2d3	2015-09-25 19:30:46 -04:00
Ronald S. Bultje	c74b33a413	vp10: remove MACROBLOCK.fwd_txm4x4 function pointer. This is preparatory work for allowing per-segment lossless coding. See issue 1035. Change-Id: Idd72e2a42d90fa7319c10122032d1a7c7a54dc05	2015-09-25 19:30:46 -04:00
James Zern	cf8f6559ce	vp9/10 encoder: prevent NULL access on failure Change-Id: I1fc8e0b3d48675cd5428b7b36f7cc28ab32cbf71	2015-09-23 17:55:51 -07:00
Debargha Mukherjee	09ff5f2792	Merge remote-tracking branch 'origin/master' into nextgenv2 Periodic merge to get master changes into nextgenv2. Change-Id: I6f0e4b470f193da03f1a8cb8e6a93ae39395699a	2015-09-17 16:33:18 -07:00
Jingning Han	c3bf837572	Refactor mbmi_ext structure This commit removes mbmi_ext_base pointer from MACROBLOCK struct. Its use case can be fully covered by cpi->mbmi_ext_base pointer. Change-Id: I155351609336cf5b6145ed13c21b105052727f30	2015-09-17 09:51:45 -07:00
Ronald S. Bultje	eeb5ef0a24	Add support for color-range. In decoder, export (eventually) into vpx_image_t.range field. In encoder, use oxcf->color_range to set it (same way as for color_space). See issue 1059. Change-Id: Ieabbb2a785fa58cc4044bd54eee66f328f3906ce	2015-09-16 06:41:46 -04:00
James Zern	c667593e1e	Merge changes from topic 'fix-vp9-bitstream-test' * changes: vp9_encoder_parms_get_to_decoder: cosmetics vp9...parms_get_to_decoder: remove unneeded func vp9...parms_get_to_decoder: fix EXPECT param order vp9_encoder_parms_get_to_decoder: delete dead code fix BitstreamParms test vp9_encoder_parms_get_to_decoder: remove vp10 yuvconfig2image(): add explicit cast to avoid conv warning vp9/10 decoder_init: add missing alloc cast vp9/10: set color_space on preview frame vp10: add extern "C" to headers vp9: add extern "C" to headers	2015-09-15 23:14:34 +00:00
Ronald S. Bultje	d1474f02aa	vp10: merge frame_parallel_decoding_mode and refresh_frame_context. See issue 1030. The value of frame_parallel_decoding_mode was ignored in vp9 if refresh_frame_context was 0, so instead make it a 3-member enum where the dependency is obviously stated. Change-Id: I37f0177e5759f54e2e6cc6217023d5681de92438	2015-09-11 19:33:46 -04:00

1 2

68 Commits