generic-library/vpx

Author	SHA1	Message	Date
Jingning Han	1faf288798	Rework transform quantization pipeline This commit reworks the transform and quantization unit. It enables the use of adaptive quantization for intra modes. This further improves the compression performance: lowres 0.36% midres 0.79% hdres 0.73% The key frame coding performance is improved: lowres 1.7% midres 1.9% hdres 3.3% The overall coding gains are: lowres 1.1% midres 1.8% hdres 2.3% Change-Id: Iaec1a3a4c1d5eac883ab526ed076d957060479dd	2016-06-14 16:32:04 -07:00
Jingning Han	a9a8c5993b	Refactor the trellis optimization process Speed up the trellis optimization unit by 10%. Change-Id: If055f6c0589a405c008d2900bb8fbc11b1246f66	2016-06-13 12:19:57 -07:00
Jingning Han	25ca322957	Trellis based adaptive quantization This commit combines uniform quantizer with trellis based coefficient level optimization. It improves the codebase compression performance: lowres 0.8% midres 1.0% hdres 1.6% Note that the current trellis optimization unit is using C code. This will make the cost of the overall quantization process slower. A number of optimizations will come up next. Change-Id: Id441dd238e4844409d0f08f82604be777f3f5282	2016-06-10 12:56:14 -07:00
Sarah Parker	a21afd421b	Move new quant experiment from nextgen This experiment implements non-uniform quantization where the width of the bins increases gradually to more closely match a laplacian distribution of the coeficcients. Performance Gain: derflr: 0.15% hevcmr: 0.675% Change-Id: I25234244e3bcd94b87c1f77cf682190b61c8ef94	2016-06-10 08:06:22 -07:00
Jingning Han	025fa11c75	Take out skip_recode speed feature The assumption doesn't hold true in the current codebase. Remove this speed feature to simplify the codebase. Change-Id: I9b69f484c9b7cd612b825047cc5b2fce63ee0af7	2016-06-08 18:27:36 +00:00
Yaowu Xu	0d7dc0cae1	Change to use proper type in vp10_token_state "qc" in vp10_token_state is used to save quantized coefficients, this commit changes the type from short to tran_low_t to properly reflect the value range for highbitdepth build. This fixes an out-of-range bug when optimize_b is used in highbitdepth build. Change-Id: I914c6fd3d3f4b9d061f9ed7cc5f08a883ab59dcd	2016-05-04 11:59:10 -07:00
Alex Converse	8f2fa04181	Unbreak the non-var_tx build. Change-Id: I76cc3d88122de42f035fbf6508bdf3fd7c995012	2016-04-21 13:27:19 -07:00
Debargha Mukherjee	53968c3917	Merge "Fix uninitialized blk_skip for VAR TX." into nextgenv2	2016-04-21 19:56:17 +00:00
hui su	ad59b08f76	Adjust optimize_b RD parameters Coding gain: lowres 0.44% midres 0.24% hdres 0.32% Change-Id: Ie558203b2b2bf5c16cd49b114df3d696c4f35049	2016-04-19 09:54:08 -07:00
hui su	e43c21112d	Enable optimize_b for intra blocks Coding gain: lowres 0.05% midres 0.10% hdres 0.18% Change-Id: I508b150c02588f911a8ddddfe73c770f0819fe10	2016-04-19 09:50:45 -07:00
Geza Lore	7aa95be980	Fix uninitialized blk_skip for VAR TX. x->blk_skip used to be uninitialized (leftover from encoding the previous block), if cm->tx_mode != TX_MODE_SELECT (which is used with higher --cpu-used or --rt options). This resulted in degraded coding performance when using cm->tx_mode != TX_MODE_SELECT. This fixes the VP10/EndToEndTestLarge.EndtoEndPSNRTest/40 unit test. Also fixed an edge effect where encode_block in encodemb.c used the formal width of the block (without cropping at the right edge), to look up blk_skip, while select_tx_block in rdopt.c used the cropped width to set blk_skip. Change-Id: I76d0f49ac5ab3ab54203573e0d7fcfcc1c6aa10d	2016-04-19 17:00:20 +01:00
Geza Lore	8d64b53dc8	Revert "Fix uninitialized blk_skip for VAR TX." This reverts commit `e7b89d8835`.	2016-04-19 15:41:56 +01:00
Geza Lore	e7b89d8835	Fix uninitialized blk_skip for VAR TX. x->blk_skip used to be uninitialzied (leftover from encoding the previous block), if cm->tx_mode != TX_MODE_SELECT (which is used with higher --cpu-used or --rt options). This resulted in degraded coding performance when uning cm->tx_mode != TX_MODE_SELECT. This fixes the VP10/EndToEndTestLarge.EndtoEndPSNRTest/40 unit test. Change-Id: If39062927446798c626fc93694b4e6a4f35fa5da	2016-04-19 14:22:48 +01:00
Angie Chiang	027d12b7d6	Merge changes I359aa49c,Ic8ca5afb into nextgenv2 * changes: Generalize txfm scale in highbd quantizer Parameterize transform scale for quantizer	2016-04-12 18:02:05 +00:00
Geza Lore	511da8cbe5	Rename MI_BLOCK_SIZE and MI_MASK macros. Rename MI_BLOCK_SIZE.* -> MAX_MIB_SIZE.* (MIB is for MI Block). Rename MI_MASK.* -> MAX_MIB_MASK.* There are no functional changes. This is in preparation for coding the superblock size at the frame level, which will require some of these constants to become variables. The new names better reflect future semantics, and hence make the code clearer. Change-Id: Iee08d97554cf4cc16a5dc166a3ffd1ab91529992	2016-03-31 09:57:41 +01:00
Angie Chiang	64413a6ca7	Parameterize transform scale for quantizer This is to facilitate changing transform scale later Change-Id: Ic8ca5afba57d2489ebd191ccc40c1b31605a0d8c	2016-03-30 15:25:26 -07:00
Geza Lore	552d5cd715	Extend superblock size fo 128x128 pixels. If --enable-ext-partition is used at build time, the superblock size (sometimes also referred to as coding unit (CU) size) is extended to 128x128 pixels. Change-Id: Ie09cec6b7e8d765b7555ff5d80974aab60803f3a	2016-03-30 18:23:06 +01:00
Angie Chiang	46b234478f	Use vp10_[fwd/inv]_txfm2d_add_32x32 for bd 10 Change-Id: I996c48a90d7d71b52594a91a35cb8712c7fc212e	2016-03-28 11:08:40 -07:00
Angie Chiang	d9a0cbb1b7	Use vp10_[fwd/inv]_txfm2d_add_#x# for bd 10 Change-Id: Ie35bdbd7aafae693e3106d7ccbbdd8e65ee8800c	2016-03-23 12:05:12 -07:00
Geza Lore	f8cfb72a32	Refactor bsse and skip_txfm in MACROBLOCK. Simple refactoring to 2 dimensional arrays, in preparation for 128 wide superblocks. Change-Id: I40d447bd9fbd4f755534ea3cc82fc8f4676cea07	2016-03-18 15:30:10 +00:00
Geza Lore	efe7d4e5a2	Refactor mbmi->inter_tx_size to 2D array. This is in preparation of increasing the superblock size. Change-Id: I9197e397399fbe8aec1178a45ea0337dd90412d7	2016-03-18 15:30:09 +00:00
Alex Converse	b3ad81288f	Port switch to 9-bit rate cost to vp10. Brings the following commits to vp10: `269428e` Tie the bit cost scale to a define. `d13385c` Switch to 9-bit rate cost constants built on a 256 probability denominator. `ad43a73` Fix a signed overflow in vp9 motion cost. `1c9b091` Fix some interger overflow errors `fac947d` Restore previous motion search bit-error scale. Change-Id: I598ba7ee7efcde18439c31dfa96b86cbf297a580	2016-02-11 09:54:24 -08:00
hui su	2778b1cbb9	Fix a bug with ext-intra when skip_recode is enabled Change-Id: I906945d61254149b315a6de81ac6373ed31791e6	2016-01-19 14:54:31 -08:00
Debargha Mukherjee	3787b17439	Super transform - ported from nextgen branch Various additional changes were made to make the experiment compatible with misc_fixes. derflr: +0.979% hevcmr: +0.865% Speed-wise with --enable-supertx the encoder is only about 10% slower than without. Decoding impact is about 30% slowdown. Note this does not work with ext-tx or var-tx yet. That is a TODO. Change-Id: If25af4241a7a9efbd28f58eda3c4f044c7a7ef4b	2016-01-04 22:12:57 -08:00
Angie Chiang	0919edd4d2	Refactor vp10_encode_block_intra 1) Add VP10_XFORM_QUANT_SKIP_QUANT mode for vp10_xform_quant 2) Let encode_block call vp10_xform_quant so that its code flow is clear Change-Id: I122d5cf6a089f444ae018f3e4bf844be847e17ee	2015-12-11 14:30:24 -08:00
Angie Chiang	88cae8b422	Refactor vp10_xform_quant 1) Add facade to quantize b/fp/dc version so that their interface are the same. 2) Merge vp10_xform_quant b/fp/dc version to one function so that the code flow in encodemb.c is clear Change-Id: Ib62d6215438fc2d07f4e7e72393f964832d6746f	2015-12-03 15:28:11 -08:00
Angie Chiang	a245d9f88c	Add facade to inverse txfm Add inv_txfm and highbd_inv_txfm as facades of inverse transform such that the code flow in encodemb.c can be simpler Change-Id: Iea45fd22dd8b173f8eb3919ca6502636f7bcfcf7	2015-11-25 13:50:40 -08:00
Angie Chiang	96baa73ed9	Create hybrid_fwd_txfm.c Move txfm functions from encodemb to hybrid_twd_txfm.c to make encodemb's code flow clear Change-Id: If174d8ddb490d149c103e5127d30ef19adfbed13	2015-11-25 12:51:25 -08:00
Angie Chiang	30e325a94b	merge txfm_#x#_1 into txfm_#x# Change-Id: I9f539491fe676898246976c91d5ac4804a155803	2015-11-24 18:21:27 -08:00
Geza Lore	01bb4a318d	Eliminate copying for FLIPADST in fwd transforms. This patch eliminates the copying of data when using FLIPADST forward transforms, by incorporating the necessary data flipping into the load_buffer_* functions of the SSE2 optimized forward transforms. The load_buffer_* functions are normally inlined, so the overhead of copying the data is removed and the overhead of flipping is minimized. Left to right flipping is still not free, as the columns need to be shuffled in registers. To preserve identity between the C and SSE2 implementations, the appropriate C implementations now also do the data flipping as part of the transform, rather than relying on the caller for flipping the input. Overall speedup is about 1.5-2% in encode on my tests. Note that these are only the forward transforms. Inverse transforms to come in a later patch. There are also a few code hygiene changes: - Fixed some indents of switch statements. - DCT_DCT transform now always use vp10_fht* functions, which dispatch to vpx_fdct* for DCT_DCT (some of them used to call vpx_fdct* directly, some of them used to call vp10_fht*). Change-Id: I93439257dc5cd104ac6129cfed45af142fb64574	2015-11-03 17:10:55 +00:00
Jingning Han	bfeac5e19c	Support per transform block skip coding Allow the encoder to drop individual transform block coding. Change-Id: I2c2b2985254cb92baf891f03daa33f067279373b	2015-10-30 08:55:17 -07:00
Debargha Mukherjee	8a4292441f	Refactoring tx-types to add more flexibility Allows inter and intra tx_types to have different sets of transforms for different tx_size/sb_type combinations. Change-Id: Ic0ac1daef7a9fb15c4210271e4d04cd36e5cec8e	2015-10-28 23:31:32 -07:00
Debargha Mukherjee	f1c4b79d72	Build fix for ext-tx Change-Id: Ifab43f85f6ae1be6b9f95521f79ba49055353b5f	2015-10-23 21:38:50 +00:00
Yaowu Xu	5a27b3bb85	Fix merge defects This commit fixes the merge conflicts between master and nextgenv2 and disable early termination in choose_tx_size() to avoid failure in test. The test failures are pre-existing, some of the issue were fixed in masterbase already, so will have another merge to introduce the fixes. Change-Id: Ib71889661955e73aedbb4db49d8be70425281dcb	2015-10-22 18:25:41 -07:00
Yaowu Xu	4ac2ae3a4d	Merge branch 'masterbase' into nextgenv2 Conflicts: configure test/vp9_encoder_parms_get_to_decoder.cc vp10/common/blockd.h vp10/common/entropymode.c vp10/common/entropymode.h vp10/common/idct.c vp10/decoder/decodeframe.c vp10/decoder/decodemv.c vp10/encoder/bitstream.c vp10/encoder/encodeframe.c vp10/encoder/encodemb.c vp10/encoder/encoder.c vp10/encoder/encoder.h vp10/encoder/rd.c vp10/encoder/rdopt.c vp10/encoder/tokenize.c vp10/encoder/tokenize.h vp9/decoder/vp9_decodeframe.c vp9/decoder/vp9_decoder.h vp9/encoder/vp9_aq_cyclicrefresh.c vp9/encoder/vp9_encoder.h vp9/vp9_cx_iface.c vpx/vp8cx.h vpx_dsp/x86/vpx_subpixel_8t_intrin_ssse3.c vpx_scale/yv12config.h Change-Id: I604a329d38badec7a11e8ede16ca1404476e9b93	2015-10-22 11:40:44 -07:00
Ronald S. Bultje	60c58b5284	vp10: per-segment lossless coding. Some more testing of this patch would probably be useful, but I think the basics of it should work fine now. See issue 1035. Change-Id: I4a36d58f671c5391cb09d564581784a00ed26245	2015-10-16 19:30:39 -04:00
Ronald S. Bultje	c7dc1d78bf	vp10: add extended-intra prediction edges experiment. This experiment allows using full above/right edges for all transform sizes whenever available (for d45/d63), and adds bottom/left edges for d207. See issue 1043. Change-Id: I5cf7f345e783e8539bb6b6d2c9972fb1d6d0a78b	2015-10-16 19:30:39 -04:00
Jingning Han	2cdc12742d	Rate-distortion optimization for recursive transform block coding This commit enables the rate-distortion optimization for recursive transform block coding scheme. Change-Id: Id6a8336ca847bb3af1e94cbfb51db1f4da12d38f	2015-10-13 12:49:03 -07:00
Jingning Han	a8dad55c82	Make chroma component RD estimate support transform partition This commit makes the rate-distortion optimization for chroma component support the recursive transform block coding scheme. Change-Id: I1bfed6d05b0ebb3905cb625222401e2ccbae10f3	2015-10-08 18:04:03 -07:00
Jingning Han	704985e65a	Add decoder support to recursive transform block partition This commit allows the decoder to recursively parse and rebuild the pixel blocks. Change-Id: I510f3a30ae7cdad5b70725c66882b00a0594e96f	2015-10-08 16:16:41 -07:00
Jingning Han	52bb9dd45c	Make tokenization process support recursive transform block coding This commit makes the transform, quantization, tokenization and their corresponding inverse operations support recursive transform block coding process. Change-Id: I71f2ef3a7c2d3db7cfc63c1fd3f1337e8e0360b5	2015-10-08 08:46:02 -07:00
Jingning Han	ebc48efe37	Use explicit block position in foreach_transformed_block Add the row and column index to the argument list of unit functions called by foreach_transformed_block wrapper. This avoids the repeated internal parsing according to the block index. Change-Id: I42b3578eac258ebaba7a7c74f684de9abab521a6	2015-10-07 16:32:19 -07:00
Jingning Han	00ca5c1c98	Simplify vp10_xform_quant index parsing Change-Id: Id7f7a9b2e53fc0074b55d58143f296afad6b844e	2015-10-06 17:19:23 -07:00
hui su	2afe7320c8	Add identity transform to ext-tx experiment ext-tx on derflr: +1.756% (was +1.648) Change-Id: I8a87970fa589e8f5f96db7aa68ec9b6c98e20188	2015-09-30 18:47:46 -07:00
Yaowu Xu	7c514e2dfd	Merged branch 'master' into nextgenv2 Resolved Conflicts in the following files: configure vp10/common/idct.c vp10/encoder/dct.c vp10/encoder/encodemb.c vp10/encoder/rdopt.c Change-Id: I4cb3986b0b80de65c722ca29d53a0a57f5a94316	2015-09-29 16:17:32 -07:00
Ronald S. Bultje	bab8d38f7f	vp10: remove MACROBLOCK.{highbd_,}itxfm_add function pointer. This is preparatory work for allowing per-segment lossless coding. See issue 1035. Change-Id: I9487d02717ee3e766aee61a487780056bb35d2d3	2015-09-25 19:30:46 -04:00
Ronald S. Bultje	c74b33a413	vp10: remove MACROBLOCK.fwd_txm4x4 function pointer. This is preparatory work for allowing per-segment lossless coding. See issue 1035. Change-Id: Idd72e2a42d90fa7319c10122032d1a7c7a54dc05	2015-09-25 19:30:46 -04:00
Debargha Mukherjee	09ff5f2792	Merge remote-tracking branch 'origin/master' into nextgenv2 Periodic merge to get master changes into nextgenv2. Change-Id: I6f0e4b470f193da03f1a8cb8e6a93ae39395699a	2015-09-17 16:33:18 -07:00
Jingning Han	481b834842	Fix vp10 high bit-depth build Change-Id: Ie3daed0b282b43ef81d2f8797ac1f6e8bde7d65e	2015-09-11 08:56:29 -07:00
Jingning Han	f137697c32	Take out skip_encode speed feature in vp10 Change-Id: Ic39d4523e78863c816b0fc85f56ea5ae5e0b3310	2015-09-10 12:45:39 -07:00

1 2

57 Commits