generic-library/vpx

Author	SHA1	Message	Date
Yunqing Wang	10e83b0717	Enable disable_splitmv feature for other speeds Added disable_splitmv feature at other speed levels. For speed 3 or above, always turn it on. Change-Id: Ibb36f0a7ef12a34b4f8d0f9cb6193eab43b34360	2013-07-17 10:25:49 -07:00
Ronald S. Bultje	83c7e13a6b	Do a skip-block check for sub8x8 partitions also. +0.2% SSIM and glbPSNR on derfraw300. Change-Id: I9cba0bca55e606a22f557c7732b064f738efe84d	2013-07-17 09:46:47 -07:00
Yunqing Wang	df90d58f4f	Speed up motion estimation using small partitions' result(experiment) Current partition checking starts from small sizes, and then goes up to large sizes. This experiment uses the small partitions' motion estimation result, which is already available, to speed up the large partition's motion estimation. We can decide to skip some patition checkings if they are unlikely choices. We could use the motion vector(MV) result as current partition's prediction MV, limit the search range and reference frame. Current result at speed 1: psnr loss: 1.19% for stdhd, 0.287% for derf. speed gain: 14% for sunflower(hd), 11% for akiyo. Further improvement will be done later. Change-Id: I5abfd070e9cace2e91e2a0247d1325df313887ab	2013-07-17 09:11:47 -07:00
Paul Wilkins	d66eab15dd	Merge "Move uv intra mode selection in rd loop."	2013-07-17 05:19:26 -07:00
Paul Wilkins	154c34a3ee	Merge "Limit transform sizes searched for uv intra."	2013-07-17 03:40:11 -07:00
Paul Wilkins	2ee338ce3b	Move uv intra mode selection in rd loop. Use an estimate based on DC_PRED for intra uv cost within the rd loop then only do a full uv mode analysis if an intra mode is chosen. Significant speed gains in some cases. Currently only enabled for speed 2 pending speed/quality tests. Change-Id: Ie851a12400d5483bce47ec0e3ccb8516041e91c0	2013-07-17 11:11:21 +01:00
Paul Wilkins	6c667f0ffe	Limit transform sizes searched for uv intra. Apply limit if search_method == USE_LARGESTALL to the range of UV tx sizes searched. Change-Id: I6db29f0dd237285ffc50d75a37e8b68151ad821c	2013-07-17 11:08:55 +01:00
Paul Wilkins	5f4722c75f	Merge "Minor cleanup in code to fine uv tx_size."	2013-07-17 02:50:09 -07:00
Dmitry Kovalev	6638b6f63f	Merge "Removing MV_GROUP_UPDATE define and corresponding code."	2013-07-16 21:09:00 -07:00
Jingning Han	0b58fa80a0	Merge "Skip redundant motion search in 4x4 level rd loop"	2013-07-16 20:54:25 -07:00
Jingning Han	a142d6fc93	Skip redundant motion search in 4x4 level rd loop This commit makes the encoder to perform motion search only once per reference frame type for each 4x4/4x8/8x4 block. For bus_cif at 2000 kbps, the runtime goes from 253812ms -> 217817ms (14% speed-up) for speed 0. Change-Id: I5f17599ccc8cfaf93ccb4f98fcb6008af6d79e92	2013-07-16 17:21:11 -07:00
Dmitry Kovalev	3997da0d35	Removing MV_GROUP_UPDATE define and corresponding code. Change-Id: I4884cdc2557d25d50c7c4f7e19b1ad8bdb93cd63	2013-07-16 15:03:00 -07:00
Dmitry Kovalev	9482a0bf10	Cleaning up tile code. Removing tile_rows and tile_columns from VP9Common, removing redundant constants MIN_TILE_WIDTH and MAX_TILE_WIDTH, changing signature of vp9_get_tile_n_bits. Change-Id: I8ff3104a38179b2c6900df965c144c1d6f602267	2013-07-16 14:47:15 -07:00
James Zern	39ce4b13d5	Merge "use consistent framerate naming"	2013-07-16 14:22:52 -07:00
James Zern	9581eb6e8a	use consistent framerate naming s/frame_rate/framerate/g Change-Id: I6fc3e088e419c5f46e3a9390dd8a2cad2677a2fc	2013-07-16 14:12:47 -07:00
Dmitry Kovalev	5de96b3ce6	Merge "Rewriting vp9_set_pred_flag_{seg_id, mbskip}."	2013-07-16 13:34:42 -07:00
James Zern	5baa416b6c	Merge "vp9: remove frames_{since,till}.. from MACROBLOCKD"	2013-07-16 13:00:14 -07:00
James Zern	3a7c2665d0	Merge "yv12config: remove YUV_TYPE"	2013-07-16 12:16:04 -07:00
Dmitry Kovalev	863138a2ad	Rewriting vp9_set_pred_flag_{seg_id, mbskip}. Making implementation of vp9_set_pred_flag_{seg_id, mbskip} consistent with vp9_get_segment_id without using confusing sub(a, b) macro. Passing mi_row and mi_col to functions explicitly instead of replying on mb_to_right_edge and mb_to_bottom_edge. Change-Id: I54c1087dd2ba9036f8ba7eb165b073e807d00435	2013-07-16 10:44:48 -07:00
Paul Wilkins	30d2ea45ce	Minor cleanup in code to fine uv tx_size. Change-Id: I94b97a966b5efbc9a243048f1f5ddbbdc4b1846e	2013-07-16 18:27:33 +01:00
Jingning Han	dd97c62ab8	Merge "Skip inter-coded block reconstruction in rd loop"	2013-07-16 09:03:38 -07:00
Dmitry Kovalev	e8e7620a1f	Merge "Removing and moving around constant definitions."	2013-07-16 00:52:53 -07:00
Yaowu Xu	c5b0cd8405	Merge "Change to extend full border only when needed"	2013-07-15 21:35:32 -07:00
Yaowu Xu	5b915ebd92	Change to extend full border only when needed This is a short term optimization till we work out a decoder implementation requiring no frame border extension. Change-Id: I02d15bfde4d926b50a4e58b393d8c4062d1be70f	2013-07-15 20:52:13 -07:00
Dmitry Kovalev	ca75f1255f	Removing and moving around constant definitions. Removing unused and duplicated constants, moving them from .h to .c if possible. Change-Id: Ief4d6b984a3ca2e9b38504f0d855ed072cf7133f	2013-07-15 19:26:30 -07:00
Johann	6eae37f45c	Merge "Remove print_nmvcounts"	2013-07-15 18:43:41 -07:00
Ronald S. Bultje	1ff94fea56	Inline vp9_quantize() in xform_quant(). Cycle times: 4x4: 151 to 131 cycles (15% faster) 8x8: 334 to 306 cycles (9% faster) 16x16: 1401 to 1368 cycles (2.5% faster) 32x32: 7403 to 7367 cycles (0.5% faster) Total encode time of first 50 frames of bus @ 1500kbps (speed 0) goes from 1min39.2 to 1min38.6, i.e. a 0.67% overall speedup. Change-Id: I799a49460e5e3fcab01725564dd49c629bfe935f	2013-07-15 17:30:57 -07:00
Ronald S. Bultje	6fb418741f	Inline xform_quant() in encode_block_intra(). Also inline some of the block calculations to assist the compiler to not do silly things like calculating the same offset (or converting between raster/transform block offset or block, mi and pixel unit) many, many, many times. Cycle times: 4x4: 584 -> 505 cycles (16% faster) 8x8: 1651 -> 1560 cycles (6% faster) 16x16: 7897 -> 7704 cycles (2.5% faster) 32x32: 16096 -> 15852 cycles (1.5% faster) Overall, this saves about 0.5 seconds (1min49.8 -> 1min49.3) on the first 50 frames of bus (speed 0) @ 1500kbps, i.e. 0.5% overall. Change-Id: If3dd62453f8e2ab9d4ee616bc4ea956fb8874b80	2013-07-15 16:00:42 -07:00
Jingning Han	043e0f9dad	Skip inter-coded block reconstruction in rd loop Skip the inverse transform and reconstruction of inter-mode coded blocks in the rate-distortion optimization loop, when skip_encode_sb feature is turned on. This provides about 1% speed-up at speed 0, and 1.5% speed-up at speed 1. No performance change in both settings. Change-Id: I2932718bf4d007163702b61b16b6ff100cf9d007	2013-07-15 11:32:14 -07:00
Jingning Han	faff6ed0fb	Skip duplicate block encoding in the rd loop This speed feature allows the encoder to largely remove the spatial dependency between blocks inside a 64x64 superblock, thereby removing the need to repeatedly encode superblocks per partition type in the rate-distortion optimization loop. A major challenge lies in the intra modes tested in the rate-distortion optimization loop. The subsequent blocks do not have access to the reconstructed boundary pixels without the intermediate coding steps. This was resolved by using the original pixels for intra prediction in the rd loop, followed by an appropriately designed distortion modeling on the quantization parameters. Experiments also suggested that the performance impact is more discernible at lower bit-rate/psnr settings. Hence a quantizer dependent threshold is applied to deactivate skip of block coding. For bus_cif at 2000 kbps, speed 0: runtime 269854ms -> 237774ms (12% speed-up) at 0.05dB performance loss. speed 1: runtime 65312ms -> 61536ms, (7% speed-up) at 0.04dB performance loss. This operation is currently turned on in settings of speed 1. Change-Id: Ib689741dfff8dd38365d8c1b92860a3e176f56ec	2013-07-15 11:08:58 -07:00
James Zern	dc1d2331f6	vp9: remove frames_{since,till}.. from MACROBLOCKD frames_since_golden / frames_till_alt_ref_frame are unused. Change-Id: I348e7689d4d75412cf4de7703d885be942e4a26b	2013-07-13 18:02:11 -07:00
Dmitry Kovalev	429070987a	Using vp9_copy and vp9_zero instead of custom code. Change-Id: Id9b6ceeddca3f9b34bfada5c499b1e7a2f42c30b	2013-07-12 18:07:43 -07:00
Yaowu Xu	cdea4a7c66	Merge "Fix a build issue"	2013-07-12 16:17:22 -07:00
James Zern	4fc6c88e9c	yv12config: remove YUV_TYPE this was never fleshed out in the context of VP8, for which it was added. for VP9 it has no meaning. Change-Id: Iba2ecc026d9e947067b96690245d337e51e26eff	2013-07-12 15:25:48 -07:00
Dmitry Kovalev	cc662dd768	Adding struct tx_probs and struct tx_counts to cleanup the code. Also removing unused declarations from vp9_entropymode.h file. Change-Id: Ib9c5826db3584a32f6bb3297a76c522b99d83402	2013-07-12 15:22:38 -07:00
Yaowu Xu	fb754b182f	Fix a build issue Change-Id: I23a75c495ed7ea917d7f312bef0990e20a6b53d9	2013-07-12 11:38:44 -07:00
James Zern	0195fb53cb	vp9: consistent 'log2' variable naming lg2 -> log2 Change-Id: I0602ddff49e42c9c40c29c084d04b7592b9f8edf	2013-07-12 11:37:43 -07:00
Deb Mukherjee	94c481f9f1	Some minor cleanups for efficiency Implements some of the helper functions more efficiently with lookups rathers than branches. Modeling function is consolidated to reduce some computations. Also merged the two enums BLOCK_SIZE_TYPES and BlockSize into one because there is no need to keep them separate (even though the semantics are a little different). No bitstream or output change. About 0.5% speedup Change-Id: I7d71a66e8031ddb340744dc493f22976052b8f9f	2013-07-12 10:22:56 -07:00
Dmitry Kovalev	727631873d	Merge "Removing redundant code mostly from vp9_pred_common.{h, c}."	2013-07-12 10:22:30 -07:00
Paul Wilkins	b8ddc9f0d3	Merge "Speed 2 feature adjustment."	2013-07-12 02:14:01 -07:00
Jingning Han	84c3ac0476	Merge "Remove unnecessary tx_type branch in encode_block"	2013-07-11 21:52:27 -07:00
Dmitry Kovalev	dd150e8ea9	Removing redundant code mostly from vp9_pred_common.{h, c}. Removing redundant function arguments and curly braces. Change-Id: I46e02561f33fe02e84a3b19756f03b9504bd6a1b	2013-07-11 18:39:10 -07:00
Johann	e6ab476dd4	Remove print_nmvcounts For some reason iOS builds take a really long time to sort this function out. It's not used anywhere so remove it. Change-Id: Ia5c8513a0d9c7eb32641cca58ca1c1113e2dd9f4	2013-07-11 17:22:03 -07:00
Ronald S. Bultje	ee09dd9949	Remove unused function block_error(). Change-Id: I78a79fc51c2d7cc3c261f35b569155397f3dc0c4	2013-07-11 17:14:03 -07:00
Dmitry Kovalev	8c05e59065	Calling is_inter_mode() instead of custom code. Change-Id: Iccd4ab95ea51a6d57ed43947f2fd7ad92e8979cf	2013-07-11 14:14:47 -07:00
Dmitry Kovalev	c4ad3273c7	Moving segmentation related vars into separate struct. Adding segmentation struct to vp9_seg_common.h. Struct members are from macroblockd and VP9Common structs. Moving segmentation related constants and enums to vp9_seg_common.h. Change-Id: I23fabc33f11a359249f5f80d161daf569d02ec03	2013-07-11 11:57:57 -07:00
Dmitry Kovalev	f70c021d36	Merge "Adding write_compressed_header function."	2013-07-11 11:57:17 -07:00
Dmitry Kovalev	802e57535a	Merge "Removing unused TOKENEXTRA arg from pick_sb_modes function."	2013-07-11 11:46:06 -07:00
Jingning Han	b9381b6faf	Remove unnecessary tx_type branch in encode_block The function encode_block is called only by inter-prediction modes, hence removing the transform type branching there. Change-Id: I34a3172e28ce2388835efd0f8781922211bff857	2013-07-11 09:11:35 -07:00
Paul Wilkins	5290eeab88	Speed 2 feature adjustment. With sf->auto_mv_step_size on it is questionable whether sf->reduce_first_step_size is worthwhile. At speed 2 it was not having a big impact. Even at speed 2 sf->optimize_coefficients = 0 is not having a big speed imapct so for now I have moved it down into a higher speed setting. Change-Id: I8a54de76d486ad37aabce76474889da2768b14c1	2013-07-11 13:59:12 +01:00

1 2 3 4 5 ...

1266 Commits