generic-library/vpx

Author	SHA1	Message	Date
Jingning Han	f60a3910c4	Move token_cache from cost_coeffs to MACROBLOCK This commit moves token_cache buffer into macroblock struct, instead of defining as a local variable in cost_coeffs. This avoids repeatedly re-allocating memory space in the rate-distortion optimization loop. The runtime at speed 0 reduces: bus 2000kbps, 161692ms to 159951ms football 600kbps, 229505ms to 225821ms Change-Id: If7da6b0b6d8c5138a16271a33c4548fba33d8840	2013-10-14 10:45:56 -07:00
Yaowu Xu	8b175679be	Masking intra mode choice adaptively The commit changes to mask available intra prediction modes for test based on prediction block size. With this patch, encoding time of CpuUsed 2 reduces from 10% to 20% for HD clips with a compression drop of 0.2% Change-Id: I65f320f1237c0f5ae3a355bf7caf447f55625455	2013-10-11 10:29:53 -07:00
Jingning Han	54e702b5d7	Merge "Restore mode skip feature in sub8x8 rd loop"	2013-10-11 09:21:06 -07:00
Yaowu Xu	e2d6e37a54	Merge "change to avoid out-of-range computation"	2013-10-10 13:38:16 -07:00
Jingning Han	09aca3089f	Merge "Re-design rate-distortion cost tracking buffers"	2013-10-10 12:57:31 -07:00
Jingning Han	fc19243ced	Re-design rate-distortion cost tracking buffers This commit re-designs the per transformed block rate-distortion costs tracking buffers. It removes redundant buffer usage, makes the needed context memory allocation per VP9_COMP instance and reuses the same buffer sets inside the rate-distortion optimization search loop, thereby avoiding repeatedly requiring memory space. It reduces speed 0 runtime: bus at 2000 kbps from 166763ms to 158967ms, football at 600 kbps from 246614ms to 234257ms. Both about 5% speed-up. Local tests suggest about 2% to 5% speed-up for speed 1 and 2 settings. This does not change compression performance. Change-Id: I363514c5276b5cf9a38c7251088ffc6ab7f9a4c3	2013-10-10 11:03:44 -07:00
Yaowu Xu	b47cef056e	change to avoid out-of-range computation Change-Id: Id5e31833a0ef40de9f64c2f5674af7083233bf14	2013-10-10 11:01:50 -07:00
Dmitry Kovalev	1e8fc24af8	Merge "Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers."	2013-10-10 10:49:27 -07:00
Deb Mukherjee	2b055dfe3f	Merge "Adjustment to mv cost parameters"	2013-10-10 09:08:58 -07:00
Jingning Han	be6ae20510	Merge "Fix intra dist model of skip_encode feature"	2013-10-10 09:00:20 -07:00
Deb Mukherjee	e4b0fce41c	Adjustment to mv cost parameters Increases these parameters. There is a small efficiency gain. Change-Id: Ie5f0ddb39c907d335e0dafa5eb112365a81f4542 derfraw300: +0.091% stdhdraw250: +0.238%	2013-10-09 23:14:25 -07:00
Jingning Han	013db649fa	Fix intra dist model of skip_encode feature The intra mode distortion adjustment for skip_encode feature was broken in the refactoring cc91851. This commit fixes it and tunes the distortion models used therein. Change-Id: I0d676e82f8e855536a90cf9b3e3fdefafcd886c6	2013-10-09 16:05:50 -07:00
Deb Mukherjee	d6aae4d456	Merge "Clean-ups in rdopt.c"	2013-10-09 12:10:20 -07:00
Deb Mukherjee	eb8b1cd764	Clean-ups in rdopt.c Some minor cleanups in preparation for experimentation with some encode parameters and thresholds Change-Id: I449d66da97eae0a7acdf4aae374e2f9111342056	2013-10-09 11:32:03 -07:00
Jingning Han	03fe08ca30	Deprecate the use of PARTITION_INFO from encoder Use b_mode_info to store the inter prediction mode of sub8x8 block, in replacement of the use of partition_info. Remove redundant buffer update for partition_info. For bus_cif at 2000 kbps, this seem to make speed 0 about 1% faster. Change-Id: Id1b3be45e75a24fb4b42335ac480c23e440978f6	2013-10-09 09:23:52 -07:00
Dmitry Kovalev	c983c966cb	Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers. We already have itxm_add member in MACROBLOCKD structure. Both inv_txm4x4_1_add and inv_txm4x4_add are just its special cases for different eob values. But eob logic is already implemented in vp9_iwht4x4_add and vp9_idct4x4_add (that's why also removing inverse_transform_b_4x4_add). Change-Id: I80bec9b6f7d40c5e5033c613faca5c819c3e6326	2013-10-08 11:27:56 -07:00
Dmitry Kovalev	8d3ef287a2	Merge "Removing redundant vp9_pt_energy_class declarations."	2013-10-08 10:54:48 -07:00
Jim Bankoski	08feefbe7b	easy to fix cpplint issue in rdopt.c Change-Id: Id093816146de0d100f0c6ae2542aaa427dbab2d8	2013-10-07 17:03:29 -07:00
Jingning Han	c8f481fa3d	Restore mode skip feature in sub8x8 rd loop This commit restores the mode skip feature in the sub8x8 rd loop. Change-Id: I5496ee32053f572b8961b549e9ecd4f1360824de	2013-10-07 14:20:34 -07:00
Dmitry Kovalev	23cc1cd8e6	Removing redundant vp9_pt_energy_class declarations. Declaring vp9_pt_energy_class in vp9_entropy.h instead of many external places. Change-Id: I66e8a3fc119a43f88d130d0dae4133c825a047a3	2013-10-07 14:11:01 -07:00
Dmitry Kovalev	272adbbec4	Using inter_mode_offset_function instead of duplicated code. Change-Id: I8de865cd1deca07b5c92c225782f0867367e9a11	2013-10-07 13:18:46 -07:00
Jingning Han	1ab60f7bfb	Merge "Remove redundant second_ref_frame check in sub8x8"	2013-10-04 09:04:11 -07:00
Paul Wilkins	8abd92f12f	Remove mode_skip_start and mask code for sub 8x8 This code serves no purpose in the re-factored sub 8x8 code. Change-Id: I5364986224d1a28b71bcb046ec8557a3d14aaa47	2013-10-04 14:26:17 +01:00
Dmitry Kovalev	d975804e9a	Merge "Replacing duplicated code with get_scan_and_band call."	2013-10-03 18:58:40 -07:00
Dmitry Kovalev	8b34437522	Replacing duplicated code with get_scan_and_band call. Change-Id: I2cc3684f416a63dc99b9303109f9850f34a470d5	2013-10-03 17:46:28 -07:00
Jingning Han	2952b7d1fb	Remove redundant second_ref_frame check in sub8x8 This commit removes the redundant second reference frame check in the rate-distortion optimization loop for sub8x8 blocks. Change-Id: I13a57a6f624c4a9bcef02ff2a867fa30d8b44a93	2013-10-03 14:02:12 -07:00
Jingning Han	b9daef91d8	Use vp9_zero in sub8x8 RD optimiazion loop Change-Id: Ic23a705e48cadaa7151f2bd8536d56636cb973e3	2013-10-03 12:34:25 -07:00
Jingning Han	4093192ec9	Change b_mode_info definition from union to struct This commit defines b_mode_info as a struct type. This will allow us to further remove the use of PARTITION_INFO in the encoding process. Change-Id: I975b0f7d557b5e0f66545a61b472def76b671cce	2013-10-03 12:34:11 -07:00
Jingning Han	793c2d8429	Remove unused variables in inter_mode rd loops Remove redundant variable definition/use in rate-distortion search loop for regular and sub8x8 blocks, respectively. Change-Id: Ic0eb3660bb6851ba2eb8d702ba9fd11595000d01	2013-10-03 12:34:11 -07:00
Jingning Han	a55625873f	Merge "Refactor inter mode rate-distortion search"	2013-10-03 12:19:53 -07:00
Jingning Han	11abab356e	Refactor inter mode rate-distortion search This commit separates the rate-distortion optimization loop of superblocks from that of sub8x8 blocks. This allows better design rate-distortion optimization search loop for each setting. It also removes the use of SPLITMV and I4X4_PRED therein. No performance change in speed 0 settings. For bus@CIF at 2000kbps, the speed 1 runtime goes from 48009ms to 43894ms (about 10% faster). The overall compression performance on derf changed by -0.021%. Speed 2 runtime goes from 27114ms to 28700ms (6% slower), while the overall coding efficiency goes up by 1.629% for derf, 1.236% for yt. Change-Id: Ie6bdfa0a370148dd60bd800961077f7e97e67dd4	2013-10-03 11:36:49 -07:00
Dmitry Kovalev	9250d1529c	Using vp9_zero instead of vpx_memset. Change-Id: I9a0d0e9c3459954aa7b9c68f92cc5d56385ebd18	2013-10-03 10:59:36 -07:00
Paul Wilkins	6253cc9279	Speed setting review. Substantial reworking of the speed vs quality trade offs for speed 1 and 2. In this patch I am attempting to freeze the "quality" meaning of speeds 1 and 2 relative to speed 0 so that in future we can better evaluate progress. I am targeting : Speed 1 quality ~-5% vs speed 0. Speed 2 quality ~-10% vs speed 0 It is inevitable that quality will still fluctuate a little as we adjust settings and add new features, but we will attempt to keep as close as possible to these values. Above speed 2 things will remain a bit more fluid for now. In this patch speed 1 is approximately 4-5x as fast as speed 0. This is similar to before but the quality hit is a lot less. Likewise speed 2 is approximately 2x as fast as speed 1 but is similar in quality to the previous speed 1 configuration. Also slight change to behavior of FLAG_EARLY_TERMINATE to insure all reference frames get at least one rd test. Important for very low variance regions. WIP :- Added a new speed level with old speed 4 becoming speed 5. Speed 3 and 4 tradeoffs still WIP Change-Id: Ic7a38dd7b5b63ab1501f9352411972f480ac6264	2013-10-03 10:23:28 +01:00
Dmitry Kovalev	b927620231	Merge "Using is_inter_block and has_second_ref functions."	2013-09-29 12:14:41 -07:00
Dmitry Kovalev	29815ca729	Merge "Moving from int_mv* to MV* (3)."	2013-09-29 12:13:16 -07:00
Dmitry Kovalev	7343681675	Merge "Removing vp9_get_coef_neighbors_handle function."	2013-09-29 12:01:36 -07:00
Dmitry Kovalev	209c6cbf8f	Removing vp9_get_coef_neighbors_handle function. Change-Id: I6be72c8b048d1ccc7ef43764cf84c32360098970	2013-09-27 14:11:13 -07:00
Guillaume Martres	2b426969c3	Simplify RDMULT and RDDIV derivation Don't divide RDMULT and RDDIV by 100 when RDMULT > 1000. This was probably done to avoid overflow when the rd cost was stored in a 32 bits integer but this is not the case anymore. This change will make it easier to support multiple quantizers per frame. derf compression gain at speed 0: 0.037% Change-Id: Ibeeb9b7cfa1a132a7af41bc90fc07a3bba0857f6	2013-09-26 13:55:16 -07:00
Dmitry Kovalev	eda4e24c0d	Using is_inter_block and has_second_ref functions. Change-Id: I60dee58a4fd24d3c4f3c101a49d30e217309f43a	2013-09-25 19:03:04 -07:00
Dmitry Kovalev	8266da1cd1	Moving from int_mv* to MV* (3). Change-Id: I9795d0937bc07793c13d067281995e0750f694d9	2013-09-25 16:44:19 -07:00
Dmitry Kovalev	f9e2140cab	Merge "Moving from int_mv* to MV* (2)."	2013-09-25 16:12:13 -07:00
Dmitry Kovalev	2b5670238b	Merge "Replacing txfm with tx."	2013-09-25 15:57:56 -07:00
Dmitry Kovalev	d445945a84	Adding vp9_get_entropy_contexts function. Change-Id: Ife0dd29fb4ad65c7e12ac5f1db8cea4ed81de488	2013-09-24 17:26:05 -07:00
Dmitry Kovalev	d0365c4a2c	Replacing txfm with tx. Renaming txfm_stepdown_count to tx_stepdown_count and max_txfm_size to max_tx_size. Change-Id: Ifc173e22c78240e561a57c4c741b64b1b8fc6fef	2013-09-24 17:24:35 -07:00
Dmitry Kovalev	b87696ac37	Moving from int_mv* to MV* (2). Updating fractional_mv_step_fp and fractional_mv_step_comp_fp function types. Change-Id: I601c4378bc39ac3ffd4e295d9cbd8e1f74829d46	2013-09-24 12:48:12 -07:00
Dmitry Kovalev	30888742f4	Merge "Moving from int_mv to MV."	2013-09-24 12:25:56 -07:00
Yaowu Xu	71cfaaa689	Merge "Replace memcpy with vpx_memcpy"	2013-09-24 11:35:03 -07:00
Yaowu Xu	9be0bb19df	Replace memcpy with vpx_memcpy Also removed obselete comment Change-Id: Iae1664777d76383639c637ee786e0d50fc45819a	2013-09-24 10:56:06 -07:00
Yaowu Xu	ff1ae7f713	Prevent using uninitialized value in RD decision INT64_MAX may be assigned as RDCOST when RDCSOST computation is skipped for speed, this commit to prevent INT64_MAX from being used as real RDCOST in transform size decision. Change-Id: I89a945134191bbdea1f1431ade70424ac079eaac	2013-09-24 10:53:01 -07:00
Jingning Han	9bcd750565	Merge "Enable per transformed block zero coeffs forcing"	2013-09-24 09:18:17 -07:00

1 2 3 4 5 ...

655 Commits