generic-library/vpx

Author	SHA1	Message	Date
Jingning Han	f60a3910c4	Move token_cache from cost_coeffs to MACROBLOCK This commit moves token_cache buffer into macroblock struct, instead of defining as a local variable in cost_coeffs. This avoids repeatedly re-allocating memory space in the rate-distortion optimization loop. The runtime at speed 0 reduces: bus 2000kbps, 161692ms to 159951ms football 600kbps, 229505ms to 225821ms Change-Id: If7da6b0b6d8c5138a16271a33c4548fba33d8840	2013-10-14 10:45:56 -07:00
Dmitry Kovalev	ac468dde46	Consistent names for inverse hybrid transforms (2 of 2). Renames: vp9_iht_add -> vp9_iht4x4_add vp9_iht_add_8x8 -> vp9_iht8x8_add vp9_iht_add_16x16 -> vp9_iht16x16_add Change-Id: I8f1a2913e02d90d41f174f27e4ee2fad0dbd4a21	2013-10-11 15:49:05 -07:00
Dmitry Kovalev	107897cf05	Merge "Consistent names for inverse hybrid transforms (1 of 2)."	2013-10-11 15:33:00 -07:00
Dmitry Kovalev	e765aade0b	Merge "Replacing {VP9_COEF, MODE}_UPDATE_PROB with DIFF_UPDATE_PROB."	2013-10-11 14:15:46 -07:00
Deb Mukherjee	c222b96bfd	Merge "Change in rddiv parameter to make it a power of 2"	2013-10-11 13:53:59 -07:00
Dmitry Kovalev	7ef573914d	Consistent names for inverse hybrid transforms (1 of 2). Renames: vp9_short_iht4x4_add -> vp9_iht4x4_16_add vp9_short_iht8x8_add -> vp9_iht8x8_64_add vp9_short_iht16x16_add_c -> vp9_iht16x16_256_add Change-Id: Ibca7a188fd062b196787ac5efc1ea545e7f166c0	2013-10-11 13:31:32 -07:00
Dmitry Kovalev	1ab7eb1406	Merge "Adding const to the input argument of all 1D transforms."	2013-10-11 13:20:57 -07:00
Yaowu Xu	4c20bff9d2	Merge "Masking intra mode choice adaptively"	2013-10-11 11:25:52 -07:00
Dmitry Kovalev	44195fda71	Adding const to the input argument of all 1D transforms. Also adding static to iadst16_1d and fadst16 functions. Change-Id: I13c7df3b776f0f8efc6e80099bdb0a2f6d29edaf	2013-10-11 11:19:58 -07:00
Dmitry Kovalev	4a0f9478ef	Replacing {VP9_COEF, MODE}_UPDATE_PROB with DIFF_UPDATE_PROB. Values of MODE_UPDATE_PROB and VP9_COEF_UPDATE_PROB are equal, so replacing them with one constant. Inlining appropriate arguments for functions: vp9_cond_prob_diff_update (encoder) vp9_diff_update_prob (decoder) Change-Id: I1255a1cb477743b799b3bfbbcd8de6b32b067338	2013-10-11 10:47:22 -07:00
Dmitry Kovalev	6e21ca7635	Merge "Removing vp9_tree_p typedef."	2013-10-11 10:44:04 -07:00
Deb Mukherjee	d9655e42b8	Change in rddiv parameter to make it a power of 2 Converts the constant rddiv parameter to 128 (from 100) and implements RDCOST with bit-shift rather than multiplication. Other parameters are also adjusted to roughly keep the same balance between Rate and Distortion. There is a slight speed-up of about 0.5-1% (at speed 0) as testted on football_cif. There is a slight change in performance due to small change in the parameters. derfraw300: +0.033% stdhdraw250; +0.102% Change-Id: I70ac69f58fa71c83108f68fe41796cd19d1fc760	2013-10-11 10:43:02 -07:00
Yaowu Xu	8b175679be	Masking intra mode choice adaptively The commit changes to mask available intra prediction modes for test based on prediction block size. With this patch, encoding time of CpuUsed 2 reduces from 10% to 20% for HD clips with a compression drop of 0.2% Change-Id: I65f320f1237c0f5ae3a355bf7caf447f55625455	2013-10-11 10:29:53 -07:00
Jingning Han	54e702b5d7	Merge "Restore mode skip feature in sub8x8 rd loop"	2013-10-11 09:21:06 -07:00
Paul Wilkins	704028d435	Experimental rate control change. When the codec in VBR (or cq) mode hits its max q limits and is struggling to hit a target bandwidth, the bit target per frame collapses. In the first instance normal frames cap out at the maximum allowed Q and then the ARF and GFs do the same. This latter behavior is not generally desirable as GFs and ARFs are only effective from a quality and data rate perspective if they have at lease some level of -Q delta compared to the surrounding frames. In this patch I define a separate max Q for GFs and ARFs that is derived from but somewhat lower than that defined for normal frames. In effect there is a minimum Q delta that will always be available for GFs and ARFs regardless of the target rate and MAXQ setting. This may of course mean that the absolute lowest rate obtainable for a given clip is somewhat higher. Change-Id: I268868b28401900d0cd87e51e609cd3b784ab54a	2013-10-11 13:40:54 +01:00
Paul Wilkins	8b989f5b23	Disable recode loop. For VBR coding disable the recode loop for speeds > 0. Results pending. Change-Id: I2cd9a87c3fcbe39c05b954798d0671a4ca62c37f	2013-10-11 13:38:52 +01:00
Dmitry Kovalev	98400c1bc4	Removing vp9_tree_p typedef. It is used only two times and it is more clear to use real type instead of typedef. Change-Id: Idc25c16504c3da4d040e0cdb33a2987631bb6a5b	2013-10-10 17:16:20 -07:00
Dmitry Kovalev	2be3b84aed	Merge "Giving consistent names to IDCT 32x32 functions."	2013-10-10 15:31:25 -07:00
Dmitry Kovalev	3309b040c8	Merge "Consistent names for FDCT functions."	2013-10-10 15:29:29 -07:00
Adrian Grange	61c607fd79	Merge "Fix typo in comment message"	2013-10-10 14:05:51 -07:00
Yaowu Xu	e2d6e37a54	Merge "change to avoid out-of-range computation"	2013-10-10 13:38:16 -07:00
Jingning Han	09aca3089f	Merge "Re-design rate-distortion cost tracking buffers"	2013-10-10 12:57:31 -07:00
Guillaume Martres	b364176c08	Prevent accidental changes to the previous frame mode_infos This is needed to fix mbgraph but shouldn't affect anything else Change-Id: I2f515052f62e348cd3794b7ff0c139802225ea95	2013-10-10 12:18:12 -07:00
Jingning Han	f0772dc5b8	Fix typo in comment message Change-Id: Ifef756a3a91423bb9f5411f06fa092027be21ecf	2013-10-10 12:17:10 -07:00
Dmitry Kovalev	fc82dbb434	Consistent names for FDCT functions. Renames: fdct4_1d -> fdct4 fadst4_1d -> fadst4 fdct8_1d -> fdct8 fadst8_1d -> fadst8 fdct16_1d -> fdct16 fadst16_1d -> fadst16 "_1d" suffix is redundant, so removing it. The same will happen with idct in the next change sets. Change-Id: Ibf421cd2f569146c6079269df7a31819c098265e	2013-10-10 11:53:55 -07:00
Dmitry Kovalev	1e766b50e2	Giving consistent names to IDCT 32x32 functions. Renames: vp9_short_idct32x32_add -> vp9_idct32x32_1024_add vp9_short_idct32x32_1_add -> vp9_idct32x32_1_add vp9_idct_add_32x32 -> vp9_idct32x32_add Change-Id: Id85306f5814bac6c47463a6b5901a93082510666	2013-10-10 11:27:39 -07:00
Jingning Han	fc19243ced	Re-design rate-distortion cost tracking buffers This commit re-designs the per transformed block rate-distortion costs tracking buffers. It removes redundant buffer usage, makes the needed context memory allocation per VP9_COMP instance and reuses the same buffer sets inside the rate-distortion optimization search loop, thereby avoiding repeatedly requiring memory space. It reduces speed 0 runtime: bus at 2000 kbps from 166763ms to 158967ms, football at 600 kbps from 246614ms to 234257ms. Both about 5% speed-up. Local tests suggest about 2% to 5% speed-up for speed 1 and 2 settings. This does not change compression performance. Change-Id: I363514c5276b5cf9a38c7251088ffc6ab7f9a4c3	2013-10-10 11:03:44 -07:00
Yaowu Xu	b47cef056e	change to avoid out-of-range computation Change-Id: Id5e31833a0ef40de9f64c2f5674af7083233bf14	2013-10-10 11:01:50 -07:00
Dmitry Kovalev	1e8fc24af8	Merge "Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers."	2013-10-10 10:49:27 -07:00
Dmitry Kovalev	419c3f6fba	Merge "Giving consistent names to IDCT 16x16 functions."	2013-10-10 10:43:14 -07:00
Deb Mukherjee	2b055dfe3f	Merge "Adjustment to mv cost parameters"	2013-10-10 09:08:58 -07:00
Jingning Han	be6ae20510	Merge "Fix intra dist model of skip_encode feature"	2013-10-10 09:00:20 -07:00
Jingning Han	4793324c16	Merge "Allow sub8x8 intra modes test for alt frame coding"	2013-10-10 09:00:08 -07:00
Paul Wilkins	c317fbd6cf	Merge "Disable MODE_TEST_HIT_STATS"	2013-10-10 05:52:06 -07:00
Deb Mukherjee	e4b0fce41c	Adjustment to mv cost parameters Increases these parameters. There is a small efficiency gain. Change-Id: Ie5f0ddb39c907d335e0dafa5eb112365a81f4542 derfraw300: +0.091% stdhdraw250: +0.238%	2013-10-09 23:14:25 -07:00
Jingning Han	80f215198f	Merge "Simplifying and inlining k_cvtlo_epi16 and k_cvthi_epi16"	2013-10-09 16:08:42 -07:00
Jingning Han	013db649fa	Fix intra dist model of skip_encode feature The intra mode distortion adjustment for skip_encode feature was broken in the refactoring cc91851. This commit fixes it and tunes the distortion models used therein. Change-Id: I0d676e82f8e855536a90cf9b3e3fdefafcd886c6	2013-10-09 16:05:50 -07:00
Deb Mukherjee	d6aae4d456	Merge "Clean-ups in rdopt.c"	2013-10-09 12:10:20 -07:00
Deb Mukherjee	eb8b1cd764	Clean-ups in rdopt.c Some minor cleanups in preparation for experimentation with some encode parameters and thresholds Change-Id: I449d66da97eae0a7acdf4aae374e2f9111342056	2013-10-09 11:32:03 -07:00
Jingning Han	03fe08ca30	Deprecate the use of PARTITION_INFO from encoder Use b_mode_info to store the inter prediction mode of sub8x8 block, in replacement of the use of partition_info. Remove redundant buffer update for partition_info. For bus_cif at 2000 kbps, this seem to make speed 0 about 1% faster. Change-Id: Id1b3be45e75a24fb4b42335ac480c23e440978f6	2013-10-09 09:23:52 -07:00
Dmitry Kovalev	c983c966cb	Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers. We already have itxm_add member in MACROBLOCKD structure. Both inv_txm4x4_1_add and inv_txm4x4_add are just its special cases for different eob values. But eob logic is already implemented in vp9_iwht4x4_add and vp9_idct4x4_add (that's why also removing inverse_transform_b_4x4_add). Change-Id: I80bec9b6f7d40c5e5033c613faca5c819c3e6326	2013-10-08 11:27:56 -07:00
Dmitry Kovalev	8d3ef287a2	Merge "Removing redundant vp9_pt_energy_class declarations."	2013-10-08 10:54:48 -07:00
Jim Bankoski	ee6b7c1b6c	Merge "easy to fix cpplint issue in rdopt.c"	2013-10-08 10:28:30 -07:00
Yaowu Xu	e29137df05	Change to allow less rectangular partion check For CpuUsed 1 & 2, this commit allow to skip retangular partition check when NONE is better than SPLIT. It also changed to allow such logic on alt ref frame coding rather than use square partition all them. The change has gain compressio about .3% on yt and ythd for both 1&2, It helped .6% compression on cif and stdhd for both CpuUsed 1&2. Change-Id: I814b653baf89f59acd20e042629a12938a1bd4e5	2013-10-08 08:12:56 -07:00
Deb Mukherjee	9390862702	Merge "Rate control parameter adjustment"	2013-10-07 19:16:53 -07:00
Jim Bankoski	08feefbe7b	easy to fix cpplint issue in rdopt.c Change-Id: Id093816146de0d100f0c6ae2542aaa427dbab2d8	2013-10-07 17:03:29 -07:00
Jim Bankoski	9d4c6fab44	cpplint issue missed in first pass for vp9_bitstream.c Change-Id: Ia725748acbc2a3f825f0d208f26522a0412301fa	2013-10-07 15:54:20 -07:00
Jim Bankoski	9603989c72	Merge "cpplint vp9_variance_sse2.c"	2013-10-07 15:44:50 -07:00
Deb Mukherjee	f43c3199bd	Rate control parameter adjustment Adjusts the bits per mb parameter about 10% smaller. Results at speed 0: fullderfraw: +0.255% fullstdhdraw: +0.262% stdhdraw250: +0.291% Change-Id: I2b7317ac3f61737bc77eb5470aad870cade83fa5	2013-10-07 15:08:40 -07:00
Jim Bankoski	2b491c19b8	Merge "cpplint errors in vp9_onyx_if.h"	2013-10-07 14:47:21 -07:00
Dmitry Kovalev	b096c5a336	Giving consistent names to IDCT 16x16 functions. Renames: vp9_short_idct16x16_add -> vp9_idct16x16_256_add vp9_short_idct16x16_10_add -> vp9_idct16x16_10_add vp9_short_idct16x16_1_add -> vp9_idct16x16_1_add vp9_idct_add_16x16 -> vp9_idct16x16_add Change-Id: Ief8a3904de78deab0f4ede944c4d0339c228cfc3	2013-10-07 14:31:10 -07:00
Jingning Han	c8f481fa3d	Restore mode skip feature in sub8x8 rd loop This commit restores the mode skip feature in the sub8x8 rd loop. Change-Id: I5496ee32053f572b8961b549e9ecd4f1360824de	2013-10-07 14:20:34 -07:00
Dmitry Kovalev	2ae93a776b	Merge "Giving consistent names to IDCT 8x8 functions."	2013-10-07 14:19:50 -07:00
Dmitry Kovalev	23cc1cd8e6	Removing redundant vp9_pt_energy_class declarations. Declaring vp9_pt_energy_class in vp9_entropy.h instead of many external places. Change-Id: I66e8a3fc119a43f88d130d0dae4133c825a047a3	2013-10-07 14:11:01 -07:00
Jim Bankoski	7eb7dd2fed	cpplint errors in vp9_onyx_if.h Slightly bigger change -> broke up encode_frame_to_datarate, lots of line length fixes. Change-Id: I7c53325e954de130f3fe1a6656626efc6705be82	2013-10-07 13:57:20 -07:00
Dmitry Kovalev	272adbbec4	Using inter_mode_offset_function instead of duplicated code. Change-Id: I8de865cd1deca07b5c92c225782f0867367e9a11	2013-10-07 13:18:46 -07:00
Adrian Grange	18a2617126	Merge "cpplint issues resolved vp9_ratectrl.c"	2013-10-07 10:54:17 -07:00
Jim Bankoski	31b7a912d1	cpplint issues resolved vp9_ratectrl.c Change-Id: Iae7674b0c946a5ac01617840b3f62965c654d920	2013-10-07 09:21:29 -07:00
Jim Bankoski	92519a005a	Merge "cpplint problems resolved with vp9_firstpass.c"	2013-10-07 09:16:46 -07:00
Jim Bankoski	ccc5a483f4	Merge "cpplint issues resolved in vp9_mcomp.c"	2013-10-07 09:14:35 -07:00
Paul Wilkins	65f0cc7f4b	Disable MODE_TEST_HIT_STATS This flag is for stats generation and testing and should not be checked in as enabled by default. Change-Id: I4ea57dbcf49790f14777f598ddd3dc37dcc7a6bb	2013-10-07 02:54:19 -07:00
Dmitry Kovalev	c6ad70d5f1	Giving consistent names to IDCT 8x8 functions. Renames: vp9_short_idct8x8_add -> vp9_idct8x8_64_add vp9_short_idct8x8_1_add -> vp9_idct8x8_1_add vp9_short_idct8x8_10_add -> vp9_idct8x8_10_add vp9_idct_add_8x8 -> vp9_idct8x8_add Change-Id: Ifb8d3a45b4c0397aa805b30463f3d14581bf72c1	2013-10-06 00:24:09 -07:00
Dmitry Kovalev	9dba044be2	Merge "Giving consistent names to IDCT/IWHT functions."	2013-10-05 23:44:05 -07:00
Jim Bankoski	bf21ce63ee	encodemb cpplint issues revisited. Change-Id: Id5f25b74e2207bf44b6f6c8ffe548fa30fd78b4d	2013-10-05 17:24:51 -07:00
Jim Bankoski	30dee8adfc	cpplint problems resolved with vp9_firstpass.c Change-Id: Ic7b7014a0d857585bfd4baaea1d5c27ffe355642	2013-10-05 17:10:54 -07:00
Jim Bankoski	c9f3f9ed70	Merge "unused typedef in vp9_variance.h"	2013-10-05 16:49:13 -07:00
Jim Bankoski	7fd13472ae	Merge "cpplint issues with vp9_boolhuff.c resolved"	2013-10-05 16:48:28 -07:00
Jim Bankoski	f59cb3eacc	Merge "added nolint to function that doesn't seem easy to breakup"	2013-10-05 16:47:23 -07:00
Jim Bankoski	4410bbbf88	Merge "cpplint issues in vp9_lookahead.c"	2013-10-05 16:46:11 -07:00
Jim Bankoski	b79b7c354d	cpplint issues resolved in vp9_mcomp.c Change-Id: I2c2f83f4dfa2782fc6b0aa6db3ba2c4e6e423ffa	2013-10-05 16:44:40 -07:00
Jim Bankoski	6a7b1fb754	Merge changes Idbfabe42,I788f1a30 * changes: cpplint issues resolved in vp9_variance_mmx.c cpplint issues in vp9_ssim.c	2013-10-05 16:32:50 -07:00
Jim Bankoski	2dba2eb46a	Merge "cpplint issues in vp9_picklpf.c"	2013-10-05 16:32:00 -07:00
Jingning Han	0d0ed6a29b	Allow sub8x8 intra modes test for alt frame coding This commit allows sub8x8 intra modes test in the rate-distortion loop for hd sequences in speed 1 and 2. For sequence y90n of hd set at 8000 kbps, speed 2 runtime goes from 207s to 210s. For ped_1080p at 3000 kbps, speed 2 runtim goes from 336s to 337s. Both are running with 300 frames. This improves compression performance by 0.24% for stdhd and 0.32% for hd. Change-Id: I173ca38a6411565ae6cfadd184c42b2070c5de1f	2013-10-04 19:13:00 -07:00
Jim Bankoski	0500cf429f	cpplint issues with vp9_boolhuff.c resolved Change-Id: I6990c9ab838323d8770dd1f49a25bf3acc4c05c7	2013-10-04 17:20:58 -07:00
Jim Bankoski	a36045fb3b	Merge "cpplint issues with vp9_temporal_filter.c"	2013-10-04 17:17:02 -07:00
Jim Bankoski	cac3e1588e	cpplint issues in vp9_picklpf.c Change-Id: I62e631ca95fefbb1a993479a5e3926dc81359fe7	2013-10-04 17:08:41 -07:00
Jim Bankoski	eead4bb89e	Merge "lint issue in vp9_psnr.c"	2013-10-04 16:42:30 -07:00
Jim Bankoski	e2d73897d0	Merge "vp9_encodeframe.c cpplint issues resolved"	2013-10-04 16:42:06 -07:00
Jim Bankoski	6e161a26e3	Merge "cpp lint issues resolved in vp9_encodeintra.c"	2013-10-04 16:41:58 -07:00
Jim Bankoski	5f80d2ad33	Merge "cpplint vp9_dct.c issues resolved"	2013-10-04 16:41:46 -07:00
Jim Bankoski	38f6a3cdc7	Merge "cpplint issues vp9_tokenize.c resolved"	2013-10-04 16:41:23 -07:00
Jim Bankoski	d07545b7b8	cpplint issues with vp9_temporal_filter.c Change-Id: I695a990689c79d160227975116125b140875aed1	2013-10-04 15:49:30 -07:00
Yaowu Xu	d129eea9fa	Merge "Further clean up of speed 4"	2013-10-04 14:45:21 -07:00
Jim Bankoski	de5cb8b140	vp9_encodeframe.c cpplint issues resolved Change-Id: Id9d837e062d9c4a94def4b4ed1f49a67c75d3618	2013-10-04 14:37:31 -07:00
Jim Bankoski	02f28bac29	cpp lint issues resolved in vp9_encodeintra.c Change-Id: Ib6a8360d24f44eeaec12c5055568382a105dc235	2013-10-04 14:35:01 -07:00
Jim Bankoski	9c2b3744c9	cpplint issues in vp9_lookahead.c Change-Id: I2a98995f0df77d99dc47bda5e41886f014d8843f	2013-10-04 14:24:19 -07:00
Jim Bankoski	5b4f836148	cpplint issues resolved in vp9_variance_mmx.c Change-Id: Idbfabe427fbeab44210f13fec8b6f63f7a4eb0dd	2013-10-04 14:22:08 -07:00
Jim Bankoski	eb5b7ac27b	added nolint to function that doesn't seem easy to breakup Change-Id: I5489b116aea7c510ea5ebbed3c1445f321b05f3e	2013-10-04 14:17:47 -07:00
Dmitry Kovalev	3a0602578e	Giving consistent names to IDCT/IWHT functions. The idea is to have the following names for each transform size: vp9_idct4x4_add vp9_idct4x4_1_add vp9_idct4x4_10_add vp9_idct4x4_16_add vp9_idct8x8_add vp9_idct8x8_1_add vp9_idct8x8_10_add vp9_idct8x8_64_add etc for 16x16, 32x32 The actual list of renames in this patch: vp9_idct_add_lossless -> vp9_iwht4x4_add vp9_short_iwalsh4x4_add -> vp9_iwht4x4_16_add vp9_short_iwalsh4x4_1_add -> vp9_iwht4x4_1_add vp9_idct_add -> vp9_idct4x4_add vp9_short_idct4x4_add -> vp9_idct4x4_16_add vp9_short_idct4x4_1_add -> vp9_idct4x4_1_add Change-Id: I6f43f7437c68dd30cdd05d72e213765578ed30b1	2013-10-04 14:17:06 -07:00
Jim Bankoski	25ecb1f0b3	cpplint vp9_variance_sse2.c Change-Id: Ifce8f5b57a1ea8952e8a67c5b92a127a061899fa	2013-10-04 14:15:06 -07:00
Jim Bankoski	f3e6a35cdb	cpplint issues in vp9_ssim.c Change-Id: I788f1a3004643347ca08d08fc3cb2bb8f0b134d9	2013-10-04 14:08:37 -07:00
Jim Bankoski	424c74e736	cpplint vp9_dct.c issues resolved Change-Id: Ia21653a447040f1b472d21ebd19103b0558c4b16	2013-10-04 13:47:59 -07:00
Jim Bankoski	c6960b6086	cpplint issues vp9_tokenize.c resolved Change-Id: Id4ec0084641d2ad4def95fb05239455fbc25f9b9	2013-10-04 13:42:58 -07:00
Jim Bankoski	660dcfe6a2	Merge "cpplint issues vp9_encodemv.c"	2013-10-04 12:55:46 -07:00
Jim Bankoski	19641c40f9	Merge "cpplint issues vp9_mbgraph"	2013-10-04 12:55:26 -07:00
Guillaume Martres	014a2c17df	Fix first pass for non-square blocks Change-Id: Ic049f0a6ce190f33859118e7b8cfcfe305979102	2013-10-04 12:04:15 -07:00
Dmitry Kovalev	042c475a8f	Merge "Moving all idct/iht functions in one place."	2013-10-04 12:01:42 -07:00
Jim Bankoski	d9215a6616	cpplint issues vp9_mbgraph Change-Id: Iedf9ac460edb31d7c072e2bebd26f2afe8e6089b	2013-10-04 11:22:22 -07:00
Jim Bankoski	19e227561a	cpplint issues vp9_encodemv.c Change-Id: Icda1d2d7cbfb176884fa6c7d9366a2d60e2994e9	2013-10-04 11:19:06 -07:00
Jim Bankoski	916f803175	lint issue in vp9_psnr.c Change-Id: Ifc7ffc02cfedb47230571298622602609a4e8a70	2013-10-04 11:01:49 -07:00
Jingning Han	1ab60f7bfb	Merge "Remove redundant second_ref_frame check in sub8x8"	2013-10-04 09:04:11 -07:00
Paul Wilkins	44e039b4f5	Further clean up of speed 4 Speed 4 still does not give a big gain over speed 3. This just cleans it up a little from the last patch and comments out features that do not seem to be giving much benefit. Change-Id: I5f366e6160e1dbe5dc45cf5eb90cc02712baa1b6	2013-10-04 16:57:24 +01:00
Paul Wilkins	8abd92f12f	Remove mode_skip_start and mask code for sub 8x8 This code serves no purpose in the re-factored sub 8x8 code. Change-Id: I5364986224d1a28b71bcb046ec8557a3d14aaa47	2013-10-04 14:26:17 +01:00
Paul Wilkins	de6ecc5ac3	Selective masking of split modes. Allow selective masking of individual split modes rather than just a single on / off flag. For speed 2 recovers the large speed loss seen for some derf clips in change Ie6bdfa0a370148dd60bd800961077f7e97e67dd4 and a small quality gain. For speed 1 10 % speed increase observed locally on some derf clips for minimal quality change. Change-Id: If86191087b93cbc05351c26c60c7933e2149e485	2013-10-04 14:20:58 +01:00
Paul Wilkins	03dd2818e4	Missing threshold case for disable split. In relation to change: Refactor inter mode rate-distortion search Ie6bdfa0a370148dd60bd800961077f7e97e67dd4 sf->thresh_mult_sub8x8[THR_INTRA] = INT_MAX missing; Change-Id: Ia86b68a5073368a3e2ca124a27b632243b525c8b	2013-10-04 11:54:24 +01:00
Dmitry Kovalev	d975804e9a	Merge "Replacing duplicated code with get_scan_and_band call."	2013-10-03 18:58:40 -07:00
Dmitry Kovalev	8b34437522	Replacing duplicated code with get_scan_and_band call. Change-Id: I2cc3684f416a63dc99b9303109f9850f34a470d5	2013-10-03 17:46:28 -07:00
Jingning Han	2952b7d1fb	Remove redundant second_ref_frame check in sub8x8 This commit removes the redundant second reference frame check in the rate-distortion optimization loop for sub8x8 blocks. Change-Id: I13a57a6f624c4a9bcef02ff2a867fa30d8b44a93	2013-10-03 14:02:12 -07:00
Jingning Han	b9daef91d8	Use vp9_zero in sub8x8 RD optimiazion loop Change-Id: Ic23a705e48cadaa7151f2bd8536d56636cb973e3	2013-10-03 12:34:25 -07:00
Jingning Han	4093192ec9	Change b_mode_info definition from union to struct This commit defines b_mode_info as a struct type. This will allow us to further remove the use of PARTITION_INFO in the encoding process. Change-Id: I975b0f7d557b5e0f66545a61b472def76b671cce	2013-10-03 12:34:11 -07:00
Jingning Han	793c2d8429	Remove unused variables in inter_mode rd loops Remove redundant variable definition/use in rate-distortion search loop for regular and sub8x8 blocks, respectively. Change-Id: Ic0eb3660bb6851ba2eb8d702ba9fd11595000d01	2013-10-03 12:34:11 -07:00
Jingning Han	a55625873f	Merge "Refactor inter mode rate-distortion search"	2013-10-03 12:19:53 -07:00
Jingning Han	11abab356e	Refactor inter mode rate-distortion search This commit separates the rate-distortion optimization loop of superblocks from that of sub8x8 blocks. This allows better design rate-distortion optimization search loop for each setting. It also removes the use of SPLITMV and I4X4_PRED therein. No performance change in speed 0 settings. For bus@CIF at 2000kbps, the speed 1 runtime goes from 48009ms to 43894ms (about 10% faster). The overall compression performance on derf changed by -0.021%. Speed 2 runtime goes from 27114ms to 28700ms (6% slower), while the overall coding efficiency goes up by 1.629% for derf, 1.236% for yt. Change-Id: Ie6bdfa0a370148dd60bd800961077f7e97e67dd4	2013-10-03 11:36:49 -07:00
Dmitry Kovalev	9250d1529c	Using vp9_zero instead of vpx_memset. Change-Id: I9a0d0e9c3459954aa7b9c68f92cc5d56385ebd18	2013-10-03 10:59:36 -07:00
Paul Wilkins	b03d3da9c1	Merge "Speed setting review."	2013-10-03 09:49:00 -07:00
Paul Wilkins	fa71882e63	Merge "make use last partition consider motion"	2013-10-03 09:48:49 -07:00
Dmitry Kovalev	6cb6987d4d	Merge "BITSTREAM - RESTORING BILINEAR INTERPOLATION FILTER SUPPORT"	2013-10-03 09:34:26 -07:00
Paul Wilkins	6253cc9279	Speed setting review. Substantial reworking of the speed vs quality trade offs for speed 1 and 2. In this patch I am attempting to freeze the "quality" meaning of speeds 1 and 2 relative to speed 0 so that in future we can better evaluate progress. I am targeting : Speed 1 quality ~-5% vs speed 0. Speed 2 quality ~-10% vs speed 0 It is inevitable that quality will still fluctuate a little as we adjust settings and add new features, but we will attempt to keep as close as possible to these values. Above speed 2 things will remain a bit more fluid for now. In this patch speed 1 is approximately 4-5x as fast as speed 0. This is similar to before but the quality hit is a lot less. Likewise speed 2 is approximately 2x as fast as speed 1 but is similar in quality to the previous speed 1 configuration. Also slight change to behavior of FLAG_EARLY_TERMINATE to insure all reference frames get at least one rd test. Important for very low variance regions. WIP :- Added a new speed level with old speed 4 becoming speed 5. Speed 3 and 4 tradeoffs still WIP Change-Id: Ic7a38dd7b5b63ab1501f9352411972f480ac6264	2013-10-03 10:23:28 +01:00
Jim Bankoski	f1d3e5e4d6	make use last partition consider motion This commit causes use last partition to consider whether a 64x64 has motion that might make a new partitioning worth while. Change-Id: I3a57bedef4f3cd961fadbfa96651c206fa36da4a	2013-10-03 10:22:39 +01:00
Paul Wilkins	ece99b3da0	Merge "Improved auto_partition_range."	2013-10-03 02:06:13 -07:00
Dmitry Kovalev	68a3e4a888	BITSTREAM - RESTORING BILINEAR INTERPOLATION FILTER SUPPORT Adding appropriate test vector vp90-2-06-bilinear.webm. Change-Id: Ia3bbf57318e0cc61a1b724fe751e3f9c7e11b337	2013-10-02 18:04:12 -07:00
A.Mahfoodh	5215b83aea	Simplifying and inlining k_cvtlo_epi16 and k_cvthi_epi16 Simplify the k_cvtlo_epi16 and k_cvthi_epi16 to only two instructions. Then inlined them. quoting from intel MMX_App_Compute_16bit_Vector.pdf‎ "The PMADDWD instruction multiplies four pairs of 16-bit numbers and produces partial sums of the results and can do so once per clock (with a three-clock latency)." so I am assuming that there will be three clock overhead after the last _mm_madd_pi16 command. Even with the overhead the number of clocks in general should be smaller. I am not sure though becasue I could not find information about number of clocks required for instructions in k_cvtlo_epi16 and k_cvthi_epi16. I will run a test and compare the execution time. Change-Id: Ieda4aa338f69ad3dd196ac6e7892da3cf1b47ea7	2013-10-02 20:02:03 -04:00
Dmitry Kovalev	a88a0e88a4	Merge "Moving get_token_alloc function from common to the encoder."	2013-10-02 16:26:00 -07:00
Jim Bankoski	f5bcc372c9	unused typedef in vp9_variance.h Change-Id: I15f79c9de34c723c1dd419b8da96c3ff948c5e03	2013-10-02 15:59:31 -07:00
Dmitry Kovalev	be7eec79be	Moving all idct/iht functions in one place. Moving functions from vp9_idct_blk to vp9_idct because these functions are used from both encoder and decoder. Removing duplicated code from vp9_encodemb.c and reusing existing functions. Change-Id: Ia0a6782f8c4c409efb891651b871dd4bf22d5fe8	2013-10-02 14:13:33 -07:00
Jingning Han	54bc73151b	Deprecate unused mode count variables Remove mode_check_freq and mode_test_hit_counts from VP9_COMP. Change-Id: Iabfd9f841444cd9bf19ac761a9795f140082ce0b	2013-10-02 11:07:14 -07:00
Jim Bankoski	825b7c301d	Merge "vp9_block.h cpplint issues resolved"	2013-10-01 16:14:58 -07:00
Jim Bankoski	691177842c	Merge "cpplint issue in vp9_rdopt.h"	2013-10-01 15:45:35 -07:00
Jim Bankoski	5491a1f33e	vp9_block.h cpplint issues resolved Change-Id: Icc6a76a5be77f3e19918155bab3998e0aa32ccf5	2013-10-01 15:17:39 -07:00
Jim Bankoski	c4627a9ff1	cpplint issues in vp9_onyx_int.h Change-Id: I6c4058aebe834e1a12b7a3fb10484b9ebe60b349	2013-10-01 15:14:39 -07:00
Jim Bankoski	b6e2f9b752	cpplint issue in vp9_rdopt.h Change-Id: I84209d382ca5dfc537ee533cd792d8caa0e25cee	2013-10-01 15:09:32 -07:00
Dmitry Kovalev	0a5e9ee054	Moving get_token_alloc function from common to the encoder. Also renaming mb_row -> mi_row, mb_col -> mi_col arguments and calculate mb_rows/mb_cols values from mi_rows/mi_cols. Change-Id: I6919a279f560648e23bc9a12f507d17c21ffd5d7	2013-10-01 11:54:10 -07:00
Jingning Han	195061feda	Fix rectangular partition check in speed 1 Make encoder skip rectangular partition check in speed 1 and above, when early termination was triggered in partition split. Thanks Guillaume (gmartres@) for catching this issue. This change makes bus_cif at 2000kbps speed 1 runtime goes down from 25612ms to 23438ms (about 9% speed-up), at the expense of -0.235% performance down. Change-Id: I98613fad081a261d30d5fa206f934ca70601c180	2013-09-30 12:14:36 -07:00
Paul Wilkins	d12a502ef9	Merge "Alter Speed 3."	2013-09-30 09:12:28 -07:00
Deb Mukherjee	fad3d07df3	Merge "Some minor changes/cleanups in rate control"	2013-09-30 06:50:56 -07:00
Paul Wilkins	65b93c7e52	Improved auto_partition_range. The code now takes into account temporal and spatial information to determine the partition size range, but the frequency counts have been removed. The net effect is similar in quality but about 10% faster. Change-Id: I39a513fb79cec9177b73b2a7218f0da70963ae95	2013-09-30 11:32:57 +01:00
Paul Wilkins	a76caa7ff4	Alter Speed 3. This patch deletes the variance based speed three partitioning. Speed 3 now uses the same partitioning method as speed 2 but with some stricter conditions. The speed and quality are now somewhere between speeds 2 and 4 whereas before it was worse in both than speed 4. Change-Id: Ia142e7007299d79db3ceee6ca8670540db6f7a41	2013-09-30 11:26:46 +01:00
Dmitry Kovalev	b927620231	Merge "Using is_inter_block and has_second_ref functions."	2013-09-29 12:14:41 -07:00
Dmitry Kovalev	29815ca729	Merge "Moving from int_mv* to MV* (3)."	2013-09-29 12:13:16 -07:00
Dmitry Kovalev	4ab01fb5f7	Merge "Reusing FRAME_CONTEXT struct to simplify the code."	2013-09-29 12:02:26 -07:00
Dmitry Kovalev	b3d3578ee4	Merge "Renaming vp9_short_idct10_8x8_add to vp9_short_idct8x8_10_add."	2013-09-29 12:01:50 -07:00
Dmitry Kovalev	7343681675	Merge "Removing vp9_get_coef_neighbors_handle function."	2013-09-29 12:01:36 -07:00
Dmitry Kovalev	efbacc9f89	Merge "Removing vp9_subpelvar.h from common."	2013-09-29 12:00:46 -07:00
Dmitry Kovalev	bd9c057433	Reusing FRAME_CONTEXT struct to simplify the code. Change-Id: Ia455c1900d84a3221e3681e31e15ca86bd03f89d	2013-09-27 16:41:20 -07:00
Guillaume Martres	ceaa3c37a9	Merge "Simplify RDMULT and RDDIV derivation"	2013-09-27 16:32:54 -07:00
Dmitry Kovalev	3fab2125ff	Renaming vp9_short_idct10_8x8_add to vp9_short_idct8x8_10_add. Making name consistent with vp9_short_idct8x8 and vp9_short_idct8x8_1. Change-Id: I99e0be040ec893f9571dcf090e18f98dc58339f5	2013-09-27 15:26:27 -07:00
Dmitry Kovalev	209c6cbf8f	Removing vp9_get_coef_neighbors_handle function. Change-Id: I6be72c8b048d1ccc7ef43764cf84c32360098970	2013-09-27 14:11:13 -07:00
Deb Mukherjee	80d582239e	Some minor changes/cleanups in rate control Some small changes to the quantizer mapping functions. Also includes some cleanups. Change-Id: I9dea29b24015f6e6697012a0e4d8983049d8e5c7 Results: derfraw300: +0.106% stdhdraw250: +0.139%	2013-09-27 13:57:42 -07:00
Dmitry Kovalev	15a36a0a0d	Renaming vp9_short_idct10_16x16 to vp9_short_idct16x16_10. Making function name consistent with vp9_short_idct16x16 and vp9_short_idct16x16_1. Change-Id: I70e54be9e6b9a1dddab0de470686591e96d05517	2013-09-26 14:01:25 -07:00
Guillaume Martres	2b426969c3	Simplify RDMULT and RDDIV derivation Don't divide RDMULT and RDDIV by 100 when RDMULT > 1000. This was probably done to avoid overflow when the rd cost was stored in a 32 bits integer but this is not the case anymore. This change will make it easier to support multiple quantizers per frame. derf compression gain at speed 0: 0.037% Change-Id: Ibeeb9b7cfa1a132a7af41bc90fc07a3bba0857f6	2013-09-26 13:55:16 -07:00
Dmitry Kovalev	eda4e24c0d	Using is_inter_block and has_second_ref functions. Change-Id: I60dee58a4fd24d3c4f3c101a49d30e217309f43a	2013-09-25 19:03:04 -07:00
Guillaume Martres	7755b9dada	Merge "Correctly set the segment_id prediction flag and context"	2013-09-25 18:04:21 -07:00
Yaowu Xu	0c02bfcc2a	Merge "Limit mv search range for first pass and mbgraph"	2013-09-25 17:21:13 -07:00
Dmitry Kovalev	8266da1cd1	Moving from int_mv* to MV* (3). Change-Id: I9795d0937bc07793c13d067281995e0750f694d9	2013-09-25 16:44:19 -07:00
Dmitry Kovalev	f9e2140cab	Merge "Moving from int_mv* to MV* (2)."	2013-09-25 16:12:13 -07:00
Dmitry Kovalev	64eff7f360	Removing vp9_subpelvar.h from common. Moving all code from that file to vp9_variace_c.c in the encoder. Change-Id: Ic803d5b4c78d5191e4d25541b3df97337878fc3e	2013-09-25 16:10:43 -07:00
Dmitry Kovalev	2b5670238b	Merge "Replacing txfm with tx."	2013-09-25 15:57:56 -07:00
Dmitry Kovalev	87a214c277	Merge "Adding vp9_get_entropy_contexts function."	2013-09-25 15:43:55 -07:00
Dmitry Kovalev	9cd14ea6ed	Merge "Removing redundant 'extern' keyword."	2013-09-25 15:42:48 -07:00
Dmitry Kovalev	d445945a84	Adding vp9_get_entropy_contexts function. Change-Id: Ife0dd29fb4ad65c7e12ac5f1db8cea4ed81de488	2013-09-24 17:26:05 -07:00
Dmitry Kovalev	d0365c4a2c	Replacing txfm with tx. Renaming txfm_stepdown_count to tx_stepdown_count and max_txfm_size to max_tx_size. Change-Id: Ifc173e22c78240e561a57c4c741b64b1b8fc6fef	2013-09-24 17:24:35 -07:00
Dmitry Kovalev	450cbfe53a	Cleaning up vp9_update_nmv_count function. Using best_mv[2] array instead of two separate variables. Change-Id: Iefa0a41f5c42c42f2c66cef26750da68405f0f25	2013-09-24 15:55:49 -07:00
Dmitry Kovalev	12d57a9409	Removing redundant 'extern' keyword. Change-Id: Ie51306689c0dc527a8aa12d3984389dd8f360dea	2013-09-24 15:13:09 -07:00
Guillaume Martres	57272e41dd	Correctly set the segment_id prediction flag and context This fix a bug introduced by `ac6093d179` Change-Id: I0700a4daf7a6a2471074f81a4596352287fb2ac9	2013-09-24 14:18:27 -07:00
Yaowu Xu	35c5d79e6b	Limit mv search range for first pass and mbgraph Both first pass and mbgraph search use block size 16x16 for motion estimation. This commit put a limit of motion vector range. The effective range allows the entire 16x16 with required subpel interpolation input to be completely outside image border, but not any further away from image border. Change-Id: Id70a5ed08be49e70959f064859d72adc7d775d08	2013-09-24 13:47:29 -07:00
Dmitry Kovalev	b87696ac37	Moving from int_mv* to MV* (2). Updating fractional_mv_step_fp and fractional_mv_step_comp_fp function types. Change-Id: I601c4378bc39ac3ffd4e295d9cbd8e1f74829d46	2013-09-24 12:48:12 -07:00
Dmitry Kovalev	30888742f4	Merge "Moving from int_mv to MV."	2013-09-24 12:25:56 -07:00
Yaowu Xu	71cfaaa689	Merge "Replace memcpy with vpx_memcpy"	2013-09-24 11:35:03 -07:00
Yaowu Xu	9be0bb19df	Replace memcpy with vpx_memcpy Also removed obselete comment Change-Id: Iae1664777d76383639c637ee786e0d50fc45819a	2013-09-24 10:56:06 -07:00
Yaowu Xu	6037f17942	Rename defined constants The change is to better reflect the nature of the constants. Change-Id: Icabac6e9bceefbdb3f03f8218f88ef75943c30fb	2013-09-24 10:53:01 -07:00
Yaowu Xu	ff1ae7f713	Prevent using uninitialized value in RD decision INT64_MAX may be assigned as RDCOST when RDCSOST computation is skipped for speed, this commit to prevent INT64_MAX from being used as real RDCOST in transform size decision. Change-Id: I89a945134191bbdea1f1431ade70424ac079eaac	2013-09-24 10:53:01 -07:00
Yaowu Xu	fe533c9741	Merge "Change to prevent invalid memory access"	2013-09-24 10:37:17 -07:00
Dmitry Kovalev	f24b9b4f87	Merge "Adding best_mv[2] array instead of two variables."	2013-09-24 10:17:53 -07:00
Deb Mukherjee	f1a627e8a2	Merge "Small tweak in the constant quality parameter"	2013-09-24 09:51:08 -07:00
Jingning Han	9bcd750565	Merge "Enable per transformed block zero coeffs forcing"	2013-09-24 09:18:17 -07:00
Jingning Han	24ad692572	Merge "Calculate rd cost per transformed block"	2013-09-24 09:18:03 -07:00
Deb Mukherjee	b7a93578e5	Small tweak in the constant quality parameter Improves results a little. Change-Id: I7bcac02dbb65b43a993445cf557c520197114e5c	2013-09-24 09:09:35 -07:00
Yunqing Wang	bacb5925ff	Merge "Number of instructions in fdct4_1d_sse2 reduced by two."	2013-09-24 08:40:56 -07:00
Yaowu Xu	92a29c157f	Change to prevent invalid memory access After change of MI context storage , mi_8x8[] pointer may be null for a block outside of image border. The commit changes to access the data only after validation of mi_row and mi_col. Change-Id: I039c4eb486a228ea9d8e5f35ab9ae6717d718bf3	2013-09-24 08:36:59 -07:00
A.Mahfoodh	13c7715a75	Number of instructions in fdct4_1d_sse2 reduced by two. Mathematically the results are the same. Change-Id: I1c5126cd3ca64e8515ca6331e0989c6f7dd651a0	2013-09-23 17:23:27 -07:00
Yaowu Xu	838eae3961	Correct 3 step search site initialziation `39c7b01d` accidently reverted the row/col initialization, which broke mv clamps, which is dependent on the sites for valid motion vector range. This commit fixed the issue. Change-Id: Ibcce0226e0360b1ef483fe760b2e33f1af4bf494	2013-09-23 16:11:49 -07:00
Jingning Han	a517343ca3	Enable per transformed block zero coeffs forcing This commit enables forcing all coefficients zero per transformed block, when its rate-distortion cost is lower than regular coeff quantization. The overall performance improvement (including its parent patch on calculating rd cost per transformed block) at speed 1: derf: 0.298% yt: 0.452% hd: 0.741% stdhd: 0.006% Change-Id: I66005fe0fd7af192c3eba32e02fd6d77952accb5	2013-09-23 10:39:35 -07:00
Jingning Han	54c87058bf	Merge "Remove redundant mv_pred use for sub8x8 blocks"	2013-09-23 08:47:21 -07:00
Deb Mukherjee	d11221f433	Improves constant qual, constrained qual turned on Adds modeled functions to decide the qp for altref frames in constant q mode similar to other functions in use in bitrate mode. Also turns on the constrained quality mode (end-usage=2) option which was turned off before. Basic testing shows the mode works in principle, to cap bitrate to the target-bitrate specified, while allowing lower bitrate depending on the cq-level specified. The mode will need to be improved over time. Results for constant quality vs bitrate control mode: derfraw300/fullderfraw: +3.0% at constant quality over bitrate control. fullstdhdraw: +4.341% stdhdraw250: +5.361% Change-Id: If5027c9ec66c8e88d33e47062c6cb84a07b1cda9	2013-09-22 23:04:50 -07:00
Jingning Han	78fbb10642	Calculate rd cost per transformed block This commit makes the rate-distortion optimization loop evaluate the rd costs of regular quantization and all zero coeffs, per transformed block. It improves speed 1 compression performance: derf: 0.245% yt: 0.515% For a large partition that consists multiple transformed blocks, this allows more flexibility to selectively force a portion of them coded as all zero coeffs, as well be continued in the next patches. Change-Id: I211518be4179747b57375696f017d1160cc91851	2013-09-20 12:40:17 -07:00
Dmitry Kovalev	bb5e2bf86a	Adding best_mv[2] array instead of two variables. Change-Id: I584fe50f73879f6a72fada45714ef80893b6d549	2013-09-20 17:08:53 +04:00
Dmitry Kovalev	e51e7a0e8d	Moving from int_mv to MV. Converting vp9_mv_bit_cost, mv_err_cost, and mvsad_err_cost functions for now. Change-Id: I60e3cc20daef773c2adf9a18e30bc85b1c2eb211	2013-09-20 13:52:43 +04:00
Dmitry Kovalev	39c7b01d3c	Cleanup in vp9_init3smotion_compensation. Change-Id: Ie47f53e76bc9530475c8c6d24e9b7a5a0189de56	2013-09-20 12:54:14 +04:00
Dmitry Kovalev	24df77e951	Merge "Adding get_scan_and_band function."	2013-09-20 00:15:06 -07:00
Jingning Han	44b708b4c4	Remove redundant mv_pred use for sub8x8 blocks The sub8x8 blocks has its own motion vector reference scheme. The mv_pred is only used blocks of sizes 8x8 and above, to find the starting point for motion search. This change does not change any coding behavior. It makes the encoding process slightly faster. (0.5% speed-up for local test on speed 1.) Change-Id: I746ee6ef0eac19aa3621be014afa12be8d82cbb9	2013-09-19 10:32:44 -07:00
Yaowu Xu	79af591368	change to avoid invalid memory read. The fake token EOSB may cause invaild memory read in pack token, this commit reworked the loop to avoid such invalid read. Change-Id: I37fdfce869b44a7f90003f82a02f84c45472a457	2013-09-19 08:22:10 -07:00
Yaowu Xu	014acfa2af	fix integer overflow errors Change-Id: I76f440a917832c02d7a727697b225bac66b99f56	2013-09-19 08:14:26 -07:00
Dmitry Kovalev	a23c2a9e7b	Adding get_scan_and_band function. Extracting get_scan_and_band function from get_entropy_context to remove duplicated code. Change-Id: I5da1f5a60263017e887da68bc834317b5f084cb2	2013-09-19 16:53:48 +04:00
Dmitry Kovalev	1600707d35	Merge "Removing redundant code from vp9_mcomp.c."	2013-09-19 00:30:18 -07:00
Dmitry Kovalev	cda802ac86	Merge "Removing redundant coef calculation + cleanup."	2013-09-19 00:28:31 -07:00
Dmitry Kovalev	98cf0145b1	Removing redundant coef calculation + cleanup. Adding temp variable for &x->plane[0], inlining src_diff values. Change-Id: I24c08a5425a6da6fd66f5b0278f2fce74f9989b2	2013-09-18 16:20:10 +04:00
Dmitry Kovalev	72fd127f8c	Removing redundant code from vp9_mcomp.c. Replacing ((1 << MV_MAX_BITS) - 1) with MV_MAX, adding const qualifiers, reusing computed values. Change-Id: I7b46d47f6c644b079d9c3478116a9de465a9baec	2013-09-18 13:11:38 +04:00
Dmitry Kovalev	245ca04bab	Fixing typo in the encoder. Change-Id: I168efdc366eecf638694f357ccad2f4eba7e2fdb	2013-09-18 12:02:22 +04:00
Yaowu Xu	85fd8bdb01	Merge "Silence a bunch of MSVC warnings"	2013-09-17 17:10:58 -07:00
Jingning Han	c437bbcde0	Clean up second ref check in sub8x8 rd loop This commit cleans up the second reference check in the rate-distortion optimization loop of sub8x8 blocks. Change-Id: Ife68feaa4cddbfad2878c9b44d3012788d634f97	2013-09-17 15:59:49 -07:00
Yaowu Xu	a783da80e7	Silence a bunch of MSVC warnings Change-Id: I16633269582a640809dca27572bbe99efa6369fc	2013-09-17 12:08:51 -07:00
Paul Wilkins	84758960db	Merge "Minor clean up."	2013-09-17 03:39:24 -07:00
Paul Wilkins	90a52694f3	Merge "Adjustment to mode_skip_start."	2013-09-17 03:39:15 -07:00
Yaowu Xu	eeae6f946d	fix a problem where an invalid mv used in search The commit added reset of pred_mv at the beginning of each SB64x64 partition mv search, also limited the usage of pred_mv only when search on the largest partition is already done. This is to fix a crash at speed 1/2 encoder where an invalid mv is used in mv search. Change-Id: I39010177da76d054e3c90b7899a44feb2e3a5b1b	2013-09-16 12:49:27 -07:00
Paul Wilkins	cb50dc7f33	Minor clean up. Removed some unused code and minor cleanup / reordering. Change-Id: I4083ae56aeb8edfe9b85aa2f42a16aa28d19da94	2013-09-16 13:45:20 +01:00
Paul Wilkins	3b01778450	Adjustment to mode_skip_start. Corrected values relating to modified mode order. Change-Id: I24fccba3af4bc16721d5e7e51888a66305bfa7fe	2013-09-16 13:44:48 +01:00
Jingning Han	e8a967d960	Merge "Adaptive motion search control"	2013-09-13 14:43:23 -07:00
Jingning Han	c4826c5941	Adaptive motion search control This commit enables adaptive constraint on motion search range for smaller partitions, given the motion vectors of collocated larger partition as a candidate initial search point. It makes speed 0 runtime of bus at CIF and 2000 kbps goes from 167s down to 162s (3% speed-up), at 0.01dB performance gains. In the settings of speed 1, this makes the runtime goes from 33687 ms to 32142 ms (4.5% speed-up), at 0.03dB performance gains. Compression performance wise, it gains at speed 1: derf 0.118% yt 0.237% hd 0.203% stdhd 0.438% Change-Id: Ic8b34c67810d9504a9579bef2825d3fa54b69454	2013-09-13 13:58:10 -07:00
Deb Mukherjee	0c3038234d	Merge "Clean up of the search best filter speed feature"	2013-09-13 11:03:59 -07:00
Paul Wilkins	5d8642354e	Merge "Fix VP9_mode_order[]"	2013-09-13 09:19:31 -07:00
Scott LaVarnway	8fc95a1b11	Merge "New mode_info_context storage -- undo revert"	2013-09-13 08:56:20 -07:00
Paul Wilkins	1407cf8588	Fix VP9_mode_order[] Mis-merge of the following change managed to break mode order and delete two mode options (new alt ref and near alt ref) It also created a situation where we could test two undefined modes off the end of the VP9_mode_order[] data structure. "clang warnings : remove split and i4x4_pred fake modes" "Change Id: I8ef3c*" Initial testing on Akiyo at speed 2. 101.35 44.567 44.447 improves to 96.82 44.915 44.815 Approx 0.3-0.4db gain and 2.5% size reduction Change-Id: Icff813e7c0778d140ad4f0eea18cf1ed203c4e34	2013-09-13 13:33:26 +01:00
Jim Bankoski	324ebb704a	Merge "fix clang warning in rdopt"	2013-09-12 16:39:05 -07:00
Jim Bankoski	9ee9918dad	fix clang warning in rdopt either missed this or it crept back in Change-Id: I6cc1519d09e558be7250254c25bde2ae720555ea	2013-09-12 06:39:42 -07:00
Jim Bankoski	cddde51ec5	Merge "clang warnings : remove split and i4x4_pred fake modes"	2013-09-12 06:20:45 -07:00
Paul Wilkins	66755abff4	Merge "Changes in speed 2 settings"	2013-09-12 02:22:45 -07:00
Jim Bankoski	7fb42d909e	clang warnings : remove split and i4x4_pred fake modes Change-Id: I8ef3c7c0f08f0f1f4ccb8ea4deca4cd8143526ee	2013-09-11 16:34:55 -07:00
Deb Mukherjee	b964646756	Clean up of the search best filter speed feature Removes this speed feature since it is very slow and unlikely to be used in practice. This cleanup removes a bunch of unnecessary complications in the outer encode loop. Change-Id: I3c66ef1ca924fbfad7dadff297c9e7f652d308a1	2013-09-11 15:16:36 -07:00
Jim Bankoski	d09abfa9f7	Merge "resolve clang issue : implicit convert tx_mode -> tx_size"	2013-09-11 13:40:11 -07:00
Deb Mukherjee	69fe840ec4	Changes in speed 2 settings Propose some changes to the speed 2 settings to improve quality. In particular, turns off the adjust_thresholds_by_speed feature which improves results by 6%. Also removes the code for adjust_thresholds_by_speed since it conflicts with the adaptive rd thresh feature. Overall, with this change speed 2 is -15.2% from speed 0 settings, on derf, which is significantly better than -21.6% down before. Change-Id: I6e90a563470979eb0c258ec32d6183ed7ce9a505	2013-09-11 10:54:07 -07:00
Scott LaVarnway	ac6093d179	New mode_info_context storage -- undo revert mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of pointers to MODE_INFO structs. The MODE_INFO structs are now stored as a stream (decoder only), eliminating unnecessary copies and is a little more cache friendly. Change-Id: I031d376284c6eb98a38ad5595b797f048a6cfc0d	2013-09-11 13:45:44 -04:00
Jingning Han	65fe7d7605	Merge "Remove redundant condition check in 32x32 quant"	2013-09-10 16:39:18 -07:00
Jingning Han	cb24406da5	Merge "Remove the use of uninitialized_safe in encode_sb_"	2013-09-10 12:05:22 -07:00
Jingning Han	5d93feb6ad	Remove redundant condition check in 32x32 quant The c code implementation of 32x32 quantization does the zbin check of all coefficients prior to the quant/dequant loop, hence removing the redundant zbin check inside the loop. This only affects the c code version. SSSE3 version does not separate the zbin check out. Change-Id: Ic197a7d61d0b25fcac3cc092987651378cb56e4e	2013-09-10 12:04:33 -07:00
Deb Mukherjee	3d22d3ae0c	Merge "Small tweaks on the constant quality mode"	2013-09-10 11:16:47 -07:00
Deb Mukherjee	09830aa0ea	Small tweaks on the constant quality mode Improves results a little. derf is now +1.078% over bitrate control. Change-Id: I4812136f3e67be21d14ec089419976a32a841785	2013-09-10 10:16:19 -07:00
Yunqing Wang	0607abc3dd	Stop partition checking when distortion is small If the current obtained distortion is very small, which happens for static image case, we pick the current partition type without further split checking. This won't affect regular videos. For static videos, we got 10%~12% encoding speed gain. PSNR was better for some clips, and worse for others. Overall it was even. Change-Id: If787a57bedf46fc595ca4f5ded2b0c0a69e9fdef	2013-09-10 10:13:24 -07:00
Yunqing Wang	939791a129	Modify encode breakout for static frames Thank Paul for the suggestions. While turning on static-thresh for static-image videos, a big jump on bitrate was seen. In this patch, we detected static frames in the video using first-pass stats. For different cases, disable encode breakout or reduce encode breakout threshold to limit the skipping. More modification need be done to break incorrect partition picking pattern for static frames while skipping happens. Change-Id: Ia25f47041af0f04e229c70a0185e12b0ffa6047f	2013-09-10 09:06:03 -07:00
Paul Wilkins	4f660cc018	Modified mode skip functionality. A previous speed feature skipped modes not used in earlier partitions but this not longer worked as intended following changes to the partition coding order and in conjunction with some other speed features (Especially speed 2 and above). This modified mode skip feature sets a mask after the first X modes have been tested in each partition depending on the reference frame of the current best case. This patch also makes some changes to the order modes are tested to fit better with this skip functionality. Initial testing suggests speed and rd hit count improvements of up to 20% at speed 1. Quality results. (derf -1.9%, std hd +0.23%). Change-Id: Idd8efa656cbc0c28f06d09690984c1f18b1115e1	2013-09-10 13:30:10 +01:00
Paul Wilkins	901c495482	Added extra check to rd_auto_partition_range() Added check that the returned max and minimum are valid in bottom and right border cases. Change-Id: I2d6cdc9b5f04c7d0ff512ddcf3228331e028bf9b	2013-09-10 13:29:23 +01:00
Ivan Maltz	20abe595ec	Merge "API extensions and sample app for spacial scalable encoder"	2013-09-09 16:57:01 -07:00
Ivan Maltz	01b35c3c16	API extensions and sample app for spacial scalable encoder Sample app: vp9_spatial_scalable_encoder vpx_codec_control extensions: VP9E_SET_SVC VP9E_SET_WIDTH, VP9E_SET_HEIGHT, VP9E_SET_LAYER VP9E_SET_MIN_Q, VP9E_SET_MAX_Q expanded buffer size for vp9_convolve modified setting of initial width in vp9_onyx_if.c so that layer size can be set prior to initial encode Default number of layers set to 3 (VPX_SS_DEFAULT_LAYERS) Number of layers set explicitly in vpx_codec_enc_cfg.ss_number_layers Change-Id: I2c7a6fe6d665113671337032f7ad032430ac4197	2013-09-09 15:57:56 -07:00
Jingning Han	18c780a0ff	Remove the use of uninitialized_safe in encode_sb_ Initialize the probability model context with default value in encode_sb. Change-Id: Id826114024dfc21c7ef41aea9f4a0316d4a5cb95	2013-09-09 15:41:16 -07:00
James Zern	c1913c9cf4	Merge "Revert "New mode_info_context storage""	2013-09-09 14:38:01 -07:00
James Zern	54a03e20dd	Revert "New mode_info_context storage" This reverts commit `dae17734ec` Encode crashes, leaks and increases integer overflow errors. Change-Id: I595aa2649bb8d0b6552ff91652837a74c103fda2	2013-09-09 13:37:01 -07:00
Paul Wilkins	740acd6891	Merge "Enable kf restrictions at speed 4"	2013-09-09 05:39:13 -07:00
Jim Bankoski	9faa7e8186	resolve clang issue : implicit convert tx_mode -> tx_size Change-Id: Ifc9da470358f58e800e3d0d70a565b61e5f7834a	2013-09-08 07:17:12 -07:00
Jim Bankoski	e378566060	Merge "New mode_info_context storage"	2013-09-08 07:16:25 -07:00
Jingning Han	09bc942b47	Fix overflow issue in 16x16 quantization SSSE3 The 16x16 transform unit test suggested that the peak coefficient value can reach 32639. This could cause potential overflow issue in the SSSE3 implmentation of 16x16 block quantization. This commit fixes this issue by replacing addition with saturated addition. Change-Id: I6d5bb7c5faad4a927be53292324bd2728690717e	2013-09-06 21:06:10 -07:00
Paul Wilkins	f15cdc7451	Enable kf restrictions at speed 4 Change-Id: I453409d3be3f5fe118b15affde45cb52184aef20	2013-09-06 11:16:04 -07:00
Deb Mukherjee	e378a89bd6	Support a constant quality mode in VP9 Adds a new end-usage option for constant quality encoding in vpx. This first version implemented for VP9, encodes all regular inter frames using the quality specified in the --cq-level= option, while encoding all key frames and golden/altref frames at a quality better than that. The current performance on derfraw300 is +0.910% up from bitrate control, but achieved without multiple recode loops per frame. The decision for qp for each altref/golden/key frame will be improved in subsequent patches based on better use of stats from the first pass. Further, the qp for regular inter frames may also be varied around the provided cq-level. Change-Id: I6c4a2a68563679d60e0616ebcb11698578615fb3	2013-09-06 10:30:53 -07:00
Scott LaVarnway	dae17734ec	New mode_info_context storage mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of a pointer to a MODE_INFO struct and a "in the image" flag. The MODE_INFO structs are now stored as a stream, eliminating unnecessary copies and is a little more cache friendly. For the test clips used, the decoder performance improved by ~4.3% (1080p) and ~9.7% (720p). Patch Set 2: Re-encoded clips with latest. Now ~1.7% (1080p) and 5.9% (720p). Change-Id: I846f29e88610fce2523ca697a9a9ef2a182e9256	2013-09-06 12:33:34 -04:00
Jingning Han	1c263d6918	Merge "Use saturated addition in SSSE3 of 32x32 quant"	2013-09-05 14:09:40 -07:00
Jingning Han	458c2833c0	Use saturated addition in SSSE3 of 32x32 quant The 32x32 forward transform can potentially reach peak coefficient value close to 32700, while the rounding factor can go upto 610. This could cause overflow issue in the SSSE3 implementation of 32x32 quantization process. This commit resolves this issue by replacing the addition operations with saturated addition operations in 32x32 block quantization. Change-Id: Id6b98996458e16c5b6241338ca113c332bef6e70	2013-09-05 12:49:12 -07:00
Jim Bankoski	9fc3d32a50	Merge "faster accounting of inc_mv"	2013-09-05 12:38:56 -07:00
Paul Wilkins	e5deed06c0	Merge "Attempt to fix speed 4"	2013-09-04 17:19:22 -07:00
Jim Bankoski	bb2313db28	Merge "make vp9 postproc a config option"	2013-09-04 10:35:26 -07:00
Yunqing Wang	9fd2767200	Merge "Use correct bit cost while static-thresh is on"	2013-09-04 10:26:37 -07:00
Jim Bankoski	79401542f7	make vp9 postproc a config option Vp9 postproc is disabled for now as its not been shown to help and may be merged with vp8. Change-Id: I25620d6cd34c6e10331b18c7b5ef7482e39c6057	2013-09-04 10:02:08 -07:00
Jim Bankoski	532179e845	faster accounting of inc_mv Moves counting of mv branches to where we have a new mv, instead of after the whole frame is summed. Change-Id: I945d9f6d9199ba2443fe816c92d5849340d17bbd	2013-09-04 09:47:57 -07:00

... 3 4 5 6 7 ...

2054 Commits