generic-library/vpx

Author	SHA1	Message	Date
Dmitry Kovalev	190c2b4591	Using stride (# of elements) instead of pitch (bytes) in fdct4x4. Just making fdct consistent with iht/idct/fht functions which all use stride (# of elements) as input argument. Change-Id: I0ba3c52513a5fdd194f1e7e2901092671398985b	2013-10-21 15:27:35 -07:00
Dmitry Kovalev	f6d870f7ae	Merge "Inlining set_partition_seg_context function."	2013-10-21 14:43:37 -07:00
Ivan Maltz	663916cea7	SVC improvements These changes were originally made in the Stratacaster team-review repository commit e114bffcd82ad74c3696ec58e13c0ac895d6c82d Author: Charles 'Buck' Krasic <ckrasic@google.com> Date: Mon Oct 14 16:52:13 2013 -0700 Make dummy frame handling a bit more explicit, fixing bug with single layer encodes. Squashed commit of the following: commit 1ebbfd976c0fadb02bf1ea562a2d0e3f0206daad Merge: `ac468dd` 54e88b7 Author: Ivan Maltz <ivanmaltz@google.com> Date: Fri Oct 11 17:29:58 2013 -0700 Move SVC code from vp9_spatial_scalable_encoder to libvpx module accessible from ffmpeg commit 54e88b78b160becc9569fc3c6cb6b0a8c95dc357 Author: Ivan Maltz <ivanmaltz@google.com> Date: Tue Oct 8 09:08:40 2013 -0700 common svc encoding code for sample app and ffmpeg added svc_encodeframe.c, svc_context.h, svc_test.cc vp9_spatial_scalable_encoder uses vpx_svc_encode commit 5616ec8e2e3d3e8d277333d8a9242f6c70151162 Merge: 4528014 `e29137d` Author: Ivan Maltz <ivanmaltz@google.com> Date: Tue Oct 8 08:47:58 2013 -0700 Merge branch 'master' into stratacaster commit 45280148450b1f3d61e390df8aadedf85cd5bce1 Merge: bb2b675 `1ab60f7` Author: Sujeevan Rajayogam <sujee@google.com> Date: Fri Oct 4 10:22:31 2013 -0700 Merge branch 'master' into stratacaster commit bb2b675e595dc9bfc8551e963edf56800c3aea61 Author: Sujeevan Rajayogam <sujee@google.com> Date: Wed Oct 2 12:37:26 2013 -0700 Track individual frame sizes and psnrs instead of averages. commit c6d303b714795c81e7ceb4173967115c9f8ff5b7 Merge: fa87df9 `3583087` Author: Sujeevan Rajayogam <sujee@google.com> Date: Fri Sep 27 10:05:35 2013 -0700 Merge branch 'master' into stratacaster commit fa87df94fba923d9f7aeb8ae20c6e15f777e00b5 Merge: bf22d71 `3c465af` Author: Sujeevan Rajayogam <sujee@google.com> Date: Thu Sep 26 16:10:31 2013 -0700 Merge branch 'master' into stratacaster commit bf22d7144895a82e0c348ac177c8a261b9e2b88e Author: Sujeevan Rajayogam <sujee@google.com> Date: Thu Sep 26 11:10:34 2013 -0700 Parameterized quantizer, 16th scalefactors, more logging, enabled single layer encodes to generate baseline. commit ceffd7e6025b765f9886b5ea0f324248aa37e327 Author: Sujeevan Rajayogam <sujee@google.com> Date: Thu Sep 19 10:04:49 2013 -0700 - Include new mode for 3 layer I frame with 5 total layers. - Refactor svc api. Change-Id: Ie4d775e21e006fa597d884c59488dc999478e9b5	2013-10-21 14:34:37 -07:00
Dmitry Kovalev	1e05c9a7e6	Merge "Cleanup: using cm variable instead of cpi->common."	2013-10-21 14:30:01 -07:00
Dmitry Kovalev	a0be71c703	Inlining set_partition_seg_context function. We used set_partition_seg_context() only before calls to: 1. update_partition_context() 2. partition_plane_context() Moving these functions from vp9_blockd.h to vp9_onyxc_int.h and inlining set_partition_seg_context into them. After that it is not necessary to have {above, left}_seg_context fields in MACROBLOCKD struture, so removing them also. Change-Id: I4723f59e1c8f3788432b7f51185d8d747b3a97f9	2013-10-21 12:02:19 -07:00
Jingning Han	deb10ac6f9	Merge "Make memory alloc in pick_mode_context bsize aware"	2013-10-21 11:45:59 -07:00
Dmitry Kovalev	a698e52926	Cleanup: using cm variable instead of cpi->common. Change-Id: Iab334b5fd51dfa7e7f29963f8bdc62fd7355e56d	2013-10-21 11:10:11 -07:00
Dmitry Kovalev	33a29f3c35	Merge "Moving allow_high_precision_mv from MACROBLOCKD to VP9_COMMON."	2013-10-21 10:55:02 -07:00
Dmitry Kovalev	d1b65c6bda	Moving allow_high_precision_mv from MACROBLOCKD to VP9_COMMON. This value is a global frame-level flag, not a macroblock-level. Change-Id: Ie8c5790a931150741c2167c00c3e3dd2cf26744d	2013-10-21 10:12:14 -07:00
Paul Wilkins	eec3def7c5	Modified no memory rate control. This 2-pass rate control setting allocates bits based on first pass stats to each kf group, gf group and individual frame but does not correct the bits left and allocation after each frame. In other words it recommends a bit allocation for each frame but does not try and correct any over or under spend on a frame over the remainder of the clip. This reduces the accuracy of rate control in terms of hitting an average bitrate but prevents problems that may arise because early frames either use to many or too few bits. This mode is currently more inclined to undershoot than overshoot (particularly at higher data rates). Also minor changes to rate of adaption when recode loop is not enabled. This mode is currently enabled by default for VBR. It gives the following % performance gains. derf +0.467, +1.072 yt 2.962, 2.645 stdhd 1.682, 1.595, yt-hd 2.3, 2.174 Change-Id: I3c84a9bf8884e5b345698ff0e19187f792c2f3a0	2013-10-19 12:40:43 +01:00
Paul Wilkins	a2769bb73d	Reduced delta for kf/gf/arf when at maxq. Delta reduced because of concern about popping on some very hard clips. Also allow some frame recode at speed 2 for kf/gf/arf. Change-Id: Ib47dff42da41aa6eec83b7285fcaaca24abb851e	2013-10-19 12:24:59 +01:00
Dmitry Kovalev	6d2a0da7a7	Removing NUM_ prefix from constant names. Renames for consistency with other constants: NUM_FRAME_TYPES -> FRAME_TYPES NUM_PARTITION_CONTEXTS -> PARTITION_CONTEXTS Change-Id: I3db30acb2868eb0a424237c831087b2e264ec47f	2013-10-18 17:44:19 -07:00
Dmitry Kovalev	18a4bd2510	Using INTER_MODES constant instead of MB_MODE_COUNT - NEARESTMV. Change-Id: Ie5ec392904d03fd5485474b33be8408108e9d3c9	2013-10-18 14:31:56 -07:00
Yaowu Xu	db1045f2c0	Merge "Use lookup table to simplify logic"	2013-10-18 12:55:24 -07:00
Dmitry Kovalev	5cb8cca9eb	Merge "Using stride (# of elements) instead of pitch (bytes) in fdct16x16."	2013-10-18 12:53:09 -07:00
Jingning Han	72033fcff8	Make memory alloc in pick_mode_context bsize aware This commit makes the buffer allocation of zcoeff_blk array in pick_mode_context block size aware. It calculates the number of 4x4 blocks in the partition and assigns the memory space accordingly. This process (and the uninitialization) is done once for each encoding pass. It allows memory copy of smaller buffer when possible. For football at 600kbps, the runtimes improve by about 1%: speed 1, 45961ms -> 45472ms speed 2, 23863ms -> 23598ms Change-Id: Id2ca24906fa89f46fa5fe742ec4b8efc2a61f877	2013-10-18 12:42:44 -07:00
Dmitry Kovalev	e5fa44c869	Using stride (# of elements) instead of pitch (bytes) in fdct8x8. Just making fdct consistent with iht/idct/fht functions which all use stride (# of elements) as input argument. Change-Id: Ibc944952a192e6c7b2b6a869ec2894c01da82ed1	2013-10-18 12:20:26 -07:00
Dmitry Kovalev	1aa7fd5aef	Using stride (# of elements) instead of pitch (bytes) in fdct16x16. Just making fdct consistent with iht/idct/fht functions which all use stride (# of elements) as input argument. Change-Id: I2d95fdcbba96aaa0ed24a80870cb38f53487a97d	2013-10-18 11:49:33 -07:00
Dmitry Kovalev	a8ffa96e9b	Passing block index explicitly instead of using get_sb_index(). That makes decoder and encoder (only bitstream writing part) a little bit simpler and faster. Moving get_sb_index() function to the encoder. Change-Id: Ie91aaeefd69c84b085948267b33556a7666c6278	2013-10-18 11:02:32 -07:00
Paul Wilkins	eaf2d4cbfc	Merge "Add missing calls to emms in the adaptive quantization code"	2013-10-18 01:59:03 -07:00
Yaowu Xu	30d1ec38a7	Use lookup table to simplify logic In deciding the transform size for a given block in a given TX_MODE. Change-Id: I1467da09853e69cd320695a24c04e19a2f3d04fb	2013-10-17 14:54:16 -07:00
Dmitry Kovalev	08fc2c925f	Merge "Adding allow_hp as an argument to mv search functions."	2013-10-17 14:47:06 -07:00
Dmitry Kovalev	631d216273	Merge "Removing last_kf_gf_q member from VP9Common structure."	2013-10-17 14:46:02 -07:00
Guillaume Martres	ff3aada6cb	Add missing calls to emms in the adaptive quantization code Also avoid using floating-point operations when adaptive quantization is disabled. Change-Id: I54936d7afb661df049cdb3ecd246d04ac2a9d8d3	2013-10-17 14:04:41 -07:00
Dmitry Kovalev	2726f383cd	Adding allow_hp as an argument to mv search functions. Making this change in order to move allow_high_precision_mv field from MACROBLOCKD structure to VP9_COMMON (because it is a frame level flag). Change-Id: I1d006ba36d938e0caf4d40fa051e2e38df9c1108	2013-10-17 14:02:04 -07:00
Dmitry Kovalev	e05412fc23	Using stride (# of elements) instead of pitch (bytes) in fdct32x32. Just making fdct consistent with iht/idct/fht functions which all use stride (# of elements) as input argument. Change-Id: Id623c5113262655fa50f7c9d6cec9a91fcb20bb4	2013-10-17 13:02:28 -07:00
Dmitry Kovalev	01993f7d4a	Removing last_kf_gf_q member from VP9Common structure. It looks like we don't actually use this value. Change-Id: If21d52b597337e7755f7ea817824fc2b1e477a14	2013-10-16 18:01:48 -07:00
Guillaume Martres	7fd2561d64	Merge changes I6226456d,I97925178,I766c4b74 * changes: Use a separate MODE_INFO stream for each tile column Get rid of "this_mi", use "mi_8x8[0]" everywhere instead Make the static_segmentation feature work again	2013-10-16 17:05:39 -07:00
Guillaume Martres	5b984b36ca	Use a separate MODE_INFO stream for each tile column This should make parallel tiles decoding easier to implement. Change-Id: I6226456dd11f275fa991e4a7a930549da6675915	2013-10-16 16:24:48 -07:00
Guillaume Martres	acf0d56f0b	Get rid of "this_mi", use "mi_8x8[0]" everywhere instead The only case where they were intentionally pointing to different structures was in mbgraph, and this didn't have the expected behavior because both of these pointers are used interchangeably through the code Change-Id: I979251782f90885fe962305bcc845bc05907f80c	2013-10-16 16:24:03 -07:00
Guillaume Martres	9a03154f46	Make the static_segmentation feature work again Change-Id: I766c4b74db526efa4ff6dd2d95ef3e0beb45b6e5	2013-10-16 16:15:27 -07:00
Dmitry Kovalev	6c891d4303	Merge "Adding get_band_translate() function."	2013-10-16 16:15:18 -07:00
Guillaume Martres	42bcb4a7ad	Merge "Prevent accidental changes to the previous frame mode_infos"	2013-10-16 16:07:05 -07:00
Yaowu Xu	f5377cd74f	Merge "change to use vp9_idct_32x32_add"	2013-10-16 15:53:37 -07:00
Dmitry Kovalev	9deb614a57	Adding get_band_translate() function. Moving code that gets band_translate array from get_scan_and_band() function to get_band_translate() function. Renaming get_scan_and_band() to get_scan(). Change-Id: I43047c205a1ca2a6e24be44db39dc04b7a385008	2013-10-16 15:11:42 -07:00
Dmitry Kovalev	ab829274b1	Inlining and removing fwd_txm16x16 and fwd_txm8x8 pointers. Change-Id: I3528ba1c3fee761918509f9d9dc2d842c69f5a44	2013-10-16 15:00:48 -07:00
Yaowu Xu	aa60db19d2	change to use vp9_idct_32x32_add instead of vp9_idct32x32_1024_add by making use the eob positions Change-Id: Iafcad20f0c75d8e00536c6a20dda76fe40a973fb	2013-10-16 14:39:14 -07:00
Dmitry Kovalev	501a8c6b91	Merge "Removing print_prob_tree function and vp9_coeff_probs typedef."	2013-10-16 13:13:25 -07:00
Dmitry Kovalev	65583b14e0	Merge "Moving FILTER_BITS constant from vp9_convolve.h to vp9_filter.h."	2013-10-16 13:13:14 -07:00
Marco Paniconi	e078c3d854	Initial 1-pass. Change-Id: I58c5436f5c95f6012fb2891cd2a02f76e4870b6a	2013-10-16 12:04:29 -07:00
Guillaume Martres	e55f60240a	Implement variance-based adaptive quantization This should be similar to what x264 does with --aq-mode 1. It works well with clips like parkjoy and touhou (http://x264.nl/developers/Dark_Shikari/LosslessTouhou.mkv). At low bitrates, the segmentation signaling overhead may negate the benefits of this feature. (PGW) Default changed to feature OFF to allow provisional merge. Change-Id: I938abf9bb487e1d4ad3b0264ea03d9826275c70b	2013-10-16 11:55:13 +01:00
Adrian Grange	12b2c712ca	Merge "Updated encoder to handle intra-only frames"	2013-10-15 17:19:28 -07:00
Jingning Han	9b05f23e05	Merge "Make vp9_zero use cases of consistent format"	2013-10-15 16:49:05 -07:00
Alexander Voronov	d6a59fb12c	Updated encoder to handle intra-only frames Updated the encoder to handle frames that are coded intra-only. Intra-only frames must be non-showable, that is, the "show frame" flag must be set to 0 in the frame header. Tested by forcing the ARF frames to be coded intra- only. Note: The rate control code will need to be modified to account for intra-only frames better than they are currently handled. Change-Id: I6a9dd5337deddcecc599d3a44a7431909ed21079	2013-10-15 16:44:02 -07:00
Jingning Han	355db16734	Merge "Remove unused variable vp9_64x64_zeros"	2013-10-15 16:24:34 -07:00
Jingning Han	bf187d1b2d	Merge "Fix a few indent format issues in buffer defs"	2013-10-15 16:23:50 -07:00
Jingning Han	fd1cd89da6	Merge "Remove unused comment"	2013-10-15 16:23:44 -07:00
Jingning Han	3f52cfa130	Merge "Re-design all-zero-coeff block index buffer use"	2013-10-15 16:23:38 -07:00
Jingning Han	c8e48f4b02	Make vp9_zero use cases of consistent format Remove the semicolon in the definition of vp9_zero macro. Make all the use cases of vp9_zero of consistent format. Change-Id: Ibaf9751e8595872b12766381a93d185a4d90df8f	2013-10-15 16:12:21 -07:00
Jingning Han	9115d84509	Remove unused variable vp9_64x64_zeros Remove the unused variable vp9_64x64_zeros from vp9_encodeframe_. Change-Id: I34bfdcab9a9105440ad05154c1e0516e70258785	2013-10-15 11:53:46 -07:00
Jingning Han	0a66541619	Fix a few indent format issues in buffer defs Change-Id: Iac55891ac9e6f13718c9f822aa099b5ca491832a	2013-10-15 11:51:09 -07:00
Jingning Han	9622271033	Remove unused comment Change-Id: I2d96940fae4c7a16661a43c2bf6907d8b1c1a127	2013-10-15 11:45:38 -07:00
Dmitry Kovalev	a4585285ed	Removing unused 8x4 transform from the encoder. Change-Id: Icbcf68b5b685a56f255ebc3859c9692accdadf9e	2013-10-15 11:27:28 -07:00
Jingning Han	8e3ce1a9e3	Re-design all-zero-coeff block index buffer use Use the zcoeff_blk buffer of PICK_MODE_CONTEXT to store the indexes of all-zero-coeff block of the current best mode. Remove the temporary buffer best_zcoeff_blk defined in the rate-distortion optimization loop. This improves the speed performance by about 0.5% in all speed settings. Change-Id: Ie3e15988ddfa581eafa2e19a8228d3fe4a46095c	2013-10-15 10:54:06 -07:00
Dmitry Kovalev	77cd8db1bf	Moving FILTER_BITS constant from vp9_convolve.h to vp9_filter.h. Change-Id: Idd7bdb0c364d94c5a0d24c87bb8574292e4c840c	2013-10-14 21:15:40 -07:00
Dmitry Kovalev	6965e6f3d5	Removing print_prob_tree function and vp9_coeff_probs typedef. Change-Id: If14265084e9b4c85c75b43e8d33a6fafad468cbc	2013-10-14 21:08:21 -07:00
Jingning Han	a0d8ec7b76	Merge "Move token_cache from cost_coeffs to MACROBLOCK"	2013-10-14 13:05:32 -07:00
Jingning Han	f60a3910c4	Move token_cache from cost_coeffs to MACROBLOCK This commit moves token_cache buffer into macroblock struct, instead of defining as a local variable in cost_coeffs. This avoids repeatedly re-allocating memory space in the rate-distortion optimization loop. The runtime at speed 0 reduces: bus 2000kbps, 161692ms to 159951ms football 600kbps, 229505ms to 225821ms Change-Id: If7da6b0b6d8c5138a16271a33c4548fba33d8840	2013-10-14 10:45:56 -07:00
Dmitry Kovalev	ac468dde46	Consistent names for inverse hybrid transforms (2 of 2). Renames: vp9_iht_add -> vp9_iht4x4_add vp9_iht_add_8x8 -> vp9_iht8x8_add vp9_iht_add_16x16 -> vp9_iht16x16_add Change-Id: I8f1a2913e02d90d41f174f27e4ee2fad0dbd4a21	2013-10-11 15:49:05 -07:00
Dmitry Kovalev	107897cf05	Merge "Consistent names for inverse hybrid transforms (1 of 2)."	2013-10-11 15:33:00 -07:00
Dmitry Kovalev	e765aade0b	Merge "Replacing {VP9_COEF, MODE}_UPDATE_PROB with DIFF_UPDATE_PROB."	2013-10-11 14:15:46 -07:00
Deb Mukherjee	c222b96bfd	Merge "Change in rddiv parameter to make it a power of 2"	2013-10-11 13:53:59 -07:00
Dmitry Kovalev	7ef573914d	Consistent names for inverse hybrid transforms (1 of 2). Renames: vp9_short_iht4x4_add -> vp9_iht4x4_16_add vp9_short_iht8x8_add -> vp9_iht8x8_64_add vp9_short_iht16x16_add_c -> vp9_iht16x16_256_add Change-Id: Ibca7a188fd062b196787ac5efc1ea545e7f166c0	2013-10-11 13:31:32 -07:00
Dmitry Kovalev	1ab7eb1406	Merge "Adding const to the input argument of all 1D transforms."	2013-10-11 13:20:57 -07:00
Yaowu Xu	4c20bff9d2	Merge "Masking intra mode choice adaptively"	2013-10-11 11:25:52 -07:00
Dmitry Kovalev	44195fda71	Adding const to the input argument of all 1D transforms. Also adding static to iadst16_1d and fadst16 functions. Change-Id: I13c7df3b776f0f8efc6e80099bdb0a2f6d29edaf	2013-10-11 11:19:58 -07:00
Dmitry Kovalev	4a0f9478ef	Replacing {VP9_COEF, MODE}_UPDATE_PROB with DIFF_UPDATE_PROB. Values of MODE_UPDATE_PROB and VP9_COEF_UPDATE_PROB are equal, so replacing them with one constant. Inlining appropriate arguments for functions: vp9_cond_prob_diff_update (encoder) vp9_diff_update_prob (decoder) Change-Id: I1255a1cb477743b799b3bfbbcd8de6b32b067338	2013-10-11 10:47:22 -07:00
Dmitry Kovalev	6e21ca7635	Merge "Removing vp9_tree_p typedef."	2013-10-11 10:44:04 -07:00
Deb Mukherjee	d9655e42b8	Change in rddiv parameter to make it a power of 2 Converts the constant rddiv parameter to 128 (from 100) and implements RDCOST with bit-shift rather than multiplication. Other parameters are also adjusted to roughly keep the same balance between Rate and Distortion. There is a slight speed-up of about 0.5-1% (at speed 0) as testted on football_cif. There is a slight change in performance due to small change in the parameters. derfraw300: +0.033% stdhdraw250; +0.102% Change-Id: I70ac69f58fa71c83108f68fe41796cd19d1fc760	2013-10-11 10:43:02 -07:00
Yaowu Xu	8b175679be	Masking intra mode choice adaptively The commit changes to mask available intra prediction modes for test based on prediction block size. With this patch, encoding time of CpuUsed 2 reduces from 10% to 20% for HD clips with a compression drop of 0.2% Change-Id: I65f320f1237c0f5ae3a355bf7caf447f55625455	2013-10-11 10:29:53 -07:00
Jingning Han	54e702b5d7	Merge "Restore mode skip feature in sub8x8 rd loop"	2013-10-11 09:21:06 -07:00
Paul Wilkins	704028d435	Experimental rate control change. When the codec in VBR (or cq) mode hits its max q limits and is struggling to hit a target bandwidth, the bit target per frame collapses. In the first instance normal frames cap out at the maximum allowed Q and then the ARF and GFs do the same. This latter behavior is not generally desirable as GFs and ARFs are only effective from a quality and data rate perspective if they have at lease some level of -Q delta compared to the surrounding frames. In this patch I define a separate max Q for GFs and ARFs that is derived from but somewhat lower than that defined for normal frames. In effect there is a minimum Q delta that will always be available for GFs and ARFs regardless of the target rate and MAXQ setting. This may of course mean that the absolute lowest rate obtainable for a given clip is somewhat higher. Change-Id: I268868b28401900d0cd87e51e609cd3b784ab54a	2013-10-11 13:40:54 +01:00
Paul Wilkins	8b989f5b23	Disable recode loop. For VBR coding disable the recode loop for speeds > 0. Results pending. Change-Id: I2cd9a87c3fcbe39c05b954798d0671a4ca62c37f	2013-10-11 13:38:52 +01:00
Dmitry Kovalev	98400c1bc4	Removing vp9_tree_p typedef. It is used only two times and it is more clear to use real type instead of typedef. Change-Id: Idc25c16504c3da4d040e0cdb33a2987631bb6a5b	2013-10-10 17:16:20 -07:00
Dmitry Kovalev	2be3b84aed	Merge "Giving consistent names to IDCT 32x32 functions."	2013-10-10 15:31:25 -07:00
Dmitry Kovalev	3309b040c8	Merge "Consistent names for FDCT functions."	2013-10-10 15:29:29 -07:00
Adrian Grange	61c607fd79	Merge "Fix typo in comment message"	2013-10-10 14:05:51 -07:00
Yaowu Xu	e2d6e37a54	Merge "change to avoid out-of-range computation"	2013-10-10 13:38:16 -07:00
Jingning Han	09aca3089f	Merge "Re-design rate-distortion cost tracking buffers"	2013-10-10 12:57:31 -07:00
Guillaume Martres	b364176c08	Prevent accidental changes to the previous frame mode_infos This is needed to fix mbgraph but shouldn't affect anything else Change-Id: I2f515052f62e348cd3794b7ff0c139802225ea95	2013-10-10 12:18:12 -07:00
Jingning Han	f0772dc5b8	Fix typo in comment message Change-Id: Ifef756a3a91423bb9f5411f06fa092027be21ecf	2013-10-10 12:17:10 -07:00
Dmitry Kovalev	fc82dbb434	Consistent names for FDCT functions. Renames: fdct4_1d -> fdct4 fadst4_1d -> fadst4 fdct8_1d -> fdct8 fadst8_1d -> fadst8 fdct16_1d -> fdct16 fadst16_1d -> fadst16 "_1d" suffix is redundant, so removing it. The same will happen with idct in the next change sets. Change-Id: Ibf421cd2f569146c6079269df7a31819c098265e	2013-10-10 11:53:55 -07:00
Dmitry Kovalev	1e766b50e2	Giving consistent names to IDCT 32x32 functions. Renames: vp9_short_idct32x32_add -> vp9_idct32x32_1024_add vp9_short_idct32x32_1_add -> vp9_idct32x32_1_add vp9_idct_add_32x32 -> vp9_idct32x32_add Change-Id: Id85306f5814bac6c47463a6b5901a93082510666	2013-10-10 11:27:39 -07:00
Jingning Han	fc19243ced	Re-design rate-distortion cost tracking buffers This commit re-designs the per transformed block rate-distortion costs tracking buffers. It removes redundant buffer usage, makes the needed context memory allocation per VP9_COMP instance and reuses the same buffer sets inside the rate-distortion optimization search loop, thereby avoiding repeatedly requiring memory space. It reduces speed 0 runtime: bus at 2000 kbps from 166763ms to 158967ms, football at 600 kbps from 246614ms to 234257ms. Both about 5% speed-up. Local tests suggest about 2% to 5% speed-up for speed 1 and 2 settings. This does not change compression performance. Change-Id: I363514c5276b5cf9a38c7251088ffc6ab7f9a4c3	2013-10-10 11:03:44 -07:00
Yaowu Xu	b47cef056e	change to avoid out-of-range computation Change-Id: Id5e31833a0ef40de9f64c2f5674af7083233bf14	2013-10-10 11:01:50 -07:00
Dmitry Kovalev	1e8fc24af8	Merge "Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers."	2013-10-10 10:49:27 -07:00
Dmitry Kovalev	419c3f6fba	Merge "Giving consistent names to IDCT 16x16 functions."	2013-10-10 10:43:14 -07:00
Deb Mukherjee	2b055dfe3f	Merge "Adjustment to mv cost parameters"	2013-10-10 09:08:58 -07:00
Jingning Han	be6ae20510	Merge "Fix intra dist model of skip_encode feature"	2013-10-10 09:00:20 -07:00
Jingning Han	4793324c16	Merge "Allow sub8x8 intra modes test for alt frame coding"	2013-10-10 09:00:08 -07:00
Paul Wilkins	c317fbd6cf	Merge "Disable MODE_TEST_HIT_STATS"	2013-10-10 05:52:06 -07:00
Deb Mukherjee	e4b0fce41c	Adjustment to mv cost parameters Increases these parameters. There is a small efficiency gain. Change-Id: Ie5f0ddb39c907d335e0dafa5eb112365a81f4542 derfraw300: +0.091% stdhdraw250: +0.238%	2013-10-09 23:14:25 -07:00
Jingning Han	80f215198f	Merge "Simplifying and inlining k_cvtlo_epi16 and k_cvthi_epi16"	2013-10-09 16:08:42 -07:00
Jingning Han	013db649fa	Fix intra dist model of skip_encode feature The intra mode distortion adjustment for skip_encode feature was broken in the refactoring cc91851. This commit fixes it and tunes the distortion models used therein. Change-Id: I0d676e82f8e855536a90cf9b3e3fdefafcd886c6	2013-10-09 16:05:50 -07:00
Deb Mukherjee	d6aae4d456	Merge "Clean-ups in rdopt.c"	2013-10-09 12:10:20 -07:00
Deb Mukherjee	eb8b1cd764	Clean-ups in rdopt.c Some minor cleanups in preparation for experimentation with some encode parameters and thresholds Change-Id: I449d66da97eae0a7acdf4aae374e2f9111342056	2013-10-09 11:32:03 -07:00
Jingning Han	03fe08ca30	Deprecate the use of PARTITION_INFO from encoder Use b_mode_info to store the inter prediction mode of sub8x8 block, in replacement of the use of partition_info. Remove redundant buffer update for partition_info. For bus_cif at 2000 kbps, this seem to make speed 0 about 1% faster. Change-Id: Id1b3be45e75a24fb4b42335ac480c23e440978f6	2013-10-09 09:23:52 -07:00
Dmitry Kovalev	c983c966cb	Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers. We already have itxm_add member in MACROBLOCKD structure. Both inv_txm4x4_1_add and inv_txm4x4_add are just its special cases for different eob values. But eob logic is already implemented in vp9_iwht4x4_add and vp9_idct4x4_add (that's why also removing inverse_transform_b_4x4_add). Change-Id: I80bec9b6f7d40c5e5033c613faca5c819c3e6326	2013-10-08 11:27:56 -07:00
Dmitry Kovalev	8d3ef287a2	Merge "Removing redundant vp9_pt_energy_class declarations."	2013-10-08 10:54:48 -07:00
Jim Bankoski	ee6b7c1b6c	Merge "easy to fix cpplint issue in rdopt.c"	2013-10-08 10:28:30 -07:00
Yaowu Xu	e29137df05	Change to allow less rectangular partion check For CpuUsed 1 & 2, this commit allow to skip retangular partition check when NONE is better than SPLIT. It also changed to allow such logic on alt ref frame coding rather than use square partition all them. The change has gain compressio about .3% on yt and ythd for both 1&2, It helped .6% compression on cif and stdhd for both CpuUsed 1&2. Change-Id: I814b653baf89f59acd20e042629a12938a1bd4e5	2013-10-08 08:12:56 -07:00
Deb Mukherjee	9390862702	Merge "Rate control parameter adjustment"	2013-10-07 19:16:53 -07:00
Jim Bankoski	08feefbe7b	easy to fix cpplint issue in rdopt.c Change-Id: Id093816146de0d100f0c6ae2542aaa427dbab2d8	2013-10-07 17:03:29 -07:00
Jim Bankoski	9d4c6fab44	cpplint issue missed in first pass for vp9_bitstream.c Change-Id: Ia725748acbc2a3f825f0d208f26522a0412301fa	2013-10-07 15:54:20 -07:00
Jim Bankoski	9603989c72	Merge "cpplint vp9_variance_sse2.c"	2013-10-07 15:44:50 -07:00
Deb Mukherjee	f43c3199bd	Rate control parameter adjustment Adjusts the bits per mb parameter about 10% smaller. Results at speed 0: fullderfraw: +0.255% fullstdhdraw: +0.262% stdhdraw250: +0.291% Change-Id: I2b7317ac3f61737bc77eb5470aad870cade83fa5	2013-10-07 15:08:40 -07:00
Jim Bankoski	2b491c19b8	Merge "cpplint errors in vp9_onyx_if.h"	2013-10-07 14:47:21 -07:00
Dmitry Kovalev	b096c5a336	Giving consistent names to IDCT 16x16 functions. Renames: vp9_short_idct16x16_add -> vp9_idct16x16_256_add vp9_short_idct16x16_10_add -> vp9_idct16x16_10_add vp9_short_idct16x16_1_add -> vp9_idct16x16_1_add vp9_idct_add_16x16 -> vp9_idct16x16_add Change-Id: Ief8a3904de78deab0f4ede944c4d0339c228cfc3	2013-10-07 14:31:10 -07:00
Jingning Han	c8f481fa3d	Restore mode skip feature in sub8x8 rd loop This commit restores the mode skip feature in the sub8x8 rd loop. Change-Id: I5496ee32053f572b8961b549e9ecd4f1360824de	2013-10-07 14:20:34 -07:00
Dmitry Kovalev	2ae93a776b	Merge "Giving consistent names to IDCT 8x8 functions."	2013-10-07 14:19:50 -07:00
Dmitry Kovalev	23cc1cd8e6	Removing redundant vp9_pt_energy_class declarations. Declaring vp9_pt_energy_class in vp9_entropy.h instead of many external places. Change-Id: I66e8a3fc119a43f88d130d0dae4133c825a047a3	2013-10-07 14:11:01 -07:00
Jim Bankoski	7eb7dd2fed	cpplint errors in vp9_onyx_if.h Slightly bigger change -> broke up encode_frame_to_datarate, lots of line length fixes. Change-Id: I7c53325e954de130f3fe1a6656626efc6705be82	2013-10-07 13:57:20 -07:00
Dmitry Kovalev	272adbbec4	Using inter_mode_offset_function instead of duplicated code. Change-Id: I8de865cd1deca07b5c92c225782f0867367e9a11	2013-10-07 13:18:46 -07:00
Adrian Grange	18a2617126	Merge "cpplint issues resolved vp9_ratectrl.c"	2013-10-07 10:54:17 -07:00
Jim Bankoski	31b7a912d1	cpplint issues resolved vp9_ratectrl.c Change-Id: Iae7674b0c946a5ac01617840b3f62965c654d920	2013-10-07 09:21:29 -07:00
Jim Bankoski	92519a005a	Merge "cpplint problems resolved with vp9_firstpass.c"	2013-10-07 09:16:46 -07:00
Jim Bankoski	ccc5a483f4	Merge "cpplint issues resolved in vp9_mcomp.c"	2013-10-07 09:14:35 -07:00
Paul Wilkins	65f0cc7f4b	Disable MODE_TEST_HIT_STATS This flag is for stats generation and testing and should not be checked in as enabled by default. Change-Id: I4ea57dbcf49790f14777f598ddd3dc37dcc7a6bb	2013-10-07 02:54:19 -07:00
Dmitry Kovalev	c6ad70d5f1	Giving consistent names to IDCT 8x8 functions. Renames: vp9_short_idct8x8_add -> vp9_idct8x8_64_add vp9_short_idct8x8_1_add -> vp9_idct8x8_1_add vp9_short_idct8x8_10_add -> vp9_idct8x8_10_add vp9_idct_add_8x8 -> vp9_idct8x8_add Change-Id: Ifb8d3a45b4c0397aa805b30463f3d14581bf72c1	2013-10-06 00:24:09 -07:00
Dmitry Kovalev	9dba044be2	Merge "Giving consistent names to IDCT/IWHT functions."	2013-10-05 23:44:05 -07:00
Jim Bankoski	bf21ce63ee	encodemb cpplint issues revisited. Change-Id: Id5f25b74e2207bf44b6f6c8ffe548fa30fd78b4d	2013-10-05 17:24:51 -07:00
Jim Bankoski	30dee8adfc	cpplint problems resolved with vp9_firstpass.c Change-Id: Ic7b7014a0d857585bfd4baaea1d5c27ffe355642	2013-10-05 17:10:54 -07:00
Jim Bankoski	c9f3f9ed70	Merge "unused typedef in vp9_variance.h"	2013-10-05 16:49:13 -07:00
Jim Bankoski	7fd13472ae	Merge "cpplint issues with vp9_boolhuff.c resolved"	2013-10-05 16:48:28 -07:00
Jim Bankoski	f59cb3eacc	Merge "added nolint to function that doesn't seem easy to breakup"	2013-10-05 16:47:23 -07:00
Jim Bankoski	4410bbbf88	Merge "cpplint issues in vp9_lookahead.c"	2013-10-05 16:46:11 -07:00
Jim Bankoski	b79b7c354d	cpplint issues resolved in vp9_mcomp.c Change-Id: I2c2f83f4dfa2782fc6b0aa6db3ba2c4e6e423ffa	2013-10-05 16:44:40 -07:00
Jim Bankoski	6a7b1fb754	Merge changes Idbfabe42,I788f1a30 * changes: cpplint issues resolved in vp9_variance_mmx.c cpplint issues in vp9_ssim.c	2013-10-05 16:32:50 -07:00
Jim Bankoski	2dba2eb46a	Merge "cpplint issues in vp9_picklpf.c"	2013-10-05 16:32:00 -07:00
Jingning Han	0d0ed6a29b	Allow sub8x8 intra modes test for alt frame coding This commit allows sub8x8 intra modes test in the rate-distortion loop for hd sequences in speed 1 and 2. For sequence y90n of hd set at 8000 kbps, speed 2 runtime goes from 207s to 210s. For ped_1080p at 3000 kbps, speed 2 runtim goes from 336s to 337s. Both are running with 300 frames. This improves compression performance by 0.24% for stdhd and 0.32% for hd. Change-Id: I173ca38a6411565ae6cfadd184c42b2070c5de1f	2013-10-04 19:13:00 -07:00
Jim Bankoski	0500cf429f	cpplint issues with vp9_boolhuff.c resolved Change-Id: I6990c9ab838323d8770dd1f49a25bf3acc4c05c7	2013-10-04 17:20:58 -07:00
Jim Bankoski	a36045fb3b	Merge "cpplint issues with vp9_temporal_filter.c"	2013-10-04 17:17:02 -07:00
Jim Bankoski	cac3e1588e	cpplint issues in vp9_picklpf.c Change-Id: I62e631ca95fefbb1a993479a5e3926dc81359fe7	2013-10-04 17:08:41 -07:00
Jim Bankoski	eead4bb89e	Merge "lint issue in vp9_psnr.c"	2013-10-04 16:42:30 -07:00
Jim Bankoski	e2d73897d0	Merge "vp9_encodeframe.c cpplint issues resolved"	2013-10-04 16:42:06 -07:00
Jim Bankoski	6e161a26e3	Merge "cpp lint issues resolved in vp9_encodeintra.c"	2013-10-04 16:41:58 -07:00
Jim Bankoski	5f80d2ad33	Merge "cpplint vp9_dct.c issues resolved"	2013-10-04 16:41:46 -07:00
Jim Bankoski	38f6a3cdc7	Merge "cpplint issues vp9_tokenize.c resolved"	2013-10-04 16:41:23 -07:00
Jim Bankoski	d07545b7b8	cpplint issues with vp9_temporal_filter.c Change-Id: I695a990689c79d160227975116125b140875aed1	2013-10-04 15:49:30 -07:00
Yaowu Xu	d129eea9fa	Merge "Further clean up of speed 4"	2013-10-04 14:45:21 -07:00
Jim Bankoski	de5cb8b140	vp9_encodeframe.c cpplint issues resolved Change-Id: Id9d837e062d9c4a94def4b4ed1f49a67c75d3618	2013-10-04 14:37:31 -07:00
Jim Bankoski	02f28bac29	cpp lint issues resolved in vp9_encodeintra.c Change-Id: Ib6a8360d24f44eeaec12c5055568382a105dc235	2013-10-04 14:35:01 -07:00
Jim Bankoski	9c2b3744c9	cpplint issues in vp9_lookahead.c Change-Id: I2a98995f0df77d99dc47bda5e41886f014d8843f	2013-10-04 14:24:19 -07:00
Jim Bankoski	5b4f836148	cpplint issues resolved in vp9_variance_mmx.c Change-Id: Idbfabe427fbeab44210f13fec8b6f63f7a4eb0dd	2013-10-04 14:22:08 -07:00
Jim Bankoski	eb5b7ac27b	added nolint to function that doesn't seem easy to breakup Change-Id: I5489b116aea7c510ea5ebbed3c1445f321b05f3e	2013-10-04 14:17:47 -07:00
Dmitry Kovalev	3a0602578e	Giving consistent names to IDCT/IWHT functions. The idea is to have the following names for each transform size: vp9_idct4x4_add vp9_idct4x4_1_add vp9_idct4x4_10_add vp9_idct4x4_16_add vp9_idct8x8_add vp9_idct8x8_1_add vp9_idct8x8_10_add vp9_idct8x8_64_add etc for 16x16, 32x32 The actual list of renames in this patch: vp9_idct_add_lossless -> vp9_iwht4x4_add vp9_short_iwalsh4x4_add -> vp9_iwht4x4_16_add vp9_short_iwalsh4x4_1_add -> vp9_iwht4x4_1_add vp9_idct_add -> vp9_idct4x4_add vp9_short_idct4x4_add -> vp9_idct4x4_16_add vp9_short_idct4x4_1_add -> vp9_idct4x4_1_add Change-Id: I6f43f7437c68dd30cdd05d72e213765578ed30b1	2013-10-04 14:17:06 -07:00
Jim Bankoski	25ecb1f0b3	cpplint vp9_variance_sse2.c Change-Id: Ifce8f5b57a1ea8952e8a67c5b92a127a061899fa	2013-10-04 14:15:06 -07:00
Jim Bankoski	f3e6a35cdb	cpplint issues in vp9_ssim.c Change-Id: I788f1a3004643347ca08d08fc3cb2bb8f0b134d9	2013-10-04 14:08:37 -07:00
Jim Bankoski	424c74e736	cpplint vp9_dct.c issues resolved Change-Id: Ia21653a447040f1b472d21ebd19103b0558c4b16	2013-10-04 13:47:59 -07:00
Jim Bankoski	c6960b6086	cpplint issues vp9_tokenize.c resolved Change-Id: Id4ec0084641d2ad4def95fb05239455fbc25f9b9	2013-10-04 13:42:58 -07:00
Jim Bankoski	660dcfe6a2	Merge "cpplint issues vp9_encodemv.c"	2013-10-04 12:55:46 -07:00
Jim Bankoski	19641c40f9	Merge "cpplint issues vp9_mbgraph"	2013-10-04 12:55:26 -07:00
Guillaume Martres	014a2c17df	Fix first pass for non-square blocks Change-Id: Ic049f0a6ce190f33859118e7b8cfcfe305979102	2013-10-04 12:04:15 -07:00
Dmitry Kovalev	042c475a8f	Merge "Moving all idct/iht functions in one place."	2013-10-04 12:01:42 -07:00
Jim Bankoski	d9215a6616	cpplint issues vp9_mbgraph Change-Id: Iedf9ac460edb31d7c072e2bebd26f2afe8e6089b	2013-10-04 11:22:22 -07:00
Jim Bankoski	19e227561a	cpplint issues vp9_encodemv.c Change-Id: Icda1d2d7cbfb176884fa6c7d9366a2d60e2994e9	2013-10-04 11:19:06 -07:00
Jim Bankoski	916f803175	lint issue in vp9_psnr.c Change-Id: Ifc7ffc02cfedb47230571298622602609a4e8a70	2013-10-04 11:01:49 -07:00
Jingning Han	1ab60f7bfb	Merge "Remove redundant second_ref_frame check in sub8x8"	2013-10-04 09:04:11 -07:00
Paul Wilkins	44e039b4f5	Further clean up of speed 4 Speed 4 still does not give a big gain over speed 3. This just cleans it up a little from the last patch and comments out features that do not seem to be giving much benefit. Change-Id: I5f366e6160e1dbe5dc45cf5eb90cc02712baa1b6	2013-10-04 16:57:24 +01:00
Paul Wilkins	8abd92f12f	Remove mode_skip_start and mask code for sub 8x8 This code serves no purpose in the re-factored sub 8x8 code. Change-Id: I5364986224d1a28b71bcb046ec8557a3d14aaa47	2013-10-04 14:26:17 +01:00
Paul Wilkins	de6ecc5ac3	Selective masking of split modes. Allow selective masking of individual split modes rather than just a single on / off flag. For speed 2 recovers the large speed loss seen for some derf clips in change Ie6bdfa0a370148dd60bd800961077f7e97e67dd4 and a small quality gain. For speed 1 10 % speed increase observed locally on some derf clips for minimal quality change. Change-Id: If86191087b93cbc05351c26c60c7933e2149e485	2013-10-04 14:20:58 +01:00
Paul Wilkins	03dd2818e4	Missing threshold case for disable split. In relation to change: Refactor inter mode rate-distortion search Ie6bdfa0a370148dd60bd800961077f7e97e67dd4 sf->thresh_mult_sub8x8[THR_INTRA] = INT_MAX missing; Change-Id: Ia86b68a5073368a3e2ca124a27b632243b525c8b	2013-10-04 11:54:24 +01:00
Dmitry Kovalev	d975804e9a	Merge "Replacing duplicated code with get_scan_and_band call."	2013-10-03 18:58:40 -07:00
Dmitry Kovalev	8b34437522	Replacing duplicated code with get_scan_and_band call. Change-Id: I2cc3684f416a63dc99b9303109f9850f34a470d5	2013-10-03 17:46:28 -07:00
Jingning Han	2952b7d1fb	Remove redundant second_ref_frame check in sub8x8 This commit removes the redundant second reference frame check in the rate-distortion optimization loop for sub8x8 blocks. Change-Id: I13a57a6f624c4a9bcef02ff2a867fa30d8b44a93	2013-10-03 14:02:12 -07:00
Jingning Han	b9daef91d8	Use vp9_zero in sub8x8 RD optimiazion loop Change-Id: Ic23a705e48cadaa7151f2bd8536d56636cb973e3	2013-10-03 12:34:25 -07:00
Jingning Han	4093192ec9	Change b_mode_info definition from union to struct This commit defines b_mode_info as a struct type. This will allow us to further remove the use of PARTITION_INFO in the encoding process. Change-Id: I975b0f7d557b5e0f66545a61b472def76b671cce	2013-10-03 12:34:11 -07:00
Jingning Han	793c2d8429	Remove unused variables in inter_mode rd loops Remove redundant variable definition/use in rate-distortion search loop for regular and sub8x8 blocks, respectively. Change-Id: Ic0eb3660bb6851ba2eb8d702ba9fd11595000d01	2013-10-03 12:34:11 -07:00
Jingning Han	a55625873f	Merge "Refactor inter mode rate-distortion search"	2013-10-03 12:19:53 -07:00
Jingning Han	11abab356e	Refactor inter mode rate-distortion search This commit separates the rate-distortion optimization loop of superblocks from that of sub8x8 blocks. This allows better design rate-distortion optimization search loop for each setting. It also removes the use of SPLITMV and I4X4_PRED therein. No performance change in speed 0 settings. For bus@CIF at 2000kbps, the speed 1 runtime goes from 48009ms to 43894ms (about 10% faster). The overall compression performance on derf changed by -0.021%. Speed 2 runtime goes from 27114ms to 28700ms (6% slower), while the overall coding efficiency goes up by 1.629% for derf, 1.236% for yt. Change-Id: Ie6bdfa0a370148dd60bd800961077f7e97e67dd4	2013-10-03 11:36:49 -07:00
Dmitry Kovalev	9250d1529c	Using vp9_zero instead of vpx_memset. Change-Id: I9a0d0e9c3459954aa7b9c68f92cc5d56385ebd18	2013-10-03 10:59:36 -07:00
Paul Wilkins	b03d3da9c1	Merge "Speed setting review."	2013-10-03 09:49:00 -07:00
Paul Wilkins	fa71882e63	Merge "make use last partition consider motion"	2013-10-03 09:48:49 -07:00
Dmitry Kovalev	6cb6987d4d	Merge "BITSTREAM - RESTORING BILINEAR INTERPOLATION FILTER SUPPORT"	2013-10-03 09:34:26 -07:00
Paul Wilkins	6253cc9279	Speed setting review. Substantial reworking of the speed vs quality trade offs for speed 1 and 2. In this patch I am attempting to freeze the "quality" meaning of speeds 1 and 2 relative to speed 0 so that in future we can better evaluate progress. I am targeting : Speed 1 quality ~-5% vs speed 0. Speed 2 quality ~-10% vs speed 0 It is inevitable that quality will still fluctuate a little as we adjust settings and add new features, but we will attempt to keep as close as possible to these values. Above speed 2 things will remain a bit more fluid for now. In this patch speed 1 is approximately 4-5x as fast as speed 0. This is similar to before but the quality hit is a lot less. Likewise speed 2 is approximately 2x as fast as speed 1 but is similar in quality to the previous speed 1 configuration. Also slight change to behavior of FLAG_EARLY_TERMINATE to insure all reference frames get at least one rd test. Important for very low variance regions. WIP :- Added a new speed level with old speed 4 becoming speed 5. Speed 3 and 4 tradeoffs still WIP Change-Id: Ic7a38dd7b5b63ab1501f9352411972f480ac6264	2013-10-03 10:23:28 +01:00
Jim Bankoski	f1d3e5e4d6	make use last partition consider motion This commit causes use last partition to consider whether a 64x64 has motion that might make a new partitioning worth while. Change-Id: I3a57bedef4f3cd961fadbfa96651c206fa36da4a	2013-10-03 10:22:39 +01:00
Paul Wilkins	ece99b3da0	Merge "Improved auto_partition_range."	2013-10-03 02:06:13 -07:00
Dmitry Kovalev	68a3e4a888	BITSTREAM - RESTORING BILINEAR INTERPOLATION FILTER SUPPORT Adding appropriate test vector vp90-2-06-bilinear.webm. Change-Id: Ia3bbf57318e0cc61a1b724fe751e3f9c7e11b337	2013-10-02 18:04:12 -07:00
A.Mahfoodh	5215b83aea	Simplifying and inlining k_cvtlo_epi16 and k_cvthi_epi16 Simplify the k_cvtlo_epi16 and k_cvthi_epi16 to only two instructions. Then inlined them. quoting from intel MMX_App_Compute_16bit_Vector.pdf‎ "The PMADDWD instruction multiplies four pairs of 16-bit numbers and produces partial sums of the results and can do so once per clock (with a three-clock latency)." so I am assuming that there will be three clock overhead after the last _mm_madd_pi16 command. Even with the overhead the number of clocks in general should be smaller. I am not sure though becasue I could not find information about number of clocks required for instructions in k_cvtlo_epi16 and k_cvthi_epi16. I will run a test and compare the execution time. Change-Id: Ieda4aa338f69ad3dd196ac6e7892da3cf1b47ea7	2013-10-02 20:02:03 -04:00
Dmitry Kovalev	a88a0e88a4	Merge "Moving get_token_alloc function from common to the encoder."	2013-10-02 16:26:00 -07:00
Jim Bankoski	f5bcc372c9	unused typedef in vp9_variance.h Change-Id: I15f79c9de34c723c1dd419b8da96c3ff948c5e03	2013-10-02 15:59:31 -07:00
Dmitry Kovalev	be7eec79be	Moving all idct/iht functions in one place. Moving functions from vp9_idct_blk to vp9_idct because these functions are used from both encoder and decoder. Removing duplicated code from vp9_encodemb.c and reusing existing functions. Change-Id: Ia0a6782f8c4c409efb891651b871dd4bf22d5fe8	2013-10-02 14:13:33 -07:00
Jingning Han	54bc73151b	Deprecate unused mode count variables Remove mode_check_freq and mode_test_hit_counts from VP9_COMP. Change-Id: Iabfd9f841444cd9bf19ac761a9795f140082ce0b	2013-10-02 11:07:14 -07:00
Jim Bankoski	825b7c301d	Merge "vp9_block.h cpplint issues resolved"	2013-10-01 16:14:58 -07:00
Jim Bankoski	691177842c	Merge "cpplint issue in vp9_rdopt.h"	2013-10-01 15:45:35 -07:00
Jim Bankoski	5491a1f33e	vp9_block.h cpplint issues resolved Change-Id: Icc6a76a5be77f3e19918155bab3998e0aa32ccf5	2013-10-01 15:17:39 -07:00
Jim Bankoski	c4627a9ff1	cpplint issues in vp9_onyx_int.h Change-Id: I6c4058aebe834e1a12b7a3fb10484b9ebe60b349	2013-10-01 15:14:39 -07:00
Jim Bankoski	b6e2f9b752	cpplint issue in vp9_rdopt.h Change-Id: I84209d382ca5dfc537ee533cd792d8caa0e25cee	2013-10-01 15:09:32 -07:00
Dmitry Kovalev	0a5e9ee054	Moving get_token_alloc function from common to the encoder. Also renaming mb_row -> mi_row, mb_col -> mi_col arguments and calculate mb_rows/mb_cols values from mi_rows/mi_cols. Change-Id: I6919a279f560648e23bc9a12f507d17c21ffd5d7	2013-10-01 11:54:10 -07:00
Jingning Han	195061feda	Fix rectangular partition check in speed 1 Make encoder skip rectangular partition check in speed 1 and above, when early termination was triggered in partition split. Thanks Guillaume (gmartres@) for catching this issue. This change makes bus_cif at 2000kbps speed 1 runtime goes down from 25612ms to 23438ms (about 9% speed-up), at the expense of -0.235% performance down. Change-Id: I98613fad081a261d30d5fa206f934ca70601c180	2013-09-30 12:14:36 -07:00
Paul Wilkins	d12a502ef9	Merge "Alter Speed 3."	2013-09-30 09:12:28 -07:00
Deb Mukherjee	fad3d07df3	Merge "Some minor changes/cleanups in rate control"	2013-09-30 06:50:56 -07:00
Paul Wilkins	65b93c7e52	Improved auto_partition_range. The code now takes into account temporal and spatial information to determine the partition size range, but the frequency counts have been removed. The net effect is similar in quality but about 10% faster. Change-Id: I39a513fb79cec9177b73b2a7218f0da70963ae95	2013-09-30 11:32:57 +01:00
Paul Wilkins	a76caa7ff4	Alter Speed 3. This patch deletes the variance based speed three partitioning. Speed 3 now uses the same partitioning method as speed 2 but with some stricter conditions. The speed and quality are now somewhere between speeds 2 and 4 whereas before it was worse in both than speed 4. Change-Id: Ia142e7007299d79db3ceee6ca8670540db6f7a41	2013-09-30 11:26:46 +01:00
Dmitry Kovalev	b927620231	Merge "Using is_inter_block and has_second_ref functions."	2013-09-29 12:14:41 -07:00
Dmitry Kovalev	29815ca729	Merge "Moving from int_mv* to MV* (3)."	2013-09-29 12:13:16 -07:00
Dmitry Kovalev	4ab01fb5f7	Merge "Reusing FRAME_CONTEXT struct to simplify the code."	2013-09-29 12:02:26 -07:00
Dmitry Kovalev	b3d3578ee4	Merge "Renaming vp9_short_idct10_8x8_add to vp9_short_idct8x8_10_add."	2013-09-29 12:01:50 -07:00
Dmitry Kovalev	7343681675	Merge "Removing vp9_get_coef_neighbors_handle function."	2013-09-29 12:01:36 -07:00
Dmitry Kovalev	efbacc9f89	Merge "Removing vp9_subpelvar.h from common."	2013-09-29 12:00:46 -07:00

... 2 3 4 5 6 ...

2061 Commits