generic-library/vpx

Author	SHA1	Message	Date
Guillaume Martres	9a03154f46	Make the static_segmentation feature work again Change-Id: I766c4b74db526efa4ff6dd2d95ef3e0beb45b6e5	2013-10-16 16:15:27 -07:00
Guillaume Martres	42bcb4a7ad	Merge "Prevent accidental changes to the previous frame mode_infos"	2013-10-16 16:07:05 -07:00
Dmitry Kovalev	ab829274b1	Inlining and removing fwd_txm16x16 and fwd_txm8x8 pointers. Change-Id: I3528ba1c3fee761918509f9d9dc2d842c69f5a44	2013-10-16 15:00:48 -07:00
Marco Paniconi	e078c3d854	Initial 1-pass. Change-Id: I58c5436f5c95f6012fb2891cd2a02f76e4870b6a	2013-10-16 12:04:29 -07:00
Guillaume Martres	e55f60240a	Implement variance-based adaptive quantization This should be similar to what x264 does with --aq-mode 1. It works well with clips like parkjoy and touhou (http://x264.nl/developers/Dark_Shikari/LosslessTouhou.mkv). At low bitrates, the segmentation signaling overhead may negate the benefits of this feature. (PGW) Default changed to feature OFF to allow provisional merge. Change-Id: I938abf9bb487e1d4ad3b0264ea03d9826275c70b	2013-10-16 11:55:13 +01:00
Adrian Grange	12b2c712ca	Merge "Updated encoder to handle intra-only frames"	2013-10-15 17:19:28 -07:00
Jingning Han	9b05f23e05	Merge "Make vp9_zero use cases of consistent format"	2013-10-15 16:49:05 -07:00
Alexander Voronov	d6a59fb12c	Updated encoder to handle intra-only frames Updated the encoder to handle frames that are coded intra-only. Intra-only frames must be non-showable, that is, the "show frame" flag must be set to 0 in the frame header. Tested by forcing the ARF frames to be coded intra- only. Note: The rate control code will need to be modified to account for intra-only frames better than they are currently handled. Change-Id: I6a9dd5337deddcecc599d3a44a7431909ed21079	2013-10-15 16:44:02 -07:00
Jingning Han	c8e48f4b02	Make vp9_zero use cases of consistent format Remove the semicolon in the definition of vp9_zero macro. Make all the use cases of vp9_zero of consistent format. Change-Id: Ibaf9751e8595872b12766381a93d185a4d90df8f	2013-10-15 16:12:21 -07:00
Dmitry Kovalev	a4585285ed	Removing unused 8x4 transform from the encoder. Change-Id: Icbcf68b5b685a56f255ebc3859c9692accdadf9e	2013-10-15 11:27:28 -07:00
Yaowu Xu	8b175679be	Masking intra mode choice adaptively The commit changes to mask available intra prediction modes for test based on prediction block size. With this patch, encoding time of CpuUsed 2 reduces from 10% to 20% for HD clips with a compression drop of 0.2% Change-Id: I65f320f1237c0f5ae3a355bf7caf447f55625455	2013-10-11 10:29:53 -07:00
Paul Wilkins	704028d435	Experimental rate control change. When the codec in VBR (or cq) mode hits its max q limits and is struggling to hit a target bandwidth, the bit target per frame collapses. In the first instance normal frames cap out at the maximum allowed Q and then the ARF and GFs do the same. This latter behavior is not generally desirable as GFs and ARFs are only effective from a quality and data rate perspective if they have at lease some level of -Q delta compared to the surrounding frames. In this patch I define a separate max Q for GFs and ARFs that is derived from but somewhat lower than that defined for normal frames. In effect there is a minimum Q delta that will always be available for GFs and ARFs regardless of the target rate and MAXQ setting. This may of course mean that the absolute lowest rate obtainable for a given clip is somewhat higher. Change-Id: I268868b28401900d0cd87e51e609cd3b784ab54a	2013-10-11 13:40:54 +01:00
Paul Wilkins	8b989f5b23	Disable recode loop. For VBR coding disable the recode loop for speeds > 0. Results pending. Change-Id: I2cd9a87c3fcbe39c05b954798d0671a4ca62c37f	2013-10-11 13:38:52 +01:00
Guillaume Martres	b364176c08	Prevent accidental changes to the previous frame mode_infos This is needed to fix mbgraph but shouldn't affect anything else Change-Id: I2f515052f62e348cd3794b7ff0c139802225ea95	2013-10-10 12:18:12 -07:00
Dmitry Kovalev	1e8fc24af8	Merge "Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers."	2013-10-10 10:49:27 -07:00
Jingning Han	4793324c16	Merge "Allow sub8x8 intra modes test for alt frame coding"	2013-10-10 09:00:08 -07:00
Jingning Han	03fe08ca30	Deprecate the use of PARTITION_INFO from encoder Use b_mode_info to store the inter prediction mode of sub8x8 block, in replacement of the use of partition_info. Remove redundant buffer update for partition_info. For bus_cif at 2000 kbps, this seem to make speed 0 about 1% faster. Change-Id: Id1b3be45e75a24fb4b42335ac480c23e440978f6	2013-10-09 09:23:52 -07:00
Dmitry Kovalev	c983c966cb	Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers. We already have itxm_add member in MACROBLOCKD structure. Both inv_txm4x4_1_add and inv_txm4x4_add are just its special cases for different eob values. But eob logic is already implemented in vp9_iwht4x4_add and vp9_idct4x4_add (that's why also removing inverse_transform_b_4x4_add). Change-Id: I80bec9b6f7d40c5e5033c613faca5c819c3e6326	2013-10-08 11:27:56 -07:00
Yaowu Xu	e29137df05	Change to allow less rectangular partion check For CpuUsed 1 & 2, this commit allow to skip retangular partition check when NONE is better than SPLIT. It also changed to allow such logic on alt ref frame coding rather than use square partition all them. The change has gain compressio about .3% on yt and ythd for both 1&2, It helped .6% compression on cif and stdhd for both CpuUsed 1&2. Change-Id: I814b653baf89f59acd20e042629a12938a1bd4e5	2013-10-08 08:12:56 -07:00
Jim Bankoski	2b491c19b8	Merge "cpplint errors in vp9_onyx_if.h"	2013-10-07 14:47:21 -07:00
Jim Bankoski	7eb7dd2fed	cpplint errors in vp9_onyx_if.h Slightly bigger change -> broke up encode_frame_to_datarate, lots of line length fixes. Change-Id: I7c53325e954de130f3fe1a6656626efc6705be82	2013-10-07 13:57:20 -07:00
Dmitry Kovalev	9dba044be2	Merge "Giving consistent names to IDCT/IWHT functions."	2013-10-05 23:44:05 -07:00
Jingning Han	0d0ed6a29b	Allow sub8x8 intra modes test for alt frame coding This commit allows sub8x8 intra modes test in the rate-distortion loop for hd sequences in speed 1 and 2. For sequence y90n of hd set at 8000 kbps, speed 2 runtime goes from 207s to 210s. For ped_1080p at 3000 kbps, speed 2 runtim goes from 336s to 337s. Both are running with 300 frames. This improves compression performance by 0.24% for stdhd and 0.32% for hd. Change-Id: I173ca38a6411565ae6cfadd184c42b2070c5de1f	2013-10-04 19:13:00 -07:00
Dmitry Kovalev	3a0602578e	Giving consistent names to IDCT/IWHT functions. The idea is to have the following names for each transform size: vp9_idct4x4_add vp9_idct4x4_1_add vp9_idct4x4_10_add vp9_idct4x4_16_add vp9_idct8x8_add vp9_idct8x8_1_add vp9_idct8x8_10_add vp9_idct8x8_64_add etc for 16x16, 32x32 The actual list of renames in this patch: vp9_idct_add_lossless -> vp9_iwht4x4_add vp9_short_iwalsh4x4_add -> vp9_iwht4x4_16_add vp9_short_iwalsh4x4_1_add -> vp9_iwht4x4_1_add vp9_idct_add -> vp9_idct4x4_add vp9_short_idct4x4_add -> vp9_idct4x4_16_add vp9_short_idct4x4_1_add -> vp9_idct4x4_1_add Change-Id: I6f43f7437c68dd30cdd05d72e213765578ed30b1	2013-10-04 14:17:06 -07:00
Paul Wilkins	44e039b4f5	Further clean up of speed 4 Speed 4 still does not give a big gain over speed 3. This just cleans it up a little from the last patch and comments out features that do not seem to be giving much benefit. Change-Id: I5f366e6160e1dbe5dc45cf5eb90cc02712baa1b6	2013-10-04 16:57:24 +01:00
Paul Wilkins	de6ecc5ac3	Selective masking of split modes. Allow selective masking of individual split modes rather than just a single on / off flag. For speed 2 recovers the large speed loss seen for some derf clips in change Ie6bdfa0a370148dd60bd800961077f7e97e67dd4 and a small quality gain. For speed 1 10 % speed increase observed locally on some derf clips for minimal quality change. Change-Id: If86191087b93cbc05351c26c60c7933e2149e485	2013-10-04 14:20:58 +01:00
Paul Wilkins	03dd2818e4	Missing threshold case for disable split. In relation to change: Refactor inter mode rate-distortion search Ie6bdfa0a370148dd60bd800961077f7e97e67dd4 sf->thresh_mult_sub8x8[THR_INTRA] = INT_MAX missing; Change-Id: Ia86b68a5073368a3e2ca124a27b632243b525c8b	2013-10-04 11:54:24 +01:00
Jingning Han	11abab356e	Refactor inter mode rate-distortion search This commit separates the rate-distortion optimization loop of superblocks from that of sub8x8 blocks. This allows better design rate-distortion optimization search loop for each setting. It also removes the use of SPLITMV and I4X4_PRED therein. No performance change in speed 0 settings. For bus@CIF at 2000kbps, the speed 1 runtime goes from 48009ms to 43894ms (about 10% faster). The overall compression performance on derf changed by -0.021%. Speed 2 runtime goes from 27114ms to 28700ms (6% slower), while the overall coding efficiency goes up by 1.629% for derf, 1.236% for yt. Change-Id: Ie6bdfa0a370148dd60bd800961077f7e97e67dd4	2013-10-03 11:36:49 -07:00
Paul Wilkins	6253cc9279	Speed setting review. Substantial reworking of the speed vs quality trade offs for speed 1 and 2. In this patch I am attempting to freeze the "quality" meaning of speeds 1 and 2 relative to speed 0 so that in future we can better evaluate progress. I am targeting : Speed 1 quality ~-5% vs speed 0. Speed 2 quality ~-10% vs speed 0 It is inevitable that quality will still fluctuate a little as we adjust settings and add new features, but we will attempt to keep as close as possible to these values. Above speed 2 things will remain a bit more fluid for now. In this patch speed 1 is approximately 4-5x as fast as speed 0. This is similar to before but the quality hit is a lot less. Likewise speed 2 is approximately 2x as fast as speed 1 but is similar in quality to the previous speed 1 configuration. Also slight change to behavior of FLAG_EARLY_TERMINATE to insure all reference frames get at least one rd test. Important for very low variance regions. WIP :- Added a new speed level with old speed 4 becoming speed 5. Speed 3 and 4 tradeoffs still WIP Change-Id: Ic7a38dd7b5b63ab1501f9352411972f480ac6264	2013-10-03 10:23:28 +01:00
Paul Wilkins	ece99b3da0	Merge "Improved auto_partition_range."	2013-10-03 02:06:13 -07:00
Jingning Han	54bc73151b	Deprecate unused mode count variables Remove mode_check_freq and mode_test_hit_counts from VP9_COMP. Change-Id: Iabfd9f841444cd9bf19ac761a9795f140082ce0b	2013-10-02 11:07:14 -07:00
Paul Wilkins	d12a502ef9	Merge "Alter Speed 3."	2013-09-30 09:12:28 -07:00
Paul Wilkins	65b93c7e52	Improved auto_partition_range. The code now takes into account temporal and spatial information to determine the partition size range, but the frequency counts have been removed. The net effect is similar in quality but about 10% faster. Change-Id: I39a513fb79cec9177b73b2a7218f0da70963ae95	2013-09-30 11:32:57 +01:00
Paul Wilkins	a76caa7ff4	Alter Speed 3. This patch deletes the variance based speed three partitioning. Speed 3 now uses the same partitioning method as speed 2 but with some stricter conditions. The speed and quality are now somewhere between speeds 2 and 4 whereas before it was worse in both than speed 4. Change-Id: Ia142e7007299d79db3ceee6ca8670540db6f7a41	2013-09-30 11:26:46 +01:00
Deb Mukherjee	80d582239e	Some minor changes/cleanups in rate control Some small changes to the quantizer mapping functions. Also includes some cleanups. Change-Id: I9dea29b24015f6e6697012a0e4d8983049d8e5c7 Results: derfraw300: +0.106% stdhdraw250: +0.139%	2013-09-27 13:57:42 -07:00
Deb Mukherjee	b7a93578e5	Small tweak in the constant quality parameter Improves results a little. Change-Id: I7bcac02dbb65b43a993445cf557c520197114e5c	2013-09-24 09:09:35 -07:00
Deb Mukherjee	d11221f433	Improves constant qual, constrained qual turned on Adds modeled functions to decide the qp for altref frames in constant q mode similar to other functions in use in bitrate mode. Also turns on the constrained quality mode (end-usage=2) option which was turned off before. Basic testing shows the mode works in principle, to cap bitrate to the target-bitrate specified, while allowing lower bitrate depending on the cq-level specified. The mode will need to be improved over time. Results for constant quality vs bitrate control mode: derfraw300/fullderfraw: +3.0% at constant quality over bitrate control. fullstdhdraw: +4.341% stdhdraw250: +5.361% Change-Id: If5027c9ec66c8e88d33e47062c6cb84a07b1cda9	2013-09-22 23:04:50 -07:00
Paul Wilkins	cb50dc7f33	Minor clean up. Removed some unused code and minor cleanup / reordering. Change-Id: I4083ae56aeb8edfe9b85aa2f42a16aa28d19da94	2013-09-16 13:45:20 +01:00
Paul Wilkins	3b01778450	Adjustment to mode_skip_start. Corrected values relating to modified mode order. Change-Id: I24fccba3af4bc16721d5e7e51888a66305bfa7fe	2013-09-16 13:44:48 +01:00
Jingning Han	e8a967d960	Merge "Adaptive motion search control"	2013-09-13 14:43:23 -07:00
Jingning Han	c4826c5941	Adaptive motion search control This commit enables adaptive constraint on motion search range for smaller partitions, given the motion vectors of collocated larger partition as a candidate initial search point. It makes speed 0 runtime of bus at CIF and 2000 kbps goes from 167s down to 162s (3% speed-up), at 0.01dB performance gains. In the settings of speed 1, this makes the runtime goes from 33687 ms to 32142 ms (4.5% speed-up), at 0.03dB performance gains. Compression performance wise, it gains at speed 1: derf 0.118% yt 0.237% hd 0.203% stdhd 0.438% Change-Id: Ic8b34c67810d9504a9579bef2825d3fa54b69454	2013-09-13 13:58:10 -07:00
Deb Mukherjee	0c3038234d	Merge "Clean up of the search best filter speed feature"	2013-09-13 11:03:59 -07:00
Scott LaVarnway	8fc95a1b11	Merge "New mode_info_context storage -- undo revert"	2013-09-13 08:56:20 -07:00
Deb Mukherjee	b964646756	Clean up of the search best filter speed feature Removes this speed feature since it is very slow and unlikely to be used in practice. This cleanup removes a bunch of unnecessary complications in the outer encode loop. Change-Id: I3c66ef1ca924fbfad7dadff297c9e7f652d308a1	2013-09-11 15:16:36 -07:00
Deb Mukherjee	69fe840ec4	Changes in speed 2 settings Propose some changes to the speed 2 settings to improve quality. In particular, turns off the adjust_thresholds_by_speed feature which improves results by 6%. Also removes the code for adjust_thresholds_by_speed since it conflicts with the adaptive rd thresh feature. Overall, with this change speed 2 is -15.2% from speed 0 settings, on derf, which is significantly better than -21.6% down before. Change-Id: I6e90a563470979eb0c258ec32d6183ed7ce9a505	2013-09-11 10:54:07 -07:00
Scott LaVarnway	ac6093d179	New mode_info_context storage -- undo revert mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of pointers to MODE_INFO structs. The MODE_INFO structs are now stored as a stream (decoder only), eliminating unnecessary copies and is a little more cache friendly. Change-Id: I031d376284c6eb98a38ad5595b797f048a6cfc0d	2013-09-11 13:45:44 -04:00
Deb Mukherjee	3d22d3ae0c	Merge "Small tweaks on the constant quality mode"	2013-09-10 11:16:47 -07:00
Deb Mukherjee	09830aa0ea	Small tweaks on the constant quality mode Improves results a little. derf is now +1.078% over bitrate control. Change-Id: I4812136f3e67be21d14ec089419976a32a841785	2013-09-10 10:16:19 -07:00
Yunqing Wang	939791a129	Modify encode breakout for static frames Thank Paul for the suggestions. While turning on static-thresh for static-image videos, a big jump on bitrate was seen. In this patch, we detected static frames in the video using first-pass stats. For different cases, disable encode breakout or reduce encode breakout threshold to limit the skipping. More modification need be done to break incorrect partition picking pattern for static frames while skipping happens. Change-Id: Ia25f47041af0f04e229c70a0185e12b0ffa6047f	2013-09-10 09:06:03 -07:00
Paul Wilkins	4f660cc018	Modified mode skip functionality. A previous speed feature skipped modes not used in earlier partitions but this not longer worked as intended following changes to the partition coding order and in conjunction with some other speed features (Especially speed 2 and above). This modified mode skip feature sets a mask after the first X modes have been tested in each partition depending on the reference frame of the current best case. This patch also makes some changes to the order modes are tested to fit better with this skip functionality. Initial testing suggests speed and rd hit count improvements of up to 20% at speed 1. Quality results. (derf -1.9%, std hd +0.23%). Change-Id: Idd8efa656cbc0c28f06d09690984c1f18b1115e1	2013-09-10 13:30:10 +01:00
Ivan Maltz	20abe595ec	Merge "API extensions and sample app for spacial scalable encoder"	2013-09-09 16:57:01 -07:00
Ivan Maltz	01b35c3c16	API extensions and sample app for spacial scalable encoder Sample app: vp9_spatial_scalable_encoder vpx_codec_control extensions: VP9E_SET_SVC VP9E_SET_WIDTH, VP9E_SET_HEIGHT, VP9E_SET_LAYER VP9E_SET_MIN_Q, VP9E_SET_MAX_Q expanded buffer size for vp9_convolve modified setting of initial width in vp9_onyx_if.c so that layer size can be set prior to initial encode Default number of layers set to 3 (VPX_SS_DEFAULT_LAYERS) Number of layers set explicitly in vpx_codec_enc_cfg.ss_number_layers Change-Id: I2c7a6fe6d665113671337032f7ad032430ac4197	2013-09-09 15:57:56 -07:00
James Zern	c1913c9cf4	Merge "Revert "New mode_info_context storage""	2013-09-09 14:38:01 -07:00
James Zern	54a03e20dd	Revert "New mode_info_context storage" This reverts commit `dae17734ec` Encode crashes, leaks and increases integer overflow errors. Change-Id: I595aa2649bb8d0b6552ff91652837a74c103fda2	2013-09-09 13:37:01 -07:00
Paul Wilkins	740acd6891	Merge "Enable kf restrictions at speed 4"	2013-09-09 05:39:13 -07:00
Jim Bankoski	e378566060	Merge "New mode_info_context storage"	2013-09-08 07:16:25 -07:00
Paul Wilkins	f15cdc7451	Enable kf restrictions at speed 4 Change-Id: I453409d3be3f5fe118b15affde45cb52184aef20	2013-09-06 11:16:04 -07:00
Deb Mukherjee	e378a89bd6	Support a constant quality mode in VP9 Adds a new end-usage option for constant quality encoding in vpx. This first version implemented for VP9, encodes all regular inter frames using the quality specified in the --cq-level= option, while encoding all key frames and golden/altref frames at a quality better than that. The current performance on derfraw300 is +0.910% up from bitrate control, but achieved without multiple recode loops per frame. The decision for qp for each altref/golden/key frame will be improved in subsequent patches based on better use of stats from the first pass. Further, the qp for regular inter frames may also be varied around the provided cq-level. Change-Id: I6c4a2a68563679d60e0616ebcb11698578615fb3	2013-09-06 10:30:53 -07:00
Scott LaVarnway	dae17734ec	New mode_info_context storage mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of a pointer to a MODE_INFO struct and a "in the image" flag. The MODE_INFO structs are now stored as a stream, eliminating unnecessary copies and is a little more cache friendly. For the test clips used, the decoder performance improved by ~4.3% (1080p) and ~9.7% (720p). Patch Set 2: Re-encoded clips with latest. Now ~1.7% (1080p) and 5.9% (720p). Change-Id: I846f29e88610fce2523ca697a9a9ef2a182e9256	2013-09-06 12:33:34 -04:00
Jim Bankoski	79401542f7	make vp9 postproc a config option Vp9 postproc is disabled for now as its not been shown to help and may be merged with vp8. Change-Id: I25620d6cd34c6e10331b18c7b5ef7482e39c6057	2013-09-04 10:02:08 -07:00
Paul Wilkins	1f4bf79d65	Added per pixel inter rd hit count stats Added some code to output normalized rd hit count stats. In effect this approximates to the average number of rd operations/tests per pixel for the sequence. The results are not quite accurate and I have not bothered to account for partial SB64s at frame edges and for key frames However they do give some idea of the number of modes / prediction methods being tested for each pixel across the different partition sizes. This indicates how much scope their is for further gains either by reducing the number of partitions examined or the modes per partition through heuristics. Patch 3 moved place where count incremented so partial rd tests that are aborted with INT_MAX return are also counted. Example numbers for first 50 frames of Akiyo. Speed 0 ~84.4 rd operations / pixel Speed 1 ~28.8 Speed 2 ~11.9 Change-Id: Ib956e787e12f7fa8b12d3a1a2f6cda19a65a6cb8	2013-08-30 00:13:51 +01:00
Deb Mukherjee	b6dbf11ed5	Merge "Adds a speed feature for fast 1-loop forw updates"	2013-08-29 15:54:04 -07:00
Deb Mukherjee	e02dc84c1a	Adds a speed feature for fast 1-loop forw updates Incorporates a speed feature for fast forward updates of coefficients. This feature takes 3 values: 0 - use standard 2-loop version 1 - use a 1-loop version 2 - use a 1-loop version with reduced updates Results: derfraw300 +0.007% (on speed 0) at feature value = 1 -0.160% (on speed 0) at feature value = 2 There is substantial speed up at speeds 2 and above for low resolution sequences where the entropy updates are a big part of the overall computations. Change-Id: Ie96fc50777088a5bd441288bca6111e43d03bcae	2013-08-28 10:56:52 -07:00
Dmitry Kovalev	78e670fcf8	Merge "Renaming D27 to D207."	2013-08-27 10:03:57 -07:00
Paul Wilkins	aa823f8667	Merge "Changes to adaptive inter rd thresholds."	2013-08-26 12:48:11 -07:00
Paul Wilkins	642696b678	Merge "Limit Key frame Intra modes checks."	2013-08-26 12:34:56 -07:00
James Zern	c8ba8c513c	cosmetics: strip 'VP9_' from defines in vp9 only code Change-Id: I481d9bb2fa3ec72b6a83d5f04d545ad8013f295c	2013-08-23 19:16:49 -07:00
Dmitry Kovalev	50ee61db4c	Renaming D27 to D207. I've already renamed d27_predictor to d207_predictor but forgot about the corresponding constant. Change-Id: Id312aa80fc5b5a1ab8a709a33418a029552a6857	2013-08-23 17:33:48 -07:00
Yaowu Xu	8e04257bc5	Merge "Added border extension"	2013-08-23 14:43:58 -07:00
Yaowu Xu	656632b776	Added border extension To the source buffer to be encoded as an alt ref frame. This is to fix the problem of using uninitialized memory in encoder. See https://code.google.com/p/webm/issues/detail?id=605 Change-Id: I97618a2fc207e08abcf5301b734aa9e3ad695e2c	2013-08-23 11:31:28 -07:00
Paul Wilkins	aa5b67add0	Changes to adaptive inter rd thresholds. Values now carried over frame to frame. Change to algorithm for decreasing threshold after a hit and to max threshold (now based on speed) Removed some old commented out code relating to VP8 adaptive thresholds. The impact of these changes tested on Akiyo (50 frames) and measured in terms of unit rd hits is as follows: Speed 0 84.36 -> 84.67 Speed 1 29.48 -> 22.22 Speed 2 11.76 -> 8.21 Speed 3 12.32 -> 7.21 Encode speed impact is broadly in line with these. Change-Id: I5b886efee3077a11553fa950d796fd6d00c8cb19	2013-08-23 16:18:45 +01:00
Paul Wilkins	f76f52df61	Limit Key frame Intra modes checks. Most of the focus so far has been on inter frames. At high speed settings the key frame is now taking a high % of the cycles. This patch puts in some masking to reduce the number of INTRA modes searched during key frame coding (as already happens for inter frames) at higher speed settings TODO: Develop this further with either adaptive rd thresholds when choosing which intra modes to consider or some other heuristic. Impact. At high speed settings on some clips the key frame was starting to dominate. In a coding of the first 50 frames of AKIYO at speed 2 limiting the key frame intra modes to DC or TM_PRED resulted in ~30% overall speedup. For Bus the number was lower at ~4-5%. Change-Id: I7bde68aee04995f9d9beb13a1902143112e341e2	2013-08-23 16:10:30 +01:00
James Zern	a5726ac453	vp9/encoder: fix last_frame_seg_map mem leak remove duplicate allocation from vp9_create_compressor, it was added to vp9_alloc_frame_buffers in: `d5bec52` Added resizing & initialization of last frame segment map Change-Id: I996723226a16a62aff8f9a52ac74e0b73cc98fdf	2013-08-22 14:13:04 -07:00
Jingning Han	01a37177d1	Refactor rd_pick_partition for parameter control This commit changes the partition search order of superblocks from {SPLIT, NONE, HORZ, VERT} to {NONE, SPLIT, HORZ, VERT} for consistency with that of sub8x8 partition search. It enable the use of early termination in partition search for all block sizes. For ped_area_1080p 50 frames coded at 4000 kbps, it makes the runtime goes down from 844305ms -> 818003ms (3% speed-up) at speed 0. This will further move towards making the in-search partition types configurable, hence unifying various speed-up approaches. Some speed 1 and 2 features are turned off during the refactoring process, including: disable_split_var_thresh using_small_partition_info Stricter constraints are applied to use_square_partition_only for right/bottom boundary blocks. Will bring back/refine these features subsequently. At this point, it makes derf set at speed 1 about 0.45% higher in compression performance, and 9% down in run-time. Change-Id: I3db9f9d1d1a0d6cbe2e50e49bd9eda1cf705f37c	2013-08-22 12:36:02 -07:00
Deb Mukherjee	8b810c7a78	Fixes on feature disabling split based on variance Adds a couple of minor fixes, which may be absorbed in Jingning's patch. Thanks to Guillaume for pointing these out. Also adjusts the thresholds for speed 1 and 2 to 16 and 32 respectively, to keep quality drops small. Results: -------- derfraw300: threshold = 16, psnr -0.082%, speedup 2-3% threshold = 32, psnr -0.218%, speedup 5-6% stdhdraw250: threshold = 16, psnr -0.031%, speedup 2-3% threshold = 32, psnr -0.273%, speedup 5-6% Change-Id: I4b11ae8296cca6c2a9f644be7e40de7c423b8330	2013-08-22 07:05:44 -07:00
Deb Mukherjee	2ffe64ad5c	Cleanup/enhancements of switchable filter search Cleans up the switchable filter search logic. Also adds a speed feature - a variance threshold - to disable filter search if source variance is lower than this value. Results: derfraw300 threshold = 16, psnr -0.238%, 4-5% speedup (tested on football) threshold = 32, psnr -0.381%, 8-9% speedup (tested on football) threshold = 64, psnr -0.611%, 12-13% speedup (tested on football) threshold = 96, psnr -0.804%, 16-17% speedup (tested on football) Based on these results, the threshold is chosen as 16 for speed 1, 32 for speed 2, 64 for speed 3 and 96 for speed 4. Change-Id: Ib630d39192773b1983d3d349b97973768e170c04	2013-08-20 09:47:04 -07:00
Paul Wilkins	e8923fe492	Changes to auto partition size selection. Changes to code to auto select a partition size range based on data from spatial neighbors. Now looks at the sb_type in each 8x8 block of above and left SB64. The effect on speed 1 is now weaker giving better quality but less speed gain. Now also used in speed 2. Change-Id: Iace33a97d5c3498dd2a9a8a4067351941abcbabc	2013-08-20 14:05:39 +01:00
Adrian Grange	79f4c1b9a4	Fixed typos and formatting Change-Id: I3814984a624bc64147c57efa74fbdda8eda47262	2013-08-16 09:15:26 -07:00
Dmitry Kovalev	b7616e387e	Moving segmentation struct from MACROBLOCKD to VP9_COMMON. VP9_COMMON is the right place to segmentatation struct because it has global segmentation parameters, not something specific to macroblock processing. Change-Id: Ib9ada0c06c253996eb3b5f6cccf6a323fbbba708	2013-08-15 10:47:48 -07:00
Deb Mukherjee	24856b6abc	Speed feature to skip split partition based on var Adds a speed feature to disable split partition search based on a given threshold on the source variance. A tighter threshold derived from the threshold provided is used to also disable horizontal and vertical partitions. Results on derfraw300: threshold = 16, psnr = -0.057%, speedup ~1% (football) threshold = 32, psnr = -0.150%, speedup ~4-5% (football) threshold = 64, psnr = -0.570%, speedup ~10-12% (football) Results on stdhdraw250: threshold = 32, psnr = -0.18%, speedup is somewhat more than derf because of a larger number of smoother blocks at higher resolution. Based on these results, a threshold of 32 is chosen for speed 1, and a threshold of 64 is chosen for speeds 2 and above. Change-Id: If08912fb6c67fd4242d12a0d094783a99f52f6c6	2013-08-15 10:01:45 -07:00
Paul Wilkins	5459f68d71	Trivial clean up. Delete unused / commented out variable references. Change-Id: Iaf20c0c3744f89adb296d153b516b5ea41b4f3b4	2013-08-13 13:26:18 +01:00
Jingning Han	3984b41c87	Fix a compile failure in vp9_get_compressed_data The lf struct is now with VP9_COMMON, instead of MACROBLOCKD. Change-Id: Idfdd4f91f78f486078a138322d58bb61e93e1bc9	2013-08-12 11:42:17 -07:00
Dmitry Kovalev	097046ae28	Merge "Removing redundant code and function arguments."	2013-08-11 12:20:58 -07:00
Dmitry Kovalev	3c43ec206c	Renaming BLOCK_SIZE_TYPES constant to BLOCK_SIZES. There will be another change set to rename BLOCK_SIZE_TYPE enum to BLOCK_SIZE. Change-Id: I8d1dfc873d6186fa5e554262f5169e929978085e	2013-08-09 17:47:32 -07:00
Dmitry Kovalev	67fe9d17cb	Removing redundant code and function arguments. Change-Id: Ia5cdda0f755befcd1e64397452c42cb7031ca574	2013-08-09 17:24:40 -07:00
Dmitry Kovalev	816d6c989c	Moving loopfilter struct to VP9_COMMON. Loop filter configuration doesn't belong to macroblock, so moving it from MACROBLOCKD to VP9_COMMON. Also moving the declaration of loopfilter struct from vp9_blockd.h to vp9_loopfilter.h. Change-Id: I4b3e34be9623b47cda35f9b1f9951f8c5b1d5d28	2013-08-09 14:41:51 -07:00
Yaowu Xu	6ec2b85bad	Added lpf level picking using partial frame Change-Id: I599ab1bd22b5f3f10d5962c609952abdef8ff67a	2013-08-09 07:37:08 -07:00
Yaowu Xu	c7c9901845	added a speed feature on lpf level picking Change-Id: Id578f8afdeab3702fc8386969f2d832d8f1b5420	2013-08-09 07:36:32 -07:00
Deb Mukherjee	2158909fc3	Merge "Adds a new subpel motion function"	2013-08-08 12:26:55 -07:00
Deb Mukherjee	1ba91a84ad	Adds a new subpel motion function Adds a new subpel motion estimation function that uses a 2-level tree-structured decision tree to eliminate redundant computations. It searches fewer points than iterative search (which can search the same point multiple times) but has the same quality roughly. This is made the default setting at speeds 0 and 1, while at speed 2 and above only a 1-level search is used. Also includes various cleanups for consistency and redundancy removal. Results: derf: +0.012% psnr stdhd: +0.09% psnr Speedup of about 2-3% Change-Id: Iedde4866f5475586dea0f0ba4cb7428fba24eee9	2013-08-08 11:41:49 -07:00
Jingning Han	6bfcce8c7a	Merge "Use low precision 32x32fdct for encodemb in speed1"	2013-08-07 19:05:14 -07:00
Jingning Han	debb9c68c8	Use low precision 32x32fdct for encodemb in speed1 The low precision 32x32 fdct has all the intermediate steps within 16-bit depth, hence allowing faster SSE2 implementation, at the expense of larger round-trip error. It was used in the rate-distortion optimization search loop only. Using the low precision version, in replace of the high precision one, affects the compression performance by about 0.7% (derf, stdhd) at speed 0. For speed 1, it makes derf set down by only 0.017%. Change-Id: I4e7d18fac5bea5317b91c8e7dabae143bc6b5c8b	2013-08-07 15:34:12 -07:00
Dmitry Kovalev	ea2348ca29	Merge "Removing NMS_STATS defines."	2013-08-07 15:28:30 -07:00
Deb Mukherjee	71b43b0ff0	Clean ups of the subpel search functions Removes some unused code and speed features, and organizes the interfaces for fractional mv step functions for use in new speed features to come. In the process a new speed feature - number of iterations per step during the subpel search - is exposed. No change when this parameter is set as the original value of 3. Results: subpel_iters_per_step = 3: baseline subpel_iters_per_step = 2: psnr -0.067%, 1% speedup subpel_iters_per_step = 1: psnr -0.331%, 3-4% speedup Change-Id: I2eba8a21f6461be8caf56af04a5337257a5693a8	2013-08-06 17:23:50 -07:00
Deb Mukherjee	15b5a6a2c7	Flexible support for various pattern searches Adds a few pattern searches to achieve various tradeoffs between motion estimation complexity and performance. The search framework is unified across these searches so that a common pattern search function is used for all. Besides it will be easier to experiment with various patterns or combinations thereof at different scales in the future. The new pattern search is multi-scale and is capable of using different patterns at different scales. The new hex search uses 8 points at the smallest scale and 6 points at other scales. Two other pattern searches - big-diamond and square are also added. Big diamond uses 4 points at the smallest scale and 8 points in diamond shape at the larger scales. Square is very similar conceptually to the default n-step search but is somewhat faster since it keeps only one survivor across all scales. Psnr/speed-up results on derf300: hex: -1.6% psnr%, 6-8% speed-up big-diamond: -0.96% psnr, 4-5% speedup square: -0.93% psnr, 4-5% speedup Change-Id: I02a7ef5193f762601e0994e2c99399a3535a43d2	2013-08-06 11:56:39 -07:00
Deb Mukherjee	33afddadb9	Merge "Add variance based mode/skipping"	2013-08-06 10:19:15 -07:00
Deb Mukherjee	8b3faccb9e	Add variance based mode/skipping Adds a speed feature to skip all intra modes other than DC_PRED if the source variance is small. This feature is made part of speed 1 and up. Results on derf300: psnr -0.07%, speedup about 1-2% Also uses the source variance to fine-tune the early termination criteria when FLAG_EARLY_TERMINATE is on. This feature is made part of speed 2 and up. Results on derf300: psnr -0.52%, speedup about 5-7% Change-Id: I59e38aa836557cfa5405ae706fc64815cbfe4232	2013-08-05 14:14:01 -07:00
Dmitry Kovalev	d007446b3f	Replacing long block size enum values with shorter ones (2). Change-Id: I428c4d42212b757112e3acfe5b81314cfbb5fd6b	2013-08-05 10:51:02 -07:00
Dmitry Kovalev	5edc65d00d	Removing NMS_STATS defines. Change-Id: Iabab0e59042a33456df1d449c0d0f01debc00c7c	2013-08-02 17:10:15 -07:00
Yunqing Wang	0d68080445	Merge "Comment out 2 unused speed features"	2013-08-02 09:58:46 -07:00
Dmitry Kovalev	741537f3ce	Cleanup: replacing xd->seg with seg, and xd->lf with lf. Change-Id: I73b59d7699a8e7e7acd3bf8041cb6c98ce9ba4bf	2013-08-01 15:38:16 -07:00
Yunqing Wang	7965a6ea34	Comment out 2 unused speed features use_min_partition_size and use_max_partition_size are not used currently, and could be added back if needed later. Change-Id: Ib22a9c06b064567a7c1d6d5445567ed77e0d3acc	2013-08-01 11:03:34 -07:00
Adrian Grange	fbd73648dd	Merge "Cleanup typos, remove unnecessary lines, replace switch"	2013-07-30 12:59:46 -07:00
Adrian Grange	b30a06b930	Cleanup typos, remove unnecessary lines, replace switch Removed unnecessary code lines, replaced switch with an if, fixed spelling errors and formatting. Change-Id: Ie48aa4604aa0ed48362ca359d792fb21b2ec1dc6	2013-07-30 12:10:32 -07:00
Yaowu Xu	a15d1f3134	removed duplication Change-Id: Ica23b66f6664e5a5b168499584f0afffbc54794f	2013-07-30 09:09:14 -07:00
Dmitry Kovalev	c09b81719f	Merge "General cleanups."	2013-07-26 13:59:39 -07:00
Paul Wilkins	fe5e2a91bb	Auto min and max partition size experiment. Speed feature experiment to set an upper and lower partition size limit based on what has been seen in spatial neighbors. This seems to gives quite reasonable speed gains in local (10-15%) and when used with speed 0 the losses are small (0.25% derf, 0.35% stdhd). However, for now I am only enabling it on speed 1 as there may be clashes with the existing temporal partition selection in speed 2. Using a tighter min / max around the range derived from the neighbors increases speed further but at the cost of a bigger quality loss. However, I think this spatial method could be combined with data from either the last frame or a variance method (or both) to refine the range of minimum and maximum partition size. I.e. consider the min and max from spatial and temporal neighbors and the variance recommendation. Change-Id: I1b96bf8b84368d6aad0c7aa600fe141b4f07435f	2013-07-26 18:30:49 +01:00
Dmitry Kovalev	7131cb0e3d	General cleanups. Removing unused constants, macros, and function declarations. Using ROUND_POWER_OF_TWO macro, vp9_zero, vp9_copy where possible. Moving #include from .h to .c. Merging for loops for motion vectors. Change-Id: Ic3bf841764a2bb177128bb3a6d7aa8f68229cd13	2013-07-25 14:13:48 -07:00
Dmitry Kovalev	47d61f008f	Removing vp9_adapt_mode_context function. Moving code from vp9_adapt_mode_context to vp9_adapt_mode_probs. Change-Id: I60829c30b28968cd813551ef3a206dfb98d323c9	2013-07-25 10:48:45 -07:00
Adrian Grange	a183f17d33	Merge "Correct spelling mistakes"	2013-07-24 09:48:57 -07:00
Adrian Grange	bc8b0529db	Correct spelling mistakes Change-Id: Id4138293efeac4503b2e01ce7a6c150a5abeef77	2013-07-24 07:58:26 -07:00
Dmitry Kovalev	1099a436d3	Moving counts from FRAME_CONTEXT to new struct FRAME_COUNTS. Counts are separate from frame context. We have several frame contexts but need only one copy of all counts. Change-Id: I5279b0321cb450bbea7049adaa9275306a7cef7d	2013-07-23 17:02:08 -07:00
Paul Wilkins	cedd24ec61	Merge "Renaming of segment constants."	2013-07-23 08:16:12 -07:00
Scott LaVarnway	2fd20eb37d	Merge "Eliminated prev_mip memsets/memcpys in encoder"	2013-07-23 06:43:52 -07:00
Paul Wilkins	7c134bc0cd	Merge "Reworked the auto_mv_step_size speed feature"	2013-07-23 04:49:55 -07:00
Paul Wilkins	32042af14b	Renaming of segment constants. Renamed: MAX_MB_SEGMENTS to MAX_SEGMENTS MB_SEG_TREE_PROBS to SEG_TREE_PROBS The minimum unit for segmentation in the segment map is now 8x8 so it is misleading to use MB_ as macro-block traditionally refers to a 16x16 region. Change-Id: I0b55a6f0426bb46dd13435fcfa5bae0a30a7fa22	2013-07-23 12:09:04 +01:00
James Zern	76db4d599a	Merge "VP[89]_COMMON: remove golden/altref frame counts"	2013-07-22 12:55:07 -07:00
Paul Wilkins	1d189d6464	Re-order mode search in rd. Mode search order in rd loop changed to better reflect observed hit counts. Also some adjustment of the baseline mode rd thresholds to reflect the order change and observed frequencies. Change-Id: I47a131cc83e11551df8add6d6d8d413d78d3a63c	2013-07-22 17:21:12 +01:00
Paul Wilkins	888375d243	Fix build error. When CONFIG_POSTPROC is set there was a now invalid reference to cm->filter_level. Changed to cpi->mb.e_mbd.lf.filter_level in line with change Iaf5fb71c33719cdfa1b991f671caf071be9ea035 Change-Id: If746e60044903f7ba8d0d346225b3d015226c7d0	2013-07-22 14:01:43 +01:00
Dmitry Kovalev	ee1771ebaa	Moving all loop filter related variables into new struct. Adding loopfilter struct with fields from MACROBLOCKD and VP9Common. Eventually it will be moved to vp9_loopfilter.h for better code structure. Change-Id: Iaf5fb71c33719cdfa1b991f671caf071be9ea035	2013-07-19 16:19:10 -07:00
Deb Mukherjee	302698fb12	Reworked the auto_mv_step_size speed feature This patch modifies the auto_mv_step_size speed feature to use a combination of the maximum magnitude mv from the last inter frame, and the maximum magnitude mv for the two reference mvs with the same reference. For arf frames, the max mav step for the resolution is used. The bounds therefore are slightly tighter. The feature is made a speed 1 feature. Rebased. Results (when this feature is turned on over speed 0): derfraw300: -0.046% psnr, about 5+% speedup (tested on football: goes from 4m30.760s to 4m17.410s). Change-Id: If492797a61b0b4b3e58c0b8f86afb880165fc9f6	2013-07-19 15:12:56 -07:00
James Zern	5f30a0c687	VP[89]_COMMON: remove golden/altref frame counts these are only used in the encoder. frames_since_golden / frames_till_alt_ref_frame -> VP[89]_COMP Change-Id: Ie14a6f46987bced685ddb449b85dc261caba6dfe	2013-07-18 14:09:21 -07:00
Dmitry Kovalev	9f3c0e34a9	Moving Scale2Ration function from vp9_onyx.h to vp9_onyx_if.c. Change-Id: Idfe2a850f72b38f519aea1aac1266d8c3aa813ee	2013-07-18 14:05:06 -07:00
Yunqing Wang	3798db88e1	Remove unnecessary calling of vp9_init_quantizer() vp9_init_quantizer() is called in vp9_create_compressor(), and should not be called in vp9_set_speed_features(). Change-Id: Ic2f1f4b0531b9d46bb841d7e1d8da9812207dad6	2013-07-17 14:59:00 -07:00
Yunqing Wang	10e83b0717	Enable disable_splitmv feature for other speeds Added disable_splitmv feature at other speed levels. For speed 3 or above, always turn it on. Change-Id: Ibb36f0a7ef12a34b4f8d0f9cb6193eab43b34360	2013-07-17 10:25:49 -07:00
Yunqing Wang	df90d58f4f	Speed up motion estimation using small partitions' result(experiment) Current partition checking starts from small sizes, and then goes up to large sizes. This experiment uses the small partitions' motion estimation result, which is already available, to speed up the large partition's motion estimation. We can decide to skip some patition checkings if they are unlikely choices. We could use the motion vector(MV) result as current partition's prediction MV, limit the search range and reference frame. Current result at speed 1: psnr loss: 1.19% for stdhd, 0.287% for derf. speed gain: 14% for sunflower(hd), 11% for akiyo. Further improvement will be done later. Change-Id: I5abfd070e9cace2e91e2a0247d1325df313887ab	2013-07-17 09:11:47 -07:00
Paul Wilkins	d66eab15dd	Merge "Move uv intra mode selection in rd loop."	2013-07-17 05:19:26 -07:00
Paul Wilkins	2ee338ce3b	Move uv intra mode selection in rd loop. Use an estimate based on DC_PRED for intra uv cost within the rd loop then only do a full uv mode analysis if an intra mode is chosen. Significant speed gains in some cases. Currently only enabled for speed 2 pending speed/quality tests. Change-Id: Ie851a12400d5483bce47ec0e3ccb8516041e91c0	2013-07-17 11:11:21 +01:00
Dmitry Kovalev	9482a0bf10	Cleaning up tile code. Removing tile_rows and tile_columns from VP9Common, removing redundant constants MIN_TILE_WIDTH and MAX_TILE_WIDTH, changing signature of vp9_get_tile_n_bits. Change-Id: I8ff3104a38179b2c6900df965c144c1d6f602267	2013-07-16 14:47:15 -07:00
James Zern	9581eb6e8a	use consistent framerate naming s/frame_rate/framerate/g Change-Id: I6fc3e088e419c5f46e3a9390dd8a2cad2677a2fc	2013-07-16 14:12:47 -07:00
James Zern	3a7c2665d0	Merge "yv12config: remove YUV_TYPE"	2013-07-16 12:16:04 -07:00
Yaowu Xu	5b915ebd92	Change to extend full border only when needed This is a short term optimization till we work out a decoder implementation requiring no frame border extension. Change-Id: I02d15bfde4d926b50a4e58b393d8c4062d1be70f	2013-07-15 20:52:13 -07:00
Jingning Han	faff6ed0fb	Skip duplicate block encoding in the rd loop This speed feature allows the encoder to largely remove the spatial dependency between blocks inside a 64x64 superblock, thereby removing the need to repeatedly encode superblocks per partition type in the rate-distortion optimization loop. A major challenge lies in the intra modes tested in the rate-distortion optimization loop. The subsequent blocks do not have access to the reconstructed boundary pixels without the intermediate coding steps. This was resolved by using the original pixels for intra prediction in the rd loop, followed by an appropriately designed distortion modeling on the quantization parameters. Experiments also suggested that the performance impact is more discernible at lower bit-rate/psnr settings. Hence a quantizer dependent threshold is applied to deactivate skip of block coding. For bus_cif at 2000 kbps, speed 0: runtime 269854ms -> 237774ms (12% speed-up) at 0.05dB performance loss. speed 1: runtime 65312ms -> 61536ms, (7% speed-up) at 0.04dB performance loss. This operation is currently turned on in settings of speed 1. Change-Id: Ib689741dfff8dd38365d8c1b92860a3e176f56ec	2013-07-15 11:08:58 -07:00
James Zern	4fc6c88e9c	yv12config: remove YUV_TYPE this was never fleshed out in the context of VP8, for which it was added. for VP9 it has no meaning. Change-Id: Iba2ecc026d9e947067b96690245d337e51e26eff	2013-07-12 15:25:48 -07:00
Paul Wilkins	b8ddc9f0d3	Merge "Speed 2 feature adjustment."	2013-07-12 02:14:01 -07:00
Dmitry Kovalev	c4ad3273c7	Moving segmentation related vars into separate struct. Adding segmentation struct to vp9_seg_common.h. Struct members are from macroblockd and VP9Common structs. Moving segmentation related constants and enums to vp9_seg_common.h. Change-Id: I23fabc33f11a359249f5f80d161daf569d02ec03	2013-07-11 11:57:57 -07:00
Scott LaVarnway	f2a6bcfb18	Eliminated prev_mip memsets/memcpys in encoder This patch is in experimental but was not merged into master. This patch swaps ptrs instead of copying and uses the last show_frame flag instead of setting the entire buffer to zero. Change-Id: Ia0950466c8ba301a2a5bf917ff3d07bc1a2c2311	2013-07-11 10:47:28 -04:00
Paul Wilkins	5290eeab88	Speed 2 feature adjustment. With sf->auto_mv_step_size on it is questionable whether sf->reduce_first_step_size is worthwhile. At speed 2 it was not having a big impact. Even at speed 2 sf->optimize_coefficients = 0 is not having a big speed imapct so for now I have moved it down into a higher speed setting. Change-Id: I8a54de76d486ad37aabce76474889da2768b14c1	2013-07-11 13:59:12 +01:00
Deb Mukherjee	53ff43adc3	Prunes out full-rd computation based on modeled rd Adds a speed feature to eliminate full-rd computation if the modeled rd or rd based on a different parameter in the same mode is already a lot larger than the best rd yet. Specifically, only search the sharp and smooth filters if the modeled rd cost based on the regular filter is within a certain factor of the best rd cost so far. Also, skip full-rd computation of non splitmv inter modes if the modeled rd cost based on pred error is within the same factor of the best rd cost so far. Also adds some enhancements in the rd search for splitmv mode to speed things up by early breakouts. Negligible impact on performance. Resuts on derfraw300: psnr: -0.013% with the splitmv enhancements, -0.24% with the rd breakout feature on. speedup: 6% with splitmv enhancements, 20% with also residual breakout (tested on football sequence at 600 Kbps) Change-Id: I37abc308ea9f110c1679ce649b6a7e73ab1ad5fc	2013-07-10 13:49:49 -07:00
Yaowu Xu	bed27a960a	Add a feature to reduce chrome intra mode search Change-Id: I721ebdeef2b53ce3e5c3eba3f7462ae2103c95a8	2013-07-10 08:59:18 -07:00
Ronald S. Bultje	ed995afba1	Make frame-wide filter-type decision fully RD-based. Overall, on all test sets, this gains about +0.2% on all metrics. City is a clip where this really hurts (-1.0% on all metrics), I'm not quite sure why yet. Maybe interesting to look into in the future. Change-Id: I6f0eecb20e72f0194633270d30bf00d76d9eae78	2013-07-08 16:22:37 -07:00
Deb Mukherjee	d9b62160a0	Implements several heuristics to prune mode search Skips mode searches for intra and compound inter modes depending on the best mode so far and the reference frames. The various heuristics to be used are selected by bits from a flag. The previous direction based intra mode search pruning is also absorbed in this framework. Specifically the flags and their impact are: 1) FLAG_SKIP_INTRA_BESTINTER (skip intra mode search for oblique directional modes and TM_PRED if the best so far is an inter mode) derfraw300: -0.15%, 10% speedup 2) FLAG_SKIP_INTRA_DIRMISMATCH (skip D27, D63, D117 and D153 mode search if the best so far is not one of the closest hor/vert/diagonal directions. derfraw300: -0.05%, about 9% speedup 3) FLAG_SKIP_COMP_BESTINTRA (skip compound prediction mode search if the best so far is an intra mode) derfraw300: -0.06%, about 7-8% speedup 4) FLAG_SKIP_COMP_REFMISMATCH (skip compound prediction search if the best single ref inter mode does not have the same ref as one of the two references being tested in the compound mode) derfraw300: -0.56%, about 10% speedup Change-Id: I1a736cd29b36325489e7af9f32698d6394b2c495	2013-07-08 12:17:12 -07:00
Paul Wilkins	ef0ca2deaa	Merge "Fix to comp_inter_joint_search_thresh feature."	2013-07-04 03:27:00 -07:00
Dmitry Kovalev	430bd0c94a	Merge "Replacing 64 / MI_SIZE with MI_BLOCK_SIZE."	2013-07-03 14:16:02 -07:00
Dmitry Kovalev	5a21de8418	Replacing 64 / MI_SIZE with MI_BLOCK_SIZE. Change-Id: I32276552b3ea6dc1dce8e298be114cfe1019b31c	2013-07-03 10:54:50 -07:00
Paul Wilkins	f58b44ad62	Fix to comp_inter_joint_search_thresh feature. When this is 0 (BLOCK_SIZE_AB4X4) we want to do the inter joint search for all sizes. Change-Id: Id40cd6fe7790e7e1165352b9cef5e12fa8c0bc88	2013-07-03 16:58:34 +01:00
Paul Wilkins	72c5778ec5	Added two new skip experiments. sf->unused_mode_skip_lvl. Tests modes as normal for all sizes at or below the given level. At larger sizes it skips all modes that were not chosen at any smaller size. Hence setting BLOCK_SIZE_SB64X64 is in effect off. Setting BLOCK_SIZE_AB4X4 will only consider modes that were chosen for one or more 4x4 blocks at larger sizes. sf->reference_masking. Do a test encode of the NONE partition at one size and create a reference frame mask based on the best rd choice. In the full search only allow this reference frame. Currently it is testing 64x64 and repeats this in the full search. This does not work well with Jim's Partition code just now and is disabled by default. Change-Id: I8f8c52d2ef4a0c08100150b0ea4155d1aaab93dd	2013-07-03 16:56:06 +01:00
Paul Wilkins	b0a2871c35	Merge "Adjust Speed 0 settings."	2013-07-03 02:47:18 -07:00
Yaowu Xu	0d7b7c09cb	Added a speed feature use_square_partition_only This commit adds a speed feature where only squared partition are evaluated in partition picking. Enable this feature in cpu-used 2 reduces encoding time by ~30%. loss of compression: -0.9% on cif set -1.23% on stdhd Change-Id: Ia6fad11210f0b78365abb889f9245604513be5b9	2013-07-02 16:40:15 -07:00
Deb Mukherjee	37501d687c	Speed feature to binary search dir intramodes This speed feature will skip searching the directional intra prediction modes D63, D117, D27, D153 if the best intra mode so far is not one of the diagonal, horizontal or vertical directions closest to the respective directions being tested. In other words, this implements a sort of binary search in the angular domain. Speedup: about 9-10% Results: -0.05% only on derfraw300. Change-Id: I413584c41f2a3e8dabfbdeb40718c8fc4b1d63a2	2013-07-02 14:07:19 -07:00
Deb Mukherjee	8d3d2b76f3	Tx size selection enhancements (1) Refines the modeling function and uses that to add some speed features. Specifically, intead of using a flag use_largest_txfm as a speed feature, an enum tx_size_search_method is used, of which two of the types are USE_FULL_RD and USE_LARGESTALL. Two other new types are added: USE_LARGESTINTRA (use largest only for intra) USE_LARGESTINTRA_MODELINTER (use largest for intra, and model for inter) (2) Another change is that the framework for deciding transform type is simplified to use a heuristic count based method rather than an rd based method using txfm_cache. In practice the new method is found to work just as well - with derf only -0.01 down. The new method is more compatible with the new framework where certain rd costs are based on full rd and certain others are based on modeled rd or are not computed. In this patch the existing rd based method is still kept for use in the USE_FULL_RD mode. In the other modes, the count based method is used. However the recommendation is to remove it eventually since the benefit is limited, and will remove a lot of complications in the code (3) Finally a bug is fixed with the existing use_largest_txfm speed feature that causes mismatches when the lossless mode and 4x4 WH transform is forced. Results on derf: USE_FULL_RD: +0.03% (due to change in the tables), 0% encode time reduction USE_LARGESTINTRA: -0.21%, 15% encode time reduction (this one is a pretty good compromise) USE_LARGESTINTRA_MODELINTER: -0.98%, 22% encode time reduction (currently the benefit of modeling is limited for txfm size selection, but keeping this enum as a placeholder) . USE_LARGESTALL: -1.05%, 27% encode-time reduction (same as existing use_largest_txfm speed feature). Change-Id: I4d60a5f9ce78fbc90cddf2f97ed91d8bc0d4f936	2013-07-02 13:54:00 -07:00
Yunqing Wang	b12e060b55	Add speed feature to disable splitmv Added a speed feature in speed 1 to disable splitmv for HD (>=720) clips. Test result on stdhd set: 0.3% psnr loss and 0.07% ssim loss. Encoding speedup is 36%. (For reference: The test result on derf set showed 2% psnr loss and 1.6% ssim loss. Encoding speedup is 34%. SPLITMV should be enabled for small resolution videos.) Change-Id: I54f72b94f506c6d404b47c42e71acaa5374d6ee6	2013-07-02 10:02:34 -07:00
Paul Wilkins	1319d9c077	Adjust Speed 0 settings. Remove the use of sf->comp_inter_joint_search_thresh from the baseline speed 0. Approx +0.4% on derf. Change-Id: Icc14db98909830f40e5ac66130d40e78d2e55c71	2013-07-02 15:42:14 +01:00
Paul Wilkins	b7cd01ed73	Revert "New motion threshold factor - speed feature." This reverts commit `1377278180`. Also fixes a spelling mistake. Change-Id: I5be8aa4d8d3c0323d4a6f41968a7b2c048949c3f	2013-07-02 15:06:40 +01:00
Yaowu Xu	9e408e3504	fix the mismatch again in cpu_used 2 Change-Id: Icc4f70f0b0f91c9e7d5d00eedd67841afe2f2679	2013-07-01 19:13:18 -07:00
Jim Bankoski	d4158283e7	use partitioning from last frame This cl converts use partition from last frame to do the following: if part is none,horz, vert -> try split if part != none and one of the children is not split - try none Change-Id: I5b6c659e35f3ac9f11c051b92ba98af6d7e8aa87 Signed-off-by: Jim Bankoski <jimbankoski@google.com>	2013-07-01 18:18:50 -07:00
Paul Wilkins	1377278180	New motion threshold factor - speed feature. Added a speed feature that focuses only on thresholds for new motion modes. Moved sf->comp_inter_joint_search_thresh into speed 1. This has ~+0.4% impact on quality at speed 0 as our quality reference baseline. Slight adjustment to baseline thresholds. Change-Id: I7ebf104f1fe29af77ed4837b2e84be065621bbe5	2013-07-01 12:11:21 +01:00
Dmitry Kovalev	59070f6e3c	Merge "Removing CONFIG_DEBUG checks on assertions."	2013-06-28 14:03:28 -07:00
Dmitry Kovalev	8e6ce6bb9e	Removing CONFIG_DEBUG checks on assertions. Adding CHECK_MEM_ERROR macro to vp9_common.h and removing two duplicated ones from vp9_onyx_int.h and vp9_onyxd_int.h. Change-Id: I916afec61b3019f18193135dac7c35ed0f89b8b6	2013-06-28 10:36:20 -07:00
Yaowu Xu	1374a06bd8	Optimize partition search order This commit change the partition search order to allow checking of rectangular partition to be done after square partitions. It also added a speed feature to skip rectangular partition check when NONE is better than SPLIT in RD sense. This feature roughly speed up encoder by 1.5X with loss on compression -0.91% on cif set -0.56% on stdhd set Change-Id: I0d2d06993041aa9ea9073fcc39c54f73a127dfa4	2013-06-28 07:13:54 -07:00
Paul Wilkins	9f3ab83486	Auto adapt step size feature. Also tweaks to other features and experiments with what is on and off at different speed settings. Change-Id: I3e1d0be0d195216bf17c2ac5df67f34ce0b306b2	2013-06-26 19:48:39 +01:00
Paul Wilkins	e606cac046	Change meaning of cpi->sf.first_step and rename. Renamed cpi->sf.first_step to cpi->sf.reduce_first_step_size and changed its meaning such that it is a delta applied to reduce the default first step size (>> x) in the motion search rather than an absolute value. The default first step size is already changed according to the image dimensions (smaller for smaller images). cpi->sf.reduce_first_step_size now applies a further correction from the default. Change-Id: Ia94e08bc24c67b604831f980909af7e982fcd16d	2013-06-26 17:04:06 +01:00
John Koleszar	7bbb0633cd	Merge "Move vp9_full_to_model_counts to encoder"	2013-06-25 22:44:16 -07:00
Ronald S. Bultje	450c7b57a8	Only do metrics on cropped (visible) area of picture. The part where we align it by 8 or 16 is an implementation detail that shouldn't matter to the outside world. Change-Id: I9edd6f08b51b31c839c0ea91f767640bccb08d53	2013-06-25 12:57:28 -07:00
Ronald S. Bultje	c24d922396	Add averaging-SAD functions for 8-point comp-inter motion search. Makes first 50 frames of bus @ 1500kbps encode from 3min22.7 to 3min18.2, i.e. 2.3% faster. In addition, use the sub_pixel_avg functions to calc the variance of the averaging predictor. This is slightly suboptimal because the function is subpixel-position-aware, but it will (at least for the SSE2 version) not actually use a bilinear filter for a full-pixel position, thus leading to approximately the same performance compared to if we implemented an actual average-aware full-pixel variance function. That gains another 0.3 seconds (i.e. encode time goes to 3min17.4), thus leading to a total gain of 2.7%. Change-Id: I3f059d2b04243921868cfed2568d4fa65d7b5acd	2013-06-25 12:57:28 -07:00
Yaowu Xu	e371cd73a3	change to enable use_largest_txform feature for all regular inter frames at speed 1 Change-Id: I0a8b301273ecf2b8730ab1f6b7a05f89f4d498e0	2013-06-24 16:43:26 -07:00
John Koleszar	08b1798ae7	Move vp9_full_to_model_counts to encoder This function is not called from the decoder, so it doesn't need to be in common/. Change-Id: I6977dd462a25b4ff39c9c7e1b0b5b16aa58ee733	2013-06-24 15:46:15 -07:00
Yaowu Xu	45e25a7814	Get some speed back for cpuused 1 and remove unused code. Change-Id: If380440c4450294b5450b7a9eeb94a376846ec01	2013-06-20 19:05:18 -07:00
Dmitry Kovalev	8283d893eb	Merge "Renaming 'nmv' to 'mv' for several functions."	2013-06-20 10:17:12 -07:00
Jim Bankoski	9f2a1ae23e	adds force partitioning greater than or less than block size adds a new speed feature to force partitioning to be greater than or less than a certain size Change-Id: I8c048eeeef93700ae822eccf98f8751a45b2e7d0	2013-06-20 09:51:42 -07:00
Jim Bankoski	18bdf708e7	adds a set partitioning to speed features this feature lets you set a partitioning size to be used by the entire frame. Change-Id: I208a4c8c701375cbb054418266f677768b6f8f06	2013-06-20 09:50:44 -07:00
Jim Bankoski	476d73d294	partition by variance using var from last frame This uses variance to split partition. Variance is calculated using nearest mv, always from last ref frame. Change-Id: Idd015b4a9aa3bc82591759eac239680c07496896	2013-06-20 09:48:22 -07:00
Jim Bankoski	1f94b97694	convert all speed things to speed features Change-Id: Ie24489a4d39f3e53e816eeebf75a1c9c7d94515a	2013-06-20 09:42:44 -07:00
Jim Bankoski	0fad6a9d99	fix to set up new speed feature This uses the speed feature functionality for code. Change-Id: I9cd16c0c5f98520ae27ebba81aa2c178546587f8	2013-06-20 09:35:02 -07:00
Dmitry Kovalev	87e1fa7627	Renaming 'nmv' to 'mv' for several functions. Change-Id: I183a38997a9d01e4a1b869e92509f6915216fa09	2013-06-18 18:28:10 -07:00
John Koleszar	ad3b12f857	Merge "Fix chroma output when scaling"	2013-06-12 12:39:10 -07:00
John Koleszar	01016ff9a6	Fix chroma output when scaling The encode-side scaling was not indexing through the image correctly for the chroma planes, causing a green checkerboard-like output in the unit test. Change-Id: I9abbd73615404cd6699588be3e64dcf59005bc14	2013-06-12 10:11:53 -07:00
Deb Mukherjee	51a7c7631d	Merge "New probs for filters/tx_size and a few others" into experimental	2013-06-10 16:39:43 -07:00
Deb Mukherjee	a43ff15399	New probs for filters/tx_size and a few others * New probs for subpel filters/tx_count * Makes a change to not reset to defaults for the tx_size probs if an intermediate frame reverts to using a fixed tx_size. * A few updates to the parameters for backward adaptation for mode/mv * some cosmetic cleanups derf300: +0.06% Change-Id: I22994d659bc31ca7a4fc8820fde24001e64a2920	2013-06-10 16:38:47 -07:00
John Koleszar	0fcb625e35	Remove remnants of VP8 profiles/versions Remove the bilinear filter mode, and the no-loopfilter mode, and the related vp9_setup_version() function. Change-Id: I32311367812faf37863131df3af37d63d03973d7	2013-06-10 15:55:03 -07:00
Ronald S. Bultje	20760254f6	Merge "Align frame size to 8 instead of 16." into experimental	2013-06-08 17:39:41 -07:00
Yaowu Xu	b7da6d0c5a	Merge "Handle partition type coding of boundary blocks" into experimental	2013-06-07 18:16:16 -07:00
Ronald S. Bultje	71701f3d40	Align frame size to 8 instead of 16. Change-Id: Ic606ef1b31e49963a779455a1e010a9ebb0f3f1f	2013-06-07 17:20:50 -07:00
Deb Mukherjee	21401942b0	Coding tx-size selection by use of spatial context Adds coding of transform size within a frame by use of context of transform sizes selected in left and above blocks. Also incorporates code for generating stats. TODO: generate and incorporate new default stats Change-Id: I6a7af099f6ad61d448521d9a51167aedaf638ed6	2013-06-07 16:07:58 -07:00
Deb Mukherjee	869a39ba60	Cleans up mbskip encoding Refactors mbskip coding to be compatible with coding of the rest of the symbols. Adds forward/backward adaptation and removes a lot of the legacy code. Results: fast50: +1.6% derfraw300: +0.317% Change-Id: I395a2976d15af044d3b8ded5acfa45f6f065f980	2013-06-07 16:00:26 -07:00
Jingning Han	78b8190cc7	Handle partition type coding of boundary blocks The partition types of blocks sitting on the frame boundary are constrained by the block size and the position of each sub-block relative to the frame. Hence we use truncated probability models to handle the coding of such information. 100 frames run: yt 0.138% Change-Id: I85d9b45665c15280069c0234ea6f778af586d87d	2013-06-07 14:19:40 -07:00
Ronald S. Bultje	d5c2d2dc94	Fix line that disables the line above it. Change-Id: I19d5cb60a00a001f6e5b3d90ce2db6e49d6209ad	2013-06-07 13:57:28 -07:00
Paul Wilkins	340c7a48e6	Change to segment ref frame feature. Simplify feature to only support a single reference frame instead of a mask. Change-Id: I5dd3a98c7a224aafb35708850ab82e2f220e68fb	2013-06-07 21:42:22 +01:00
Yaowu Xu	0bb6da3668	Merge "Remove two un-used entries in mode_lf_delta[]" into experimental	2013-06-07 10:10:45 -07:00
Yaowu Xu	b097a3ba82	Remove two un-used entries in mode_lf_delta[] With the removal of i4X4 and SPLIT_MV modes, the two entries for the modes are no longer used. This patch remove the coding of the deltas. Change-Id: Iea4eb500404ebe9706159380a03b8eca542fb4c3	2013-06-07 09:24:09 -07:00
Deb Mukherjee	78fbaf4d84	Merge "Coding updates for tx-size selection" into experimental	2013-06-07 09:19:36 -07:00
Deb Mukherjee	3ee1a21a42	Coding updates for tx-size selection Changes to the coding of transform sizes, along with forward and backward probability updates. Results: derf300: +0.241% Context based coding of transform sizes will be in a separate patch. Change-Id: I97241d60a926f014fee2de21fa4446ca56495756	2013-06-07 08:54:00 -07:00
Paul Wilkins	576c2bb021	Fix bug in segment skip. Wrong max data size (skip has no data) and use of vp9_get_segdata() when it should be vp9_segfeature_active(). Change-Id: I1eb97d33df6e2a42cc589049f704266fe3639902	2013-06-07 13:27:08 +01:00
Ronald S. Bultje	6ef805eb9d	Change ref frame coding. Code intra/inter, then comp/single, then the ref frame selection. Use contextualization for all steps. Don't code two past frames in comp pred mode. Change-Id: I4639a78cd5cccb283023265dbcc07898c3e7cf95	2013-06-06 17:28:09 -07:00
Paul Wilkins	c3316c2bc5	Rd thresholds change with block size. Added structures to support independent rd thresholds for different block sizes (and set experimental block size correction factors). Added structure to to allow dynamic adaptation of thresholds per mode and per block size basis depending on how often the mode/block size combination is seen (currently fixed factor). Removed some unused variables. TODO - Adaptation of thresholds based on how often each mode chosen. - The baseline mode values could also be adjusted based on the block size (e.g. for a particular intra mode use a low threshold for 4x4 prediction blocks but a relatively high value for 64x64. Change-Id: Iddee65ff3324ee309815ae7c1c5a8584720e7568	2013-06-06 15:45:53 +01:00
Paul Wilkins	c880e02f97	Turn off compound inter search refinement for good quality. Turn this feature off for some modes in "good" quality. Change-Id: I3f262d62cca8f01736b977af1465291e8be29f0a	2013-06-06 15:44:25 +01:00
Deb Mukherjee	30226a658f	Cosmetic renaming VP9_MVREFS to VP9_INTER_MODES NO bitstream change Change-Id: I79f6146dac5fdd157051b6f8dc611c0b7b5e5f7f	2013-06-05 11:24:01 -07:00
Deb Mukherjee	83885235a7	Clean-ups on switchable interpolation and mv_ref Adds backward adaptation and differential forward updates of switchable interpolation filter probabilities. Also adds some cosmetic cleanups and minor fixes on mv_ref probabilities. derfraw300: +0.353% (with most coming from switchable interp changes) Change-Id: Ie2718be73528c945fd0d80cfd63ca2d9cb3032de	2013-06-05 10:11:52 -07:00
Ronald S. Bultje	a288cb3b10	Merge "Merge all various transform size data trackers into single variables." into experimental	2013-05-31 09:59:24 -07:00
Scott LaVarnway	1e025dbfd1	Merge "Moved use_prev_in_find_mv_refs check to frame level" into experimental	2013-05-31 09:35:51 -07:00
Ronald S. Bultje	e9d68a5e36	Merge all various transform size data trackers into single variables. Change-Id: I2dfc569106b29fbe4da20585a0e85e5e9ea6a4db	2013-05-31 09:18:59 -07:00
Jim Bankoski	9e176494c2	put back in lost speedups speed >1 can be spead up by turning these on - lost in a prior commit Change-Id: Iaef85e10ecfeec3aea5ab0e691edf02bb7f5190d	2013-05-31 06:47:40 -07:00
Paul Wilkins	aaf61dfbca	Merge "Patch to remove implicit segmentation." into experimental	2013-05-31 02:56:20 -07:00
Ronald S. Bultje	310bc1030a	Merge "Merge VP9_YMODES, VP9_UV_MODES, INTRA_MODE_COUNT and cousins." into experimental	2013-05-30 20:58:19 -07:00
Ronald S. Bultje	6ea6f4d253	Merge "Remove one (unused) entry from mvref tables." into experimental	2013-05-30 20:58:13 -07:00
Jim Bankoski	ced21bd6a6	Creates a new speed 1: This speed 1 - uses variance threshold stolen from static-thresh to determine split. Any superblock with greater than the variance set by static thresh * quantizer index squared is split. In addition transform size is set to largest size less than or equal to partition size, sub pixel filter is set to normal, and only 12 modes are used at all. Change-Id: If7a2858ee70f96d1eb989c04fd87a332b147abef	2013-05-30 19:53:00 -07:00
Ronald S. Bultje	a433abbcad	Merge VP9_YMODES, VP9_UV_MODES, INTRA_MODE_COUNT and cousins. These are now merged in a new define called VP9_INTRA_MODES. Change-Id: I0890f895756a7395d84c92f98f43e43f4cf9050d	2013-05-30 17:21:06 -07:00
Ronald S. Bultje	580d29bdbb	Remove one (unused) entry from mvref tables. Change-Id: Ieb4669ae564bec9f3051485ecdf186cb4e00decb	2013-05-30 17:21:06 -07:00
Ronald S. Bultje	f5827699bf	Merge "Merge all intra mode coding trees into a single one." into experimental	2013-05-30 11:27:51 -07:00
Jingning Han	5e97862a71	Merge "Enable iterative motion search for 4x4 inter pred" into experimental	2013-05-30 11:02:10 -07:00
Adrian Grange	6f361f5841	Merge "Add intra_only and reset_frame_context flags" into experimental	2013-05-30 10:56:25 -07:00
Ronald S. Bultje	98c192ae83	Merge all intra mode coding trees into a single one. Also merge all counters. This removes a few unused probability updates from the bitstream. Change-Id: I20f58853e9dac84d8c0d9703ae012c55917516eb	2013-05-30 09:58:53 -07:00
Deb Mukherjee	c98bfcfbbb	Merge "Balancing coef-tree to reduce bool decodes" into experimental	2013-05-30 08:10:47 -07:00
Paul Wilkins	1b103f250f	Patch to remove implicit segmentation. This patch removes the implicit segmentation experiment from the code base as the benefits were still unproven as of the bitstream deadline. Change-Id: I273b99d8d621d1853eac4182f97982cb5957247e	2013-05-30 11:06:29 +01:00
Jingning Han	87626a8f6e	Enable iterative motion search for 4x4 inter pred This commit enables iterative motion search for 4x4/4x8/8x4 block size compound inter-inter prediction. WIP: borg run testing Change-Id: I2b318db4a03cdca5a8002b3fa6c0fa89b129288b	2013-05-30 10:49:35 +01:00
Adrian Grange	9e5bb9598c	Add intra_only and reset_frame_context flags Added two flags to the frame header: intra_only: Signals that the frame is encoded using only INTRA coding modes. reset_frame_context: Indicates that the coding context specified in the frame header should be reset to default values before the frame is encoded/decoded. Change-Id: I182d46f1f84fb67a13c46ad767f246a38d7861a2	2013-05-29 17:16:00 -07:00
Deb Mukherjee	407eb03ad7	Merge "Build fix when ENTROPY_STATS is defined" into experimental	2013-05-29 17:01:43 -07:00
Deb Mukherjee	b8b3f1a46d	Balancing coef-tree to reduce bool decodes This patch changes the coefficient tree to move the EOB to below the ZERO node in order to save number of bool decodes. The advantages of moving EOB one step down as opposed to two steps down in the other parallel patch are: 1. The coef modeling based on the One-node becomes independent of the tree structure above it, and 2. Fewer conext/counter increases are needed. The drawback is that the potential savings in bool decodes will be less, but assuming that 0s are much more predominant than 1's the potential savings is still likely to be substantial. Results on derf300: -0.237% Change-Id: Ie784be13dc98291306b338e8228703a4c2ea2242	2013-05-29 16:25:52 -07:00
Scott LaVarnway	353642bc53	Moved use_prev_in_find_mv_refs check to frame level This patch checks at the frame level to see if the previous mode info context can be used. This patch eliminates the flag check that was done for every mode and removes another check that was done prior to every vp9_find_mv_refs(). Change-Id: I9da5e18b7e7e28f8b1f90d527cad087073df2d73	2013-05-29 16:42:23 -04:00
Dmitry Kovalev	18c83b3714	Compressed/uncompressed frame header changes. Adding API to read/write uncompressed frame header bits (it is not final yet). Separate functions to read/write uncompressed header. Moving clr_type, error_resilient_mode, refresh_frame_context, frame_parallel_decoding_mode, frame_context_idx from compressed partition to uncompressed frame header. Change-Id: Id3ed8a387980c652ae147549412f4ec24a0a5bd0	2013-05-28 18:07:54 -07:00
Deb Mukherjee	a09707b7be	Build fix when ENTROPY_STATS is defined Fixes a build issue due to removal of VP9_KF_BINTRAMODES macro, when ENTROPY_STATS is on. Change-Id: I75c61702bf626376c942ab49ab887714b43284f0	2013-05-28 17:07:27 -07:00
Paul Wilkins	245a11553a	Merge "Remove loop dering experiment." into experimental	2013-05-28 05:34:14 -07:00
Dmitry Kovalev	1a24011469	Revert "Adding API to read/write uncompressed frame header bits." because of bitstream mismatches. This reverts commit `df037b615f` Change-Id: I1a529f2590df7bc912f5035d22311268933e3dd6	2013-05-28 02:24:52 -07:00
Ronald S. Bultje	01b7f72aab	Fix coding statistics compilation. Change-Id: I21e7c4ef6bc80f4b9281fc94c88fb710b1595c23	2013-05-27 08:29:07 -07:00
Ronald S. Bultje	5cac66078e	Remove splitmv. Also do per-partition motion vector referencing in <sb8x8 partitions, and adjust mvref finding for sub8x8 partitions. Change-Id: Id3ed1ed4d2a8910d11d327db6cc63b8eb79f941f	2013-05-26 14:40:49 -07:00
Paul Wilkins	845bc13ba9	Remove loop dering experiment. Change-Id: I1a979bf74c286b157c31bab6bdcba0494acb4918	2013-05-25 10:09:23 +01:00
Dmitry Kovalev	0b2b81249b	Merge "Adding API to read/write uncompressed frame header bits." into experimental	2013-05-24 13:43:19 -07:00
Jingning Han	d093027791	Fix transform size coding mismatch This commit fixes a transform size enc/dec mismatch issue in the key frame coding. Change-Id: I0c4f40464a367b33dd91ace84506650b1aec2873	2013-05-24 11:30:58 -07:00
Jingning Han	7ac5ac52f9	Merge 4x4 block level partition into codebase Move 4x4/4x8/8x4 partition coding out of experimental list. This commit fixed the unit test failure issues. It also resolved the merge conflicts between 4x4 block level partition and iterative motion search for comp_inter_inter. Change-Id: I898671f0631f5ddc4f5cc68d4c62ead7de9c5a58	2013-05-23 11:58:50 +01:00
Yaowu Xu	8ba92a0bed	changes intra coding to be based on txfm block This commit changed the encoding and decoding of intra blocks to be based on transform block. In each prediction block, the intra coding iterates thorough each transform block based on raster scan order. This commit also fixed a bug in D135 prediction code. TODO next: The RD mode/txfm_size selection should take this into account when computing RD values. Change-Id: I6d1be2faa4c4948a52e830b6a9a84a6b2b6850f6	2013-05-22 11:53:19 +01:00
Paul Wilkins	0b713f8c18	Merge CONFIG_COMP_INTER_JOINT_SEARCH. Merge this experiment so that it is under a speed feature flag not a configuration flag. Change-Id: I536f7f125a4ff5149bb3a64f791e835c324535fd	2013-05-22 11:23:31 +01:00
John Koleszar	ddf13be8ef	Merge "Initial version of alpha channel support" into experimental	2013-05-21 17:29:51 -07:00
Dmitry Kovalev	df037b615f	Adding API to read/write uncompressed frame header bits. The API is not final yet and can be changed. Actual layout of uncompressed frame part will be finalized later. Right now moving clr_type, error_resilient_mode, refresh_frame_context, frame_parallel_decoding_mode from first compressed partition to uncompressed frame part. Change-Id: I3afc5d4ea92c5a114f4c3d88f96858cccc15b76e	2013-05-21 15:31:32 -07:00
Deb Mukherjee	7a645e4e12	Merging the model coef prob experiment Merges the experiment. Change-Id: I4eb19af6de6df6aa3a96a2e82f231d47ed9b3ae9	2013-05-21 14:44:38 -07:00
Scott LaVarnway	1db6373267	Merge "WIP: 4x4 idct/recon merge" into experimental	2013-05-21 10:45:53 -07:00
Deb Mukherjee	39a90bc8e8	Updating the model coef experiment Cleans up the experiment. Actually uses reduced counts for backward updates, and reduced number of probabilities in the context. No change in bitstream when the experiment is on. Between expt on and off: derfraw300 is down only -0.062% (which is better than when expts were run previously). Change-Id: I55285a049a0c22810bdb42914212ab5a4f8521b5	2013-05-20 12:46:36 -07:00
Scott LaVarnway	ba48a11130	WIP: 4x4 idct/recon merge This patch eliminates the intermediate diff buffer usage by combining the short idct and the add residual into one function. The encoder can use the same code as well. Change-Id: I296604bf73579c45105de0dd1adbcc91bcc53c22	2013-05-20 13:03:17 -04:00
Paul Wilkins	61e2eac61d	Merge "New inter mode context." into experimental	2013-05-17 06:59:09 -07:00
Paul Wilkins	51bc4bf4a0	Remove MODE_STATS flag and code Change-Id: I6c70a8a8a4633399842ac74792003ae5f7859ffa	2013-05-17 12:34:10 +01:00
John Koleszar	679e4abdd5	Initial version of alpha channel support This is a mostly-working implementation of an extra channel in the bitstream. Configure with --enable-alpha to test. Notable TODOs: - Add extra channel to all mismatch tests, PSNR, SSIM, etc - Configurable subsampling - Variable number of planes (currently always uses all 4) - Loop filtering - Per-plane lossless quantizer - ARNR support This implementation just uses the same contents as the Y channel for the A channel, due to lack of content and general pain in playing back 4 channel content. A later patch will use the actual alpha channel passed in from outside the codec. Change-Id: Ibf81f023b1c570bd84b3064e9b4b8ae52e087592	2013-05-16 22:21:09 -07:00
Jingning Han	8e3d0e4d7d	Add building blocks for 4x8/8x4 rd search These building blocks enable rate-distortion optimization search over block sizes of 8x4 and 4x8. Need to convert them into mmx/sse forms. Change-Id: I570ea2d22d14ceec3fe3575128d7dfa172a577de	2013-05-16 10:41:29 -07:00
Paul Wilkins	6ff3eb1647	New inter mode context. This patch creates a new inter mode contest that avoids a dependence on the reconstructed motion vectors from neighboring blocks. This was a change requested by a hardware vendor to improve decode performance. As part of this change I have also made some modifications to stats output code (under a flag) to allow accumulation of inter mode context flags over multiple clips Some further changes will be required to accommodate the deprecation of the split mv mode over the next few days. Performance as stands is around -0.25% on derf and std-hd but up on the YT and YT-HD sets. With further tuning or some adjustment to the context criteria it should be possible to make this change broadly neutral. Change-Id: Ia15cb4470969b9e87332a59c546ae0bd40676f6c	2013-05-16 12:09:19 +01:00
Paul Wilkins	18e07420a2	Merge "Further Implicit Segmentation Changes" into experimental	2013-05-16 03:03:44 -07:00
John Koleszar	418564e7d0	Add vp9_extend_frame_borders Adds a subsampling aware border extension function. This may be reworked soon to support more than 3 planes. Change-Id: I76b81901ad10bb1e678dd4f0d22740ca6c76c43b	2013-05-15 20:58:26 -07:00
Dmitry Kovalev	5c5582242d	Moving the same code to new function vp9_setup_scale_factors. Change-Id: I2408ad22717784a40e23701ccb9d978265440e4f	2013-05-15 16:33:36 -07:00
Dmitry Kovalev	cd16fe9160	Merge "Preparing vp9_deblock and vp9_denoise to alpha support." into experimental	2013-05-15 15:40:52 -07:00
Paul Wilkins	7d7e5b5131	Further Implicit Segmentation Changes Trial use of a combination of reference frame, prediction block size and mv to define segmentation. Change-Id: Ie8946a0446dbad777fdcf7626f89e5af0994db50	2013-05-15 16:00:06 +01:00
Dmitry Kovalev	7bbf716f04	Preparing vp9_deblock and vp9_denoise to alpha support. Change-Id: I299feefa64b93bd62263aea1ff1e41e85faeb6ca	2013-05-14 11:01:57 -07:00
John Koleszar	d31a632dcd	Merge "Revert "Preparing vp9_deblock and vp9_denoise to alpha support."" into experimental	2013-05-14 06:47:44 -07:00
John Koleszar	56efb73be3	Revert "Preparing vp9_deblock and vp9_denoise to alpha support." This reverts commit `a933311131` Change-Id: I2321f88011178381adbcffeda1bcc6a430ab8f1d	2013-05-14 06:46:11 -07:00

... 3 4 5 6 7 ...

624 Commits