generic-library/vpx

Author	SHA1	Message	Date
James Zern	a5726ac453	vp9/encoder: fix last_frame_seg_map mem leak remove duplicate allocation from vp9_create_compressor, it was added to vp9_alloc_frame_buffers in: `d5bec52` Added resizing & initialization of last frame segment map Change-Id: I996723226a16a62aff8f9a52ac74e0b73cc98fdf	2013-08-22 14:13:04 -07:00
Dmitry Kovalev	640dea4d9d	Adding vp9_is_scaled function. Change-Id: Ieb7077ca3586b9491912027eed450a4f6fd38d30	2013-08-22 14:04:59 -07:00
Jingning Han	01a37177d1	Refactor rd_pick_partition for parameter control This commit changes the partition search order of superblocks from {SPLIT, NONE, HORZ, VERT} to {NONE, SPLIT, HORZ, VERT} for consistency with that of sub8x8 partition search. It enable the use of early termination in partition search for all block sizes. For ped_area_1080p 50 frames coded at 4000 kbps, it makes the runtime goes down from 844305ms -> 818003ms (3% speed-up) at speed 0. This will further move towards making the in-search partition types configurable, hence unifying various speed-up approaches. Some speed 1 and 2 features are turned off during the refactoring process, including: disable_split_var_thresh using_small_partition_info Stricter constraints are applied to use_square_partition_only for right/bottom boundary blocks. Will bring back/refine these features subsequently. At this point, it makes derf set at speed 1 about 0.45% higher in compression performance, and 9% down in run-time. Change-Id: I3db9f9d1d1a0d6cbe2e50e49bd9eda1cf705f37c	2013-08-22 12:36:02 -07:00
Deb Mukherjee	8b810c7a78	Fixes on feature disabling split based on variance Adds a couple of minor fixes, which may be absorbed in Jingning's patch. Thanks to Guillaume for pointing these out. Also adjusts the thresholds for speed 1 and 2 to 16 and 32 respectively, to keep quality drops small. Results: -------- derfraw300: threshold = 16, psnr -0.082%, speedup 2-3% threshold = 32, psnr -0.218%, speedup 5-6% stdhdraw250: threshold = 16, psnr -0.031%, speedup 2-3% threshold = 32, psnr -0.273%, speedup 5-6% Change-Id: I4b11ae8296cca6c2a9f644be7e40de7c423b8330	2013-08-22 07:05:44 -07:00
Scott LaVarnway	f39bf458e5	Merge "Initialize mb_skip_coeff before picking modes"	2013-08-22 06:26:04 -07:00
Scott LaVarnway	94bfbaa84e	Initialize mb_skip_coeff before picking modes It appears that the above/left mb_skip_coeff used during the pick modes, is left over from the previously encode frame. This patch initializes the flag to the default value of zero. Change-Id: Ida4684cc99611d6e3e82628db35ed717e28ce550	2013-08-22 08:51:04 -04:00
Dmitry Kovalev	cb05a451c6	Merge "Cleaning up optimize_init_b function."	2013-08-22 01:35:27 -07:00
Dmitry Kovalev	64c0f5c592	Merge "Cleaning up sum_intra_stats function."	2013-08-22 01:34:39 -07:00
Jingning Han	fcb890d751	Merge "Enable zero coeff check in sub8x8 UV rd loop"	2013-08-21 22:07:00 -07:00
Dmitry Kovalev	be60924f29	Cleaning up optimize_init_b function. Change-Id: Ib2c975e1d96deefb7ac4d6b600c8c5388035d111	2013-08-21 16:40:16 -07:00
Dmitry Kovalev	048ccb2849	Cleaning up sum_intra_stats function. Using size_group_lookup table and better variable names. Change-Id: I6e67f2ce091845db43ace7d21b7ae31c6f165aec	2013-08-21 16:25:02 -07:00
Dmitry Kovalev	3286abd82e	Merge "Adding scale factor check."	2013-08-21 14:11:13 -07:00
Dmitry Kovalev	2f1a0a0e2c	Removing PLANE_TYPE argument from cost_coeffs function. We can determine plane_type for another function arguments. Change-Id: I85331877aedb357632ae916a37b5b15f22c0bb1f	2013-08-21 13:02:28 -07:00
Dmitry Kovalev	27a984fbd3	Removing a lot of duplicated code. Adding set_contexts contexts function and call it instead of set_contexts_on_border. Calling txfrm_block_to_raster_xy to get aoff and loff. Change-Id: I41897e344afd2cae1f923f4fdbe63daccf6fe80e	2013-08-21 11:55:12 -07:00
Dmitry Kovalev	a3ae4c87fd	Adding scale factor check. We support only [1/16, 2] scale factors, enforcing this now. Change-Id: I0822eb7cea51720df6814e42d3f35ff340963061	2013-08-21 11:24:47 -07:00
Adrian Grange	ce28d0ca89	Fix typos and minor stylistic cleanup Change-Id: I32e43474e8651ef2eb181d24860a8f118cfea7bf	2013-08-21 08:45:42 -07:00
Dmitry Kovalev	7f814c6bf8	Merge "Passing plane_bsize to foreach_transformed_block_visitor."	2013-08-20 14:25:01 -07:00
Jingning Han	1bf1428654	Enable zero coeff check in sub8x8 UV rd loop Check the minimum rate-distortion cost of regular quantization and all zero coeffs cases in the sub8x8 inter prediction rd loop for luma components. Use this as the cumulative rdcost sent to UV rd estimation. Change-Id: Ia4bc7700437d5e13d7cdad4cf9ae57ab036d3e97	2013-08-20 10:33:42 -07:00
Deb Mukherjee	246381faf2	Merge "Cleanup/enhancements of switchable filter search"	2013-08-20 10:16:51 -07:00
Dmitry Kovalev	5826407f2a	Merge "Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c."	2013-08-20 10:06:22 -07:00
Deb Mukherjee	2ffe64ad5c	Cleanup/enhancements of switchable filter search Cleans up the switchable filter search logic. Also adds a speed feature - a variance threshold - to disable filter search if source variance is lower than this value. Results: derfraw300 threshold = 16, psnr -0.238%, 4-5% speedup (tested on football) threshold = 32, psnr -0.381%, 8-9% speedup (tested on football) threshold = 64, psnr -0.611%, 12-13% speedup (tested on football) threshold = 96, psnr -0.804%, 16-17% speedup (tested on football) Based on these results, the threshold is chosen as 16 for speed 1, 32 for speed 2, 64 for speed 3 and 96 for speed 4. Change-Id: Ib630d39192773b1983d3d349b97973768e170c04	2013-08-20 09:47:04 -07:00
Jingning Han	bb64c9a355	Merge "Enable early termination in uv rd loop"	2013-08-20 09:07:26 -07:00
Paul Wilkins	e8923fe492	Changes to auto partition size selection. Changes to code to auto select a partition size range based on data from spatial neighbors. Now looks at the sb_type in each 8x8 block of above and left SB64. The effect on speed 1 is now weaker giving better quality but less speed gain. Now also used in speed 2. Change-Id: Iace33a97d5c3498dd2a9a8a4067351941abcbabc	2013-08-20 14:05:39 +01:00
Yaowu Xu	c4048dbdd3	Change to limit the mv search range As the pixel values beyond image border are duplicates of pixels on edge, the change limits the mv search range, any mv beyond the limits no longer produce new/different prediction values as entire block with pixels used for subpel interpolation are outside image border. Change-Id: I4c6fdf06e33c1cef1489f5470ce0fb4e5e01fb79	2013-08-19 17:19:36 -07:00
Yaowu Xu	f70330a906	fix a bug when null function pointer is used. For certain partition size, the function poniter may not be intialized at all. The patch prevent the call if the pointer is not set. Change-Id: I78b8c3992b639e8799a16b3c74f0973d07b8b9ac	2013-08-19 17:16:12 -07:00
Dmitry Kovalev	569ca37d09	Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c. Change-Id: Ib8af21f2e7f603c2fb407e5d15a3bba64b545b49	2013-08-19 16:44:10 -07:00
Jingning Han	3275ad701a	Enable early termination in uv rd loop This commit enables early termination in the rate-distortion optimization search loop for chroma components. When the cumulative rd cost is above the current best value, skip the rest per-block transform/quantization/coeff_cost and continue to the next prediction mode. For bus_cif at 2000 kbps, the average run-time goes down from 168546ms -> 164678ms, (2% speed-up) at speed 0 36197ms -> 34465ms, (4% speed-up) at speed 1 Change-Id: I9d3043864126e62bd0166250d66b3170d520b3c0	2013-08-19 16:31:19 -07:00
Dmitry Kovalev	82d4d9a008	Passing plane_bsize to foreach_transformed_block_visitor. Updating all foreach_transformed_block_visitor functions to work with plane block size instead of general block. Removing a lot of duplicated code. Change-Id: I6a9069e27528c611f5a648e1da0c5a5fd17f1bb4	2013-08-19 15:47:24 -07:00
Jingning Han	31c97c2bdf	Merge "Fix potential use of uninitialized value"	2013-08-19 15:15:58 -07:00
Jingning Han	5dc0b309ab	Merge "Fix the returned distortion value in rd_pick_intra"	2013-08-19 14:34:19 -07:00
Dmitry Kovalev	2e3478a593	Using plane_bsize instead of bsize. This change set is intermediate. The next one will remove all repetitive plane_bsize calculations, because it will be passed as argument to foreach_transformed_block_visitor. Change-Id: Ifc12e0b330e017c6851a28746b3a5460b9bf7f0b	2013-08-19 13:20:21 -07:00
Jingning Han	b34ce04378	Fix potential use of uninitialized value Initialize the best mode and tx_size values in the rate-distortion optimization search loop. Change-Id: Ibfb5c0895691f172abcd4265c23aef4cb99fa8af	2013-08-19 11:15:53 -07:00
Jingning Han	f67919ae86	Fix the returned distortion value in rd_pick_intra Return the distortion value in vp9_rd_pick_intra_mode_sb as sum of dist_y and dist_uv. Remove the right shift operation on dist_uv, and make it consistent with that of vp9_rd_pick_inter_mode_sb. Change-Id: I9d564e242d9add38e32595d33b0e0dddb1d55e5b	2013-08-16 21:23:22 -07:00
Dmitry Kovalev	26e5b5e25d	Removing unused or redundant arguments from *_args structures. Redundant dst, pre[2] from build_inter_predictors_args, unused cm from encode_b_args. Change-Id: I2c476cd328c5c0cca4c78ba451ca6ba2a2c37e2d	2013-08-16 12:51:20 -07:00
Dmitry Kovalev	367cb10fcf	Merge "Moving from ss_txfrm_size to tx_size."	2013-08-16 12:46:45 -07:00
Adrian Grange	79f4c1b9a4	Fixed typos and formatting Change-Id: I3814984a624bc64147c57efa74fbdda8eda47262	2013-08-16 09:15:26 -07:00
Dmitry Kovalev	afd9bd3e3c	Moving from ss_txfrm_size to tx_size. Updating foreach_transformed_block_visitor and corresponding functions to accept tx_size instead of ss_txfrm_size. List of functions per file: vp9_decodframe.c decode_block decode_block_intra vp9_detokenize.c decode_block vp9_encodemb.c optimize_block vp9_xform_quant vp9_encode_block_intra vp9_rdopt.c dist_block rate_block block_yrd_txfm vp9_tokenize.c set_entropy_context_b tokenize_b is_skippable Change-Id: I351bf563eb36cf34db71c3f06b9bbc9a61b55b73	2013-08-15 17:03:03 -07:00
Jingning Han	5e80a49307	Merge "Refactor rd loop for chroma components"	2013-08-15 16:02:12 -07:00
Dmitry Kovalev	9451e8d37e	Merge "Converting code from using ss_txfrm_size to tx_size."	2013-08-15 15:21:09 -07:00
Dmitry Kovalev	939b1e4a8c	Merge "Moving segmentation struct from MACROBLOCKD to VP9_COMMON."	2013-08-15 15:14:32 -07:00
Jingning Han	68369ca897	Refactor rd loop for chroma components This commit makes the rate-distortion optimization search of chroma components consistent across all block sizes. It removes redundant codes. Change-Id: I7e76f54d045e8efdd41d84a164c71f55b484471b	2013-08-15 14:54:48 -07:00
Jingning Han	c2ff1882ff	Merge "Remove unused RDCOST_8X8 macro"	2013-08-15 13:48:25 -07:00
Jingning Han	ca983f34f7	Merge "Unify luma and chroma rd-cost estimation"	2013-08-15 13:48:15 -07:00
Dmitry Kovalev	bb3b817c1e	Converting code from using ss_txfrm_size to tx_size. Updated function signatures: txfrm_block_to_raster_block txfrm_block_to_raster_xy extend_for_intra vp9_optimize_b Change-Id: I7213f4c4b1b9ec802f90621d5ba61d5e4dac5e0a	2013-08-15 11:44:57 -07:00
Dmitry Kovalev	6f4fa44c42	Using { 0 } for initialization instead of memset. Change-Id: I4fad357465022d14bfc7e13b348c6da267587314	2013-08-15 11:37:56 -07:00
Dmitry Kovalev	b7616e387e	Moving segmentation struct from MACROBLOCKD to VP9_COMMON. VP9_COMMON is the right place to segmentatation struct because it has global segmentation parameters, not something specific to macroblock processing. Change-Id: Ib9ada0c06c253996eb3b5f6cccf6a323fbbba708	2013-08-15 10:47:48 -07:00
Jingning Han	b0646f9e98	Remove unused RDCOST_8X8 macro Change-Id: I17c7d7eaa60fe69c543403c340f7c1078bfd339f	2013-08-15 10:40:44 -07:00
Dmitry Kovalev	4d73416099	Merge "Quantization code cleanup."	2013-08-15 10:23:01 -07:00
Deb Mukherjee	24856b6abc	Speed feature to skip split partition based on var Adds a speed feature to disable split partition search based on a given threshold on the source variance. A tighter threshold derived from the threshold provided is used to also disable horizontal and vertical partitions. Results on derfraw300: threshold = 16, psnr = -0.057%, speedup ~1% (football) threshold = 32, psnr = -0.150%, speedup ~4-5% (football) threshold = 64, psnr = -0.570%, speedup ~10-12% (football) Results on stdhdraw250: threshold = 32, psnr = -0.18%, speedup is somewhat more than derf because of a larger number of smoother blocks at higher resolution. Based on these results, a threshold of 32 is chosen for speed 1, and a threshold of 64 is chosen for speeds 2 and above. Change-Id: If08912fb6c67fd4242d12a0d094783a99f52f6c6	2013-08-15 10:01:45 -07:00
Jingning Han	ec01f52ffa	Unify luma and chroma rd-cost estimation This commit unifies the rate-distortion cost calculation process of luma and chroma components. It allows early termination to be enabled later in the rd search loop of chroma components, in consistent with luma pixels. Change-Id: I2e52a7c6496176bf2a5e3ef338d34ceb8aad9b3d	2013-08-15 09:41:33 -07:00
Paul Wilkins	1a3641d91b	Merge "Renaming in MB_MODE_INFO"	2013-08-15 02:12:48 -07:00
Dmitry Kovalev	bb072000e8	foreach_transformed_block_in_plane cleanup, explicit tx_size var. Making foreach_transformed_block_in_plane more clear (it's not finished yet). Using explicit tx_size variable consistently instead of (ss_txfrm_size / 2) or (ss_txfrm_size >> 1) expression. Change-Id: I1b9bba2c0a9f817fca72c88324bbe6004766fb7d	2013-08-14 11:39:31 -07:00
Paul Wilkins	26fead7ecf	Renaming in MB_MODE_INFO The macro block mode info context originally contained an entry for each 16x16 macroblock. In VP9 each entry refers to an 8x8 region not a macro block, so the naming is misleading. This first stage clean up changes the names of 3 entries in the structure to remove the mb_ prefix. TODO clean up the nomenclature more widely in respect of mbmi and bmi. Change-Id: Ia7305c6d0cb805dfe8cdc98dad21338f502e49c6	2013-08-14 12:47:52 +01:00
Paul Wilkins	54979b4350	Merge "Honor min_partition_size properly for non-square splits"	2013-08-14 04:45:18 -07:00
Guillaume Martres	fc50477082	Honor min_partition_size properly for non-square splits Don't do vertical or horizontal splits if subsize < min_partition_size, except for edge blocks where it makes sense. Change-Id: I479aa66ba1838d227b5de8312d46be184a8d6401	2013-08-13 15:24:03 -07:00
Guillaume Martres	ecb78b3e0c	Merge "Trivial clean up."	2013-08-13 12:40:37 -07:00
Jingning Han	7e0f88b6be	Use lookup table to find largest txfm size Refactor choose_largest_txfm_size_ and make it find the largest transform size via lookup table. Change-Id: I685e0396d71111b599d5367ab1b9c934bd5490c8	2013-08-13 10:32:14 -07:00
Jingning Han	dc70fbe42d	Merge "Refactor model based tx search in super_block_yrd"	2013-08-13 08:48:49 -07:00
Paul Wilkins	5459f68d71	Trivial clean up. Delete unused / commented out variable references. Change-Id: Iaf20c0c3744f89adb296d153b516b5ea41b4f3b4	2013-08-13 13:26:18 +01:00
Paul Wilkins	8e35263bed	Merge "Honor min_partition_size properly"	2013-08-13 05:19:51 -07:00
Jingning Han	78136edcdc	SSE2 high precision 32x32 forward DCT Enable SSE2 implementation of high precision 32x32 forward DCT. The intermediate stacks are of 32-bits. The run-time goes down from 32126 cycles to 13442 cycles. Change-Id: Ib5ccafe3176c65bd6f2dbdef790bd47bbc880e56	2013-08-12 16:52:53 -07:00
Jingning Han	14cc7b319f	Refactor model based tx search in super_block_yrd Remove unnecessary conditional branches in model-based transform size search. Change-Id: Ic862dc33ed6710a186f6248239dd5f09b5c19981	2013-08-12 16:34:48 -07:00
Dmitry Kovalev	98e3d73e16	Merge "Using MV* instead of int_mv* as argument of vp9_clamp_mv_min_max."	2013-08-12 15:53:25 -07:00
Dmitry Kovalev	9d5885b0ab	Quantization code cleanup. Change-Id: I77b42418b852093f79260cbd880533a0bd86678f	2013-08-12 15:23:47 -07:00
Dmitry Kovalev	c66320b3e4	Merge "Entropy context related cleanups."	2013-08-12 15:18:24 -07:00
Dmitry Kovalev	1aedfc992a	Using MV* instead of int_mv* as argument of vp9_clamp_mv_min_max. Change-Id: I3c45916a9059f11b41e9d798e34ffee052969a44	2013-08-12 13:56:04 -07:00
Jingning Han	3984b41c87	Fix a compile failure in vp9_get_compressed_data The lf struct is now with VP9_COMMON, instead of MACROBLOCKD. Change-Id: Idfdd4f91f78f486078a138322d58bb61e93e1bc9	2013-08-12 11:42:17 -07:00
Dmitry Kovalev	8b0e6035a2	Entropy context related cleanups. Adding set_skip_context() function used from both encoder and decoder. Change-Id: Ia22cfad3211a00a63eb294f64f857b78f4aa9b85	2013-08-12 11:24:24 -07:00
Dmitry Kovalev	097046ae28	Merge "Removing redundant code and function arguments."	2013-08-11 12:20:58 -07:00
Dmitry Kovalev	3c43ec206c	Renaming BLOCK_SIZE_TYPES constant to BLOCK_SIZES. There will be another change set to rename BLOCK_SIZE_TYPE enum to BLOCK_SIZE. Change-Id: I8d1dfc873d6186fa5e554262f5169e929978085e	2013-08-09 17:47:32 -07:00
Guillaume Martres	58b07a6f9d	Honor min_partition_size properly It represents the minimum partition size, so don't split if bsize == min_partition_size . Change-Id: Id77c32d6afef7d2ddec0368eaae18fb13227d30e	2013-08-09 17:28:33 -07:00
Dmitry Kovalev	67fe9d17cb	Removing redundant code and function arguments. Change-Id: Ia5cdda0f755befcd1e64397452c42cb7031ca574	2013-08-09 17:24:40 -07:00
Dmitry Kovalev	e7c5ca8983	Merge "Inlining 16 as a stride for BLOCK_OFFSET macro."	2013-08-09 17:22:46 -07:00
James Zern	ef101af8ae	Merge "vp9_rd_pick_inter_mode_sb: fix uninitialized value"	2013-08-09 17:13:32 -07:00
Dmitry Kovalev	f1559bdeaf	Inlining 16 as a stride for BLOCK_OFFSET macro. Change-Id: I7f23d174eb089e5500f268a10db09648634c1b82	2013-08-09 16:40:05 -07:00
James Zern	f295774d43	vp9_rd_pick_inter_mode_sb: fix uninitialized value 'skippable' can remain unset and negatively affect later decisions address one aspect of issue #599 Change-Id: Iffdf0ac2e49ac481c27dc27c87fa546d4167bb28	2013-08-09 16:26:22 -07:00
Dmitry Kovalev	cd0629fe68	Merge "Removing plane_block_{width, height}_log2by4 functions."	2013-08-09 15:26:51 -07:00
Dmitry Kovalev	816d6c989c	Moving loopfilter struct to VP9_COMMON. Loop filter configuration doesn't belong to macroblock, so moving it from MACROBLOCKD to VP9_COMMON. Also moving the declaration of loopfilter struct from vp9_blockd.h to vp9_loopfilter.h. Change-Id: I4b3e34be9623b47cda35f9b1f9951f8c5b1d5d28	2013-08-09 14:41:51 -07:00
Scott LaVarnway	41251ae558	Bug fix: call set_offsets before rd_auto_partition_range The set_offsets call is necessary inorder to set the mode_info_context ptr correctly. Change-Id: I644910cc5bacc50ee9cd78458843274ad8ee636d	2013-08-09 14:09:49 -04:00
Yaowu Xu	6ec2b85bad	Added lpf level picking using partial frame Change-Id: I599ab1bd22b5f3f10d5962c609952abdef8ff67a	2013-08-09 07:37:08 -07:00
Yaowu Xu	6a7a4ba753	renamed vp8_yv12_copy_y to vpx_yv12_copy_y Becuase the routine is used by both vp8 and vp9 Change-Id: I2d35b287b5bc2394865d931a27da61f4ce7edeeb	2013-08-09 07:37:08 -07:00
Yaowu Xu	c7c9901845	added a speed feature on lpf level picking Change-Id: Id578f8afdeab3702fc8386969f2d832d8f1b5420	2013-08-09 07:36:32 -07:00
Dmitry Kovalev	6a8ec3eac2	General code cleanup. Removing redundant parenthesis and curly braces. Combining declarations with initializations. Adding useful intermediate variables instead of recalculating expressions every time. Change-Id: I00106f404afd60bfc189905b0fded881684f941a	2013-08-08 21:12:34 -07:00
Deb Mukherjee	2158909fc3	Merge "Adds a new subpel motion function"	2013-08-08 12:26:55 -07:00
Deb Mukherjee	1ba91a84ad	Adds a new subpel motion function Adds a new subpel motion estimation function that uses a 2-level tree-structured decision tree to eliminate redundant computations. It searches fewer points than iterative search (which can search the same point multiple times) but has the same quality roughly. This is made the default setting at speeds 0 and 1, while at speed 2 and above only a 1-level search is used. Also includes various cleanups for consistency and redundancy removal. Results: derf: +0.012% psnr stdhd: +0.09% psnr Speedup of about 2-3% Change-Id: Iedde4866f5475586dea0f0ba4cb7428fba24eee9	2013-08-08 11:41:49 -07:00
Adrian Grange	83ee80c045	Moved fast motion search level decision to function Moving this block of code into a function makes the code easier to read and change. Change-Id: If4ede570cce1eab1982b188c4d3e4fd3d4db236e	2013-08-08 11:01:44 -07:00
Adrian Grange	aae6a4c895	Simplify & fix potential bug in rd_pick_partition Different partitionings were not being evaluated against best_rd and there were unnecessary calls to RDCOST. This could have resulted in a non-optimal partioning being selected. I simplified the variables used to track the rate, distortion and RD values throughout the function. Change-Id: Ifa7085ee80d824e86791432a5bc6d8fea5a3e313	2013-08-08 09:55:45 -07:00
Jingning Han	6bfcce8c7a	Merge "Use low precision 32x32fdct for encodemb in speed1"	2013-08-07 19:05:14 -07:00
Dmitry Kovalev	61c33d0ad5	Removing plane_block_{width, height}_log2by4 functions. Change-Id: I040b82b8e32aee272d10cbb021c7ba1c76343d7a	2013-08-07 17:06:33 -07:00
Dmitry Kovalev	1492698ed3	Merge "Adding ss_size_lookup table."	2013-08-07 16:08:24 -07:00
Jingning Han	debb9c68c8	Use low precision 32x32fdct for encodemb in speed1 The low precision 32x32 fdct has all the intermediate steps within 16-bit depth, hence allowing faster SSE2 implementation, at the expense of larger round-trip error. It was used in the rate-distortion optimization search loop only. Using the low precision version, in replace of the high precision one, affects the compression performance by about 0.7% (derf, stdhd) at speed 0. For speed 1, it makes derf set down by only 0.017%. Change-Id: I4e7d18fac5bea5317b91c8e7dabae143bc6b5c8b	2013-08-07 15:34:12 -07:00
Dmitry Kovalev	8db2675b97	Adding ss_size_lookup table. Removing the old one bsize_from_dim_lookup. Now we have a way to determine block size for plane using its subsampling values (ss_size_lookup). And then we can find the number of pixels in the block (num_pels_log2_lookup). Change-Id: I6fc981da2ae093de81741d3d78eaefed11015db9	2013-08-07 15:33:17 -07:00
Dmitry Kovalev	ea2348ca29	Merge "Removing NMS_STATS defines."	2013-08-07 15:28:30 -07:00
Deb Mukherjee	296931c817	Merge "Clean ups of the subpel search functions"	2013-08-06 17:28:48 -07:00
Deb Mukherjee	71b43b0ff0	Clean ups of the subpel search functions Removes some unused code and speed features, and organizes the interfaces for fractional mv step functions for use in new speed features to come. In the process a new speed feature - number of iterations per step during the subpel search - is exposed. No change when this parameter is set as the original value of 3. Results: subpel_iters_per_step = 3: baseline subpel_iters_per_step = 2: psnr -0.067%, 1% speedup subpel_iters_per_step = 1: psnr -0.331%, 3-4% speedup Change-Id: I2eba8a21f6461be8caf56af04a5337257a5693a8	2013-08-06 17:23:50 -07:00
Jingning Han	2c091f9768	Merge "Place holder for high-precision 32x32 fdct"	2013-08-06 14:47:30 -07:00
Jim Bankoski	5b307886fb	variance x86inc guards also fixed bug in sad calcs Change-Id: I6571fcbe37556c16ae32be66dc0fd879852aac1d	2013-08-06 14:17:13 -07:00
Deb Mukherjee	fac7c8c9f9	Merge "Flexible support for various pattern searches"	2013-08-06 14:03:27 -07:00
Dmitry Kovalev	8725ca2ed2	Merge "Inlining vp9_get_pred_probs_switchable_interp function."	2013-08-06 11:57:45 -07:00
Deb Mukherjee	15b5a6a2c7	Flexible support for various pattern searches Adds a few pattern searches to achieve various tradeoffs between motion estimation complexity and performance. The search framework is unified across these searches so that a common pattern search function is used for all. Besides it will be easier to experiment with various patterns or combinations thereof at different scales in the future. The new pattern search is multi-scale and is capable of using different patterns at different scales. The new hex search uses 8 points at the smallest scale and 6 points at other scales. Two other pattern searches - big-diamond and square are also added. Big diamond uses 4 points at the smallest scale and 8 points in diamond shape at the larger scales. Square is very similar conceptually to the default n-step search but is somewhat faster since it keeps only one survivor across all scales. Psnr/speed-up results on derf300: hex: -1.6% psnr%, 6-8% speed-up big-diamond: -0.96% psnr, 4-5% speedup square: -0.93% psnr, 4-5% speedup Change-Id: I02a7ef5193f762601e0994e2c99399a3535a43d2	2013-08-06 11:56:39 -07:00
Jingning Han	28566a6cd5	Place holder for high-precision 32x32 fdct Resolve compile warnings on re-define FDCT32x32_2D template. Change-Id: Idb3a54ef8d2710ce7245b726379a0e5c875f5cad	2013-08-06 11:44:08 -07:00
Dmitry Kovalev	0c80065694	Inlining vp9_get_pred_probs_switchable_interp function. There was no benefit having this function. For example, inside read_switchable_filter_type switchable filter context was calculated twice. Change-Id: I79cd5bf95cbc0f6d8bf91a2e32289e01b18dcff1	2013-08-06 11:04:31 -07:00
Jingning Han	7d61f8fe53	Merge "Move fdct32x32 SSE2 implementation in separate file."	2013-08-06 10:46:41 -07:00
Dmitry Kovalev	3e51acafec	Merge "Finally removing all old block size constants."	2013-08-06 10:30:37 -07:00
Dmitry Kovalev	4a692e4168	Merge "Changing the order switchable filter enum constants."	2013-08-06 10:30:26 -07:00
Dmitry Kovalev	25b7dc08cd	Merge "Removing unused functions."	2013-08-06 10:29:57 -07:00
Deb Mukherjee	33afddadb9	Merge "Add variance based mode/skipping"	2013-08-06 10:19:15 -07:00
Christian Duvivier	3d98205fce	Move fdct32x32 SSE2 implementation in separate file. This is in preparation for the SSE2 version of the high-precision 32x32 forward DCT which will share a lot of code with the existing low precision version used for rate-distortion search. Change-Id: I7084b6bdfb480b1fabb8493fb14e3f7fcc7888c0	2013-08-06 10:17:11 -07:00
Dmitry Kovalev	b9c7d04e95	Finally removing all old block size constants. Change-Id: I3aae21e88b876d53ecc955260479980ffe04ad8d	2013-08-05 15:23:49 -07:00
Deb Mukherjee	8b3faccb9e	Add variance based mode/skipping Adds a speed feature to skip all intra modes other than DC_PRED if the source variance is small. This feature is made part of speed 1 and up. Results on derf300: psnr -0.07%, speedup about 1-2% Also uses the source variance to fine-tune the early termination criteria when FLAG_EARLY_TERMINATE is on. This feature is made part of speed 2 and up. Results on derf300: psnr -0.52%, speedup about 5-7% Change-Id: I59e38aa836557cfa5405ae706fc64815cbfe4232	2013-08-05 14:14:01 -07:00
Jim Bankoski	9f988a2edf	Merge "cleanups after bw bh code"	2013-08-05 14:02:02 -07:00
Dmitry Kovalev	3f611555d7	Changing the order switchable filter enum constants. This changeset allows to remove vp9_switchable_interp and vp9_switchable_interp_map arrays and make code much clear. Actually we still have to use these mapping but only inside read_interp_filter_type and write_interp_filter_type functions. Change-Id: I4026c6f8c4acefba6c81421b7bacbaa52cc45f50	2013-08-05 12:26:15 -07:00
Jim Bankoski	5d2cb7ead0	cleanups after bw bh code Cons bw/bh parms that should have been const. Additional formatting. Change-Id: Icd36a5c9dc17dadd7284315ac0d6fef1a565ca16	2013-08-05 12:15:52 -07:00
Dmitry Kovalev	d007446b3f	Replacing long block size enum values with shorter ones (2). Change-Id: I428c4d42212b757112e3acfe5b81314cfbb5fd6b	2013-08-05 10:51:02 -07:00
Dmitry Kovalev	fe2a201eb1	Replacing "txfm" with "tx" in identifiers. Consistent names with TX_SIZE, TX_MODE, and TX_MODE. Change-Id: I79592218bf5a40ace89197a34a06ee7de581ed8d	2013-08-02 17:28:23 -07:00
Dmitry Kovalev	5edc65d00d	Removing NMS_STATS defines. Change-Id: Iabab0e59042a33456df1d449c0d0f01debc00c7c	2013-08-02 17:10:15 -07:00
Dmitry Kovalev	7b50333e8f	Merge "Adding is_inter_block function."	2013-08-02 16:54:32 -07:00
Dmitry Kovalev	fec4ec4edd	Removing unused functions. Removed functions: model_rd_for_sb_y, block_error_sby, get_sb_variance Change-Id: Iec458df180caf6f8eac3605773841a4121dd3a8f	2013-08-02 16:41:09 -07:00
Dmitry Kovalev	603931e291	Merge "Changing function arg type from int_mv* to MV*."	2013-08-02 16:30:06 -07:00
Dmitry Kovalev	a6adc82e78	Merge "Cleanups around allow_high_precision_mv flag."	2013-08-02 16:27:05 -07:00
Dmitry Kovalev	680ec32d18	Adding is_inter_block function. Using it instead of long unclear verbose check "mbmi->ref_frame[0] != INTRA_FRAME". Change-Id: I9c7b4b3797942fa962bf3ba7460fff3084beabe9	2013-08-02 16:25:33 -07:00
Dmitry Kovalev	d4e020c4b1	Merge "Cleaning up set_contexts_on_border function."	2013-08-02 16:22:50 -07:00
Yunqing Wang	d340c114fb	Merge "Add more checking to using_small_partition_info"	2013-08-02 15:55:09 -07:00
Dmitry Kovalev	769bcab3f5	Cleaning up set_contexts_on_border function. Change-Id: I8f21c18b29f54b277fb1c167f278f109d9f3b996	2013-08-02 15:52:26 -07:00
Dmitry Kovalev	25b77e2569	Changing function arg type from int_mv* to MV*. Change-Id: Ic878d31df2ce783a2c9a8c4bc9ed301ec8ffe25e	2013-08-02 15:26:32 -07:00
Adrian Grange	60ff123536	Merge "Fixed typos and added a few explanatory comments"	2013-08-02 11:37:47 -07:00
Adrian Grange	075b11f004	Merge "Changed name of rd_pick_intra4x4mby_modes"	2013-08-02 11:36:46 -07:00
Dmitry Kovalev	86053d3ae2	Cleanups around allow_high_precision_mv flag. Change-Id: Ic07f5f8ffeaedd5b7513b464871f83afc82dcd5c	2013-08-02 11:21:16 -07:00
Dmitry Kovalev	b47153deed	Replacing long block size enum values with shorter ones. Change-Id: I0e9329490828684a4fd46f540d89114cc68e8407	2013-08-02 10:48:27 -07:00
Yunqing Wang	0d68080445	Merge "Comment out 2 unused speed features"	2013-08-02 09:58:46 -07:00
Dmitry Kovalev	741537f3ce	Cleanup: replacing xd->seg with seg, and xd->lf with lf. Change-Id: I73b59d7699a8e7e7acd3bf8041cb6c98ce9ba4bf	2013-08-01 15:38:16 -07:00
Dmitry Kovalev	9f4f001ba5	Merge "Cleanup: removing unused function arguments."	2013-08-01 15:07:12 -07:00
Dmitry Kovalev	ddf02e323a	Merge "Nice looking motion vector clamping functions."	2013-08-01 14:50:14 -07:00
Dmitry Kovalev	ce8dedc353	Cleanup: removing unused function arguments. Change-Id: I27471768980fc631916069f24bc7c482a5c9ca17	2013-08-01 13:41:38 -07:00
Dmitry Kovalev	b621e2d72e	Nice looking motion vector clamping functions. Removing assign_and_clamp_mv function, making implementation of clamp_mv and clamp_mv2 more clear and consistent. Change-Id: Iecd08e1c1bf0379f8314ebe01811f8253f4ade58	2013-08-01 13:40:26 -07:00
Deb Mukherjee	dbea726daf	Adds a source variance computation function Adds a function to compute source variance for various sb_types to be used for pruning mode and partition searches. [The existing activity measure function is currently specialized for only 16x16 MBs and needs to be updated]. Change-Id: I22a41e6f1430184201487326fdbebb9b47e6fc24	2013-08-01 13:01:54 -07:00
Yunqing Wang	215b010f4b	Add more checking to using_small_partition_info If the partition is out of partition size range, we don't need to process small partition information. Change-Id: Ice9bfbbdebe1f2ef79271a3aee17de0ed4608376	2013-08-01 11:37:41 -07:00
Yunqing Wang	7965a6ea34	Comment out 2 unused speed features use_min_partition_size and use_max_partition_size are not used currently, and could be added back if needed later. Change-Id: Ib22a9c06b064567a7c1d6d5445567ed77e0d3acc	2013-08-01 11:03:34 -07:00
Dmitry Kovalev	ff4bfa726b	Merge "Adding missing const to vp9_extra_bits array."	2013-08-01 10:19:51 -07:00
Adrian Grange	89e73c63c0	Fixed typos and added a few explanatory comments Change-Id: Ib4e4b41094b54874ee34343dd77c0c131ceed9d2	2013-08-01 09:23:49 -07:00
Adrian Grange	5271d47892	Changed name of rd_pick_intra4x4mby_modes The function name rd_pick_intra4x4mby_modes is confusing, so I changed it to rd_pick_intra_sub_8x8_y_modes to better reflect what the function does. Also added const qualifiers to some of the input parameters and removed camel-case. Change-Id: I23d53d4c7af5d79ed8a471acd59a09bbb47add39	2013-08-01 09:23:49 -07:00
Dmitry Kovalev	5b65246a71	Adding missing const to vp9_extra_bits array. Change-Id: Icd128ab58719e0b9066bdfa66a5d0d427a84d6df	2013-07-31 18:51:18 -07:00
Jingning Han	12f5762756	Remove unnecessary arguments in rd_pick_ref_frame This commit removes redundant arguments passing in the function of rd_pick_reference_frame. This resolves the clang warnings about potential use of uninitialized values. Change-Id: Ic68f949a9f8fcd0a583786b0c75321104ea44739	2013-07-31 17:04:13 -07:00
Dmitry Kovalev	9239e96536	Removing get_mi_{row, col} functions. Passing mi_row and mi_col parameters to functions explicitly. Removing unused xd argument from scale_mv function. Change-Id: Icb4c495ec72d26fb066c14470d3ae0b741fbf18a	2013-07-31 14:06:55 -07:00
Dmitry Kovalev	3be9fd9120	Merge "Removing unused "ishp" arguments."	2013-07-31 12:03:04 -07:00
Dmitry Kovalev	0e0a6f840b	Merge "Consistent update for inter_mode probabilities."	2013-07-31 12:02:35 -07:00
Dmitry Kovalev	500ade243a	Removing unused "ishp" arguments. Using different variable names "allow_hp" and "use_hp" instead of "usehp". Change-Id: I0cd5996ddeb46bd754473b680a993c0aaf8eb879	2013-07-31 11:27:53 -07:00
Jingning Han	ac7bab7575	Merge "Make the use of ref_frame index consistent"	2013-07-31 09:11:37 -07:00
Jingning Han	86c384d398	Make the use of ref_frame index consistent Refactor the frame buffer referencing in choose_partition and make it consistent with other places. This means to prevent potential issues when we extend reference frame buffer. Change-Id: I5ff33ed5f671e1f4cc7049622212769a9b4578d9	2013-07-30 19:49:36 -07:00
Dmitry Kovalev	8701bc11df	Consistent update for inter_mode probabilities. Using inter-mode counts instead of inter-mode-tree branch counts inside FRAME_COUNTS structure. Change-Id: I60dde13af37d06146d7d15543311c1b5044e9e04	2013-07-30 18:06:34 -07:00
Adrian Grange	fbd73648dd	Merge "Cleanup typos, remove unnecessary lines, replace switch"	2013-07-30 12:59:46 -07:00
Adrian Grange	b30a06b930	Cleanup typos, remove unnecessary lines, replace switch Removed unnecessary code lines, replaced switch with an if, fixed spelling errors and formatting. Change-Id: Ie48aa4604aa0ed48362ca359d792fb21b2ec1dc6	2013-07-30 12:10:32 -07:00
Yaowu Xu	88e48444da	Merge "removed duplication"	2013-07-30 09:38:02 -07:00
Yaowu Xu	a15d1f3134	removed duplication Change-Id: Ica23b66f6664e5a5b168499584f0afffbc54794f	2013-07-30 09:09:14 -07:00
Jingning Han	525745b17a	Remove a redundant branching in tokenize_b The tokenize_b function is only called when output flag is on. Hence removing the conditional branch on it therein. Change-Id: Ib709f47f23f39ca05a695faf86fa3377f11f2dd0	2013-07-29 17:08:13 -07:00
Jingning Han	455f2de20b	Tune tokenization/detokenization flow for speed-up This commit optimizes the tokenization and detokenization operational flow for speed-up. It makes the coding process about 0.3% faster at speed 0. Change-Id: I28008df7482874e4b5f237f2d418ff82a249dd56	2013-07-29 16:15:30 -07:00
Jingning Han	b5323ed89a	Skip redundant tokenization in rd loop This commit makes the encoder skip the redundant tokenization process in the rate-distortion optimization search loop, while updating the entropy contexts accordingly. It makes the speed 0 encoding process about 0.5% faster at no performance change. Change-Id: I34a4155a0b5332afeb45c93a51c7f35a294d685c	2013-07-29 16:09:16 -07:00
Jingning Han	5875d7a4a4	Merge "16x16 inverse 2D-DCT with DC only"	2013-07-29 15:29:25 -07:00
Jingning Han	a7c4de22e1	16x16 inverse 2D-DCT with DC only This commit provides special handle on 16x16 inverse 2D-DCT, where only DC coefficient is quantized to be non-zero value. Change-Id: I7bf71be7fa13384fab453dc8742b5b50e77a277c	2013-07-29 14:45:53 -07:00
Dmitry Kovalev	828119d6ab	Renaming txfm to tx for consistency in some places. Change-Id: I2a6a646570e2af66315e7c658d00d99f80c4b127	2013-07-29 14:35:55 -07:00
Dmitry Kovalev	730a34416f	Renaming NB_TXFM_MODES constant to TX_MODES. Change-Id: I10bf06e3a3d5271221ae6a42a36074d01d493039	2013-07-29 13:38:40 -07:00
Dmitry Kovalev	23391ea835	Renaming TX_SIZE_MAX_SB to TX_SIZES. Change-Id: I6aa4191935aa93461a07c41b59fdae1eb5f5f107	2013-07-29 12:25:34 -07:00
Jingning Han	decb1b94de	Merge "Shortcut 8x8/16x16 inverse 2D-DCT"	2013-07-29 11:04:07 -07:00
Ronald S. Bultje	118ccdcd30	Inverse dimension order in token_cost array. This allows us to increment the position at the band-level only as we go from one band to the next; more importantly, that allows us to use an add instead of multiply instruction, and omit the instruction altogether if the band doesn't change from one coef to the next, thus being slightly faster (probably more noticeable on systems where a multiply is expensive, like arm). Change-Id: I4343fe35b9f9a47fa00b217bdcbf5f91ff96c381	2013-07-26 17:30:04 -07:00
Ronald S. Bultje	dcacce6dd9	Merge "Save pixels instead of coefficients in intra4x4 RD loop."	2013-07-26 17:20:58 -07:00
Ronald S. Bultje	d30c8f41ef	Merge "Add best_rd breakout in intra4x4 RD loop."	2013-07-26 17:20:51 -07:00
Jingning Han	38fa487164	Shortcut 8x8/16x16 inverse 2D-DCT This commit brought back the shortcut implementation of 8x8/16x16 inverse 2D-DCT. When the eob <= 10, it skips the inverse transform operations on row 4:7/4:15 in the first round. For bus_cif at 1000 kbps, this provides about 2% speed-up at speed 0. Change-Id: I453e2d72956467d75be4ad8c04b4482ab889d572	2013-07-26 17:19:14 -07:00
Jingning Han	b9c3dd481a	Merge "Special handle on DC only inverse 8x8 2D-DCT"	2013-07-26 16:04:14 -07:00
Jingning Han	325e0aa650	Special handle on DC only inverse 8x8 2D-DCT This commit enables a special handle for the 8x8 inverse 2D-DCT, where only DC coefficient is quantized to be non-zero. For bus_cif at 2000 kbps, it provides about 1% speed-up at speed 0. Change-Id: I2523222359eec26b144cf8fd4c63a4ad63b1b011	2013-07-26 14:16:51 -07:00
Dmitry Kovalev	c09b81719f	Merge "General cleanups."	2013-07-26 13:59:39 -07:00
Yaowu Xu	4f75a1f4ed	Merge "Auto min and max partition size experiment."	2013-07-26 12:10:27 -07:00
Paul Wilkins	fe5e2a91bb	Auto min and max partition size experiment. Speed feature experiment to set an upper and lower partition size limit based on what has been seen in spatial neighbors. This seems to gives quite reasonable speed gains in local (10-15%) and when used with speed 0 the losses are small (0.25% derf, 0.35% stdhd). However, for now I am only enabling it on speed 1 as there may be clashes with the existing temporal partition selection in speed 2. Using a tighter min / max around the range derived from the neighbors increases speed further but at the cost of a bigger quality loss. However, I think this spatial method could be combined with data from either the last frame or a variance method (or both) to refine the range of minimum and maximum partition size. I.e. consider the min and max from spatial and temporal neighbors and the variance recommendation. Change-Id: I1b96bf8b84368d6aad0c7aa600fe141b4f07435f	2013-07-26 18:30:49 +01:00
Yunqing Wang	52256cdbca	Modify static threshold calculation Used 3 * standard_deviation in internal threshold calculation instead of fit curve. This actually approached the algorithm better. For comparison, similar tests were done: The overall psnr loss is less than before. 1. derf set: when static-thresh = 1, psnr loss is 0.329%; when static-thresh = 500, psnr loss is 0.970%; 2. stdhd set: when static-thresh = 1, psnr loss is 0.922%; when static-thresh = 500, psnr loss is 1.307%; Similar speedup is achieved. For example, clip bitrate static-thresh psnr time akiyo(cif) 500 0 48.952 5.077s(50f) akiyo 500 500 48.866 4.169s(50f) parkjoy(1080p) 4000 0 30.388 78.20s(30f) parkjoy 4000 500 30.367 70.85s(30f) sunflower(1080p) 4000 0 44.402 74.55s(30f) sunflower 4000 500 44.414 68.69s(30f) Change-Id: Ic78833642ce1911dbbd1cb6c899a2d7e2dfcc1f3	2013-07-25 19:59:33 -07:00
Yunqing Wang	845fd5011c	Merge "Add encoding option --static-thresh"	2013-07-25 14:58:00 -07:00
Yunqing Wang	d36852b702	Add encoding option --static-thresh This option exists in VP8, and it was rewritten in VP9 to support skipping on different partition levels. After prediction is done, we can check if the residuals in the partition block will be all quantized to 0. If this is true, the skip flag is set, and only prediction data are needed in reconstruction. Based on DCT's energy conservation property, the skipping check can be estimated in spatial domain. The prediction error is calculated and compared to a threshold. The threshold is determined by the dequant values, and also adjusted by partition sizes. To be precise, the DC and AC parts for Y, U, and V planes are checked to decide skipping or not. Test showed that 1. derf set: when static-thresh = 1, psnr loss is 0.666%; when static-thresh = 500, psnr loss is 1.162%; 2. stdhd set: when static-thresh = 1, psnr loss is 1.249%; when static-thresh = 500, psnr loss is 1.668%; For different clips, encoding speedup range is between several percentage and 20+% when static-thresh <= 500. For example, clip bitrate static-thresh psnr time akiyo(cif) 500 0 48.923 5.635s(50f) akiyo 500 500 48.863 4.402s(50f) parkjoy(1080p) 4000 0 30.380 77.54s(30f) parkjoy 4000 500 30.384 69.59s(30f) sunflower(1080p) 4000 0 44.461 85.2s(30f) sunflower 4000 500 44.418 78.1s(30f) Higher static-thresh values give larger speedup with larger quality loss. Change-Id: I857031ceb466ff314ab580ac5ec5d18542203c53	2013-07-25 14:28:05 -07:00
Dmitry Kovalev	7131cb0e3d	General cleanups. Removing unused constants, macros, and function declarations. Using ROUND_POWER_OF_TWO macro, vp9_zero, vp9_copy where possible. Moving #include from .h to .c. Merging for loops for motion vectors. Change-Id: Ic3bf841764a2bb177128bb3a6d7aa8f68229cd13	2013-07-25 14:13:48 -07:00
Dmitry Kovalev	d53fc9ee4e	Merge "Adding lookup table for size group."	2013-07-25 13:57:28 -07:00
Dmitry Kovalev	08fd41ccd7	Adding lookup table for size group. Change-Id: Ia6144d77ebed66e0739b62e4d673e26a95aa9550	2013-07-25 12:58:54 -07:00
Adrian Grange	e862c6f9eb	Merge "Simplify handling of sub-partition motion vectors"	2013-07-25 12:58:38 -07:00
Adrian Grange	6f0f0e4907	Merge "Use local variables rather than structure members"	2013-07-25 12:57:52 -07:00
Dmitry Kovalev	d604914f09	Merge "Removing vp9_adapt_mode_context function."	2013-07-25 12:46:31 -07:00
Jingning Han	d571af76d3	Merge "Make coeff_optimize initialized per-plane"	2013-07-25 12:46:14 -07:00
Yaowu Xu	51a8458822	Merge "fix a bug where flags are not reset"	2013-07-25 12:18:51 -07:00
Adrian Grange	be700e140a	Simplify handling of sub-partition motion vectors Simplified the code that extracts and uses the motion vectors for the 4 sub-partitions in rd_pick_partition. Change-Id: Iaf698ef7ee3aef9edd59015e1ae065dd359b17d9	2013-07-25 11:51:51 -07:00
Jingning Han	2f58faffa4	Make coeff_optimize initialized per-plane This commit makes the initialization of trellis coeff optimization a per-plane operation, thereby eliminating the redundant steps in encode_sby and encode_sbuv. It makes the encoder at speed 0 slightly faster. Change-Id: Iffe9faca6a109dafc0dd69dc7273cbdec19b17cd	2013-07-25 11:44:29 -07:00
Dmitry Kovalev	47d61f008f	Removing vp9_adapt_mode_context function. Moving code from vp9_adapt_mode_context to vp9_adapt_mode_probs. Change-Id: I60829c30b28968cd813551ef3a206dfb98d323c9	2013-07-25 10:48:45 -07:00
Yaowu Xu	3e386aefc2	fix a bug where flags are not reset The feature that uses small partition results as a measure to skip mode evaluation at larger partition requires the flags to be reset. The reset was missing in the code path that calls rd_use_partition(). Change-Id: Ia0a3a0aee1a862b6e2333d596808db7c48033d50	2013-07-25 10:28:38 -07:00
Scott LaVarnway	a0e8b45fee	Merge "pack_inter_mode_mvs cleanup"	2013-07-25 04:47:56 -07:00
Dmitry Kovalev	fcc34796d2	Removing CONFIG_BALANCED_COEFTREE experiment. Change-Id: I61a8b0101eac3ee2e0621d56151b90c269fd4db4	2013-07-24 15:53:42 -07:00
Dmitry Kovalev	9139ee0908	Adding condition inside get_tx_type_{4x4, 8x8, 16x16}. Adding plane type check condition because it was always used outside of get_tx_type_{4x4, 8x8, 16x16}. Change-Id: I02f0bbfee8063474865bd903eb25b54d26e07230	2013-07-24 12:55:45 -07:00
Adrian Grange	4cfd36d8fd	Use local variables rather than structure members Although local copies of the mode member variables (mode, ref_frame) were made, they were not used in all places. Also, made a local copy of the second_ref_frame member. Change-Id: I84d8c822e5cb3d8a02fc3de8a4037ca3fea8bfad	2013-07-24 11:17:44 -07:00
Adrian Grange	a183f17d33	Merge "Correct spelling mistakes"	2013-07-24 09:48:57 -07:00
Ronald S. Bultje	7817d3221f	Save pixels instead of coefficients in intra4x4 RD loop. Prevents doing duplicate IDCTs; encoding of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min4.0 to 1min3.5, i.e. 0.87% faster overall. Change-Id: I2df39e29ed9d5ea5e7d2704a34940ba622832ddd	2013-07-24 09:03:20 -07:00
Ronald S. Bultje	b72ecbb1b9	Add best_rd breakout in intra4x4 RD loop. Encoding time of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min5.4 to 1min4.0, i.e. 2.2% faster overall. Change-Id: I8c32f2aff9a649ce7dd49d910dc5ba16b99c3bc6	2013-07-24 09:02:05 -07:00
Adrian Grange	bc8b0529db	Correct spelling mistakes Change-Id: Id4138293efeac4503b2e01ce7a6c150a5abeef77	2013-07-24 07:58:26 -07:00
Ronald S. Bultje	47336afd8d	Merge "More optimizations for cost_coeffs()."	2013-07-23 21:36:12 -07:00
Jingning Han	666c266623	Merge "Unify the use of encode_b_args/optimize_block_args"	2013-07-23 18:08:50 -07:00
Dmitry Kovalev	1099a436d3	Moving counts from FRAME_CONTEXT to new struct FRAME_COUNTS. Counts are separate from frame context. We have several frame contexts but need only one copy of all counts. Change-Id: I5279b0321cb450bbea7049adaa9275306a7cef7d	2013-07-23 17:02:08 -07:00
Jingning Han	ab77828b36	Unify the use of encode_b_args/optimize_block_args The struct optimize_block_args is defined same as encode_b_args. Remove this redundant definition, and use encode_b_args consistently. Change-Id: I1703aeeb3bacf92e98a34f4355202712110173d9	2013-07-23 16:04:02 -07:00
Dmitry Kovalev	8d13b0d1df	Removing LOW_PRECISION_MV_UPDATE define. Change-Id: I78d16ee758e1fae0200b746f00031f6d9c6d6ce7	2013-07-23 15:41:45 -07:00
Dmitry Kovalev	a9bbabd94b	Merge "Removing vp9_is_interpolating_filter array."	2013-07-23 15:01:19 -07:00
Adrian Grange	719cd35f3a	Merge "Rolled-up several for loops into one"	2013-07-23 15:00:06 -07:00
Adrian Grange	646edbc1b2	Rolled-up several for loops into one Several consecutive for loops executed over the same index range, so I rolled them into one. Change-Id: I5cfcc8c38c738478965768409cca9d09adf224e1	2013-07-23 14:32:21 -07:00
Dmitry Kovalev	db7f5d28b9	Removing vp9_is_interpolating_filter array. All filters are interpolating now, so we don't need this array, all values from this array are evaluated to true. Change-Id: I9af6d8219ae0eb984063cd15e4e2296374ae4961	2013-07-23 14:24:39 -07:00
Dmitry Kovalev	2855d8aea1	Merge "Adding update_tx_counts function."	2013-07-23 13:57:59 -07:00
Jingning Han	e9e2fe8ec3	Make xform_quant operations tx_type independent The xform_quant() module is only used by inter modes, hence removing the redundant switches therein conditioned on tx_type. Change-Id: Ib87ce5b2f2e4cbf3ceb133a1108afa173c933a3f	2013-07-23 12:37:25 -07:00
James Zern	8dede954c7	Merge "vp9: make some static tables const"	2013-07-23 11:37:01 -07:00
Jingning Han	4ef1d35abf	Merge "Skip inverse transform when eob is zero"	2013-07-23 10:31:19 -07:00
Deb Mukherjee	9360fd3dcf	Merge "Diamond search change to accelerate movement"	2013-07-23 10:14:10 -07:00
Jingning Han	0359ad7f9a	Skip inverse transform when eob is zero When all the transform coefficients were quantized to zero, skip the inverse transform operation. For bus_cif at 1000 kbps, the runtime goes from 154967ms -> 149842ms, i.e., about 3% speed-up, at speed 0. Change-Id: Ic0a813fff5e28972d4888ee42d8747846a6c3cc6	2013-07-23 10:06:41 -07:00
Paul Wilkins	cedd24ec61	Merge "Renaming of segment constants."	2013-07-23 08:16:12 -07:00
Scott LaVarnway	7bc294a3fe	pack_inter_mode_mvs cleanup xd->mode_info_context is set to m prior to this call. Change-Id: Ibc442529961750c29ccf0c6cae08cb2b0431415f	2013-07-23 10:08:28 -04:00
Jim Bankoski	256ee00093	Merge "clean up bw, bh"	2013-07-23 06:58:28 -07:00
Jim Bankoski	86a9dec73c	clean up bw, bh many structures use bw and bh and they have different meanings. This cl attempts to start this clean up and remove unneccessary 2 step look up log and then shift operations... also removed partition type multiple operation code in bitstream.c. Change-Id: I7e03e552bdfc0939738e430862e3073d30fdd5db	2013-07-23 06:51:44 -07:00
Scott LaVarnway	2fd20eb37d	Merge "Eliminated prev_mip memsets/memcpys in encoder"	2013-07-23 06:43:52 -07:00
Paul Wilkins	7c134bc0cd	Merge "Reworked the auto_mv_step_size speed feature"	2013-07-23 04:49:55 -07:00
Paul Wilkins	32042af14b	Renaming of segment constants. Renamed: MAX_MB_SEGMENTS to MAX_SEGMENTS MB_SEG_TREE_PROBS to SEG_TREE_PROBS The minimum unit for segmentation in the segment map is now 8x8 so it is misleading to use MB_ as macro-block traditionally refers to a 16x16 region. Change-Id: I0b55a6f0426bb46dd13435fcfa5bae0a30a7fa22	2013-07-23 12:09:04 +01:00
James Zern	3c8cce353f	vp9: make some static tables const Change-Id: I8bcae51271673da8755c66a51aea005dfe6a3739	2013-07-22 19:19:13 -07:00
Ronald S. Bultje	e20fcd9585	More optimizations for cost_coeffs(). 4x4: 163 -> 123 cycles (33% faster) 8x8: 491 -> 399 cycles (23% faster) 16x16: 1889 -> 1763 cycles (7% faster) 32x32: 8311 -> 8180 cycles (1.6% faster) Overall encoding time of first 50 frames of bus (speed 0) @ 1500kbps goes from 1min4.33 to 1min3.00, i.e. 2.11% faster. Change-Id: Ib52d1dbb5649b14de769d3e7a74af67440b5284f	2013-07-22 16:09:09 -07:00
Dmitry Kovalev	b2fc6fa969	Adding update_tx_counts function. Moving common encoder/decoder code to update_tx_counts. Also renaming vp9_get_pred_probs_tx_size to get_tx_probs2 and adding get_tx_probs to call vp9_get_pred_context_tx_size inside read_selected_tx_size only once (twice before). Change-Id: Ia50247f3893de88ef8e9041b0d44be44a40aaa4d	2013-07-22 14:57:43 -07:00
Yaowu Xu	6261d79206	Merge "fix a build error"	2013-07-22 13:02:15 -07:00
James Zern	76db4d599a	Merge "VP[89]_COMMON: remove golden/altref frame counts"	2013-07-22 12:55:07 -07:00
Yaowu Xu	fc186dcad6	fix a build error Change-Id: I3b05687f439ff6a7c426d2c97a6c58c831fa51ac	2013-07-22 12:37:30 -07:00
Jingning Han	416f315e82	Merge "Skip buffer update in sub8x8 rd loop"	2013-07-22 12:08:22 -07:00
Jingning Han	a5a9f5f7f3	Merge "Optimize operation flow in sub8x8 rd loop"	2013-07-22 12:08:15 -07:00
Deb Mukherjee	a1e2d50be9	Diamond search change to accelerate movement Optional change in diamond search to continue in the best move direction until that move turns worse. This is still WIP since the exact way the new method is to be used is under investigation. One option is to make it an option in diamond search and use it only when motion is large. Overall slightly positive on derfraw300 +0.02%, stdhdraw +0.13%, but works a lot better for high motion sequences (ex. football : +1%). Change-Id: If88e01a6021daa0cda934680cdc70be1ee04f798	2013-07-22 11:19:15 -07:00
Paul Wilkins	3798d7a641	Merge "Re-order mode search in rd."	2013-07-22 10:46:04 -07:00
Jingning Han	409e77f2d4	Optimize operation flow in sub8x8 rd loop Stack the rate-distortion statistics in the sub8x8 rd loop. This allows the encoder to skip the forward transform, quantization, and coeff cost estimation, in the sub8x8 rd optimization search, if the motion vector(s) are of integer pixel value, and have been tested in the previous prediction filter type rd loops of the same block. This gives about 2% speed-up for bus_cif at 2000 kpbs, for speed 0. Its efficacy depends how frequently the motion search will select an integer motion vector. Change-Id: Iee15d4283ad4adea05522c1d40b198b127e6dd97	2013-07-22 10:40:33 -07:00
Paul Wilkins	1d189d6464	Re-order mode search in rd. Mode search order in rd loop changed to better reflect observed hit counts. Also some adjustment of the baseline mode rd thresholds to reflect the order change and observed frequencies. Change-Id: I47a131cc83e11551df8add6d6d8d413d78d3a63c	2013-07-22 17:21:12 +01:00
Jim Bankoski	9ad604c6fb	Merge "fix left over overflow"	2013-07-22 08:51:26 -07:00
Jim Bankoski	2ac8b50cd8	fix left over overflow This cl fixes issues rbultje brought up. that I somehow neglected when I submitted yaowu's patch. Change-Id: I07ad18796317822510b96e951c88d29f194a3c2e	2013-07-22 06:39:39 -07:00
Paul Wilkins	888375d243	Fix build error. When CONFIG_POSTPROC is set there was a now invalid reference to cm->filter_level. Changed to cpi->mb.e_mbd.lf.filter_level in line with change Iaf5fb71c33719cdfa1b991f671caf071be9ea035 Change-Id: If746e60044903f7ba8d0d346225b3d015226c7d0	2013-07-22 14:01:43 +01:00
Dmitry Kovalev	ee1fe2f750	Merge "Removing pre probabilities from FRAME_CONTEXT."	2013-07-20 22:50:32 -07:00
Dmitry Kovalev	8962d975b2	Merge "Moving all loop filter related variables into new struct."	2013-07-20 22:45:24 -07:00
Dmitry Kovalev	39342db138	Merge "Consistent names for inter mode probabilities and encodings."	2013-07-20 22:40:51 -07:00
Dmitry Kovalev	f66821afbb	Merge "Removing frame_type field from MACROBLOCKD struct."	2013-07-20 22:40:06 -07:00
Dmitry Kovalev	2b089f149a	Merge "Removing unused static arrays from vp9_reatectrl.c."	2013-07-20 22:39:33 -07:00
Jingning Han	c725502bf3	Skip buffer update in sub8x8 rd loop This commit allows the encoder to skip a few buffer update steps in rd_pick_best_mbsegmentation, when early breakout has been triggered in the rd_check_segment_txsize. It provides about 1% speed-up for bus_cif at 2000 kbps, in the settings of speed 0. Change-Id: Ica034f10a24dec572b397d8389a2b81020ebc0b9	2013-07-20 21:38:12 -07:00
Yaowu Xu	ea284d6281	added checks to prevent rate/distortion overflow At speed 2, due to the threshold scheme used, it is possible the rate and distortion assigned with INT_MAX value. The patch added checking to prevent the INT_MAX value is used in further calculation of RD scores. The patch also changed the assertion in rd_use_partition() to be mirror similar assertion in rd_pick_partition(). Change-Id: Idb52c543cc1e10abdf6e6a5d6e9cb535a42214dc	2013-07-19 17:52:50 -07:00
Dmitry Kovalev	7e703de729	Removing pre probabilities from FRAME_CONTEXT. Using cm->frame_contexts[cm->frame_context_idx] as source of previous probabilities. Change-Id: Ie03778acf0e7bebdc3a1f6a51854d4a0712f24a1	2013-07-19 17:33:10 -07:00
Dmitry Kovalev	ee1771ebaa	Moving all loop filter related variables into new struct. Adding loopfilter struct with fields from MACROBLOCKD and VP9Common. Eventually it will be moved to vp9_loopfilter.h for better code structure. Change-Id: Iaf5fb71c33719cdfa1b991f671caf071be9ea035	2013-07-19 16:19:10 -07:00
Dmitry Kovalev	29f0f79317	Removing unused static arrays from vp9_reatectrl.c. Removed arrays: kf_boost_seperation_adjustment, gf_adjust_table, gf_intra_usage_adjustment, gf_interval_table. Change-Id: I62e400cb6e4d039787615169a3779e31ebf95893	2013-07-19 15:55:09 -07:00
Dmitry Kovalev	c3a56ee583	Merge "Moving Scale2Ration function from vp9_onyx.h to vp9_onyx_if.c."	2013-07-19 15:27:24 -07:00
Deb Mukherjee	302698fb12	Reworked the auto_mv_step_size speed feature This patch modifies the auto_mv_step_size speed feature to use a combination of the maximum magnitude mv from the last inter frame, and the maximum magnitude mv for the two reference mvs with the same reference. For arf frames, the max mav step for the resolution is used. The bounds therefore are slightly tighter. The feature is made a speed 1 feature. Rebased. Results (when this feature is turned on over speed 0): derfraw300: -0.046% psnr, about 5+% speedup (tested on football: goes from 4m30.760s to 4m17.410s). Change-Id: If492797a61b0b4b3e58c0b8f86afb880165fc9f6	2013-07-19 15:12:56 -07:00
Dmitry Kovalev	e71a4a77bb	Merge "Renaming TXFM_MODE to TX_MODE (like TX_SIZE, TX_TYPE)."	2013-07-19 12:14:32 -07:00
Dmitry Kovalev	97e96bc4e9	Removing frame_type field from MACROBLOCKD struct. Change-Id: Ia4e83913251c1cdc7aa2abd64bf01ecb1a962119	2013-07-19 11:55:36 -07:00
Dmitry Kovalev	c0eb57406c	Renaming TXFM_MODE to TX_MODE (like TX_SIZE, TX_TYPE). Moving TX_MODE enum to vp9_enums.h. Renaming txfm_mode variables to tx_mode. Change-Id: I459d1af6dd928ce7fccdf8ce30b6f1ca057bef92	2013-07-19 11:37:13 -07:00
Dmitry Kovalev	afe43d4089	Removing redundant VP9_COMMON* from function signatures. Functions: vp9_get_pred_context_switchable_interp, vp9_get_pred_context_intra_inter, vp9_get_pred_context_single_ref_p1, vp9_get_pred_context_single_ref_p2. Change-Id: I3d6fb8aee23c9062270768e1e6da416dd9bb8f96	2013-07-19 11:20:49 -07:00
Dmitry Kovalev	bc7acb134b	Consistent names for inter mode probabilities and encodings. Renaming vp9_sb_mv_ref_tree to vp9_inter_mode_tree, and vp9_sb_mv_ref_encoding_array to vp9_inter_mode_encodings. Change-Id: I0e91fbf81350d3ec5a2599064c74089b5d06133a	2013-07-19 10:40:04 -07:00
Paul Wilkins	f3ed9f5523	Alignment of THR_MODES to vp9_mode_order[] Change-Id: I4032dd0442043543954dcb3724df974b7cc7e515	2013-07-19 11:33:39 +01:00

... 3 4 5 6 7 ...

1752 Commits