generic-library/vpx

Author	SHA1	Message	Date
Jingning Han	166dc85bed	Temporarily disable SSSE3 quant_32x32 Make the current head working properly, while working on fixing an issue in the SSSE3 implementation of 32x32 quantization. Change-Id: Ic029da3fd7f1f5e58bc641341cbd226ec49a16bc	2013-08-26 10:45:59 -07:00
James Zern	c8ba8c513c	cosmetics: strip 'VP9_' from defines in vp9 only code Change-Id: I481d9bb2fa3ec72b6a83d5f04d545ad8013f295c	2013-08-23 19:16:49 -07:00
James Zern	2c6ba737f8	Merge "vp9: remove unnecessary wait w/threaded loopfilter"	2013-08-23 18:52:10 -07:00
Dmitry Kovalev	50ee61db4c	Renaming D27 to D207. I've already renamed d27_predictor to d207_predictor but forgot about the corresponding constant. Change-Id: Id312aa80fc5b5a1ab8a709a33418a029552a6857	2013-08-23 17:33:48 -07:00
Dmitry Kovalev	480dd8ffbe	Using existing functions instead of raw expressions. Change-Id: Ifa50b04bac1a6ff2abef989073cbf1f37a89eb50	2013-08-23 17:26:53 -07:00
Dmitry Kovalev	e6c435b506	Merge "Cleanup in mvref_common.{h, c}."	2013-08-23 17:09:49 -07:00
Dmitry Kovalev	7194da2167	Merge "Fixing display size setting problem."	2013-08-23 17:08:51 -07:00
Yaowu Xu	13930cf569	Limit mv range to be based on partition size Previous change `c4048dbd` limits the mv search range assuming max block size of 64x64, this commit change the search range using actual block size instead. Change-Id: Ibe07ab02b62bf64bd9f8675d2b997af20a2c7e11	2013-08-23 15:43:57 -07:00
Dmitry Kovalev	cd2cc27af1	Removing redundant calls to clamp_mv2. We could avoid calling clamp_mv2 because it has been already called inside vp9_find_best_ref_mvs function. Change-Id: I08edeaf3e11e98c19e67b9711b2523ca5fb1416e	2013-08-23 15:18:35 -07:00
Yaowu Xu	8e04257bc5	Merge "Added border extension"	2013-08-23 14:43:58 -07:00
Adrian Grange	78debf246b	Merge "Fix bug in convolution functions (filter selection)"	2013-08-23 13:41:47 -07:00
Dmitry Kovalev	fb481913f0	Merge "Removing useless calls to setup_{pre, dst}_planes."	2013-08-23 13:37:32 -07:00
Dmitry Kovalev	11e3ac62a5	Fixing display size setting problem. Fix of https://code.google.com/p/webm/issues/detail?id=608. We could have used invalid display size equal to the previous frame size (not to the current frame size). Change-Id: I91b576be5032e47084214052a1990dc51213e2f0	2013-08-23 13:12:46 -07:00
Dmitry Kovalev	21d8e8590b	Cleanup in mvref_common.{h, c}. Making code more compact, adding consts, removing redundant arguments, adding do/while(0) for macros. Change-Id: Ic9ec0bc58cee0910a5450b7fb8cfbf35fa9d0d16	2013-08-23 12:00:30 -07:00
Yaowu Xu	656632b776	Added border extension To the source buffer to be encoded as an alt ref frame. This is to fix the problem of using uninitialized memory in encoder. See https://code.google.com/p/webm/issues/detail?id=605 Change-Id: I97618a2fc207e08abcf5301b734aa9e3ad695e2c	2013-08-23 11:31:28 -07:00
Adrian Grange	3f10831308	Fix bug in convolution functions (filter selection) (In response to Issue 604: https://code.google.com/p/webm/issues/detail?id=604) There were bugs in the convolution code for two cases: 1. Where the filter table was assumed to be aligned to a 256 byte boundary. The offset of the pixel in the source buffer was computed incorrectly. 2. Where no such alignment assumption was made. An incorrect address for the filter table base was used. To fix both problems, I now assume that the filter table is 256-byte aligned and modify the pixel offset calculation to match. A later patch should remove the restriction that the filter table is aligned to a 256-byte boundary. There was also a bug in the ConvolveTest unit test (convolve_test.cc). (Bug & initial fix suggestion submitted by Tero Rintaluoma and Sami Pietilä). Change-Id: I71985551e62846e55e40de9e7e3959d4805baa82	2013-08-23 11:16:08 -07:00
Dmitry Kovalev	1c159c470a	Merge "Checking scale factors on access."	2013-08-23 11:05:17 -07:00
hkuang	b85367a608	Merge "Optimise idct4x4: rearrange the instructions a bit to improve instruction scheduling."	2013-08-23 10:08:43 -07:00
Paul Wilkins	aa5b67add0	Changes to adaptive inter rd thresholds. Values now carried over frame to frame. Change to algorithm for decreasing threshold after a hit and to max threshold (now based on speed) Removed some old commented out code relating to VP8 adaptive thresholds. The impact of these changes tested on Akiyo (50 frames) and measured in terms of unit rd hits is as follows: Speed 0 84.36 -> 84.67 Speed 1 29.48 -> 22.22 Speed 2 11.76 -> 8.21 Speed 3 12.32 -> 7.21 Encode speed impact is broadly in line with these. Change-Id: I5b886efee3077a11553fa950d796fd6d00c8cb19	2013-08-23 16:18:45 +01:00
Paul Wilkins	f76f52df61	Limit Key frame Intra modes checks. Most of the focus so far has been on inter frames. At high speed settings the key frame is now taking a high % of the cycles. This patch puts in some masking to reduce the number of INTRA modes searched during key frame coding (as already happens for inter frames) at higher speed settings TODO: Develop this further with either adaptive rd thresholds when choosing which intra modes to consider or some other heuristic. Impact. At high speed settings on some clips the key frame was starting to dominate. In a coding of the first 50 frames of AKIYO at speed 2 limiting the key frame intra modes to DC or TM_PRED resulted in ~30% overall speedup. For Bus the number was lower at ~4-5%. Change-Id: I7bde68aee04995f9d9beb13a1902143112e341e2	2013-08-23 16:10:30 +01:00
Jingning Han	9655c2c7a6	Merge "Fix rectangular partition check flag"	2013-08-22 18:59:18 -07:00
Dmitry Kovalev	33104cdd42	Merge "vp9_encodeframe.c cleanup."	2013-08-22 18:07:35 -07:00
James Zern	711aff9d9d	Merge "vp9/encoder: fix last_frame_seg_map mem leak"	2013-08-22 18:04:03 -07:00
James Zern	d843ac5132	Merge "rename LOG2_* defines to *_LOG2"	2013-08-22 18:02:42 -07:00
Jingning Han	84f3b76e1c	Fix rectangular partition check flag Put rectangular partition check flag change according to the rd costs of NONE and SPLIT partition types under the speed feature. Change-Id: If681e1e078a8d43d86961ea4b748da5cd1b6c331	2013-08-22 17:15:01 -07:00
Dmitry Kovalev	53f6f8ac93	Merge "check_bsize_coverage cleanup."	2013-08-22 16:18:24 -07:00
hkuang	4205d79273	Merge "Add neon optimize vp9_short_idct10_16x16_add."	2013-08-22 15:57:28 -07:00
hkuang	4082bf9d7c	Add neon optimize vp9_short_idct10_16x16_add. vp9_short_idct10_16x16_add is used to handle the block that only have valid data at top left 4x4 block. All the other datas are 0. So we could cut many unnecessary calculations in order to save instructions. Change-Id: I6e30a3fee1ece5af7f258532416d0bfddd1143f0	2013-08-22 15:53:22 -07:00
Dmitry Kovalev	604022d40b	vp9_encodeframe.c cleanup. Removing unused get_sbuv_perpixel_variance function, using has_second_ref/ is_inter_block functions, organizing includes. Change-Id: I016de4af12fbbb8b4ece26a70759b2392651b095	2013-08-22 15:50:51 -07:00
Dmitry Kovalev	335b1d360b	check_bsize_coverage cleanup. Change-Id: Ib7803857b35c00e317c9deb8630e777e25eb278f	2013-08-22 15:45:56 -07:00
Dmitry Kovalev	3c42657207	Checking scale factors on access. It is possible to have invalid scale factors and not access them during decoding. Error is reported if we really try to use invalid scale factors. Change-Id: Ie532d3ea7325ee0c7a6ada08269f804350c80fdf	2013-08-22 15:19:05 -07:00
James Zern	40ae02c247	rename LOG2_* defines to *_LOG2 gets rid of a mix of styles Change-Id: I3591d312157bc6f53a25438bf047765c671fd8a8	2013-08-22 14:45:24 -07:00
Dmitry Kovalev	13eed79c77	Merge "Adding vp9_is_scaled function."	2013-08-22 14:39:55 -07:00
Dmitry Kovalev	09858c239b	Removing useless calls to setup_{pre, dst}_planes. Comment is wrong, we don't initialize any xd pointers. We only initialize xd->planes[i]->dst and xd->planes[i]->pre[], which are actually initialized for every block during the decoding. Change-Id: If152ea872ebef1f83ca70712fa6f8df1b6855f56	2013-08-22 14:39:05 -07:00
James Zern	a5726ac453	vp9/encoder: fix last_frame_seg_map mem leak remove duplicate allocation from vp9_create_compressor, it was added to vp9_alloc_frame_buffers in: `d5bec52` Added resizing & initialization of last frame segment map Change-Id: I996723226a16a62aff8f9a52ac74e0b73cc98fdf	2013-08-22 14:13:04 -07:00
Dmitry Kovalev	640dea4d9d	Adding vp9_is_scaled function. Change-Id: Ieb7077ca3586b9491912027eed450a4f6fd38d30	2013-08-22 14:04:59 -07:00
Jingning Han	8adc20ce35	Merge "Refactor rd_pick_partition for parameter control"	2013-08-22 13:54:48 -07:00
James Zern	da9a6ac9e7	Merge "vp9_peek_si: add bitstream v1 support"	2013-08-22 13:28:00 -07:00
Jingning Han	01a37177d1	Refactor rd_pick_partition for parameter control This commit changes the partition search order of superblocks from {SPLIT, NONE, HORZ, VERT} to {NONE, SPLIT, HORZ, VERT} for consistency with that of sub8x8 partition search. It enable the use of early termination in partition search for all block sizes. For ped_area_1080p 50 frames coded at 4000 kbps, it makes the runtime goes down from 844305ms -> 818003ms (3% speed-up) at speed 0. This will further move towards making the in-search partition types configurable, hence unifying various speed-up approaches. Some speed 1 and 2 features are turned off during the refactoring process, including: disable_split_var_thresh using_small_partition_info Stricter constraints are applied to use_square_partition_only for right/bottom boundary blocks. Will bring back/refine these features subsequently. At this point, it makes derf set at speed 1 about 0.45% higher in compression performance, and 9% down in run-time. Change-Id: I3db9f9d1d1a0d6cbe2e50e49bd9eda1cf705f37c	2013-08-22 12:36:02 -07:00
hkuang	610642c130	Optimise idct4x4: rearrange the instructions a bit to improve instruction scheduling. Change-Id: I5ea881a6e419f9e8ed4b3b619406403b4de24134	2013-08-22 11:02:22 -07:00
Deb Mukherjee	8b810c7a78	Fixes on feature disabling split based on variance Adds a couple of minor fixes, which may be absorbed in Jingning's patch. Thanks to Guillaume for pointing these out. Also adjusts the thresholds for speed 1 and 2 to 16 and 32 respectively, to keep quality drops small. Results: -------- derfraw300: threshold = 16, psnr -0.082%, speedup 2-3% threshold = 32, psnr -0.218%, speedup 5-6% stdhdraw250: threshold = 16, psnr -0.031%, speedup 2-3% threshold = 32, psnr -0.273%, speedup 5-6% Change-Id: I4b11ae8296cca6c2a9f644be7e40de7c423b8330	2013-08-22 07:05:44 -07:00
Scott LaVarnway	f39bf458e5	Merge "Initialize mb_skip_coeff before picking modes"	2013-08-22 06:26:04 -07:00
Scott LaVarnway	94bfbaa84e	Initialize mb_skip_coeff before picking modes It appears that the above/left mb_skip_coeff used during the pick modes, is left over from the previously encode frame. This patch initializes the flag to the default value of zero. Change-Id: Ida4684cc99611d6e3e82628db35ed717e28ce550	2013-08-22 08:51:04 -04:00
Dmitry Kovalev	96a1a59d21	Merge "Using has_second_ref function to simplify the code."	2013-08-22 01:39:14 -07:00
Dmitry Kovalev	a33f178491	Merge "Cleaning up foreach_transformed_block_in_plane."	2013-08-22 01:37:21 -07:00
Dmitry Kovalev	359b571448	Merge "Cleaning up reset_skip_context function."	2013-08-22 01:36:25 -07:00
Dmitry Kovalev	596c51087b	Merge "Removing unused foreach_predicted_block function."	2013-08-22 01:35:41 -07:00
Dmitry Kovalev	cb05a451c6	Merge "Cleaning up optimize_init_b function."	2013-08-22 01:35:27 -07:00
Dmitry Kovalev	64c0f5c592	Merge "Cleaning up sum_intra_stats function."	2013-08-22 01:34:39 -07:00
Jingning Han	fcb890d751	Merge "Enable zero coeff check in sub8x8 UV rd loop"	2013-08-21 22:07:00 -07:00
James Zern	85640f1c9d	vp9: remove unnecessary wait w/threaded loopfilter the final macroblock rows are scheduled in the main thread. prior to this change one additional macroblock row would be scheduled in the worker forcing the main thread to wait before finishing. Change-Id: I05f3168e5c629b898fcebb0d77eb6d6a90d6105e	2013-08-21 17:43:44 -07:00
Dmitry Kovalev	4172d7c584	Cleaning up foreach_transformed_block_in_plane. Change-Id: I9f45af3894c57f35cb266c255e2b904295d39c34	2013-08-21 17:16:02 -07:00
James Zern	6167355309	vp9_peek_si: add bitstream v1 support currently protected by CONFIG_NON420 as v1 is still not entirely stable Change-Id: Id1c5081b04a2c47a842822048b8804be67d23a6d	2013-08-21 17:04:10 -07:00
Dmitry Kovalev	be60924f29	Cleaning up optimize_init_b function. Change-Id: Ib2c975e1d96deefb7ac4d6b600c8c5388035d111	2013-08-21 16:40:16 -07:00
Dmitry Kovalev	c43da352ab	Cleaning up reset_skip_context function. Change-Id: Ib3e72671eb8da6f2e9767a6de292ec7c7cde6bc7	2013-08-21 16:31:51 -07:00
Dmitry Kovalev	048ccb2849	Cleaning up sum_intra_stats function. Using size_group_lookup table and better variable names. Change-Id: I6e67f2ce091845db43ace7d21b7ae31c6f165aec	2013-08-21 16:25:02 -07:00
Dmitry Kovalev	3286abd82e	Merge "Adding scale factor check."	2013-08-21 14:11:13 -07:00
Dmitry Kovalev	687891238c	Merge "Removing PLANE_TYPE argument from cost_coeffs function."	2013-08-21 14:10:05 -07:00
Deb Mukherjee	a2f7619860	Merge "Make "good" quality 2-pass vpxenc encoding default"	2013-08-21 13:58:49 -07:00
James Zern	ac12f3926b	Merge "vp9 rtcd: remove non-existent sad functions"	2013-08-21 13:55:59 -07:00
Dmitry Kovalev	2f1a0a0e2c	Removing PLANE_TYPE argument from cost_coeffs function. We can determine plane_type for another function arguments. Change-Id: I85331877aedb357632ae916a37b5b15f22c0bb1f	2013-08-21 13:02:28 -07:00
Deb Mukherjee	0d8723f8d5	Make "good" quality 2-pass vpxenc encoding default Currently, the best quality mode in VP9 is not very well developed, and unnecessarily makes the encode too slow. Hence the command line default is changed to "good" quality. Also, the number of passes default is changed to 2 passes as well, since 1-pass encoding is not very efficient in VP9. Besides, a number of VP9 defaults are set to the currently recommended settings. With these changes, vpxenc run with --codec=vp9 --kf-max-dist=9999 --cpu-used=0 should work about the same as our borg results. Note when the --cpu-used=0 option is dropped there will be a slight difference in the output, because of a difference in the cpu-used value for the first pass. Specifically, the default when unspecified is to use cpu_used=1 for the first pass and cpu_used=0 for the second pass. But when specified, both passes will use the cpu-used value specified. Note that this also changes the default for VP8 as being "good" but other options stay unchanged. Change-Id: Ib23c1a05ae2f36ee076c0e34403efbda518c5066	2013-08-21 12:41:26 -07:00
Dmitry Kovalev	27a984fbd3	Removing a lot of duplicated code. Adding set_contexts contexts function and call it instead of set_contexts_on_border. Calling txfrm_block_to_raster_xy to get aoff and loff. Change-Id: I41897e344afd2cae1f923f4fdbe63daccf6fe80e	2013-08-21 11:55:12 -07:00
Dmitry Kovalev	a3ae4c87fd	Adding scale factor check. We support only [1/16, 2] scale factors, enforcing this now. Change-Id: I0822eb7cea51720df6814e42d3f35ff340963061	2013-08-21 11:24:47 -07:00
Adrian Grange	ce28d0ca89	Fix typos and minor stylistic cleanup Change-Id: I32e43474e8651ef2eb181d24860a8f118cfea7bf	2013-08-21 08:45:42 -07:00
Adrian Grange	5b63963573	Merge "Further correct bug in loopfilter initialization"	2013-08-21 07:17:43 -07:00
James Zern	ae455fabd8	vp9 rtcd: remove non-existent sad functions vp9_sad32x3, vp9_sad3x32 + remove unnecessary sad include from vp9_findnearmv.c Change-Id: Idef2a89cadc3fec64eff82ba9be60ffff50b3468	2013-08-20 18:07:53 -07:00
Dmitry Kovalev	90027be251	Removing unused foreach_predicted_block function. Moving foreach_predicted_block_in_plane function to vp9_reconinter.c because there is only one usage. Change-Id: I9852feae43fc3cf809b817fc541d043bc5496209	2013-08-20 17:20:47 -07:00
Dmitry Kovalev	7f814c6bf8	Merge "Passing plane_bsize to foreach_transformed_block_visitor."	2013-08-20 14:25:01 -07:00
Dmitry Kovalev	27de4fe922	Using has_second_ref function to simplify the code. Updating implementation of vp9_get_pred_context_single_ref_p2 using has_second_ref function to make code easier to read. Change-Id: I5ba642712f59861a48aab974e73aa01640d086fe	2013-08-20 14:09:56 -07:00
hkuang	62a2cd9ed2	Merge "Add neon optimize vp9_short_idct10_8x8_add."	2013-08-20 14:06:57 -07:00
Dmitry Kovalev	381d3b8b7d	Merge "vp9_filter.{h, c} cleanup + adding SUBPEL_TAPS constant."	2013-08-20 13:46:53 -07:00
Dmitry Kovalev	d19ac4b66d	vp9_filter.{h, c} cleanup + adding SUBPEL_TAPS constant. Change-Id: Ib394ea23f464591dad50b5c65c316701378d06d7	2013-08-20 12:29:57 -07:00
hkuang	37cda6dc4c	Add neon optimize vp9_short_idct10_8x8_add. vp9_short_idct10_8x8_add is used to handle the block that only have valid data at top left 4x4 block. All the other datas are 0. So we could cut several unnecessary calculations in order to save instructions. Change-Id: I34fda95e29082b789aded97c2df193991c2d9195	2013-08-20 11:51:07 -07:00
Jingning Han	1bf1428654	Enable zero coeff check in sub8x8 UV rd loop Check the minimum rate-distortion cost of regular quantization and all zero coeffs cases in the sub8x8 inter prediction rd loop for luma components. Use this as the cumulative rdcost sent to UV rd estimation. Change-Id: Ia4bc7700437d5e13d7cdad4cf9ae57ab036d3e97	2013-08-20 10:33:42 -07:00
Deb Mukherjee	246381faf2	Merge "Cleanup/enhancements of switchable filter search"	2013-08-20 10:16:51 -07:00
Dmitry Kovalev	5826407f2a	Merge "Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c."	2013-08-20 10:06:22 -07:00
Dmitry Kovalev	5baf510f74	Merge "Adding has_second_ref function."	2013-08-20 10:06:14 -07:00
Dmitry Kovalev	039b0c4c9e	Merge "Adding VP9_FILTER_BITS constant."	2013-08-20 10:05:09 -07:00
Deb Mukherjee	2ffe64ad5c	Cleanup/enhancements of switchable filter search Cleans up the switchable filter search logic. Also adds a speed feature - a variance threshold - to disable filter search if source variance is lower than this value. Results: derfraw300 threshold = 16, psnr -0.238%, 4-5% speedup (tested on football) threshold = 32, psnr -0.381%, 8-9% speedup (tested on football) threshold = 64, psnr -0.611%, 12-13% speedup (tested on football) threshold = 96, psnr -0.804%, 16-17% speedup (tested on football) Based on these results, the threshold is chosen as 16 for speed 1, 32 for speed 2, 64 for speed 3 and 96 for speed 4. Change-Id: Ib630d39192773b1983d3d349b97973768e170c04	2013-08-20 09:47:04 -07:00
Jingning Han	bb64c9a355	Merge "Enable early termination in uv rd loop"	2013-08-20 09:07:26 -07:00
Jim Bankoski	be5dc2321b	Merge "fix the mv_ref_idx issue"	2013-08-20 09:00:57 -07:00
Jim Bankoski	f167433d9c	fix the mv_ref_idx issue The following issue was reported : https://code.google.com/p/webm/issues/detail?id=601&q=jimbankoski&sort=-id&colspec=ID%20Pri%20mstone%20ReleaseBlock%20Type%20Component%20Status%20Owner%20Summary This code makes the choice and code cleaner and removes any question about whether the border needs to be checked. Change-Id: Ia7aecfb3168e340618805bd318499176c2989597	2013-08-20 08:14:52 -07:00
Paul Wilkins	e8923fe492	Changes to auto partition size selection. Changes to code to auto select a partition size range based on data from spatial neighbors. Now looks at the sb_type in each 8x8 block of above and left SB64. The effect on speed 1 is now weaker giving better quality but less speed gain. Now also used in speed 2. Change-Id: Iace33a97d5c3498dd2a9a8a4067351941abcbabc	2013-08-20 14:05:39 +01:00
Dmitry Kovalev	2612b99cc7	Adding VP9_FILTER_BITS constant. Removing VP9_FILTER_WEIGHT, VP9_FILTER_SHIFT, BLOCK_WIDTH_HEIGHT constants. Using ROUND_POWER_OF_TWO for rounding. Change-Id: I2e8d6858dcd600a87096138209731137d7decc24	2013-08-20 00:42:25 -07:00
Dmitry Kovalev	d8286dd56d	Adding has_second_ref function. Updating implementation of vp9_get_pred_context_single_ref_p1 using has_second_ref function to make code easier to read. Change-Id: Ie8f60403a7195117ceb2c6c43176ca9a9e70b909	2013-08-19 18:39:34 -07:00
Yaowu Xu	c4048dbdd3	Change to limit the mv search range As the pixel values beyond image border are duplicates of pixels on edge, the change limits the mv search range, any mv beyond the limits no longer produce new/different prediction values as entire block with pixels used for subpel interpolation are outside image border. Change-Id: I4c6fdf06e33c1cef1489f5470ce0fb4e5e01fb79	2013-08-19 17:19:36 -07:00
Yaowu Xu	f70330a906	fix a bug when null function pointer is used. For certain partition size, the function poniter may not be intialized at all. The patch prevent the call if the pointer is not set. Change-Id: I78b8c3992b639e8799a16b3c74f0973d07b8b9ac	2013-08-19 17:16:12 -07:00
Dmitry Kovalev	569ca37d09	Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c. Change-Id: Ib8af21f2e7f603c2fb407e5d15a3bba64b545b49	2013-08-19 16:44:10 -07:00
Jingning Han	3275ad701a	Enable early termination in uv rd loop This commit enables early termination in the rate-distortion optimization search loop for chroma components. When the cumulative rd cost is above the current best value, skip the rest per-block transform/quantization/coeff_cost and continue to the next prediction mode. For bus_cif at 2000 kbps, the average run-time goes down from 168546ms -> 164678ms, (2% speed-up) at speed 0 36197ms -> 34465ms, (4% speed-up) at speed 1 Change-Id: I9d3043864126e62bd0166250d66b3170d520b3c0	2013-08-19 16:31:19 -07:00
Dmitry Kovalev	82d4d9a008	Passing plane_bsize to foreach_transformed_block_visitor. Updating all foreach_transformed_block_visitor functions to work with plane block size instead of general block. Removing a lot of duplicated code. Change-Id: I6a9069e27528c611f5a648e1da0c5a5fd17f1bb4	2013-08-19 15:47:24 -07:00
Jingning Han	31c97c2bdf	Merge "Fix potential use of uninitialized value"	2013-08-19 15:15:58 -07:00
Jingning Han	5dc0b309ab	Merge "Fix the returned distortion value in rd_pick_intra"	2013-08-19 14:34:19 -07:00
Dmitry Kovalev	2e3478a593	Using plane_bsize instead of bsize. This change set is intermediate. The next one will remove all repetitive plane_bsize calculations, because it will be passed as argument to foreach_transformed_block_visitor. Change-Id: Ifc12e0b330e017c6851a28746b3a5460b9bf7f0b	2013-08-19 13:20:21 -07:00
Adrian Grange	5a1a269f67	Further correct bug in loopfilter initialization The intent was to initialize the deltas for the segment to the computed value, irrespective of mode and reference frame if (mode_ref_delta_enabled == 0). (In response to bug posted by Manjit Hota to codec-devel and webm-discuss lists) Change-Id: I10435cb63d0f88359bb4c14f22181878a1988e72	2013-08-19 11:58:52 -07:00
Jingning Han	b34ce04378	Fix potential use of uninitialized value Initialize the best mode and tx_size values in the rate-distortion optimization search loop. Change-Id: Ibfb5c0895691f172abcd4265c23aef4cb99fa8af	2013-08-19 11:15:53 -07:00
Jingning Han	f67919ae86	Fix the returned distortion value in rd_pick_intra Return the distortion value in vp9_rd_pick_intra_mode_sb as sum of dist_y and dist_uv. Remove the right shift operation on dist_uv, and make it consistent with that of vp9_rd_pick_inter_mode_sb. Change-Id: I9d564e242d9add38e32595d33b0e0dddb1d55e5b	2013-08-16 21:23:22 -07:00
Dmitry Kovalev	26e5b5e25d	Removing unused or redundant arguments from *_args structures. Redundant dst, pre[2] from build_inter_predictors_args, unused cm from encode_b_args. Change-Id: I2c476cd328c5c0cca4c78ba451ca6ba2a2c37e2d	2013-08-16 12:51:20 -07:00
Dmitry Kovalev	367cb10fcf	Merge "Moving from ss_txfrm_size to tx_size."	2013-08-16 12:46:45 -07:00
Dmitry Kovalev	1462433370	Merge "Renaming d27 predictor to d207."	2013-08-16 12:07:24 -07:00
Johann	d514b778c4	Merge "Reduce the instructions of idct8x8. Also add the saving and restoring of D registers."	2013-08-16 11:30:21 -07:00
Johann	65aa89af1a	Merge "Reduce instructions of idct4x4."	2013-08-16 11:28:35 -07:00
Frank Galligan	bdc785e976	Merge "vp9: neon: optimise vp9_wide_mbfilter_neon"	2013-08-16 11:16:48 -07:00
hkuang	df0715204c	Reduce instructions of idct4x4. Change-Id: Ia26a2526804e7e2f656b0051618a615fca8fc79d	2013-08-16 10:54:56 -07:00
hkuang	60ecd60c9a	Reduce the instructions of idct8x8. Also add the saving and restoring of D registers. Change-Id: Id3630c90fcb160ef939fef55411342608af5f990	2013-08-16 10:32:12 -07:00
Johann	bba68342ce	Merge "vp9: neon: use aligned stores in convolve functions"	2013-08-16 10:29:59 -07:00
Adrian Grange	79f4c1b9a4	Fixed typos and formatting Change-Id: I3814984a624bc64147c57efa74fbdda8eda47262	2013-08-16 09:15:26 -07:00
Adrian Grange	3e340880a8	Merge "Added resizing & initialization of last frame segment map"	2013-08-16 09:07:36 -07:00
Mans Rullgard	4fa93bcef4	vp9: neon: use aligned stores in convolve functions The destination is block-aligned so it is safe to use aligned stores. Change-Id: I38261e4fa40bc60e6472edffece59e372908da7e	2013-08-16 14:25:08 +01:00
Dmitry Kovalev	afd9bd3e3c	Moving from ss_txfrm_size to tx_size. Updating foreach_transformed_block_visitor and corresponding functions to accept tx_size instead of ss_txfrm_size. List of functions per file: vp9_decodframe.c decode_block decode_block_intra vp9_detokenize.c decode_block vp9_encodemb.c optimize_block vp9_xform_quant vp9_encode_block_intra vp9_rdopt.c dist_block rate_block block_yrd_txfm vp9_tokenize.c set_entropy_context_b tokenize_b is_skippable Change-Id: I351bf563eb36cf34db71c3f06b9bbc9a61b55b73	2013-08-15 17:03:03 -07:00
Jingning Han	5e80a49307	Merge "Refactor rd loop for chroma components"	2013-08-15 16:02:12 -07:00
Adrian Grange	d5bec522da	Added resizing & initialization of last frame segment map When the frame size changes the last frame segment map must be resized to match and initialized to 0. Change-Id: Idc10de109f55dbe9af3a6caae355a2974712243d	2013-08-15 15:35:21 -07:00
Dmitry Kovalev	9451e8d37e	Merge "Converting code from using ss_txfrm_size to tx_size."	2013-08-15 15:21:09 -07:00
Dmitry Kovalev	939b1e4a8c	Merge "Moving segmentation struct from MACROBLOCKD to VP9_COMMON."	2013-08-15 15:14:32 -07:00
Johann	a9aa7d07d0	Merge "vp9: neon: add vp9_convolve_avg_neon"	2013-08-15 14:55:15 -07:00
Johann	63e140eaa7	Merge "vp9: neon: add vp9_convolve_copy_neon"	2013-08-15 14:55:08 -07:00
Jingning Han	68369ca897	Refactor rd loop for chroma components This commit makes the rate-distortion optimization search of chroma components consistent across all block sizes. It removes redundant codes. Change-Id: I7e76f54d045e8efdd41d84a164c71f55b484471b	2013-08-15 14:54:48 -07:00
Jingning Han	c2ff1882ff	Merge "Remove unused RDCOST_8X8 macro"	2013-08-15 13:48:25 -07:00
Jingning Han	ca983f34f7	Merge "Unify luma and chroma rd-cost estimation"	2013-08-15 13:48:15 -07:00
Dmitry Kovalev	bb3b817c1e	Converting code from using ss_txfrm_size to tx_size. Updated function signatures: txfrm_block_to_raster_block txfrm_block_to_raster_xy extend_for_intra vp9_optimize_b Change-Id: I7213f4c4b1b9ec802f90621d5ba61d5e4dac5e0a	2013-08-15 11:44:57 -07:00
Dmitry Kovalev	6f4fa44c42	Using { 0 } for initialization instead of memset. Change-Id: I4fad357465022d14bfc7e13b348c6da267587314	2013-08-15 11:37:56 -07:00
Dmitry Kovalev	81d7bd50f5	Renaming d27 predictor to d207. 27 degrees intra predictor is actually 207 degrees, so renaming it. Change-Id: Ife96a910437eb80ccdc0b7a5b7a62c77542ae5be	2013-08-15 11:09:49 -07:00
Mans Rullgard	67e53716e0	vp9: neon: optimise vp9_wide_mbfilter_neon Break up long dependency chains to improve instruction scheduling. Change-Id: I0e0cb66943df24af920767bb4167b25c38af9630	2013-08-15 19:07:22 +01:00
James Zern	89a1fcf884	Merge "vp9_dx_iface: check for NULL/0-size input"	2013-08-15 10:59:22 -07:00
Dmitry Kovalev	b7616e387e	Moving segmentation struct from MACROBLOCKD to VP9_COMMON. VP9_COMMON is the right place to segmentatation struct because it has global segmentation parameters, not something specific to macroblock processing. Change-Id: Ib9ada0c06c253996eb3b5f6cccf6a323fbbba708	2013-08-15 10:47:48 -07:00
Jingning Han	b0646f9e98	Remove unused RDCOST_8X8 macro Change-Id: I17c7d7eaa60fe69c543403c340f7c1078bfd339f	2013-08-15 10:40:44 -07:00
Dmitry Kovalev	4d73416099	Merge "Quantization code cleanup."	2013-08-15 10:23:01 -07:00
Deb Mukherjee	24856b6abc	Speed feature to skip split partition based on var Adds a speed feature to disable split partition search based on a given threshold on the source variance. A tighter threshold derived from the threshold provided is used to also disable horizontal and vertical partitions. Results on derfraw300: threshold = 16, psnr = -0.057%, speedup ~1% (football) threshold = 32, psnr = -0.150%, speedup ~4-5% (football) threshold = 64, psnr = -0.570%, speedup ~10-12% (football) Results on stdhdraw250: threshold = 32, psnr = -0.18%, speedup is somewhat more than derf because of a larger number of smoother blocks at higher resolution. Based on these results, a threshold of 32 is chosen for speed 1, and a threshold of 64 is chosen for speeds 2 and above. Change-Id: If08912fb6c67fd4242d12a0d094783a99f52f6c6	2013-08-15 10:01:45 -07:00
Jingning Han	ec01f52ffa	Unify luma and chroma rd-cost estimation This commit unifies the rate-distortion cost calculation process of luma and chroma components. It allows early termination to be enabled later in the rd search loop of chroma components, in consistent with luma pixels. Change-Id: I2e52a7c6496176bf2a5e3ef338d34ceb8aad9b3d	2013-08-15 09:41:33 -07:00
Paul Wilkins	1a3641d91b	Merge "Renaming in MB_MODE_INFO"	2013-08-15 02:12:48 -07:00
James Zern	20395189cd	vp9_dx_iface: check for NULL/0-size input avoids a crash caused by issue #585 Change-Id: I301595ee0227699b0da6f0dad6d870dd546e94ef	2013-08-14 18:35:22 -07:00
hkuang	39f42c8713	Merge "Add neon optimize vp9_short_idct16x16_add."	2013-08-14 14:16:20 -07:00
hkuang	cf6beea661	Add neon optimize vp9_short_idct16x16_add. Change-Id: I27134b9a5cace2bdad53534562c91d829b48838d	2013-08-14 13:52:16 -07:00
Dmitry Kovalev	bb072000e8	foreach_transformed_block_in_plane cleanup, explicit tx_size var. Making foreach_transformed_block_in_plane more clear (it's not finished yet). Using explicit tx_size variable consistently instead of (ss_txfrm_size / 2) or (ss_txfrm_size >> 1) expression. Change-Id: I1b9bba2c0a9f817fca72c88324bbe6004766fb7d	2013-08-14 11:39:31 -07:00
Dmitry Kovalev	f2c073efaa	Adding const to arguments of intra prediction functions. Adding const to above and left pointers. Cleanup. Change-Id: I51e195fa2e2923048043fe68b4e38a47ee82cda1	2013-08-14 10:35:56 -07:00
Mans Rullgard	0f1deccf86	vp9: neon: add vp9_convolve_avg_neon Change-Id: I33cff9ac4f2234558f6f87729f9b2e88a33fbf58	2013-08-14 16:27:55 +01:00
Mans Rullgard	635ba269be	vp9: neon: add vp9_convolve_copy_neon Change-Id: I15adbbda15d1842e9f15f21878a5ffbb75c3c0c9	2013-08-14 16:27:55 +01:00
Paul Wilkins	26fead7ecf	Renaming in MB_MODE_INFO The macro block mode info context originally contained an entry for each 16x16 macroblock. In VP9 each entry refers to an 8x8 region not a macro block, so the naming is misleading. This first stage clean up changes the names of 3 entries in the structure to remove the mb_ prefix. TODO clean up the nomenclature more widely in respect of mbmi and bmi. Change-Id: Ia7305c6d0cb805dfe8cdc98dad21338f502e49c6	2013-08-14 12:47:52 +01:00
Paul Wilkins	54979b4350	Merge "Honor min_partition_size properly for non-square splits"	2013-08-14 04:45:18 -07:00
Guillaume Martres	fc50477082	Honor min_partition_size properly for non-square splits Don't do vertical or horizontal splits if subsize < min_partition_size, except for edge blocks where it makes sense. Change-Id: I479aa66ba1838d227b5de8312d46be184a8d6401	2013-08-13 15:24:03 -07:00
Dmitry Kovalev	bcc8e9d9c6	Merge "Little cleanup inside decode_tile() function."	2013-08-13 14:43:10 -07:00
Guillaume Martres	ecb78b3e0c	Merge "Trivial clean up."	2013-08-13 12:40:37 -07:00
Jingning Han	7e0f88b6be	Use lookup table to find largest txfm size Refactor choose_largest_txfm_size_ and make it find the largest transform size via lookup table. Change-Id: I685e0396d71111b599d5367ab1b9c934bd5490c8	2013-08-13 10:32:14 -07:00
Dmitry Kovalev	8105ce6dce	Merge "Using is_inter_block() instead of repetitive code."	2013-08-13 10:00:01 -07:00
Jingning Han	dc70fbe42d	Merge "Refactor model based tx search in super_block_yrd"	2013-08-13 08:48:49 -07:00
Paul Wilkins	5459f68d71	Trivial clean up. Delete unused / commented out variable references. Change-Id: Iaf20c0c3744f89adb296d153b516b5ea41b4f3b4	2013-08-13 13:26:18 +01:00
Paul Wilkins	8e35263bed	Merge "Honor min_partition_size properly"	2013-08-13 05:19:51 -07:00
Jingning Han	39fe235032	Merge "SSE2 high precision 32x32 forward DCT"	2013-08-12 23:03:47 -07:00
Dmitry Kovalev	2c7ae8c29a	Little cleanup inside decode_tile() function. Change-Id: I3ed4beb59371fe21ca3e82253aa98e0cbd5e0630	2013-08-12 18:28:13 -07:00
Johann	4417c04531	Merge "vp9: neon: optimise convolve8_vert functions"	2013-08-12 17:54:47 -07:00
Johann	4cabbca4ce	Merge "vp9: neon: optimise convolve8_horiz functions"	2013-08-12 17:54:42 -07:00
Dmitry Kovalev	32006aadd8	Using is_inter_block() instead of repetitive code. Change-Id: If0b04c476c34fb8c102c9f750d7fe5669a86a532	2013-08-12 17:42:14 -07:00
Jingning Han	78136edcdc	SSE2 high precision 32x32 forward DCT Enable SSE2 implementation of high precision 32x32 forward DCT. The intermediate stacks are of 32-bits. The run-time goes down from 32126 cycles to 13442 cycles. Change-Id: Ib5ccafe3176c65bd6f2dbdef790bd47bbc880e56	2013-08-12 16:52:53 -07:00
Jingning Han	14cc7b319f	Refactor model based tx search in super_block_yrd Remove unnecessary conditional branches in model-based transform size search. Change-Id: Ic862dc33ed6710a186f6248239dd5f09b5c19981	2013-08-12 16:34:48 -07:00
Dmitry Kovalev	b89eef8f82	Merge "Simplifying vp9_mvref_common.c."	2013-08-12 16:24:22 -07:00
Dmitry Kovalev	b214cd0dab	Merge "Removing foreach_predicted_block_uv function."	2013-08-12 15:54:01 -07:00
Dmitry Kovalev	98e3d73e16	Merge "Using MV* instead of int_mv* as argument of vp9_clamp_mv_min_max."	2013-08-12 15:53:25 -07:00
Dmitry Kovalev	1a5e6ffb02	Simplifying vp9_mvref_common.c. Change-Id: I272df2e33fa05310466acf06c179728514dd7494	2013-08-12 15:52:08 -07:00
Dmitry Kovalev	9d5885b0ab	Quantization code cleanup. Change-Id: I77b42418b852093f79260cbd880533a0bd86678f	2013-08-12 15:23:47 -07:00
Dmitry Kovalev	c66320b3e4	Merge "Entropy context related cleanups."	2013-08-12 15:18:24 -07:00
Dmitry Kovalev	bd1bc1d303	Merge "Making scaling code more clear."	2013-08-12 15:17:26 -07:00
Dmitry Kovalev	9a31d05e24	Removing unused convolve_avg_c function + cleanup. Change-Id: Id2b126c6456627c25e4041a82e304d0151d951ba	2013-08-12 14:28:00 -07:00
Dmitry Kovalev	1aedfc992a	Using MV* instead of int_mv* as argument of vp9_clamp_mv_min_max. Change-Id: I3c45916a9059f11b41e9d798e34ffee052969a44	2013-08-12 13:56:04 -07:00
Dmitry Kovalev	76d166e413	Removing foreach_predicted_block_uv function. Adding function build_inter_predictors_for_planes to build inter predictors for specified planes. This function allows to remove condition "#if CONFIG_ALPHA" and use MAX_MB_PLANE for general case. Renaming 'which_mv' local var to 'ref', and 'weight' argument to 'ref'. Change-Id: I1a97160c9263006929d38953f266bc68e9c56c7d	2013-08-12 13:54:13 -07:00
Dmitry Kovalev	a72e269318	Making scaling code more clear. Reusing existing functions, using constants instead of magic numbers. Change-Id: Idc689ffba52c9a8b203fcf26bd67110ecb5635f9	2013-08-12 13:30:26 -07:00
Jingning Han	3984b41c87	Fix a compile failure in vp9_get_compressed_data The lf struct is now with VP9_COMMON, instead of MACROBLOCKD. Change-Id: Idfdd4f91f78f486078a138322d58bb61e93e1bc9	2013-08-12 11:42:17 -07:00
Dmitry Kovalev	8b0e6035a2	Entropy context related cleanups. Adding set_skip_context() function used from both encoder and decoder. Change-Id: Ia22cfad3211a00a63eb294f64f857b78f4aa9b85	2013-08-12 11:24:24 -07:00
Mans Rullgard	ad7021dd6c	vp9: neon: optimise convolve8_vert functions Invert loops to operate vertically in the inner loop. This allows removing redundant loads. Also add preloading of data. Change-Id: I4fa85c0ab1735bcb1dd6ea58937efac949172bdc	2013-08-12 15:37:48 +01:00
Dmitry Kovalev	097046ae28	Merge "Removing redundant code and function arguments."	2013-08-11 12:20:58 -07:00
Mans Rullgard	b84dc949c8	vp9: neon: optimise convolve8_horiz functions Each iteration of the horizontal loop reuses 7 of the 11 source values. Loading only the 4 new values saves some time. Also add preload for source data. Overall 4% faster on Chromebook. Change-Id: I8f69e749f2b7f79e9734620dcee51dbfcd716b44	2013-08-11 16:21:55 +01:00
Dmitry Kovalev	3c43ec206c	Renaming BLOCK_SIZE_TYPES constant to BLOCK_SIZES. There will be another change set to rename BLOCK_SIZE_TYPE enum to BLOCK_SIZE. Change-Id: I8d1dfc873d6186fa5e554262f5169e929978085e	2013-08-09 17:47:32 -07:00
Guillaume Martres	58b07a6f9d	Honor min_partition_size properly It represents the minimum partition size, so don't split if bsize == min_partition_size . Change-Id: Id77c32d6afef7d2ddec0368eaae18fb13227d30e	2013-08-09 17:28:33 -07:00
Dmitry Kovalev	67fe9d17cb	Removing redundant code and function arguments. Change-Id: Ia5cdda0f755befcd1e64397452c42cb7031ca574	2013-08-09 17:24:40 -07:00
Dmitry Kovalev	e7c5ca8983	Merge "Inlining 16 as a stride for BLOCK_OFFSET macro."	2013-08-09 17:22:46 -07:00
James Zern	ef101af8ae	Merge "vp9_rd_pick_inter_mode_sb: fix uninitialized value"	2013-08-09 17:13:32 -07:00
Dmitry Kovalev	f1559bdeaf	Inlining 16 as a stride for BLOCK_OFFSET macro. Change-Id: I7f23d174eb089e5500f268a10db09648634c1b82	2013-08-09 16:40:05 -07:00
James Zern	f295774d43	vp9_rd_pick_inter_mode_sb: fix uninitialized value 'skippable' can remain unset and negatively affect later decisions address one aspect of issue #599 Change-Id: Iffdf0ac2e49ac481c27dc27c87fa546d4167bb28	2013-08-09 16:26:22 -07:00
Dmitry Kovalev	125146034e	Merge "Using MV struct instead of int[2] array."	2013-08-09 15:33:08 -07:00
Dmitry Kovalev	cd0629fe68	Merge "Removing plane_block_{width, height}_log2by4 functions."	2013-08-09 15:26:51 -07:00
Dmitry Kovalev	ff7df102d9	Merge "Moving loopfilter struct to VP9_COMMON."	2013-08-09 15:23:00 -07:00
Dmitry Kovalev	816d6c989c	Moving loopfilter struct to VP9_COMMON. Loop filter configuration doesn't belong to macroblock, so moving it from MACROBLOCKD to VP9_COMMON. Also moving the declaration of loopfilter struct from vp9_blockd.h to vp9_loopfilter.h. Change-Id: I4b3e34be9623b47cda35f9b1f9951f8c5b1d5d28	2013-08-09 14:41:51 -07:00
Dmitry Kovalev	8ffe85ad00	Moving scale_factors and related code to separate files. Change-Id: I531829e5aee2a4a7a112d528ecccbddf052d0e74	2013-08-09 14:07:09 -07:00
Scott LaVarnway	ace93a175d	Merge "Bug fix: call set_offsets before rd_auto_partition_range"	2013-08-09 12:30:52 -07:00
Dmitry Kovalev	fa0cd61087	Merge "Using buf_2d struct instead of separate buffer and stride vars."	2013-08-09 11:50:58 -07:00
Scott LaVarnway	41251ae558	Bug fix: call set_offsets before rd_auto_partition_range The set_offsets call is necessary inorder to set the mode_info_context ptr correctly. Change-Id: I644910cc5bacc50ee9cd78458843274ad8ee636d	2013-08-09 14:09:49 -04:00
Adrian Grange	0eef1acbef	Merge "Correct bug in loopfilter initialization"	2013-08-09 09:51:58 -07:00
Adrian Grange	12eb2d0267	Correct bug in loopfilter initialization The memset sets 16 bytes rather than the correct size of the final array dimension (MAX_MODE_LF_DELTAS). (In response to bug posted by Manjit Hota to codec-devel and webm-discuss lists) Change-Id: I8980f5aa71ddc9d7ef57c5b4700bc28ddf8651c7	2013-08-09 09:21:15 -07:00
Yaowu Xu	6ec2b85bad	Added lpf level picking using partial frame Change-Id: I599ab1bd22b5f3f10d5962c609952abdef8ff67a	2013-08-09 07:37:08 -07:00
Yaowu Xu	6a7a4ba753	renamed vp8_yv12_copy_y to vpx_yv12_copy_y Becuase the routine is used by both vp8 and vp9 Change-Id: I2d35b287b5bc2394865d931a27da61f4ce7edeeb	2013-08-09 07:37:08 -07:00
Yaowu Xu	c7c9901845	added a speed feature on lpf level picking Change-Id: Id578f8afdeab3702fc8386969f2d832d8f1b5420	2013-08-09 07:36:32 -07:00
Dmitry Kovalev	6fd2407035	Using buf_2d struct instead of separate buffer and stride vars. Change-Id: Id5cc3566cc16d1e3030ddb4d1c58459320321dca	2013-08-08 21:25:48 -07:00
Dmitry Kovalev	6a8ec3eac2	General code cleanup. Removing redundant parenthesis and curly braces. Combining declarations with initializations. Adding useful intermediate variables instead of recalculating expressions every time. Change-Id: I00106f404afd60bfc189905b0fded881684f941a	2013-08-08 21:12:34 -07:00
Dmitry Kovalev	ee40e1a637	Merge "Cleanup inside vp9_reconinter.c."	2013-08-08 14:59:38 -07:00
Deb Mukherjee	2158909fc3	Merge "Adds a new subpel motion function"	2013-08-08 12:26:55 -07:00
Dmitry Kovalev	9e3bcdd135	Merge "Removing unneeded intermediate entropy_nodes_adapt var."	2013-08-08 12:16:57 -07:00
Dmitry Kovalev	47fad4c2d7	Using MV struct instead of int[2] array. Change-Id: Iab951c555037e36b154f319f351c5e67f9abb931	2013-08-08 12:01:56 -07:00
Dmitry Kovalev	ac008f0030	Removing unneeded intermediate entropy_nodes_adapt var. Change-Id: I541a178d997b4541e0e2d4d5b854e2ed6b113c3a	2013-08-08 11:52:02 -07:00
Deb Mukherjee	1ba91a84ad	Adds a new subpel motion function Adds a new subpel motion estimation function that uses a 2-level tree-structured decision tree to eliminate redundant computations. It searches fewer points than iterative search (which can search the same point multiple times) but has the same quality roughly. This is made the default setting at speeds 0 and 1, while at speed 2 and above only a 1-level search is used. Also includes various cleanups for consistency and redundancy removal. Results: derf: +0.012% psnr stdhd: +0.09% psnr Speedup of about 2-3% Change-Id: Iedde4866f5475586dea0f0ba4cb7428fba24eee9	2013-08-08 11:41:49 -07:00
Adrian Grange	83ee80c045	Moved fast motion search level decision to function Moving this block of code into a function makes the code easier to read and change. Change-Id: If4ede570cce1eab1982b188c4d3e4fd3d4db236e	2013-08-08 11:01:44 -07:00
Adrian Grange	aae6a4c895	Simplify & fix potential bug in rd_pick_partition Different partitionings were not being evaluated against best_rd and there were unnecessary calls to RDCOST. This could have resulted in a non-optimal partioning being selected. I simplified the variables used to track the rate, distortion and RD values throughout the function. Change-Id: Ifa7085ee80d824e86791432a5bc6d8fea5a3e313	2013-08-08 09:55:45 -07:00
Jingning Han	6bfcce8c7a	Merge "Use low precision 32x32fdct for encodemb in speed1"	2013-08-07 19:05:14 -07:00
Dmitry Kovalev	61c33d0ad5	Removing plane_block_{width, height}_log2by4 functions. Change-Id: I040b82b8e32aee272d10cbb021c7ba1c76343d7a	2013-08-07 17:06:33 -07:00
Dmitry Kovalev	a766d8918e	Cleanup inside vp9_reconinter.c. Using block width and block height instead of their logarithms. Using SUBPEL_BITS and SUBPEL_SHIFTS constants instead of magic numbers. Change-Id: I4e10e93c907c8a5e1cb27dfe74d1fcdcc4995448	2013-08-07 17:02:28 -07:00
Dmitry Kovalev	82d7c6fb3c	Merge "Using only one scale function in scale_factors struct."	2013-08-07 16:32:09 -07:00
Dmitry Kovalev	1492698ed3	Merge "Adding ss_size_lookup table."	2013-08-07 16:08:24 -07:00
Jingning Han	debb9c68c8	Use low precision 32x32fdct for encodemb in speed1 The low precision 32x32 fdct has all the intermediate steps within 16-bit depth, hence allowing faster SSE2 implementation, at the expense of larger round-trip error. It was used in the rate-distortion optimization search loop only. Using the low precision version, in replace of the high precision one, affects the compression performance by about 0.7% (derf, stdhd) at speed 0. For speed 1, it makes derf set down by only 0.017%. Change-Id: I4e7d18fac5bea5317b91c8e7dabae143bc6b5c8b	2013-08-07 15:34:12 -07:00
Dmitry Kovalev	8db2675b97	Adding ss_size_lookup table. Removing the old one bsize_from_dim_lookup. Now we have a way to determine block size for plane using its subsampling values (ss_size_lookup). And then we can find the number of pixels in the block (num_pels_log2_lookup). Change-Id: I6fc981da2ae093de81741d3d78eaefed11015db9	2013-08-07 15:33:17 -07:00
Dmitry Kovalev	ea2348ca29	Merge "Removing NMS_STATS defines."	2013-08-07 15:28:30 -07:00
Christian Duvivier	78182538d6	Neon version of vp9_short_idct4x4_add. Change-Id: Idec4cae0cb9b3a29835fd2750d354c1393d47aa4	2013-08-06 18:41:27 -07:00
Deb Mukherjee	296931c817	Merge "Clean ups of the subpel search functions"	2013-08-06 17:28:48 -07:00
Deb Mukherjee	71b43b0ff0	Clean ups of the subpel search functions Removes some unused code and speed features, and organizes the interfaces for fractional mv step functions for use in new speed features to come. In the process a new speed feature - number of iterations per step during the subpel search - is exposed. No change when this parameter is set as the original value of 3. Results: subpel_iters_per_step = 3: baseline subpel_iters_per_step = 2: psnr -0.067%, 1% speedup subpel_iters_per_step = 1: psnr -0.331%, 3-4% speedup Change-Id: I2eba8a21f6461be8caf56af04a5337257a5693a8	2013-08-06 17:23:50 -07:00
Dmitry Kovalev	63ec0587c1	Merge "Motion vector code cleanup."	2013-08-06 16:00:01 -07:00
Dmitry Kovalev	1c552e79bd	Using only one scale function in scale_factors struct. Functions scale_mv_q4 and scale_mv_q3_to_q4 were almost identical except q3->q4 conversion in scale_mv_q3_to_q4. Now q3->q4 conversion happens directly in vp9_build_inter_predictor. Also adding useful constants: SUBPEL_BITS and SUBPEL_MASK. Change-Id: Ia0a6ad2ac07c45fdf95a5139ece6286c035e9639	2013-08-06 15:43:56 -07:00
Jingning Han	2c091f9768	Merge "Place holder for high-precision 32x32 fdct"	2013-08-06 14:47:30 -07:00
Jim Bankoski	5b307886fb	variance x86inc guards also fixed bug in sad calcs Change-Id: I6571fcbe37556c16ae32be66dc0fd879852aac1d	2013-08-06 14:17:13 -07:00
Jim Bankoski	6eb1254b88	sse3 intrapred x86inc protected Change-Id: I4a3c83119cdf8a205920034c8019d855d5504605	2013-08-06 14:17:13 -07:00
Deb Mukherjee	fac7c8c9f9	Merge "Flexible support for various pattern searches"	2013-08-06 14:03:27 -07:00
Jim Bankoski	c9126e0b30	sad + miscellaneous updates Enable use_x86inc as a commandline option. Fix Bug with sse2 when x86inc is disabled. Adds Sad asm protection to x86inc protection Change-Id: Iee0f9dd235ea10e8ace512eb362ba9bebe8c9df6	2013-08-06 12:16:04 -07:00
Dmitry Kovalev	8725ca2ed2	Merge "Inlining vp9_get_pred_probs_switchable_interp function."	2013-08-06 11:57:45 -07:00
Deb Mukherjee	15b5a6a2c7	Flexible support for various pattern searches Adds a few pattern searches to achieve various tradeoffs between motion estimation complexity and performance. The search framework is unified across these searches so that a common pattern search function is used for all. Besides it will be easier to experiment with various patterns or combinations thereof at different scales in the future. The new pattern search is multi-scale and is capable of using different patterns at different scales. The new hex search uses 8 points at the smallest scale and 6 points at other scales. Two other pattern searches - big-diamond and square are also added. Big diamond uses 4 points at the smallest scale and 8 points in diamond shape at the larger scales. Square is very similar conceptually to the default n-step search but is somewhat faster since it keeps only one survivor across all scales. Psnr/speed-up results on derf300: hex: -1.6% psnr%, 6-8% speed-up big-diamond: -0.96% psnr, 4-5% speedup square: -0.93% psnr, 4-5% speedup Change-Id: I02a7ef5193f762601e0994e2c99399a3535a43d2	2013-08-06 11:56:39 -07:00
Jingning Han	28566a6cd5	Place holder for high-precision 32x32 fdct Resolve compile warnings on re-define FDCT32x32_2D template. Change-Id: Idb3a54ef8d2710ce7245b726379a0e5c875f5cad	2013-08-06 11:44:08 -07:00
Dmitry Kovalev	0c80065694	Inlining vp9_get_pred_probs_switchable_interp function. There was no benefit having this function. For example, inside read_switchable_filter_type switchable filter context was calculated twice. Change-Id: I79cd5bf95cbc0f6d8bf91a2e32289e01b18dcff1	2013-08-06 11:04:31 -07:00
Jingning Han	7d61f8fe53	Merge "Move fdct32x32 SSE2 implementation in separate file."	2013-08-06 10:46:41 -07:00
Jim Bankoski	efc94102f0	Merge "intrapred x86inc guards"	2013-08-06 10:39:19 -07:00
Dmitry Kovalev	a39abe2627	Motion vector code cleanup. Converting arguments of two functions (clamp_mv_ref, lower_mv_precision) from int_mv* to MV*. Rewriting is_inside function to make it much shorter. Change-Id: Ie4c4cf3eccd46707c7df099ec21fb1b61c72fc7a	2013-08-06 10:31:11 -07:00
Dmitry Kovalev	3e51acafec	Merge "Finally removing all old block size constants."	2013-08-06 10:30:37 -07:00
Dmitry Kovalev	4a692e4168	Merge "Changing the order switchable filter enum constants."	2013-08-06 10:30:26 -07:00
Dmitry Kovalev	25b7dc08cd	Merge "Removing unused functions."	2013-08-06 10:29:57 -07:00
Deb Mukherjee	33afddadb9	Merge "Add variance based mode/skipping"	2013-08-06 10:19:15 -07:00
Christian Duvivier	3d98205fce	Move fdct32x32 SSE2 implementation in separate file. This is in preparation for the SSE2 version of the high-precision 32x32 forward DCT which will share a lot of code with the existing low precision version used for rate-distortion search. Change-Id: I7084b6bdfb480b1fabb8493fb14e3f7fcc7888c0	2013-08-06 10:17:11 -07:00
Jim Bankoski	25ec1375c9	intrapred x86inc guards Change-Id: If0399d8e11f4ebe75a5c91abb8d6a52a7709065b	2013-08-06 09:39:30 -07:00
Jim Bankoski	62c6aa884d	block error / x86inc mods Change-Id: Icb607745634e10b9bac5019d06661ece09fcdb40	2013-08-06 06:23:38 -07:00
Jim Bankoski	a93b115cd6	reworked config for use_x86_inc Support enabling it or disabling it. Moved read out to configure.sh so that its done once instead of in make and in config. Change-Id: I73a9190cf31de9f03e8a577f478fa522f8c01c8b	2013-08-05 17:35:25 -07:00
James Zern	d115cd8b12	Merge changes I082959ab,Ib6932640 * changes: vp9/decoder: threaded row-based loop filter vp9/decoder: add thread worker	2013-08-05 16:07:09 -07:00
Dmitry Kovalev	b9c7d04e95	Finally removing all old block size constants. Change-Id: I3aae21e88b876d53ecc955260479980ffe04ad8d	2013-08-05 15:23:49 -07:00
Jim Bankoski	f4837579d1	fixed script problem with config_force_x86_inc Change-Id: I226e5094d216b09dc47fa5511a66e2d314608000	2013-08-05 14:48:20 -07:00
Jim Bankoski	a5a7322459	Merge "Begin to restrict x86inc.asm usage"	2013-08-05 14:17:49 -07:00
Deb Mukherjee	8b3faccb9e	Add variance based mode/skipping Adds a speed feature to skip all intra modes other than DC_PRED if the source variance is small. This feature is made part of speed 1 and up. Results on derf300: psnr -0.07%, speedup about 1-2% Also uses the source variance to fine-tune the early termination criteria when FLAG_EARLY_TERMINATE is on. This feature is made part of speed 2 and up. Results on derf300: psnr -0.52%, speedup about 5-7% Change-Id: I59e38aa836557cfa5405ae706fc64815cbfe4232	2013-08-05 14:14:01 -07:00
Jim Bankoski	9f988a2edf	Merge "cleanups after bw bh code"	2013-08-05 14:02:02 -07:00
James Zern	a0ffa2794b	vp9/decoder: threaded row-based loop filter Currently the only threaded option for vp9 decode. Enabled when the decoder config thread count is > 1. Change-Id: I082959abac9e31aa4a38ed9fd68b94680e57f4df	2013-08-05 13:22:04 -07:00
James Zern	183b77d5ab	vp9/decoder: add thread worker vp9/decoder/vp9_thread.[hc] Original source: http://git.chromium.org/webm/libwebp.git 100644 blob b1615d0fb8d311666b2fa4561076c62d72c2e3ff src/utils/thread.c 100644 blob 13a61a4c84194c3374080cbf03d881d3cd6af40d src/utils/thread.h Local modifications: - s/WebP/VP9/g - camelcase functions -> lower with _'s Change-Id: Ib6932640ee34f8b4782c6fbd15864a59d5d4c5fe	2013-08-05 13:21:13 -07:00
Dmitry Kovalev	3f611555d7	Changing the order switchable filter enum constants. This changeset allows to remove vp9_switchable_interp and vp9_switchable_interp_map arrays and make code much clear. Actually we still have to use these mapping but only inside read_interp_filter_type and write_interp_filter_type functions. Change-Id: I4026c6f8c4acefba6c81421b7bacbaa52cc45f50	2013-08-05 12:26:15 -07:00
Jim Bankoski	5d2cb7ead0	cleanups after bw bh code Cons bw/bh parms that should have been const. Additional formatting. Change-Id: Icd36a5c9dc17dadd7284315ac0d6fef1a565ca16	2013-08-05 12:15:52 -07:00
Jim Bankoski	c3809f3de5	Begin to restrict x86inc.asm usage Chromium does not support 32bit builds for Mac which use x86inc.asm. Make the files which include it work if 64bit or not PIC enabled starting with vp9_copy_sse2.asm Consolidate these targets in vp9_rtcd_defs.sh Change-Id: If18f0b957a611efd085a3ee7d245cf1eb91e8248	2013-08-05 12:07:30 -07:00
Dmitry Kovalev	d007446b3f	Replacing long block size enum values with shorter ones (2). Change-Id: I428c4d42212b757112e3acfe5b81314cfbb5fd6b	2013-08-05 10:51:02 -07:00
Dmitry Kovalev	319867d71c	Merge "Cleaning up vp9_build_inter_predictor function."	2013-08-05 01:52:11 -07:00
Dmitry Kovalev	78671e2eff	Merge "Replacing "txfm" with "tx" in identifiers."	2013-08-04 02:52:22 -07:00
Jim Bankoski	f703f98757	reworked find_mv_ref This is an attempt at rewriting vp9_find_mv_refs_idx. I believe that it gains about 1-2% decode speed Change-Id: Ia5359c94ce9bb43b32652890e605e9a385485c1b	2013-08-03 20:25:55 -07:00
Dmitry Kovalev	fe2a201eb1	Replacing "txfm" with "tx" in identifiers. Consistent names with TX_SIZE, TX_MODE, and TX_MODE. Change-Id: I79592218bf5a40ace89197a34a06ee7de581ed8d	2013-08-02 17:28:23 -07:00
Dmitry Kovalev	5edc65d00d	Removing NMS_STATS defines. Change-Id: Iabab0e59042a33456df1d449c0d0f01debc00c7c	2013-08-02 17:10:15 -07:00

... 3 4 5 6 7 ...

2722 Commits