generic-library/vpx

Author	SHA1	Message	Date
Tero Rintaluoma	e326cecf18	Fix intermediate height in convolve_c - Intermediate height was not correct i.e. when block size is 4 and y_step_q4 is 6. In this case intermediate height was (4*6) >> 4 = 1 and vertical interpolation needs two source pixels plus 7 extra pixels for taps. - Also if the current output block is 16x16 and we are using 4x upscaling we need only 12 rows after horizontal filtering instead of 16. Patch Set 2: Intermediate_height updated after CL 66723 "Fix bug in convolution functions (filter selection)" Change-Id: I5a1a1bc2ac9d5edb3a6e0818de618bf318fdd589	2013-08-30 10:31:21 +03:00
hkuang	ba8fc71979	Merge "Add neon optimize vp9_short_idct4x4_1_add."	2013-08-26 16:26:38 -07:00
hkuang	69384f4fad	Add neon optimize vp9_short_idct4x4_1_add. Change-Id: I6ecb5c4a1a472feb8e84e9f3352b536d5e28a4a5	2013-08-26 15:55:16 -07:00
Yaowu Xu	4505e8accb	Merge "Fix the reading of too many input pixels"	2013-08-26 14:01:50 -07:00
Paul Wilkins	aa823f8667	Merge "Changes to adaptive inter rd thresholds."	2013-08-26 12:48:11 -07:00
Yaowu Xu	6c5433c836	Fix the reading of too many input pixels in VP9_get4x4var_mmx Change-Id: I4b4a8f45f25ebdfad281f169cc87aba5e2d6f227	2013-08-26 12:35:27 -07:00
Paul Wilkins	642696b678	Merge "Limit Key frame Intra modes checks."	2013-08-26 12:34:56 -07:00
Jingning Han	4681197a58	Merge "Temporarily disable SSSE3 quant_32x32"	2013-08-26 11:19:53 -07:00
Dmitry Kovalev	5eed6e2224	Merge "Removing redundant calls to clamp_mv2."	2013-08-26 10:48:37 -07:00
Jingning Han	166dc85bed	Temporarily disable SSSE3 quant_32x32 Make the current head working properly, while working on fixing an issue in the SSSE3 implementation of 32x32 quantization. Change-Id: Ic029da3fd7f1f5e58bc641341cbd226ec49a16bc	2013-08-26 10:45:59 -07:00
James Zern	c8ba8c513c	cosmetics: strip 'VP9_' from defines in vp9 only code Change-Id: I481d9bb2fa3ec72b6a83d5f04d545ad8013f295c	2013-08-23 19:16:49 -07:00
James Zern	2c6ba737f8	Merge "vp9: remove unnecessary wait w/threaded loopfilter"	2013-08-23 18:52:10 -07:00
Dmitry Kovalev	e6c435b506	Merge "Cleanup in mvref_common.{h, c}."	2013-08-23 17:09:49 -07:00
Dmitry Kovalev	7194da2167	Merge "Fixing display size setting problem."	2013-08-23 17:08:51 -07:00
Yaowu Xu	13930cf569	Limit mv range to be based on partition size Previous change `c4048dbd` limits the mv search range assuming max block size of 64x64, this commit change the search range using actual block size instead. Change-Id: Ibe07ab02b62bf64bd9f8675d2b997af20a2c7e11	2013-08-23 15:43:57 -07:00
Dmitry Kovalev	cd2cc27af1	Removing redundant calls to clamp_mv2. We could avoid calling clamp_mv2 because it has been already called inside vp9_find_best_ref_mvs function. Change-Id: I08edeaf3e11e98c19e67b9711b2523ca5fb1416e	2013-08-23 15:18:35 -07:00
Yaowu Xu	8e04257bc5	Merge "Added border extension"	2013-08-23 14:43:58 -07:00
Adrian Grange	78debf246b	Merge "Fix bug in convolution functions (filter selection)"	2013-08-23 13:41:47 -07:00
Dmitry Kovalev	fb481913f0	Merge "Removing useless calls to setup_{pre, dst}_planes."	2013-08-23 13:37:32 -07:00
Dmitry Kovalev	11e3ac62a5	Fixing display size setting problem. Fix of https://code.google.com/p/webm/issues/detail?id=608. We could have used invalid display size equal to the previous frame size (not to the current frame size). Change-Id: I91b576be5032e47084214052a1990dc51213e2f0	2013-08-23 13:12:46 -07:00
Dmitry Kovalev	21d8e8590b	Cleanup in mvref_common.{h, c}. Making code more compact, adding consts, removing redundant arguments, adding do/while(0) for macros. Change-Id: Ic9ec0bc58cee0910a5450b7fb8cfbf35fa9d0d16	2013-08-23 12:00:30 -07:00
Yaowu Xu	656632b776	Added border extension To the source buffer to be encoded as an alt ref frame. This is to fix the problem of using uninitialized memory in encoder. See https://code.google.com/p/webm/issues/detail?id=605 Change-Id: I97618a2fc207e08abcf5301b734aa9e3ad695e2c	2013-08-23 11:31:28 -07:00
Adrian Grange	3f10831308	Fix bug in convolution functions (filter selection) (In response to Issue 604: https://code.google.com/p/webm/issues/detail?id=604) There were bugs in the convolution code for two cases: 1. Where the filter table was assumed to be aligned to a 256 byte boundary. The offset of the pixel in the source buffer was computed incorrectly. 2. Where no such alignment assumption was made. An incorrect address for the filter table base was used. To fix both problems, I now assume that the filter table is 256-byte aligned and modify the pixel offset calculation to match. A later patch should remove the restriction that the filter table is aligned to a 256-byte boundary. There was also a bug in the ConvolveTest unit test (convolve_test.cc). (Bug & initial fix suggestion submitted by Tero Rintaluoma and Sami Pietilä). Change-Id: I71985551e62846e55e40de9e7e3959d4805baa82	2013-08-23 11:16:08 -07:00
Dmitry Kovalev	1c159c470a	Merge "Checking scale factors on access."	2013-08-23 11:05:17 -07:00
hkuang	b85367a608	Merge "Optimise idct4x4: rearrange the instructions a bit to improve instruction scheduling."	2013-08-23 10:08:43 -07:00
Paul Wilkins	aa5b67add0	Changes to adaptive inter rd thresholds. Values now carried over frame to frame. Change to algorithm for decreasing threshold after a hit and to max threshold (now based on speed) Removed some old commented out code relating to VP8 adaptive thresholds. The impact of these changes tested on Akiyo (50 frames) and measured in terms of unit rd hits is as follows: Speed 0 84.36 -> 84.67 Speed 1 29.48 -> 22.22 Speed 2 11.76 -> 8.21 Speed 3 12.32 -> 7.21 Encode speed impact is broadly in line with these. Change-Id: I5b886efee3077a11553fa950d796fd6d00c8cb19	2013-08-23 16:18:45 +01:00
Paul Wilkins	f76f52df61	Limit Key frame Intra modes checks. Most of the focus so far has been on inter frames. At high speed settings the key frame is now taking a high % of the cycles. This patch puts in some masking to reduce the number of INTRA modes searched during key frame coding (as already happens for inter frames) at higher speed settings TODO: Develop this further with either adaptive rd thresholds when choosing which intra modes to consider or some other heuristic. Impact. At high speed settings on some clips the key frame was starting to dominate. In a coding of the first 50 frames of AKIYO at speed 2 limiting the key frame intra modes to DC or TM_PRED resulted in ~30% overall speedup. For Bus the number was lower at ~4-5%. Change-Id: I7bde68aee04995f9d9beb13a1902143112e341e2	2013-08-23 16:10:30 +01:00
Jingning Han	9655c2c7a6	Merge "Fix rectangular partition check flag"	2013-08-22 18:59:18 -07:00
Dmitry Kovalev	33104cdd42	Merge "vp9_encodeframe.c cleanup."	2013-08-22 18:07:35 -07:00
James Zern	711aff9d9d	Merge "vp9/encoder: fix last_frame_seg_map mem leak"	2013-08-22 18:04:03 -07:00
James Zern	d843ac5132	Merge "rename LOG2_* defines to *_LOG2"	2013-08-22 18:02:42 -07:00
Jingning Han	84f3b76e1c	Fix rectangular partition check flag Put rectangular partition check flag change according to the rd costs of NONE and SPLIT partition types under the speed feature. Change-Id: If681e1e078a8d43d86961ea4b748da5cd1b6c331	2013-08-22 17:15:01 -07:00
Dmitry Kovalev	53f6f8ac93	Merge "check_bsize_coverage cleanup."	2013-08-22 16:18:24 -07:00
hkuang	4205d79273	Merge "Add neon optimize vp9_short_idct10_16x16_add."	2013-08-22 15:57:28 -07:00
hkuang	4082bf9d7c	Add neon optimize vp9_short_idct10_16x16_add. vp9_short_idct10_16x16_add is used to handle the block that only have valid data at top left 4x4 block. All the other datas are 0. So we could cut many unnecessary calculations in order to save instructions. Change-Id: I6e30a3fee1ece5af7f258532416d0bfddd1143f0	2013-08-22 15:53:22 -07:00
Dmitry Kovalev	604022d40b	vp9_encodeframe.c cleanup. Removing unused get_sbuv_perpixel_variance function, using has_second_ref/ is_inter_block functions, organizing includes. Change-Id: I016de4af12fbbb8b4ece26a70759b2392651b095	2013-08-22 15:50:51 -07:00
Dmitry Kovalev	335b1d360b	check_bsize_coverage cleanup. Change-Id: Ib7803857b35c00e317c9deb8630e777e25eb278f	2013-08-22 15:45:56 -07:00
Dmitry Kovalev	3c42657207	Checking scale factors on access. It is possible to have invalid scale factors and not access them during decoding. Error is reported if we really try to use invalid scale factors. Change-Id: Ie532d3ea7325ee0c7a6ada08269f804350c80fdf	2013-08-22 15:19:05 -07:00
James Zern	40ae02c247	rename LOG2_* defines to *_LOG2 gets rid of a mix of styles Change-Id: I3591d312157bc6f53a25438bf047765c671fd8a8	2013-08-22 14:45:24 -07:00
Dmitry Kovalev	13eed79c77	Merge "Adding vp9_is_scaled function."	2013-08-22 14:39:55 -07:00
Dmitry Kovalev	09858c239b	Removing useless calls to setup_{pre, dst}_planes. Comment is wrong, we don't initialize any xd pointers. We only initialize xd->planes[i]->dst and xd->planes[i]->pre[], which are actually initialized for every block during the decoding. Change-Id: If152ea872ebef1f83ca70712fa6f8df1b6855f56	2013-08-22 14:39:05 -07:00
James Zern	a5726ac453	vp9/encoder: fix last_frame_seg_map mem leak remove duplicate allocation from vp9_create_compressor, it was added to vp9_alloc_frame_buffers in: `d5bec52` Added resizing & initialization of last frame segment map Change-Id: I996723226a16a62aff8f9a52ac74e0b73cc98fdf	2013-08-22 14:13:04 -07:00
Dmitry Kovalev	640dea4d9d	Adding vp9_is_scaled function. Change-Id: Ieb7077ca3586b9491912027eed450a4f6fd38d30	2013-08-22 14:04:59 -07:00
Jingning Han	8adc20ce35	Merge "Refactor rd_pick_partition for parameter control"	2013-08-22 13:54:48 -07:00
James Zern	da9a6ac9e7	Merge "vp9_peek_si: add bitstream v1 support"	2013-08-22 13:28:00 -07:00
Jingning Han	01a37177d1	Refactor rd_pick_partition for parameter control This commit changes the partition search order of superblocks from {SPLIT, NONE, HORZ, VERT} to {NONE, SPLIT, HORZ, VERT} for consistency with that of sub8x8 partition search. It enable the use of early termination in partition search for all block sizes. For ped_area_1080p 50 frames coded at 4000 kbps, it makes the runtime goes down from 844305ms -> 818003ms (3% speed-up) at speed 0. This will further move towards making the in-search partition types configurable, hence unifying various speed-up approaches. Some speed 1 and 2 features are turned off during the refactoring process, including: disable_split_var_thresh using_small_partition_info Stricter constraints are applied to use_square_partition_only for right/bottom boundary blocks. Will bring back/refine these features subsequently. At this point, it makes derf set at speed 1 about 0.45% higher in compression performance, and 9% down in run-time. Change-Id: I3db9f9d1d1a0d6cbe2e50e49bd9eda1cf705f37c	2013-08-22 12:36:02 -07:00
hkuang	610642c130	Optimise idct4x4: rearrange the instructions a bit to improve instruction scheduling. Change-Id: I5ea881a6e419f9e8ed4b3b619406403b4de24134	2013-08-22 11:02:22 -07:00
Deb Mukherjee	8b810c7a78	Fixes on feature disabling split based on variance Adds a couple of minor fixes, which may be absorbed in Jingning's patch. Thanks to Guillaume for pointing these out. Also adjusts the thresholds for speed 1 and 2 to 16 and 32 respectively, to keep quality drops small. Results: -------- derfraw300: threshold = 16, psnr -0.082%, speedup 2-3% threshold = 32, psnr -0.218%, speedup 5-6% stdhdraw250: threshold = 16, psnr -0.031%, speedup 2-3% threshold = 32, psnr -0.273%, speedup 5-6% Change-Id: I4b11ae8296cca6c2a9f644be7e40de7c423b8330	2013-08-22 07:05:44 -07:00
Scott LaVarnway	f39bf458e5	Merge "Initialize mb_skip_coeff before picking modes"	2013-08-22 06:26:04 -07:00
Scott LaVarnway	94bfbaa84e	Initialize mb_skip_coeff before picking modes It appears that the above/left mb_skip_coeff used during the pick modes, is left over from the previously encode frame. This patch initializes the flag to the default value of zero. Change-Id: Ida4684cc99611d6e3e82628db35ed717e28ce550	2013-08-22 08:51:04 -04:00

1 2 3 4 5 ...

2529 Commits