generic-library/vpx

Author	SHA1	Message	Date
Scott LaVarnway	516ea8460b	Use the fast quantizer for inter mode selection Use the fast quantizer for inter mode selection and the regular quantizer for the rest of the encode for good quality, speed 1. Both performance and quality were improved. The quality gains will make up for the quality loss mentioned in I9dc089007ca08129fb6c11fe7692777ebb8647b0. Change-Id: Ia90bc9cf326a7c65d60d31fa32f6465ab6984d21	2010-12-28 14:51:46 -05:00
Yunqing Wang	bf53ec492d	Adjust MV borders for SPLITMV mode Add limits to avoid MV going out of range. Change-Id: I8a5deb40bf393488d29f694b5a56804d578e68b5	2010-12-28 13:23:07 -05:00
Yunqing Wang	e463b95b4e	Merge "Modify motion estimation for SPLITMV mode"	2010-12-28 08:12:26 -08:00
Yunqing Wang	a5a8d92976	Modify motion estimation for SPLITMV mode 1. Search for block8x16/block16x8 uses block8x8's search results. 2. Check block4x4 only if block8x8 is chosen. (This hurts quality, which will be improved in another check-in.) 3. In block4x4 search, the previous block's result is used as MV predictor for next block. This change improves performance. Change-Id: I9dc089007ca08129fb6c11fe7692777ebb8647b0	2010-12-28 10:34:42 -05:00
John Koleszar	2a68727af9	Merge remote branch 'origin/master' into experimental Change-Id: I238df40ea8e0f34b85a38525605f7c91905f650a	2010-12-27 00:05:06 -05:00
Yaowu Xu	0f5264b584	adjusted sad_per_bit to correlate with quantizer Re-calibrated sad_per_bit16 and sad_per_bit4 tables to linearly correlated to quantizer values, these two variables are used in motion search for costing motion vectors. This change has an small positive effect on compression. Change-Id: Ic9b5ea6fb8d5078ef663ba4899db019cc51f4166	2010-12-23 22:59:38 -08:00
John Koleszar	7c8ae69ad8	Merge remote branch 'origin/master' into experimental Change-Id: I05d5b211674cb4560d3a54dcdfa853f8d84599e6	2010-12-24 00:05:06 -05:00
Johann	20b855c33e	improve integer version of filter the lookup table is based on floating point calculations (see source) by moving the *3 before the downshift and adding the rounding bit, the delta (LUT - integer) goes from: ______________________________________ __ 1__ 1______________________________ __ 1__ 1______________________________ ____ 1______ 1________________________ ____ 1 2__ 2 1________________________ ______ 1 1 2__ 2__ 2__ 2 1 1__________ ________ 1 1 2 2__ 1 2 3 1 2__ 2__ 2__ to: __-1__-1______________________________ ______________________________________ ____-1______-1________________________ ______________________________________ ________-1______________-1____________ ______________________________________ it's important to be able to use the integer version because the LUT more or less precludes SIMD optimizations Change-Id: I45a81127dc7b72a06fba951649135d9d918386c0	2010-12-22 11:33:59 -05:00
Johann	4b6219cb33	temporal filter naming changes be more consistant with the naming pattern, especially wrt rtcd Change-Id: I3df50686a09f1dab0a9620b5adbb8a1577b40f2f	2010-12-22 11:32:15 -05:00
Johann	092b5bef37	abstract apply_temporal_filter allow for optimized versions of apply_temporal_filter (now vp8_apply_temporal_filter_c) the function was previously declared as static and appears to have been inlined. with this change, that's no longer possible. performance takes a small hit. the declaration for vp8_cx_temp_filter_c was moved to onyx_if.c because of a circular dependency. for rtcd, temporal_filter.h holds the definition for the rtcd table, so it needs to be included by onyx_int.h. however, onyx_int.h holds the definition for VP8_COMP which is needed for the function prototype. blah. Change-Id: I499c055fdc652ac4659c21c5a55fe10ceb7e95e3	2010-12-22 11:31:54 -05:00
John Koleszar	0b710c8d1a	Merge remote branch 'origin/master' into experimental Conflicts: vp8/vp8_cx_iface.c Change-Id: I76f302448f11b28772efd4b5643f86a7cc69a8c2	2010-12-21 07:54:10 -05:00
John Koleszar	eee331e7f3	Merge remote branch 'origin/master' into experimental Change-Id: Iae8b85d2f6ad4d854c43dded8588e054906f7156	2010-12-18 00:05:19 -05:00
John Koleszar	b0da9b399d	Add psnr/ssim tuning option Add a new encoder control, VP8E_SET_TUNING, to allow the application to inform the encoder that the material will benefit from certain tuning. Expose this control as the --tune option to vpxenc. The args helper is expanded to support enumerated arguments by name or value. Two tunings are provided by this patch, PSNR (default) and SSIM. Activity masking is made dependent on setting --tune=ssim, as the current implementation hurts speed (10%) and PSNR (2.7% avg, 10% peak) too much for it to be a default yet. Change-Id: I110d969381c4805347ff5a0ffaf1a14ca1965257	2010-12-17 10:01:05 -05:00
John Koleszar	1d7c6e79d9	Merge remote branch 'origin/master' into experimental Change-Id: Ie8f1f1a949e310ec1362f352d7a220ae4155cbea	2010-12-17 00:05:07 -05:00
Scott LaVarnway	64baa8df2e	Changed segmentation check order In SPLITMV, the 8x8 segment will be checked first. If the 8x8 rd is better than the best, we check the other segments. Otherwise bail. Adjustments to the thresh_mult were necessary to make up for the initial quality loss. The performance improved by 20% (average) for good quality, speed 0 and speed 1, while the overall quality remained the same. Change-Id: I717aef401323c8a254fba3e9777d2a316c774cc3	2010-12-16 17:01:27 -05:00
Scott LaVarnway	81cdeb7117	Adjusted breakout RD for SPLITMV vp8_rd_pick_best_mbsegmentation looks at y only. The new breakout does not include the frame cost, the prob_skip_false cost, or the uv rate. Performance improved by a few percent and the quality remained the same. Change-Id: I94ff013998ac51e8ecce7130870f7b6600758e15	2010-12-16 09:38:02 -05:00
John Koleszar	1f5d91d92e	Merge remote branch 'origin/master' into experimental Change-Id: I3ff6a301e89b6d17a66c58801b5acc649f929de8	2010-12-16 00:05:07 -05:00
Yunqing Wang	4fbd0227f5	Merge "Fix a bug in motion search code(2)"	2010-12-15 08:10:34 -08:00
John Koleszar	4fa8d36f76	Merge remote branch 'origin/master' into experimental Conflicts: vp8/common/entropy.c Change-Id: I35fd49cf92a50d09082fe199d3bf21bfca68a94f	2010-12-15 08:08:18 -05:00
Yunqing Wang	08706a3ea7	Fix a bug in motion search code(2) This fix added MV range checks for NEWMV mode as suggested by Jim. To reduce unnecessary MV range checks, I tried Yaowu's suggestion. Update UMV borders in NEWMV mode to also cover MV range check. Also, in this way, every MV that is valid gets checked in diamond search function. Change-Id: I95a89ce0daf6f178c454448f13d4249f19b30f3a	2010-12-14 17:39:25 -05:00
Yaowu Xu	3ac73173a4	Merge "fix a bug that "optimize" flag is not set for sub-threads"	2010-12-14 13:32:04 -08:00
Yunqing Wang	23aa13d92c	Merge "Fix a bug in motion search code"	2010-12-14 13:25:34 -08:00
Yunqing Wang	7fb0f86863	Fix a bug in motion search code The MV's range is 256. Since the new motion search uses a different starting MV than the center ref MV, a MV range checking needs to be done to avoid corruption. Change-Id: I8ae0721d1bd203639e13891e2e54a2e87276f306	2010-12-14 13:59:38 -05:00
Yaowu Xu	64f3d91579	fix a bug that "optimize" flag is not set for sub-threads The flag for quantization optimization was not properly propagated to mb row encoding threads. Change-Id: Ic561599c35acd94cd5698c9b314bccd596ac2deb	2010-12-14 10:12:21 -08:00
Johann	825adc464f	shrink TOKENEXTRA and vp8_extra_bit_struct Per John's previous change, shrink TOKENEXTRA from 20 to 8 bytes original: `b7b1e6fb` reverted: `41f4458a` Also drop unused field from vp8_extra_bit_struct Update ARM ASM to deal with this change. In particular, Extra is signed and needs to be sign-extended when loaded. Change-Id: Ibd0ddc058432bc7bb09222d6ce4ef77e93a30b41	2010-12-14 10:32:50 -05:00
John Koleszar	6a80032280	Merge remote branch 'origin/master' into experimental Change-Id: Ic88e9b2fcf1dcb2852a7205bcda3f181103f5612	2010-12-14 00:05:05 -05:00
John Koleszar	41f4458a03	Revert "Reduce size of TOKENEXTRA struct" This reverts commit `b7b1e6fb55`. Previous fix is incomplete, breaks ARM. Itchy submit finger. Change-Id: I939dc0d3bf4173cf951c1d152338ab6ea2184bb9	2010-12-13 17:12:51 -05:00
John Koleszar	3809d7bbd9	Merge "remove unused temporal preproc code"	2010-12-13 13:57:59 -08:00
John Koleszar	398aa81849	Merge "Reduce size of TOKENEXTRA struct"	2010-12-13 13:57:55 -08:00
John Koleszar	b1aa54ab26	remove unused temporal preproc code This code is unused, as the current preproc implementation uses the same spatial filter that postproc uses. Change-Id: Ia06d5664917d67283f279e2480016bebed602ea7	2010-12-13 16:47:59 -05:00
John Koleszar	b7b1e6fb55	Reduce size of TOKENEXTRA struct Change the size of structure elements to reduce memory utilization. Removed the 'section' member entirely, as it is set but never read. Change-Id: Iad043830392fb4168cb3cd6075fb0eb70c7f691c	2010-12-13 16:37:37 -05:00
John Koleszar	b6905e36d9	Merge remote branch 'origin/master' into experimental Change-Id: Ibbe41ff2356aa8583c728e9ab1b0814958a51752	2010-12-11 00:05:08 -05:00
Yaowu Xu	97a86c5b13	fix a bug in multithreaded encoding with active_map enabled Added the initialization of the pointer to active map. Also added the same logic for cyclic refresh in mbrow encoding threads. Change-Id: Ic48d0849dc706b27fba72d07dcc498075725663d	2010-12-10 10:48:30 -08:00
Fritz Koenig	0ced701487	Merge "vp8 fast quantizer sse2 optimizations for eob."	2010-12-10 09:25:04 -08:00
John Koleszar	0f02b37992	Merge remote branch 'origin/master' into experimental Change-Id: Iada4d917df4af42b16404e1b54b30ba2ca74df39	2010-12-10 00:05:07 -05:00
Fritz Koenig	e0cf330cde	vp8 fast quantizer sse2 optimizations for eob. Changed the end of block computation to use pmaxw. Removed additional pushing and popping of registers that was not needed. Change-Id: I08cb9b424513cd8a2c7ad8cea53b4e2adc66ef98	2010-12-09 15:00:30 -08:00
John Koleszar	cb9698951c	fix uninitialized read in encode breakout Change I3430820 performed an uninitialized read when encode_breakout == 0, since the sum and sse wouldn't be set: if(x->encode_breakout) VARIANCE_INVOKE(..., get16x16var)(..., &sum, &sse); if (cpi->active_map_enabled && x->active_ptr[0] == 0) { ... } else if (sse < x->encode_breakout) Change-Id: I915eb76d1227b4b6d1137a0dedf2c143860098a2	2010-12-09 16:05:26 -05:00
Paul Wilkins	c63fc881e1	Correct q_low and q_high limits for the recode loop Corrected the initial Q range limits for the recode loop to reflect the current allowed range for the frame. In experimental work on constrained quality this bug was causing unnecessary recodes. Change-Id: I7e256fbfa681293b0223fe21ec329933d76c229f	2010-12-09 15:02:04 +00:00
John Koleszar	808f3814fc	Merge remote branch 'origin/master' into experimental Change-Id: I2b70793a97f80039ad23feea164744b1c236ac74	2010-12-09 00:05:07 -05:00
Yaowu Xu	160f3c7e9e	Merge "vp8e - static threshold play"	2010-12-08 13:08:04 -08:00
Yaowu Xu	d88da98614	Merge "vp8e - remove unnecessary variance calc"	2010-12-08 09:19:22 -08:00
John Koleszar	c5795b673d	Merge remote branch 'origin/master' into experimental Change-Id: I76ed5f6c24f3f71bba47679ff09d28e046ec1db9	2010-12-08 00:05:06 -05:00
Jim Bankoski	718c19711a	vp8e - static threshold play Realized no need for new assembly code sum is already calculated. Change-Id: Ie2d94feb4b7c1f77c5359bca29b66228e41638c9	2010-12-07 16:07:23 -05:00
Scott LaVarnway	f661fa1f24	Merge "vp8_rd_pick_best_mbsegmentation code restructure"	2010-12-07 07:53:12 -08:00
Yaowu Xu	062980cc48	Merge "adjust RDMULT for UV plane in quantization RDO"	2010-12-06 22:04:45 -08:00
John Koleszar	727abbb38a	Merge remote branch 'origin/master' into experimental Change-Id: I1baeedb24f321d3e200f00412cc657ab92c43143	2010-12-07 00:05:08 -05:00
Yaowu Xu	7c03a1c308	adjust RDMULT for UV plane in quantization RDO This patch adds a weighting factor on RDMULT for UV blocks. The change has an overall gain about 0.5% based on ssim, between 0.1 and 0.2% by psnr numbers. Change-Id: I97781b077ce3bb7e34241b03268491917e8d1d72	2010-12-06 20:53:59 -08:00
Yunqing Wang	9520f4b3cc	Fix a memory leak problem in encoder Deallocating the buffers before re-allocating them. The fix passed James Berry's test program for memory leak check. Change-Id: I18c3cf665412c0e313a523e3d435106c03ca438d	2010-12-06 17:21:37 -05:00
Scott LaVarnway	2fa5d5a26d	vp8_rd_pick_best_mbsegmentation code restructure Moved the code from the segmentation loop into a function which is now called for each segment. This will allow us to change the segment order checking more easily. Change-Id: I9510d26f0acae5a73043fcca8f1984b121d3e052	2010-12-06 16:42:52 -05:00
Scott LaVarnway	d283d9bb30	Merge "Improve MV prediction accuracy to achieve performance gain"	2010-12-06 09:41:09 -08:00
Patrik Westin	8534071de0	Fix for manual Golden frame frequency When auto_golden wasn't set it forced all frames to be a golden frame. Now the manual configured frequency is adhered to. Change-Id: I360acac9bc487db0d9c4d4da6ee41f70c227c539	2010-12-06 09:53:41 -05:00
John Koleszar	7e9910c69b	Merge remote branch 'origin/master' into experimental Change-Id: I2a47e43cb3ad61620bfef9e8caf578f321487f2c	2010-12-05 00:05:06 -05:00
Paul Wilkins	ccb0348473	Merge "Change to inter_minq table."	2010-12-04 02:06:33 -08:00
Paul Wilkins	cec6a596b5	Change to inter_minq table. The inter_minq table controls the range of quantizers available for a particular frame in two pass relative to a max Q value. The changes reduces the range somewhat. The effect of this was a small increase (0.3% average) in psnr for the test set but it should also help encode speed somewhat for higher quality modes as it will reduce the number of iterations in the recode loop. The change damps the range of quantizers available locally within a section of a clip and should therefore help keep quality more uniform. If there is systematic overshoot or undershoot the range can shift gradually to accommodate. However, there is some increased risk of overshoot or undershoot against the target bit rate in VBR mode and this risk will be more pronounced for short clips. The change damps the range of quantizers available locally within a section of a clip and should therefore help keep quality more uniform. If there is systematic overshoot or undershoot the range can shift gradually to accommodate. However, there is some increased risk of overshoot or undershoot against the target bit rate in VBR mode and this risk will be more pronounced for short clips. Change-Id: I84465567d49ae767c6c73ff2a2aac30c895adb52	2010-12-04 10:04:12 +00:00
John Koleszar	16724b7c93	Merge remote branch 'origin/master' into experimental Change-Id: I11cd10dba54d0f3f96640dadc97199e5733f1888	2010-12-04 00:05:08 -05:00
Yunqing Wang	c3bbb29164	Improve MV prediction accuracy to achieve performance gain Add vp8_mv_pred() to better predict starting MV for NEWMV mode in vp8_rd_pick_inter_mode(). Set different search ranges according to MV prediction accuracy, which improves encoder performance without hurting the quality. Also, as Yaowu suggested, using diamond search result as full search starting point and therefore adjusting(reducing) full search range helps the performance. Change-Id: Ie4a3c8df87e697c1f4f6e2ddb693766bba1b77b6	2010-12-03 15:23:35 -05:00
John Koleszar	5e76dfcc70	Merge 'Add simple version of activity masking.' Merge commit 'refs/changes/79/779/2' of https://review.webmproject.org/p/libvpx Conflicts: vp8/encoder/encodeintra.c vp8/encoder/encodemb.c Change-Id: Id607063fabe92d99eeb3c380e8ca670b01bfb3ef	2010-12-03 13:30:50 -05:00
John Koleszar	ea2a5754b4	Merge remote branch 'origin/master' into experimental Change-Id: If95cb994d898d3f29b28db0d118a1f9c973e88d9	2010-12-02 08:20:43 -05:00
Fritz Koenig	9c8ad79fdc	Set refresh_alt_ref_frame on keyframe encode. On a keyframe alt ref and golden are refreshed. The flag was not being set and so on the frame after a keyframe, motion search would occur on the alt ref frame. This is not necessary because the alt ref frame identical to the last frame in this scenario. Handle corner case where a forward alt-ref frame is put directly after a keyframe. Change-Id: I9be4cf290d694f8cf2f9a31852014b5ccf1504d3	2010-12-01 12:48:22 -08:00
Jim Bankoski	3430820bbe	vp8e - remove unnecessary variance calc only do the variance calculation if necessary ( eg needed for breakout test)	2010-11-27 14:02:59 -05:00
John Koleszar	8416312095	Merge remote branch 'origin/master' into experimental	2010-11-23 00:05:05 -05:00
Paul Wilkins	ad6150f769	Recalibration of bits per MB tables The baseline bits per MB prediction tables have been re calibrated based on the assumption that bits per mb is inversely proportional to the quantizer level. Change-Id: Ibd355c7acac4b8053dda1baf1032fe35f11da7f7	2010-11-22 13:17:35 +00:00
Paul Wilkins	1753f0d208	Merge "Added extra two pass stats gathering."	2010-11-22 04:11:20 -08:00
John Koleszar	b13d1c307e	Merge remote branch 'origin/master' into experimental	2010-11-21 00:05:05 -05:00
Paul Wilkins	70b885a0e8	Added extra two pass stats gathering. Added code to record spend so far against planed budget. Change-Id: I5a3335346fa1771b2b1219df9f6127f9993d2594	2010-11-19 14:12:33 -05:00
Yaowu Xu	39ceef38a7	changed MAX_PSNR to 100 Changing the MAX_PSNR to 100 to allow testing of further experiments on extending quantizer range to near lossless. With an effective quantizer of 1, encoder achieves ~68DB, which is consistent with fdct/idct round trip error. Change-Id: I7b6d0e94a8936968ef42e82e63ebb13999c36832	2010-11-18 09:12:02 -08:00
Pascal Massimino	ed5ab7fa49	remove warning was having: "vp8/encoder/onyx_if.c:5365: warning: comparison of unsigned expression >= 0 is always true"	2010-11-17 16:50:02 -08:00
Scott LaVarnway	9a6740af80	Merge "Removed unnecessary checks."	2010-11-17 11:28:22 -08:00
Scott LaVarnway	f7670acc68	Removed unnecessary checks. macro_block_yrd and vp8_rdcost_mby are not called for SPLITMV. Change-Id: I2224d3c8725df526d48426447482768d543752f1	2010-11-17 14:25:48 -05:00
Paul Wilkins	f874391e02	Replaced recode loop test with a function call Replaced existing code to decide if a frame recode is required with a function call. This is to simplify addition of extra clauses that may be needed for the planned constrained quality mode. Also fixed a bug where by alt ref not considered in the test. Change-Id: I3d40bb21abe3e19f8456761e6849deb171738b60	2010-11-17 15:12:04 +00:00
Fritz Koenig	69ee697fef	Comments for alt ref flags. Clarify what the alt ref flags do when encoding. Change-Id: I71f78e0f42edae633fb91840f29dfbe64362c44c	2010-11-16 15:16:24 -08:00
Fritz Koenig	e180255375	Remove stack shadowing for x86-x64 for SAD functions. x86-64 passes arguments in registers. There is no need to push them to the stack before using them. This fixes `15acc84f10` where ebx was not getting preserved on x86. Change-Id: I1214b5f818a0201f75ab6ad7d5c6f448e09b16c2	2010-11-15 10:56:02 -08:00
Paul Wilkins	f4709d2895	Merge "Bad cost tables used in ARNR filtering."	2010-11-15 09:55:35 -08:00
Paul Wilkins	373f5c3144	Bad cost tables used in ARNR filtering. The use of incorrect mv costing tables in the ARNR sub-pel filtering code led to corruption of the altref buffer in some cases, particularly at low data rates. The average gain from this fix is about 0.3% but there are a few extreme cases where nasty and visible artifacts manifested and for these few data points the improvement is > 10%. PGW and AWG Change-Id: I95cc02b196a433e71d0d2bd2b933fe68ed31e796	2010-11-15 17:47:12 +00:00
Yaowu Xu	73189f21b3	Merge "make rdmult adaptive for intra in quantizer RDO"	2010-11-15 09:22:45 -08:00
Yaowu Xu	ef2f27f10e	make rdmult adaptive for intra in quantizer RDO This intends to correct the tendency that VP8 aggressively favors rate on intra coded frames. Experiments tested different numbers in [0, 1] and found 9/16 overall provided about 2-4% gains for all-intra coded clips based on vpx-ssim metric. The impact on regular encoded clips is much smaller but positive overall. Overall impact on psnr is also positive even though very small. Change-Id: If808553aaaa87fdd44691f9787820ac9856d9f8a	2010-11-11 11:33:35 -08:00
John Koleszar	0a49747b01	quantizer: fix assertion in fast quantizer path The fast quantizer assembly code has not been updated to match the new exact quantizer, which was made the default in commit `6adbe09`. Specifically, they are not aware of the potential for the coefficient to be scaled, which results in the quantized result exceeding the range of the DCT. This patch restores the previous behavior of using the non-shifted coefficients when in the fast quantizer code path, but unfortunately requires rebuilding the tables when switching between the two. Change-Id: I0a33f5b3850335011a06906f49fafed54dda9546	2010-11-11 13:05:20 -05:00
Fritz Koenig	58083cb34d	Revert "Remove stack shadowing for x86-64" This reverts commit `15acc84f10`. Change-Id: Ia640be8cbc134432914849c1750f62575ea084e6	2010-11-11 08:20:02 -08:00
Paul Wilkins	213f7b0907	Merge "Relax rate control for last few frames"	2010-11-11 02:39:20 -08:00
Fritz Koenig	9b1ece2cca	Merge "Remove stack shadowing for x86-64"	2010-11-10 14:36:10 -08:00
Fritz Koenig	5f0e0617ba	FDCT optimizations. Fixed up the fdct for mmx and 8x4 sse2 to match them most recent changes. Change-Id: Ibee2d6c536fe14dcf75cd6eb1c73f4848a56d719	2010-11-10 14:34:02 -08:00
Fritz Koenig	647df00f30	postproc : Re-work posproc calling to allow more flags. Debugging in postproc needs more flags to allow for specific block types to be turned on or off in the visualizations. Must be enabled with --enable-postproc-visualizer during configuration time. Change-Id: Ia74f357ddc3ad4fb8082afd3a64f62384e4fcb2d	2010-11-10 14:14:46 -08:00
Paul Wilkins	513f8e6814	Relax rate control for last few frames VBR rate control can become very noisy for the last few frames. If there are a few bits to spare or a small overshoot then the target rate and hence quantizer may start to fluctuate wildly. This patch prevents further adjustment of the active Q limits for the last few frames. Patch also removes some redundant variables and makes one small bug fix. Change-Id: Ic167831bec79acc9f0d7e4698bcc4bb188840c45	2010-11-10 10:09:45 +00:00
Paul Wilkins	6adbe09058	Tuning for the more exact quantizer. Small changes to the default zero bin and rounding tables. Though the tables are currently the same for the Y1 and Y2 cases I have left them as separate tables in case we want to tune this later. There is now some adjustment of the zbin based on the prediction mode. Previously this was restricted to an adjustment for gf/arf 0,0 MV. The exact quantizer now marginal outperforms and is the default. The overall average gain is about 0.5% Change-Id: I5e4353f3d5326dde4e86823684b236a1e9ea7f47	2010-11-10 09:52:58 +00:00
John Koleszar	f7e187d362	improve average framerate calculation Change Ice204e86 identified a problem with bitrate undershoot due to low precision in the timestamps passed to the library. This patch takes a different approach by calculating the duration of this frame and passing it to the library, rather than using a fixed duration and letting the library average it out with higher precision timestamps. This part of the fix only applies to vpxenc. This patch also attempts to fix the problem for generic applications that may have made the same mistake vpxenc did. Instead of calculating this frame's duration by the difference of this frame's and the last frame's start time, we use the end times instead. This allows the framerate calculation to scavenge "unclaimed" time from the last frame. For instance: start \| end \| calculated duration ======+=======+==================== 0ms 33ms 33ms 33ms 66ms 33ms 66ms 99ms 33ms 100ms 133ms 34ms Change-Id: I92be4b3518e0bd530e97f90e69e75330a4c413fc	2010-11-05 08:42:46 -04:00
Scott LaVarnway	ff4a71f4c2	SSSE3 version of fast quantizer (test clip: tulip) For good quality mode with speed=1, this gave the encoder a small (2 - 3%) performance boost. Change-Id: I8a1d4269465944ac0819986c2f0be4b0a2ee0b35	2010-11-01 16:24:15 -04:00
Scott LaVarnway	dcee88ea37	Finding first label Using tables for the label count and label offset. Change-Id: Iac3d5b292c37341a881be0af282f5cac3b3e01eb	2010-10-29 10:01:04 -04:00
Yunqing Wang	6614563b8f	Save XMM registers in asm functions XMM6/7 are used in these functions, and need to be saved. Change-Id: I3dfaddaf2a69cd4bf8e8735c7064b17bac5a14e5	2010-10-28 16:59:03 -04:00
Yunqing Wang	f57fc7bcc6	Merge "Fix full-search SAD function crash in Visual Studio"	2010-10-28 13:46:35 -07:00
Yunqing Wang	7e3a1e7361	Fix full-search SAD function crash in Visual Studio Unlike GCC, Visual Studio compiler doesn't allocate SAD output array 16-byte aligned, which causes crash in visual studio. Change-Id: Ia755cf5a807f12929bda8db94032bb3c9d0c2362	2010-10-28 15:26:58 -04:00
Timothy B. Terriberry	c4d7e5e67e	Eliminate more warnings. This eliminates a large set of warnings exposed by the Mozilla build system (Use of C++ comments in ISO C90 source, commas at the end of enum lists, a couple incomplete initializers, and signed/unsigned comparisons). It also eliminates many (but not all) of the warnings expose by newer GCC versions and _FORTIFY_SOURCE (e.g., calling fread and fwrite without checking the return values). There are a few spurious warnings left on my system: ../vp8/encoder/encodemb.c:274:9: warning: 'sz' may be used uninitialized in this function gcc seems to be unable to figure out that the value shortcut doesn't change between the two if blocks that test it here. ../vp8/encoder/onyx_if.c:5314:5: warning: comparison of unsigned expression >= 0 is always true ../vp8/encoder/onyx_if.c:5319:5: warning: comparison of unsigned expression >= 0 is always true This is true, so far as it goes, but it's comparing against an enum, and the C standard does not mandate that enums be unsigned, so the checks can't be removed. Change-Id: Iaf689ae3e3d0ddc5ade00faa474debe73b8d3395	2010-10-27 18:08:04 -07:00
Yunqing Wang	71ecb5d7d9	Full search SAD function optimization in SSE4.1 Use mpsadbw, and calculate 8 sad at once. Function list: vp8_sad16x16x8_sse4 vp8_sad16x8x8_sse4 vp8_sad8x16x8_sse4 vp8_sad8x8x8_sse4 vp8_sad4x4x8_sse4 (test clip: tulip) For best quality mode, this gave encoder a 5% performance boost. For good quality mode with speed=1, this gave encoder a 3% performance boost. Change-Id: I083b5a39d39144f88dcbccbef95da6498e490134	2010-10-27 13:36:31 -04:00
John Koleszar	a0ae3682aa	Fix half-pixel variance RTCD functions This patch fixes the system dependent entries for the half-pixel variance functions in both the RTCD and non-RTCD cases: - The generic C versions of these functions are now correct. Before all three cases called the hv code. - Wire up the ARM functions in RTCD mode - Created stubs for x86 to call the optimized subpixel functions with the correct parameters, rather than falling back to C code. Change-Id: I1d937d074d929e0eb93aacb1232cc5e0ad1c6184	2010-10-27 13:00:30 -04:00
John Koleszar	1747207700	Merge "Add half-pixel variance RTCD functions"	2010-10-26 20:05:02 -07:00
John Koleszar	1320e54d95	Merge "make vp8_recon16x16mb{,y} RTCD functions"	2010-10-26 20:02:57 -07:00
John Koleszar	87e17737e9	Merge "make arm hex search the generic implementation"	2010-10-26 20:02:37 -07:00
John Koleszar	9fdd90c9aa	Merge "arm: remove duplicate functions"	2010-10-26 20:01:54 -07:00
John Koleszar	209d82ad72	Add half-pixel variance RTCD functions NEON has optimized 16x16 half-pixel variance functions, but they were not part of the RTCD framework. Add these functions to RTCD, so that other platforms can make use of this optimization in the future and special-case ARM code can be removed. A number of functions were taking two variance functions as parameters. These functions were changed to take a single parameter, a pointer to a struct containing all the variance functions for that block size. This provides additional flexibility for calling additional variance functions (the half-pixel special case, for example) and by initializing the table for all block sizes, we don't have to construct this function pointer table for each macroblock. Change-Id: I78289ff36b2715f9a7aa04d5f6fbe3d23acdc29c	2010-10-26 20:00:56 -07:00
John Koleszar	d6c67f02c9	make vp8_recon16x16mb{,y} RTCD functions ARM NEON has a platform specific version of vp8_recon16x16mb, though it's just a stub to extract the various parameters from the MACROBLOCKD struct and pass them to vp8_recon16x16mb_neon(). Using that function's prototype directly will be a better long term solution, but it's quite an invasive change. Change-Id: I04273149e2ade34749e2d09e7edb0c396e1dd620	2010-10-26 13:23:36 -04:00
John Koleszar	96cf6588de	make arm hex search the generic implementation The ARM version of vp8_hex_search() is a faster implementation of the same algorithm. Since it doesn't use any ARM specific code, it can be made the default implementation. This removes a linking error. Change-Id: I77d10f2c16b2515bff4522c350004e03b7659934	2010-10-26 10:46:31 -04:00

1 2 3 4 5 ...

282 Commits