generic-library/vpx

Author	SHA1	Message	Date
Fritz Koenig	734b1b2041	Revert "Reclasify optimized ssim calculations as SSE2." This reverts commit `01376858cd`	2011-08-22 11:31:12 -07:00
Fritz Koenig	f8e3d23b99	Merge "Reclasify optimized ssim calculations as SSE2."	2011-08-22 09:20:33 -07:00
John Koleszar	efe35fa63f	Merge remote branch 'internal/upstream' into HEAD	2011-08-20 00:05:04 -04:00
Fritz Koenig	01376858cd	Reclasify optimized ssim calculations as SSE2. Calculations were incorrectly classified as either SSE3 or SSSE3. Only using SSE2 instructions. Cleanup function names and make non-RTCD code work as well. Change-Id: I29f5c2ead342b2086a468029c15e2c1d948b5d97	2011-08-19 08:51:27 -07:00
John Koleszar	edec5eb5e7	Merge "Copy less when active map is in use"	2011-08-19 07:31:00 -07:00
Alpha Lam	4e8d35a461	Copy less when active map is in use When active map is specified and the current frame is not a key frame, golden frame nor a altref frame then copy only those active regions. This significantly reduces encoding time by as much as 19% on the test system where realtime encoding is used. This is particularly useful when the frame size is large (e.g. 2560x1600) and there's only a few action macroblocks. Change-Id: If394a813ec2df5a0201745d1348dbde4278f7ad4	2011-08-19 10:29:41 -04:00
John Koleszar	3743fd0cc7	Merge remote branch 'internal/upstream' into HEAD	2011-08-18 00:05:09 -04:00
Paul Wilkins	744f482350	Small boost to every other frame. Instead of a single mid GF boost apply a few extra bits to every other frame. This gives a very small average metrics improvement on both derf and YT sets. Also use min GF interval as min KF interval. Change-Id: Iee238b8cae0ffaed850a5a944ac825cee18da485	2011-08-17 14:14:23 +01:00
Scott LaVarnway	19987dcbfa	Faster vp8_default_coef_probs Copies from a generated table instead of building the default coeff probabilities during runtime. Change-Id: I4d9551ea3a2d7d4a4f7ce9eda006495221a8de50	2011-08-16 16:21:21 -04:00
John Koleszar	f54d561fa8	Merge remote branch 'internal/upstream' into HEAD	2011-08-16 00:05:05 -04:00
John Koleszar	9cc1611588	Merge v0.9.7-p1 release int 'origin/master' Change-Id: I93388d2f8846615ad1e26b975308c5e96b9b1918	2011-08-15 17:10:01 -04:00
Stefan Holmer	99d870a472	Don't set the bmi mode when doing error concealment Since the block will be interpreted as an inter block, the mode will be interpreted as a motion vector, resulting in bad concealment. Change-Id: Ifcc685ae1cc883492bce6dbd61e418d91a89b053	2011-08-15 11:46:04 -04:00
Stefan Holmer	ff35649758	Don't set the bmi mode when doing error concealment Since the block will be interpreted as an inter block, the mode will be interpreted as a motion vector, resulting in bad concealment. Change-Id: Ifcc685ae1cc883492bce6dbd61e418d91a89b053	2011-08-15 09:56:07 +02:00
John Koleszar	8f60186502	Merge remote branch 'internal/upstream' into HEAD	2011-08-13 00:05:06 -04:00
John Koleszar	e96131705a	Revert "Improved 1-pass CBR rate control" This reverts commit `b5ea2fbc2c`. Further testing showed noticable keyframe popping in some cases, reverting this for now to give time for a proper fix. Conflicts: vp8/encoder/onyx_if.c vp8/encoder/ratectrl.c Change-Id: I159f53d1bf0e24c035754ab3ded8ccfd58fd04af	2011-08-12 14:51:36 -04:00
John Koleszar	a4c2211ea3	Propagate macroblock MV to subblocks for error concealment EC expects the subblock MVs to be populated, but `f1d6cc79e4` removed this code. This commit restores it, protected by CONFIG_ERROR_CONCEALMENT. May move this to the EC code more directly in the future. Change-Id: I44f8f985720cb9a1bf222e59143f9e69abf56ad2	2011-08-12 14:49:35 -04:00
Stefan Holmer	a609be5633	Disable error concealment until first key frame is decoded When error concealment is enabled the first key frame must be successfully received before error concealment is activated. Error concealment will be activated when the delta following delta frame is received. Also fixed a couple of bugs related to error tracking in multi-threading. And avoiding decoding corrupt residual when we have multiple non-resilient partitions. Change-Id: I45c4bb296e2f05f57624aef500a874faf431a60d	2011-08-12 14:49:34 -04:00
John Koleszar	cdae03a4eb	Fix potential OOB read with Error Concealment This patch fixes an OOB read when error concealment is enabled and the partition sizes are corrupt. The partition size read from the bitstream was not being validated in EC mode. Change-Id: Ia81dfd4bce1ab29ee78e42320abe52cee8318974	2011-08-12 14:49:34 -04:00
John Koleszar	4645c89889	Merge "Disable error concealment until first key frame is decoded"	2011-08-12 11:45:26 -07:00
John Koleszar	91206793c2	Propagate macroblock MV to subblocks for error concealment EC expects the subblock MVs to be populated, but `f1d6cc79e4` removed this code. This commit restores it, protected by CONFIG_ERROR_CONCEALMENT. May move this to the EC code more directly in the future. Change-Id: I44f8f985720cb9a1bf222e59143f9e69abf56ad2	2011-08-12 11:34:40 -04:00
Stefan Holmer	3e10be93f2	Disable error concealment until first key frame is decoded When error concealment is enabled the first key frame must be successfully received before error concealment is activated. Error concealment will be activated when the delta following delta frame is received. Also fixed a couple of bugs related to error tracking in multi-threading. And avoiding decoding corrupt residual when we have multiple non-resilient partitions. Change-Id: I45c4bb296e2f05f57624aef500a874faf431a60d	2011-08-12 16:10:04 +02:00
John Koleszar	810a06b12c	Fix potential OOB read with Error Concealment This patch fixes an OOB read when error concealment is enabled and the partition sizes are corrupt. The partition size read from the bitstream was not being validated in EC mode. Change-Id: Ia81dfd4bce1ab29ee78e42320abe52cee8318974	2011-08-11 18:07:03 -04:00
John Koleszar	a16cd74ba1	Merge remote branch 'internal/upstream-experimental' into HEAD Conflicts: vp8/decoder/detokenize.c vp8/decoder/onyxd_if.c vp8/vp8_common.mk Change-Id: Ifca1108186a8bc715da86a44021ee2fa5550b5b8	2011-08-11 13:01:45 -04:00
John Koleszar	939f64f68e	Merge remote branch 'origin/master' into experimental Change-Id: I9c479c9b6e72aa78b412d25c00b8075eaca5229d	2011-08-06 00:05:15 -04:00
Yunqing Wang	b84e8f20c3	Merge "Adjust half-pixel only search"	2011-08-05 12:15:32 -07:00
John Koleszar	712762b508	Merge remote branch 'origin/master' into experimental Change-Id: Ic698ea5f5b31a5faf467eb0da4b762f9586df938	2011-08-05 00:05:05 -04:00
John Koleszar	238dae8604	Fix source buffer selection This patch fixes a bug in the interaction between the recode loop and spatial resampling. If the codec was in a spatial resampling state, and a subsequent iteration of the recode loop disables resampling, then the source buffer must be reset to the unscaled source. Change-Id: I4e4cd47b943f6cd26a47449dc7f4255b38e27c77	2011-08-03 16:13:15 -04:00
Yunqing Wang	b9f19f8917	Adjust half-pixel only search Changed motion search in vp8_find_best_half_pixel_step() to be the same as in vp8_find_best_sub_pixel_step(), which checks 5 points instead of 8 points. This only affects real-time mode with cpu-used >=9. Tests showed it gives 2% encoding speedup with a quality loss(psnr) of up to 0.5%. Change-Id: I16049cad1535002346d46cfdfad345bfc3dc5146	2011-08-03 11:51:07 -04:00
Johann	30e5deae5d	update extend frame borders the neon code made several assumptions which were broken by a recent change: https://review.webmproject.org/2676 update the code with new assumptions and guard them with a compile time assert Change-Id: I32a8378030759966068f34618d7b4b1b02e101a0	2011-08-02 19:26:46 -04:00
James Berry	27ee521753	include asm_com/dec_offsets for make dist Change-Id: Ia1ad66066a24c01915cd9e3ff75c7e070cc984c8	2011-08-02 13:42:03 -04:00
John Koleszar	f475f0c1bb	Merge "include the arm header files in make dist" into cayuga	2011-08-02 05:21:10 -07:00
John Koleszar	06c3d5bb9a	Fix building with --disable-postproc Change-Id: I7e6bc28e7974a376da747300744e0dd5dc1d21e9	2011-08-01 17:50:23 -04:00
Johann	3e8c6d3d35	include the arm header files in make dist Change-Id: Ibcf5b4b14153f65ce1b53c3bfba87ad2feb17bbd	2011-08-01 17:20:21 -04:00
John Koleszar	87e570e6be	Merge remote branch 'origin/master' into experimental Change-Id: I473166452c0ed5a4219b5e7d96a91a6641b11b9d	2011-07-30 00:05:09 -04:00
John Koleszar	6f080f9cec	Merge "Convert rc_max_intra_bitrate_pct to control"	2011-07-29 11:57:48 -07:00
John Koleszar	1f71d2e2c8	Correctly track sharpness in vp8cx_pick_filter_level_fast Make sure to update last_sharpness_level from the current sharpness_level whenever it changes. Change-Id: I0258d2f5b11a407abf6176a8d4c4994d925943f0	2011-07-29 12:27:03 -04:00
John Koleszar	1654ae9a2a	Convert rc_max_intra_bitrate_pct to control Since this is the only ABI incompatible change since the last release, convert it to use the control interface instead. The member of the configuration struct is replaced with the VP8E_SET_MAX_INTRA_BITRATE_PCT control. More significant API changes were expected to be forthcoming when this control was first introduced, and while they continue to be expected, it's not worth breaking compatibility for only this change. Change-Id: I799d8dbe24c8bc9c241e0b7743b2b64f81327d59	2011-07-28 09:17:35 -04:00
John Koleszar	728886fae9	Merge remote branch 'origin/master' into experimental Change-Id: Iaca87acc9726b5173d638528684d154538ec01e6	2011-07-28 00:05:12 -04:00
Yunqing Wang	2f2302f8d5	Preload reference area in sub-pixel motion search (real-time mode) This change implemented same idea in change "Preload reference area to an intermediate buffer in sub-pixel motion search." The changes were made to vp8_find_best_sub_pixel_step() and vp8_find_best_half _pixel_step() functions which are called when speed >= 5. Test result (using tulip clip): 1. On Core2 Quad machine(Linux) rt mode, speed (-5 ~ -8), encoding speed gain: 2% ~ 3% rt mode, speed (-9 ~ -11), encoding speed gain: 1% ~ 2% rt mode, speed (-12 ~ -14), no noticeable encoding speed gain 2. On Xeon machine(Linux) Test on speed (-5 ~ -14) didn't show noticeable speed change. Change-Id: I21bec2d6e7fbe541fcc0f4c0366bbdf3e2076aa2	2011-07-27 14:19:10 -04:00
Yunqing Wang	f11613b620	Merge "Fix range checks in motion search"	2011-07-27 09:34:13 -07:00
Yunqing Wang	bde2afbe23	Fix range checks in motion search There were some situations that the start motion vectors were out of range. This fix adjusted range checks to make sure they are checked and clamped. Change-Id: Ife83b7fed0882bba6d1fa559b6e63c054fd5065d	2011-07-27 10:37:33 -04:00
John Koleszar	9fbb1d4350	Merge remote branch 'origin/master' into experimental Change-Id: I1ae82458536ba2f0969e1bea78f41cd16fe96b79	2011-07-27 00:05:06 -04:00
John Koleszar	db8f0d2ca9	Merge "cosmetics: consistently use [u]int64_t"	2011-07-26 12:57:43 -07:00
James Zern	b45065d38b	cosmetics: consistently use [u]int64_t Removes mixed usage of (unsigned) long long and INT64. Fixes Issue #208. Change-Id: I220d3ed5ce4bb1280cd38bb3715f208ce23cf83a	2011-07-26 11:34:36 -07:00
John Koleszar	eccfca5165	Make cat6 probs properly dependent on CONFIG_EXTEND_QRANGE Change-Id: I2ac5d8818acb50f9db38de8cb562f337e51006b2	2011-07-26 10:30:33 -04:00
John Koleszar	62400028e2	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/decoder/detokenize.c vp8/decoder/onyxd_int.h Change-Id: Ib9b516b939358ac8bf694200a8425fdd62c8d149	2011-07-26 10:22:42 -04:00
John Koleszar	3c4a39e71c	Merge remote branch 'origin/master' into experimental Conflicts: vp8/decoder/detokenize.c vp8/decoder/onyxd_int.h Change-Id: Idc301ae630dc1aedeb85674ecfdcf1eb28420f81	2011-07-26 10:04:36 -04:00
Johann	ca7e346669	Merge ""Eliminated TOKENEXTRABITS" broke the windows build."	2011-07-26 06:34:31 -07:00
Scott LaVarnway	a11624497c	"Eliminated TOKENEXTRABITS" broke the windows build. Fixed. Change-Id: I3348e8dbcaee6ace263af413701101d77636e5df	2011-07-26 09:33:16 -04:00
Scott LaVarnway	4894b45ced	Merge "Eliminated TOKENEXTRABITS"	2011-07-25 14:35:58 -07:00
Scott LaVarnway	76eb402668	Eliminated TOKENEXTRABITS Noticed small performance gains, depending on material. Change-Id: I334369f6312bc19aa73481fc3f790ab181e11867	2011-07-25 17:11:24 -04:00
Yunqing Wang	5b0de48ddd	Merge "Use CONFIG_FAST_UNALIGNED consistently in codec"	2011-07-25 12:40:50 -07:00
Yunqing Wang	fe270dd527	Specify size for argument pushed to stack The change fixes building error on Win64. Change-Id: I63d25b26220c4da8a98ca2e36530cbb802468e6b	2011-07-25 11:30:45 -04:00
Yunqing Wang	65dfcf4696	Use CONFIG_FAST_UNALIGNED consistently in codec CONFIG_FAST_UNALIGNED is enabled by default. Disable it if it is not supported by hardware. Change-Id: I7d6905ed79fed918bca074bd62820b0c929d81ab	2011-07-25 10:11:24 -04:00
John Koleszar	664cd5ac91	Merge remote branch 'internal/upstream' into HEAD	2011-07-23 00:05:14 -04:00
John Koleszar	e14ad46efa	Merge remote branch 'origin/master' into experimental Change-Id: I0a24d6762598e5fee30f264de1dcd10331c01eac	2011-07-23 00:05:13 -04:00
Johann	773bcc300d	Merge "fix sharpness bug and clean up"	2011-07-22 09:34:55 -07:00
Johann	a04ed0e8f3	fix sharpness bug and clean up sharpness was not recalculated in vp8cx_pick_filter_level_fast remove last_filter_type. all values are calculated, don't need to update the lfi data when it changes. always use cm->sharpness_level. the extra indirection was annoying. don't track last frame_type or sharpness_level manually. frame type only matters for motion search and sharpness_level is taken care of in frame_init move function declarations to their proper header Change-Id: I7ef037bd4bf8cf5e37d2d36bd03b5e22a2ad91db	2011-07-22 12:33:57 -04:00
Yunqing Wang	829179e888	Merge "Preload reference area to an intermediate buffer in sub-pixel motion search"	2011-07-22 06:56:15 -07:00
Yunqing Wang	20bd1446c0	Preload reference area to an intermediate buffer in sub-pixel motion search In sub-pixel motion search, the search range is small(+/- 3 pixels). Preload whole search area from reference buffer into a 32-byte aligned buffer. Then in search, load reference data from this buffer instead. This keeps data in cache, and reduces the crossing cache- line penalty. For tulip clip, tests on Intel Core2 Quad machine(linux) showed encoder speed improvement: 3.4% at --rt --cpu-used =-4 2.8% at --rt --cpu-used =-3 2.3% at --rt --cpu-used =-2 2.2% at --rt --cpu-used =-1 Test on Atom notebook showed only 1.1% speed improvement(speed=-4). Test on Xeon machine also showed less improvement, since unaligned data access latency is greatly reduced in newer cores. Next, I will apply similar idea to other 2 sub-pixel search functions for encoding speed > 4. Make this change exclusively for x86 platforms. Change-Id: Ia7bb9f56169eac0f01009fe2b2f2ab5b61d2eb2f	2011-07-22 09:28:06 -04:00
John Koleszar	dc9e1b7683	Merge remote branch 'origin/master' into experimental Change-Id: I8b0a76b3232c8cff15c0ca5289e18af6889e5095	2011-07-22 00:05:11 -04:00
John Koleszar	7d44c805cf	Merge remote branch 'internal/upstream' into HEAD	2011-07-22 00:05:06 -04:00
Yaowu Xu	f614661242	Merge "fix more merge issues" into experimental	2011-07-21 16:05:24 +00:00
Yaowu Xu	8c31484ea1	fix more merge issues With this fix, the experimental branch now builds and encodes correctly with the following two configure options respectively: --enable-experimental --enable-t8x8 --enable-experimental Change-Id: I3147c33c503fe713a85fd371e4f1a974805778bf	2011-07-21 09:01:53 -07:00
John Koleszar	2bdda84e37	Merge "Increase chrow row alignment to 16 bytes."	2011-07-21 07:32:39 -07:00
Yunqing Wang	c5fe641179	Merge "Add improvements made in good-quality mode to real-time mode"	2011-07-21 07:27:09 -07:00
John Koleszar	ca60e0c2f9	Merge remote branch 'origin/master' into experimental Change-Id: I9761428209518b7fcbde60e884c06754664c0c36	2011-07-21 00:05:10 -04:00
John Koleszar	a53586d9d1	Merge remote branch 'internal/upstream' into HEAD	2011-07-21 00:05:05 -04:00
Yaowu Xu	1c24eb2b7b	fixed a number of problems caused by auto merges The auto merge process pull and merge commits from public git or master branch. These automerges while worked well most time, but has created a few problems. This commit fixed several issues existed long before the latest 8x8 transform commit. Change-Id: I895ca99713231b1aec521d57db5d9839f74aacfa	2011-07-20 12:45:35 -07:00
Timothy B. Terriberry	7d1b37cdac	Increase chrow row alignment to 16 bytes. This is done by expanding luma row to 32-byte alignment, since there is currently a bunch of code that assumes that uv_stride == y_stride/2 (see, for example, vp8/common/postproc.c, common/reconinter.c, common/arm/neon/recon16x16mb_neon.asm, encoder/temporal_filter.c, and possibly others; I haven't done a full audit). It also uses replaces the hardcoded border of 16 in a number of encoder buffers with VP8BORDERINPIXELS (currently 32), as the chroma rows start at an offset of border/2. Together, these two changes have the nice advantage that simply dumping the frame memory as a contiguous blob produces a valid, if padded, image. Change-Id: Iaf5ea722ae5c82d5daa50f6e2dade9de753f1003	2011-07-20 10:20:31 -07:00
Deb Mukherjee	08f6471890	Add 8x8 transform to experimental branch Please refer to previous commit messages for detailed info: https://on2-git.corp.google.com/g/#change,5940 https://on2-git.corp.google.com/g/#change,6045 Change-Id: I8b16992f2f69c5a808ad40a3e32ef589cce7c59d	2011-07-20 09:49:22 -07:00
Attila Nagy	0afcc76971	encoder: don't set the fragment bit for the last partition Change-Id: Icb4e4f0d7c3074a8507852178be87541a1cb5bac	2011-07-20 14:09:42 +03:00
John Koleszar	6907117175	Merge remote branch 'origin/master' into experimental Change-Id: I956822324c046c254806dd712a2d3be4dcf8564b	2011-07-20 00:05:17 -04:00
John Koleszar	8e464cc4c2	Merge remote branch 'internal/upstream' into HEAD	2011-07-20 00:05:09 -04:00
Scott LaVarnway	b2d9700f53	Merge "Moved vp8_encode_bool into boolhuff.h"	2011-07-19 08:15:14 -07:00
Johann	6afafc313c	remove old armv5 code armv5 dequantizer is not referenced Change-Id: Id1cc617dcee35ebd6a406816ec6aaa26e8bbc8ad	2011-07-19 09:20:38 -04:00
Scott LaVarnway	a25f6a9c88	Moved vp8_encode_bool into boolhuff.h allowing the compiler to inline this function. For real-time encodes, this gave a boost of 1% to 2.5%, depending on the speed setting. Change-Id: I3929d176cca086b4261267b848419d5bcff21c02	2011-07-19 09:17:25 -04:00
John Koleszar	2614b77fcb	Merge remote branch 'origin/master' into experimental Change-Id: Ida9204624fe3fb99fed1b149d1f88159480fdd83	2011-07-19 00:05:11 -04:00
John Koleszar	b3b34b0bc7	Merge remote branch 'internal/upstream' into HEAD	2011-07-19 00:05:05 -04:00
John Koleszar	b5ea2fbc2c	Improved 1-pass CBR rate control This patch attempts to improve the handling of CBR streams with respect to the short term buffering requirements. The "buffer level" is changed to be an average over the rc buffer, rather than a long running average. Overshoot is also tracked over the same interval and the golden frame targets suppressed accordingly to correct for overly aggressive boosting. Testing shows that this is fairly consistently positive in one metric or another -- some clips that show significant decreases in quality have better buffering characteristics, others show improvenents in both. Change-Id: I924c89aa9bdb210271f2e03311e63de3f1f8f920	2011-07-18 11:48:05 -04:00
John Koleszar	8bf2cbce98	Merge remote branch 'origin/master' into experimental Change-Id: Ic623c335cd4991c9d80f675f390e81282b18c137	2011-07-16 00:05:08 -04:00
John Koleszar	dc1c3f9024	Merge remote branch 'internal/upstream' into HEAD	2011-07-16 00:05:05 -04:00
Scott LaVarnway	e68894fa03	Merge "Tokenize MB optimized"	2011-07-15 07:54:14 -07:00
Tero Rintaluoma	4e82f01547	Tokenize MB optimized Optimized C-code of the following functions: - vp8_tokenize_mb - tokenize1st_order_b - tokenize2nd_order_b Gives ~1-5% speed-up for RT encoding on Cortex-A8/A9 depending on encoding parameters. Change-Id: I6be86104a589a06dcbc9ed3318e8bf264ef4176c	2011-07-15 11:26:54 +03:00
John Koleszar	f1fcd74e3e	Merge remote branch 'origin/master' into experimental Change-Id: Icbeb14d64ed3d9337606b591dde4e0669540a10d	2011-07-15 00:05:06 -04:00
John Koleszar	087b338d9e	Merge remote branch 'internal/upstream' into HEAD	2011-07-15 00:05:04 -04:00
James Berry	6b6f367c3d	bug fix vpx_copy_and_extend_frame size issue vpx_copy_and_extend_frame could incorrectly resize uv frames which could result in a crash. Change-Id: Ie96f7078b1e328b3907a06eebeee44ca39a2e898	2011-07-14 15:58:15 -04:00
John Koleszar	04dce631a2	Remove unused speed features min_fs_radius, max_fs_radius, full_freq were set but never read. Change-Id: I82657f4e7f2ba2acc3cbc3faa5ec0de5b9c6ec74	2011-07-14 14:20:25 -04:00
John Koleszar	86edcb0cc7	Merge remote branch 'origin/master' into experimental Change-Id: I3f64e220b78738e5261a9fda3c270d51613f4faa	2011-07-14 00:05:12 -04:00
John Koleszar	6901105e99	Merge remote branch 'internal/upstream' into HEAD	2011-07-14 00:05:04 -04:00
Yunqing Wang	f1f28535c3	Merge "Fix unnecessary casting of B_PREDICTION_MODE (issue 349)"	2011-07-13 13:32:57 -07:00
Yunqing Wang	139577f937	Fix unnecessary casting of B_PREDICTION_MODE (issue 349) Minor fix. Change-Id: Iaf93f6e47e882a33c479e57c7a0d0bf321e291c0	2011-07-13 15:52:07 -04:00
Yunqing Wang	0e9a6ed72a	Add improvements made in good-quality mode to real-time mode Several improvements we made in good-quality mode can be added into real-time mode to speed up encoding in speed 1, 2, and 3 with small quality loss. Tests using tulip clip showed: --rt --cpu-used=-1 (before change) PSNR: 38.028 time: 1m33.195s (after change) PSNR: 38.014 time: 1m20.851s --rt --cpu-used=-2 (before change) PSNR: 37.773 time: 0m57.650s (after change) PSNR: 37.759 time: 0m54.594s --rt --cpu-used=-3 (before change) PSNR: 37.392 time: 0m42.865s (after change) PSNR: 37.375 time: 0m41.949s Change-Id: I76ab2a38d72bc5efc91f6fe20d332c472f6510c9	2011-07-13 14:51:02 -04:00
Fritz Koenig	84c3cd79d1	Merge "Reduce motion vector search on alt-ref frame."	2011-07-13 10:07:30 -07:00
Johann	211694f67e	Merge "update x86 asm for loopfilter"	2011-07-13 04:10:03 -07:00
Johann	8f910594bd	Merge "Update armv6 loopfilter to new interface"	2011-07-13 04:09:55 -07:00
Johann	1a219c22b1	Merge "Update armv7 loopfilter to new interface"	2011-07-13 04:09:42 -07:00
Johann	d9b825cff2	Merge "New loop filter interface"	2011-07-13 04:09:26 -07:00
John Koleszar	ffc4587a47	Merge remote branch 'origin/master' into experimental Change-Id: I9dab62c24d71f71cdc36732ed8ed469bee67d7e1	2011-07-13 00:05:04 -04:00
John Koleszar	791ad1bb37	Merge remote branch 'internal/upstream' into HEAD	2011-07-13 00:05:03 -04:00
Attila Nagy	c231b0175d	Update armv6 loopfilter to new interface Change-Id: I5fe581d797571a7a9432fbd17fc557591d0c1afa	2011-07-12 12:14:51 +03:00
Attila Nagy	283b0e25ac	Update armv7 loopfilter to new interface Change-Id: I65105a9c63832669237e6a6a7fcb4ea3ea683346	2011-07-12 12:12:25 +03:00
Fritz Koenig	ede0b15c9d	Reduce motion vector search on alt-ref frame. Clamp mv search to accomodate subpixel filtering of UV mv. Change-Id: Iab3ed405993ef6bf779ad7cf60863153068fb7d1	2011-07-11 09:05:43 -07:00
John Koleszar	6058c9ce0c	Merge remote branch 'origin/master' into experimental Change-Id: Ica63d16cb39e2d65a3414f0b9f86c8a64112dfa3	2011-07-09 00:05:09 -04:00
John Koleszar	c24479e870	Merge remote branch 'internal/upstream' into HEAD	2011-07-09 00:05:04 -04:00
Yunqing Wang	587ca06da9	Minor change in pick_inter_mode() Scott suggested to move vp8_mv_pred() under "case NEWMV" to save extra checks. Change-Id: I09e69892f34a08dd425a4d81cfcc83674e344a20	2011-07-08 14:08:45 -04:00
Yunqing Wang	e83d36c053	Merge "Adjust full-pixel clamping and motion vector limit calculation"	2011-07-08 08:39:32 -07:00
Yunqing Wang	40991faeae	Adjust full-pixel clamping and motion vector limit calculation Do mvp clamping in full-pixel precision instead of 1/8-pixel precision to avoid error caused by right shifting operation. Also, further fixed the motion vector limit calculation in change: `b748045470` Change-Id: Ied88a4f7ddfb0476eb9f7afc6ceeddbf209fffd7	2011-07-08 11:34:28 -04:00
Johann	01433c5043	update x86 asm for loopfilter Change-Id: I1ed739522db7c00c189851c7095c1b64ef6412ce	2011-07-08 09:23:38 -04:00
John Koleszar	ef7f489dc3	Merge remote branch 'internal/upstream-experimental' into HEAD	2011-07-08 08:57:03 -04:00
Johann	6ae12c415e	Merge "clean up warnings when building arm with rtcd"	2011-07-08 05:16:09 -07:00
Attila Nagy	622958449b	New loop filter interface Separate simple filter with reduced no. of parameters. MB filter level picking based on precalculated table. Level table updated for each frame. Inside and edge limits precalculated and updated just when sharpness changes. HEV threshhold is constant. ARM targets use scalars and others vectors. Change works only with --target=generic-gnu All other targets have to be updated! Change-Id: I6b73aca6b525075b20129a371699b2561bd4d51c	2011-07-08 09:31:41 +03:00
John Koleszar	2c3c54f747	Merge remote branch 'origin/master' into experimental Change-Id: I9cead934ebea85d81aceaaec4674efc74367f984	2011-07-08 00:05:05 -04:00
John Koleszar	973a9c075d	Merge "Set VPX_FRAME_IS_DROPPABLE"	2011-07-07 08:11:05 -07:00
John Koleszar	37de0b8bdf	Set VPX_FRAME_IS_DROPPABLE Allow the encoder to inform the application that the encoded frame will not be used as a reference. Change-Id: I90e41962325ef73d44da03327deb340d6f7f4860	2011-07-07 10:38:45 -04:00
John Koleszar	61d9d595d5	Merge remote branch 'origin/master' into experimental Change-Id: I009c7e3043ad1eb1ce95c69132a4727073b86757	2011-07-02 00:05:12 -04:00
John Koleszar	5380a2215e	Merge remote branch 'internal/upstream' into HEAD	2011-07-02 00:05:10 -04:00
John Koleszar	b4f70084cc	Merge "Properly use GET_GOT/RESTORE_GOT when using GLOBAL()."	2011-07-01 07:14:34 -07:00
John Koleszar	54c3637828	Merge remote branch 'origin/master' into experimental Change-Id: Iaf6e9e14d0cfe5cef3895cfb68524d51139a6d23	2011-07-01 00:05:12 -04:00
John Koleszar	766fce16c4	Merge remote branch 'internal/upstream' into HEAD	2011-07-01 00:05:11 -04:00
Ronald S. Bultje	c8a23ad3f4	Properly use GET_GOT/RESTORE_GOT when using GLOBAL(). This should fix binaries using PIC on x86-32. Also should fix issue 343. Change-Id: I591de3ad68c8a8bb16054bd8f987a75b4e2bad02	2011-06-30 14:04:27 -07:00
Yunqing Wang	ae8aa836d5	Merge "Copy macroblock data to a buffer before encoding it"	2011-06-30 11:14:24 -07:00
Yunqing Wang	80c3bbf657	Merge "Bug fix in motion vector limit calculation"	2011-06-30 09:52:03 -07:00
Yunqing Wang	b748045470	Bug fix in motion vector limit calculation Motion vector limits are calculated using right shifts, which could give wrong results for negative numbers. James Berry's test on one clip showed encoder produced some artifacts. This change fixed that. Change-Id: I035fc02280b10455b7f6eb388f7c2e33b796b018	2011-06-30 11:20:13 -04:00
Johann	3e4a80cc35	Merge "remove incorrect initialization"	2011-06-30 07:59:08 -07:00
John Koleszar	9dfd006017	Merge remote branch 'internal/upstream-experimental' into HEAD Conflicts: vp8/encoder/bitstream.c Change-Id: I44c00f98dcb99eb728ce4f5256aefb135a711a74	2011-06-30 08:46:49 -04:00
John Koleszar	6251e9e5ce	Merge remote branch 'origin/master' into experimental Change-Id: I35c9ca116aecd0d03e762942d9cf1289edb4f23d	2011-06-30 00:05:10 -04:00
Paul Wilkins	eacaabc592	Merge "Change to arf boost calculation."	2011-06-29 10:03:57 -07:00
Paul Wilkins	11694aab66	Change to arf boost calculation. In this commit I have added an experimental function that tests prediction quality either side of a central position to calculate a suggested boost number for an ARF frame. The function is passed an offset from the current position and a number of frames to search forwards and backwards. It returns a forward, backward and compound boost number. The new code can be deactivated using #define NEW_BOOST 0 In its current default state the code searches forwards and backwards from the proposed position of the next alt ref. The the old code used a boost number calculated by scanning forward from the previous GF up to the proposed alt ref frame position. I have also added some code to try and prevent placement of a gf/arf where there is a brief flash. Change-Id: I98af789a5181148659f10dd5dd2ff2d4250cd51c	2011-06-29 18:01:25 +01:00
Johann	fe53107fda	remove incorrect initialization Values were set, then reset. Only set them once. Change-Id: Iaf43c8467129f2f261f04fa9188b603aa46216b5	2011-06-29 11:54:27 -04:00
Johann	6611f66978	clean up warnings when building arm with rtcd Change-Id: I3683cb87e9cb7c36fc22c1d70f0799c7c46a21df	2011-06-29 10:51:41 -04:00
John Koleszar	f3a13cb236	Merge "Use MAX_ENTROPY_TOKENS and ENTROPY_NODES more consistently"	2011-06-29 07:29:59 -07:00
John Koleszar	fe5765a5f3	Merge remote branch 'origin/master' into experimental Change-Id: I68e604e4a731f6703fdec7eff2c2c9b9e36879ea	2011-06-29 00:05:10 -04:00
Johann	dc004e8c17	Merge "Avoid text relocations in ARM vp8 decoder"	2011-06-28 16:34:10 -07:00
Johann	02c30cdeef	Merge "utilize preload in ARMv6 MC/LPF/Copy routines"	2011-06-28 16:33:45 -07:00
John Koleszar	b32da7c3da	Use MAX_ENTROPY_TOKENS and ENTROPY_NODES more consistently There were many instances in the code of vp8_coef_tokens and vp8_coef_tokens-1, which was a preprocessor macro despite the naming convention. Replace these with MAX_ENTROPY_TOKENS and ENTROPY_NODES, respectively. Change-Id: I72c4f6c7634c94e1fa066cd511471e5592c748da	2011-06-28 17:03:55 -04:00
John Koleszar	9bcf07ae4a	Merge "Simplify decode_macroblock."	2011-06-28 12:54:25 -07:00
Gaute Strokkenes	81c0546407	Simplify decode_macroblock. Change-Id: Ieb2f3827ae7896ae594203b702b3e8fa8fb63d37	2011-06-28 17:01:14 +01:00
Stefan Holmer	7296b3f922	New ways of passing encoded data between encoder and decoder. With this commit frames can be received partition-by-partition from the encoder and passed partition-by-partition to the decoder. At the encoder-side this makes it easier to split encoded frames at partition boundaries, useful when packetizing frames. When VPX_CODEC_USE_OUTPUT_PARTITION is enabled, several VPX_CODEC_CX_FRAME_PKT packets will be returned from vpx_codec_get_cx_data(), containing one partition each. The partition_id (starting at 0) specifies the decoding order of the partitions. All partitions but the last has the VPX_FRAME_IS_FRAGMENT flag set. At the decoder this opens up the possibility of decoding partition N even though partition N-1 was lost (given that independent partitioning has been enabled in the encoder) if more info about the missing parts of the stream is available through external signaling. Each partition is passed to the decoder through the vpx_codec_decode() function, with the data pointer pointing to the start of the partition, and with data_sz equal to the size of the partition. Missing partitions can be signaled to the decoder by setting data != NULL and data_sz = 0. When all partitions have been given to the decoder "end of data" should be signaled by calling vpx_codec_decode() with data = NULL and data_sz = 0. The first partition is the first partition according to the VP8 bitstream + the uncompressed data chunk + DCT address offsets if multiple residual partitions are used. Change-Id: I5bc0682b9e4112e0db77904755c694c3c7ac6e74	2011-06-28 11:10:17 -04:00
Stefan Holmer	4cb0ebe5b2	Adding support for independent partitions Adding support in the encoder for generating independent residual partitions by forcing equal probabilities over the prev coef entropy contexts. Change-Id: I402f5c353255f3ca20eae2620af739f6a498cd21	2011-06-28 11:10:17 -04:00
Mike Hommey	e3f850ee05	Avoid text relocations in ARM vp8 decoder The current code stores pointers to coefficient tables and loads them to access the tables contents. As these pointers are stored in the code sections, it means we end up with text relocations. eu-findtextrel will thus complain about code not compiled with -fpic/-fPIC. Since the pointers are stored in the code sections, we can actually cheat and let the assembler generate relative addressing when accessing the coefficient tables, and just load their location with adr. Change-Id: Ib74ae2d3f2bab80b29991355f2dbe6955f38f6ae	2011-06-28 09:11:40 +02:00
John Koleszar	f86e14d8dc	Merge remote branch 'internal/upstream' into HEAD	2011-06-28 00:05:04 -04:00
John Koleszar	d83b68c622	Merge remote branch 'origin/master' into experimental Change-Id: Ia944723797d67abef24312cf928cf6fd64cd9766	2011-06-28 00:05:04 -04:00
John Koleszar	d7c6c9472f	Merge remote branch 'internal/upstream-experimental' into HEAD	2011-06-28 00:05:04 -04:00
John Koleszar	7985e023eb	Merge "fix build issues for experimental branch" into experimental	2011-06-27 11:55:15 -07:00
Fritz Koenig	be99868bd1	Fix after removal of B_MODE_INFO Change Ieb746989: Removed B_MODE_INFO missed this. Change-Id: I32202555581cc2a5d45e729c6650ada4d2df55d3	2011-06-27 09:43:21 -07:00
Johann	8a9a11e8dc	Merge "configuration, support disabling any subset of ARM arch"	2011-06-27 08:55:18 -07:00
John Koleszar	1ec4e27095	Merge remote branch 'origin/master' into experimental Change-Id: I689f4624a53184a72258df575305eb1aa97e61ca	2011-06-27 09:39:56 -04:00
Stefan Holmer	ba0822ba96	Adding support for error concealment in multi-threaded decoding Also includes a couple of error concealment bug fixes: - the segment_id wasn't properly initialized when missing - when interpolating and no neighbors are found, set to zero - clear the qcoef buffer when concealing an MB Change-Id: Id79c876b41d78b559a2241e9cd0fd2cae6198f49	2011-06-27 09:03:06 -04:00
John Koleszar	3ce5adb154	Merge remote branch 'internal/upstream' into HEAD	2011-06-25 00:05:03 -04:00
Adrian Grange	deca8cfc44	Fixed initialization of frame buffer ref counters Only the first frame buffer ref counter was being initialized because the index was fixed at 0 rather than using i. Change-Id: Ib842298be4a5e3607f9e21c2cd4bfbee4054ffc4	2011-06-24 08:43:40 -07:00
Yunqing Wang	0d87098e08	Copy macroblock data to a buffer before encoding it I got this idea from Pascal (Thanks). Before encoding a macroblock, copy it to a 16x16 buffer, and then read source data from there instead. This will help keep the source data in cache, and help with the performance. Change-Id: Id05f4cb601299150511d59dcba0ae62c49b5b757	2011-06-23 13:54:02 -04:00
Yaowu Xu	7793b386a7	fix build issues for experimental branch experimental branch build was broken from some merge artifacts, this commit fixes those issues to enable the experimental branch to build. Change-Id: Ic52b2d2f1d1b80abb7ecaa4c0927bcf887ac0c2a	2011-06-23 09:19:44 -07:00
John Koleszar	7467f6d04a	Merge remote branch 'internal/upstream' into HEAD	2011-06-23 11:55:51 -04:00
John Koleszar	db67dcba6a	Revert "Reduce overshoot in 1 pass rate control" This reverts commit `212f618373`. Further testing shows that the overshoot accumulation/damping is too aggressive on some clips. Allowing the accumulated overshoot to decay and limiting to damping to golden frames shows some promise. But some clips show significant overshoot in the buffer window, so I think this still needs work. Change-Id: Ic02a9ca34f55229f9cc04786f4fab54cdc1a3ef5	2011-06-23 11:52:12 -04:00
John Koleszar	8a7ca2b635	Merge remote branch 'internal/upstream' into HEAD	2011-06-23 00:05:04 -04:00
John Koleszar	4ec081a7de	Merge remote branch 'internal/upstream-experimental' into HEAD	2011-06-23 00:05:04 -04:00
James Berry	2bd90c13a0	get/set reference buffer dimension check added vp8_yv12_copy_frame_ptr() expects same size buffers which was not previously gaurenteed. Using an improperly allocated buffer would cause a crash before. Change-Id: I904982313ce9352474f80de842013dcd89f48685	2011-06-22 13:36:24 -04:00
Johann	786246ebf1	Merge remote branch 'origin/master' into experimental Conflicts: vp8/encoder/rdopt.c Use new constant (110) from `10ed60dc7` Change-Id: Ic7d8a45ccc8deeeb94a0ab1c58d5d052ef3c27e4	2011-06-22 07:45:17 -04:00
Yaowu Xu	76495617e0	Merge "adjusting the calculation of errorperbit"	2011-06-21 09:47:42 -07:00
Scott LaVarnway	55c3963c88	Merge "Improved vp8dx_decode_bool"	2011-06-21 07:45:51 -07:00
Yunqing Wang	109c20299c	Merge "Remove unnecessary bounds checking in motion search"	2011-06-21 07:23:24 -07:00
Attila Nagy	6f23f24afe	configuration, support disabling any subset of ARM arch Useful for leaving out any version specific asm files. Change-Id: I233514410eb9d7ca88d2d2c839673122c507fa99	2011-06-21 10:39:01 +03:00
Yaowu Xu	10ed60dc71	adjusting the calculation of errorperbit RDMULT/RDDIV defines a bit worth of distortion in term of sum squared difference. This has also been used as errorperbit in subpixel motion search, where the distortions computed as variance of the difference. The variance of differences is different from sum squared differences by amount of DC squared. Typically, for inter predicted MBs, this difference averages around 10% between the two distortion, so this patch introduces a 110% constant in deriving errorperbit from RDMULT/RDDIV. Test on CIF set shows small but positive gain on overall PSNR (.03%) and SSIM (.07%), overall impact on average PSNR is 0. Change-Id: I95425f922d037b4d96083064a10c7cdd4948ee62	2011-06-20 16:32:30 -07:00
Scott LaVarnway	67a1f98c2c	Improved vp8dx_decode_bool Relocated the vp8dx_bool_decoder_fill() call, allowing the compiler to produce better assembly code. Tests showed a 1 - 2 % performance boost (x86 using gcc) for the 720p clip used. Change-Id: Ic5a4eefed8777e6eefa007d4f12dfc7e64482732	2011-06-20 14:44:16 -04:00
John Koleszar	ae74199ecf	Merge remote branch 'internal/upstream-experimental' into HEAD Conflicts: vp8/encoder/encodeframe.c vp8/encoder/rdopt.c Change-Id: I6ff3d92aa400bef10f6cc87f9da7ebaf6db8cc88	2011-06-20 09:07:43 -04:00
Taekhyun Kim	458fb8f491	utilize preload in ARMv6 MC/LPF/Copy routines About 9~10% decoding perf improvement on non-Neon ARM cpus Change-Id: I7dc2a026764e84e9c2faf282b4ae113090326837	2011-06-17 14:04:53 -07:00
John Koleszar	deb2e9cf62	Merge remote branch 'internal/upstream' into HEAD Conflicts: vp8/encoder/encodeframe.c vp8/encoder/rdopt.c Change-Id: I183fd3ce9e94617ec888c9f891055b9f1f8ca6c5	2011-06-17 15:36:43 -04:00
Johann	e18d7bc230	Merge remote branch 'origin/master' into experimental Conflicts: vp8/encoder/encodeframe.c vp8/encoder/rdopt.c Change-Id: I8bab720889ac652361abdedfe2cc91a89742cb30	2011-06-17 14:56:27 -04:00
Yunqing Wang	2cd1c2855e	Remove unnecessary bounds checking in motion search The starting points are always within the limits, and bounds checking on these points is not needed. For speed < 5, the encoded result changes a little because different treatment is taken while starting point equals the bounds. Change-Id: I09a402d310f51e305a3519f1601b1d17b05c6152	2011-06-17 14:19:51 -04:00
John Koleszar	a60fc419f5	Merge "Use SSE as BPRED distortion metric consistently"	2011-06-17 09:48:32 -07:00
Ronald S. Bultje	87fd66bb0e	Assign boost to GF bit allocation if past frame had no ARF. Modify the second-pass code to provide a full golden-frame (GF) bit allocation boost if the past GF group (GFG) had no alt-ref frame (ARF), even if the current GFG does contain and ARF. This mostly has no effect on clips, since switching ARFs on/off between GFGs is not very common. Has a positive effect on e.g. cheer (+0.45 SSIM at 600kbps) and football (+0.25 SSIM at 600kbps), particularly at high bitrates. Has a negative effect (-0.04 SSIM at 300kbps) at pamphlet, which appears only marginally related to this patch, and crew (-0.1 SSIM at 700kbps). Change-Id: I2e32899638b59f857e26efeac18a82e0c0b77089	2011-06-16 13:01:27 -04:00
John Koleszar	eb645abeac	Merge "Disable specialcase for last frames if the sequence contains ARFs."	2011-06-16 09:56:05 -07:00
John Koleszar	5223016337	Merge "Remove redundant check for KEY_FRAME in multithreaded decoder"	2011-06-15 10:18:06 -07:00
John Koleszar	61599fb59f	Use SSE as BPRED distortion metric consistently The BPRED mode selection uses SSE as a distortion metric, but the early breakout threshold being used was a variance value. Change-Id: I42d4602fb9b548bf681a36445701fada5e73aff1	2011-06-15 10:53:37 -04:00
John Koleszar	1ade44b352	Merge "fix --disable-runtime-cpu-detect on x86"	2011-06-15 07:09:09 -07:00
Ronald S. Bultje	299193dd1c	Disable specialcase for last frames if the sequence contains ARFs. firstpass.c contains some rate adjustment code that assures that the last few frames in a sequence abide by rate limits. If the second-to- last group of frames contains an alt-ref frame (ARF), the last golden frame (GF) is zero bytes, and we will thus spend a ridiculously high number of bits on regular P-frames trying to hit the target rate. This does slightly enhance the quality of these last few frames, but has no perceptual value (other than hitting the target rate). Disabling this code means we consistently (slightly) undershoot the target rate and consequently do worse on the last few frames of a clip, which is particularly noticeable for small clips. The quality- per-bitrate is generally better, ~0.2% better overall on derf-set, especially on clips such as garden, tennis, foreman at low bitrates. Has a negative effect on hallmonitor at high bitrates. Change-Id: I1d63452fef5fee4a0ad2fb2e9af4c9f2e0d86d23	2011-06-15 09:47:00 -04:00
Attila Nagy	c7e6aabbca	Remove redundant check for KEY_FRAME in multithreaded decoder For Intra blocks is enough to check ref_frame == INTRA_FRAME. Change-Id: I3e2d3064c7642658a9e14011a4627de58878e366	2011-06-15 09:01:27 +03:00
Scott LaVarnway	7be5b6dae4	Merge "Populate bmi for B_PRED only"	2011-06-14 12:04:50 -07:00
Johann	92b0e544f3	fix --disable-runtime-cpu-detect on x86 Change-Id: Ib8e429152c9a8b6032be22b5faac802aa8224caa	2011-06-14 11:31:50 -04:00
Tero Rintaluoma	9909047461	Fix RT only build Moved encode_intra function from firstpass.c to encodeintra.c to prevent linking problem in real-time only build. Also changed name of the function to vp8_encode_intra because it is not a static. Change-Id: Ibf3c6c1de3152567347e5fbef47d1d39564620a5	2011-06-14 13:39:06 +03:00
James Zern	532c30c83e	fix corrupt frame leak If setup_token_decoder reported an internal error the memory allocated there would not be freed in the resulting call to _remove_decompressor. Change-Id: Ib459de222d76b1910d6f449cdcd01663447dbdf6	2011-06-13 17:32:19 -07:00
Scott LaVarnway	223d1b54cf	Populate bmi for B_PRED only Small decode performance gain (~1%) on keyframes. No noticeable gains on encode. Also changed pick_intra4x4mby_modes() to read the above and left block modes for keyframes only. Change-Id: I1f4885252f5b3e9caf04d4e01e643960f910aba5	2011-06-13 17:14:11 -04:00
Scott LaVarnway	e71a010646	Calc ref_frame_cost once per frame instead of every macro block. Change-Id: I2604e94c6b89e3a8457777e21c8c38406d55b165	2011-06-13 09:58:03 -04:00
John Koleszar	f3ba4c6b82	Merge "bug fix mode_info_context not initialized for error-resilient"	2011-06-09 13:39:47 -07:00
Yaowu Xu	361717d2be	remove one set of 16x16 variance funcations call to this set of functions are replaced by var16x16. Change-Id: I5ff1effc6c1358ea06cda1517b88ec28ef551b0d	2011-06-09 11:23:05 -07:00
James Berry	45feea4cf0	bug fix mode_info_context not initialized for error-resilient uninitialized xd->mode_info_context would crash vpxenc for --error-resilient=1. Change-Id: I31849e40281e3d65ab63257cfec5e93398997f0b	2011-06-09 12:46:31 -04:00
John Koleszar	af49c11250	Update keyframe activity in non-RD mode Activity update is no longer dependent on being in RD mode, so update it unconditionally. Change-Id: Ib617a6fc210dfc045455e3e4467d7ee5e3d1fa0e	2011-06-09 12:05:31 -04:00
Johann	79327be6c7	use GCC inline magic Better fix for #326. ICC happens to support the inline magic Change-Id: Ic367eea608c88d89475cb7b05d73500d2a1bc42b	2011-06-08 16:19:37 -04:00
John Koleszar	8767ac3bc7	Merge "vp8_pick_inter_mode: remove best_bmodes"	2011-06-08 10:59:30 -07:00
John Koleszar	9e4df2bcf5	Merge "vp8_pick_intra_mode: correct returned rate"	2011-06-08 10:58:36 -07:00
John Koleszar	254a7483e5	Merge "Move RD intra block mode selection to rdopt.c"	2011-06-08 10:51:50 -07:00
John Koleszar	001bd51ceb	vp8_pick_inter_mode: remove best_bmodes Since BPRED will be tested at most once, and SPLITMV is not enabled, there's nothing to clobber the subblock modes, so there's no need to save and restore them. Change-Id: I7c3615b69190c10bd068a44df5488d6e8b85a364	2011-06-08 13:50:50 -04:00
Scott LaVarnway	dce64343d6	Merge "Removed unused function parameters"	2011-06-08 10:20:28 -07:00
John Koleszar	91907e0bf4	vp8_pick_intra_mode: correct returned rate The returned rate was always the 4x4 rate, instead of the rate matching the selected mode. Change-Id: I51da31f80884f5e37f3bcc77d1047d31e612ded4	2011-06-08 13:19:12 -04:00
Scott LaVarnway	69d8d386ed	Removed unused function parameters Change-Id: Ib641c624faec28ad9eb99e2b5de51ae74bbcb2a2	2011-06-08 13:01:09 -04:00
Yaowu Xu	1fba1e38ea	Adjust errorperbit according to RDMULT in activity masking In activity masking, RDO constant RDMULT is adjusted on a per MB basis adaptive to activity with the MB. errorperbit, which is defined as RDMULT/RDDIV, is a constant used in motion estimation. Previously, in activity masking, errorperbit is not changed even when RDMULT is changed. This commit changed to adjust errorperbit according to the change in RDMULT. Test in cif set showed a very small but consistent gain by all quality metrics (average, overall psnr and ssim) when activity masking is on. Change-Id: I07ded3e852919ab76757691939fe435328273823	2011-06-08 09:45:47 -07:00
Yaowu Xu	5fafa2d524	Merge "Further activity masking changes:"	2011-06-08 09:30:31 -07:00
John Koleszar	96a42aaa2d	Move RD intra block mode selection to rdopt.c This change is analogous to I0b67dae1f8a74902378da7bdf565e39ab832dda7, which made the move for the non-RD path. Change-Id: If63fc1b0cd1eb7f932e710f83ff24d91454f8ed1	2011-06-08 12:05:05 -04:00
John Koleszar	e90d17d240	Move intra block mode selection to pickinter.c This commit moves the intra block mode selection from encodeframe.c to pickinter.c (in the non-RD case). This allowed pick_intra_mbuv_mode and pick_intra4x4mby_modes to be made static, and is a step towards refactoring intra mode selection in the main pickinter loop. Gave a small perf increase (~0.5%). Change-Id: I0b67dae1f8a74902378da7bdf565e39ab832dda7	2011-06-08 11:44:57 -04:00
Paul Wilkins	4e81a68af7	Further activity masking changes: Some further re-structuring of activity masking code. Still has various experimental switches. Supports a metric based on intra encode. Experimental comparison against a fixed activity target rather than a frame average, for altering rd and zbin. Overall the SSIM performance is similar to TT's original code but there is a much smaller PSNR hit of circa 0.5% instead of 3.2% Change-Id: I0fd53b2dfb60620b3f74d7415e0b81c1ac58c39a	2011-06-08 16:03:37 +01:00
Yaowu Xu	7368dd4f8f	Merge "remove redundant functions"	2011-06-07 16:36:37 -07:00
Yaowu Xu	59129afc05	Merge "adjust sad per bit constants"	2011-06-07 12:37:04 -07:00
Yaowu Xu	221e00eaa9	adjust sad per bit constants While investigating the effect of DC values on SAD and SSE in motion estimation, a side finding indicates the two table of constants need be adjusted. The adjustment was done by multiplying old constants by 90% with rounding. Also absorb the 1/2 scaling constant into the two tables. Refer to change Ifa285c3e for background of the 1/2 factor. Cif set test showed a very small gain on all metric. Change-Id: I04333527a823371175dd46cb04a817e5b9a8b752	2011-06-07 12:35:03 -07:00
John Koleszar	5c166470a5	Merge "Reduce overshoot in 1 pass rate control"	2011-06-07 12:30:37 -07:00
Scott LaVarnway	346358a5b7	Merge "Wrapped asserts in critical code with CONFIG_DEBUG"	2011-06-07 06:53:51 -07:00
Scott LaVarnway	afb84bb1cc	Merge "Removed unused function vp8_treed_read_num"	2011-06-07 06:51:24 -07:00
Scott LaVarnway	0e3bcc6f32	Wrapped asserts in critical code with CONFIG_DEBUG Change-Id: I5b0aaca06f2e0f40588cb24fb0642b6865da8970	2011-06-07 09:34:47 -04:00
Scott LaVarnway	1374a4db3b	Removed unused function vp8_treed_read_num Change-Id: Id66e70540ee7345876f099139887c1843093907f	2011-06-07 09:32:51 -04:00
John Koleszar	6c8205d37e	Merge remote branch 'origin/master' into experimental Change-Id: I67cc3b490266f958a1b3a935ec08ee19d7b4f6a0	2011-06-07 00:05:07 -04:00
John Koleszar	d13cfba344	Merge remote branch 'internal/upstream' into HEAD	2011-06-07 00:05:04 -04:00
Yaowu Xu	d4700731ca	remove redundant functions The encoder defined about 4 set of similar functions to calculate sum, variance or sse or a combination of them. This commit removed one set of these functions, get8x8var and get16x16var, where calls to the later function are replaced with var16x16 by using the fact on a 16x16 MB: variance == sse - sum*sum/256 Change-Id: I803eabd1fb3ab177780a40338cbd596dffaed267	2011-06-06 16:44:05 -07:00
Yunqing Wang	03973017a7	Remove hex search's variance calculation while in real-time mode In real-time mode motion search, there is no need to calculate variance. This change improved encoding speed by 1% ~ 2%(speed=-5). Change-Id: I65b874901eb599ac38fe8cf9cad898c14138d431	2011-06-06 19:11:05 -04:00
Johann	04edde2b11	Merge "neon fast quantize block pair"	2011-06-06 13:42:58 -07:00
Johann	da8eb716e8	Merge "adds preload for armv6 encoder asm"	2011-06-06 13:32:13 -07:00
John Koleszar	84f5b14b0e	Merge remote branch 'internal/upstream' into HEAD	2011-06-06 15:51:23 -04:00
John Koleszar	be15a09980	Merge remote branch 'internal/upstream-experimental' into HEAD Conflicts: vp8/encoder/encodeframe.c Change-Id: Ibb5a3894ede08ed401ec6e974a8902d7393c9978	2011-06-06 15:50:48 -04:00
Scott LaVarnway	d1c0ba8f7a	Merge "Removed unnecessary bmi motion vector stores."	2011-06-06 07:57:39 -07:00
John Koleszar	824e9410c6	Merge "Don't allow very short GF groups even when the GF is predicted from an ARF."	2011-06-06 07:02:29 -07:00
John Koleszar	2c308f36fc	Merge remote branch 'origin/master' into experimental Change-Id: I81ac427cbaf3d0865df4acef3e0bfc2e95556c4b	2011-06-04 00:05:13 -04:00
John Koleszar	212f618373	Reduce overshoot in 1 pass rate control This patch attempts to reduce the peak bitrate hit by the encoder when using small buffer windows. Tested on the CIF set over 200-500kbps using these settings: --buf-sz=500 --buf-initial-sz=250 --buf-optimal-sz=250 \ --undershoot-pct=100 Two pass encodes were tested at best quality. One pass encodes were tested only at realtime speed 4: --rt --cpu-used=-4 The peak datarate (over the specified 500ms window) was measured for each encode, and averaged together to get metric for "average peak," computed as SUM(peak)/SUM(target). This patch reduces the average peak datarate as follows: One pass: baseline: 1.29715 this patch: 1.23664 Two pass: baseline: 1.32702 this patch: 1.37824 This change had a positive effect on our quality metrics as well: One pass CBR: Min / Mean / Max (pct) Average PSNR -0.42 / 2.86 / 27.32 Overall PSNR -0.90 / 2.00 / 17.27 SSIM -0.05 / 3.95 / 37.46 Two pass CBR: Min / Mean / Max (pct) Average PSNR -4.47 / 4.35 / 35.99 Overall PSNR -3.40 / 4.18 / 36.46 SSIM -4.56 / 6.98 / 53.67 One pass VBR: Min / Mean / Max (pct) Average PSNR -5.21 / 0.01 / 3.30 Overall PSNR -8.10 / -0.38 / 1.21 SSIM -7.38 / -0.11 / 3.17 (note: most values here were close to the mean, there were a few outliers on files that were very sensitive to golden frame size) Two pass VBR: Min / Mean / Max (pct) Average PSNR 0.00 / 0.00 / 0.00 Overall PSNR 0.00 / 0.00 / 0.00 SSIM 0.00 / 0.00 / 0.00 Neither one pass or two pass CBR mode adheres particularly strictly to the short term buffer constraints, and two pass is less consistent, even in the baseline commit. This should be addressed in a later commit. This likely will hurt the quality numbers, as it will have to reduce the burstiness of golden frames. Aside: My work on this commit makes it clear that we need to make rate control modes "pluggable", where you can easily write a new one or work on one in isolation. Change-Id: I1ea9a48f2beedd59891f1288aabf7064956b4716	2011-06-03 16:38:11 -04:00
Scott LaVarnway	f1d6cc79e4	Removed unnecessary bmi motion vector stores. left_block_mv and above_block_mv will return the MB motion vector for non SPLITMV macro blocks. Change-Id: I58dbd7833b4fdcd44b6b72e98ec732c93c2ce4f4	2011-06-03 13:09:46 -04:00
Scott LaVarnway	8c5b73de2a	Merge "Removed B_MODE_INFO"	2011-06-03 08:32:30 -07:00
Yunqing Wang	e5c236c210	Adjust bounds checking for hex search in real-time mode Currently, hex search couldn't guarantee the motion vector(MV) found is within the limit of maximum MV. Therefore, very large motion vectors resulted from big motion in the video could cause encoding artifacts. This change adjusted hex search bounds checking to make sure the resulted motion vector won't go out of the range. James Berry, thank you for finding the bug. Change-Id: If2c55edd9019e72444ad9b4b8688969eef610c55	2011-06-03 08:53:42 -04:00
John Koleszar	480f025754	Merge remote branch 'origin/master' into experimental Change-Id: I7395011ef6c1783110ebd06305ca3d908d2457bb	2011-06-03 00:05:21 -04:00
John Koleszar	90e84704ae	Merge remote branch 'internal/upstream' into HEAD	2011-06-03 00:05:17 -04:00
Scott LaVarnway	773768ae27	Removed B_MODE_INFO Declared the bmi in BLOCKD as a union instead of B_MODE_INFO. Then removed B_MODE_INFO completely. Change-Id: Ieb7469899e265892c66f7aeac87b7f2bf38e7a67	2011-06-02 13:46:41 -04:00
Ronald S. Bultje	9f002bee53	Don't allow very short GF groups even when the GF is predicted from an ARF. This is basically a slightly modified version of the previous patch, and it has a moderately positive effect (SSIM/PSNR both +0.08% avg on derf-set). Most clips show no change, except waterfall/coastguard, each ~ +0.8% SSIM/PSNR. You can see similar effects in other clips by shortening their length to terminate at a very short last group of frames. Change-Id: I7a70de99ca1f9fe6a8b6ca7a6e30e8a4b64383e4	2011-06-02 09:14:51 -07:00
Yaowu Xu	4ce6928d5b	Merge "further clean up of errorperbit and sadperbit"	2011-06-02 08:58:03 -07:00
John Koleszar	32817d6fbe	Merge remote branch 'origin/master' into experimental Change-Id: I993dbef81ca3d1638e16c4134aa8dc177e57875c	2011-06-02 00:05:13 -04:00
John Koleszar	319404f4f1	Merge remote branch 'internal/upstream' into HEAD	2011-06-02 00:05:07 -04:00
Yaowu Xu	5b2fb32961	further clean up of errorperbit and sadperbit this commit makes the usage errorperbit and sadperbit consistent for encoding modes and passes. Removed all different magic weight factors associated with errorperbit. Now 1/2 is used for both sadperbit16 and sadperbit4, the /2 operation is merged into initializations of the 2 variables. Tests on cif set show .23%, 0.18% and 0.19% gain by avg psnr, overall psnr and ssim respectively. Change-Id: Ifa285c3e065ce0a5a77addfc9f95aabf54ee270d	2011-06-01 14:44:06 -07:00
John Koleszar	4101b5c5ed	Merge "Bugfix in vp8dx_set_reference"	2011-06-01 13:57:23 -07:00
Henrik Lundin	69ba6bd142	Bugfix in vp8dx_set_reference The fb_idx_ref_cnt book-keeping was in error. Added an assert to prevent future errors in the reference count vector. Also fixed a pointer syntax error. Change-Id: I563081090c78702d82199e407df4ecc93da6f349	2011-06-01 21:41:12 +02:00
John Koleszar	5610970fe9	Merge "Fix code under #if CONFIG_INTERNAL_STATS."	2011-06-01 11:14:17 -07:00
Ronald S. Bultje	34ba18760f	Fix code under #if CONFIG_INTERNAL_STATS. Change-Id: Iccbd78d91c3071b16fb3b2911523a22092652ecd	2011-06-01 11:10:13 -07:00
Yaowu Xu	50916c6a7d	remove some magic weights associated with sad_per_bit sad_per_bit has been used for a number of motion vector search routines with different magic weights: 1, 1/2 and 1/4. This commit remove these magic numbers and use 1/2 for all motion search routines, also reformat a number of source code lines to within 80 column limit. Test on cif set shows overall effect is neutral on all metrics. <=0.01% Change-Id: I8a382821fa4cffc9c0acf8e8431435a03df74885	2011-06-01 10:10:44 -07:00
Tero Rintaluoma	61f0c090df	neon fast quantize block pair vp8_fast_quantize_b_pair_neon function added to quantize two adjacent blocks at the same time to improve performance. - Additional 3-6% speedup compared to neon optimized fast quantizer (Tanya VGA@30fps, 1Mbps stream, cpu-used=-5..-16) Change-Id: I3fcbf141e5d05e9118c38ca37310458afbabaa4e	2011-06-01 10:48:05 +03:00
John Koleszar	2289ba4b9c	Merge remote branch 'origin/master' into experimental Change-Id: I1e7ce466bc01e380eb392b964ba677f0bb8cd13b	2011-06-01 00:05:13 -04:00
John Koleszar	f07dec70cd	Merge remote branch 'internal/upstream' into HEAD	2011-06-01 00:05:11 -04:00
Scott LaVarnway	9e4f76c154	Merge "vp8_pick_inter_mode code cleanup"	2011-05-31 12:31:46 -07:00
Scott LaVarnway	1a5a1903ea	vp8_pick_inter_mode code cleanup Small code cleanups before attempting to reduce the size of bmi found in BLOCKD. Change-Id: Ie9c14adb53afd847716a75bcce067d0e6c04f225	2011-05-31 14:24:42 -04:00
John Koleszar	0a72f568ec	Initialize first_time_stamp_ever Misplaced #endif caused first_time_stamp_ever to only be initialized if CONFIG_INTERNAL_STATS was set. Change-Id: I2296a4ab00f7dfb767583edcc5d59b94f48c0621	2011-05-31 12:37:45 -04:00
Tero Rintaluoma	5305e79eae	adds preload for armv6 encoder asm Added preload instructions to armv6 encoder optimizations. About 5% average speed-up on Tegra2 for VGA@30fps sequence. Change-Id: I41d74737720fb71ce7a316f07555357822f3347e	2011-05-30 11:10:03 +03:00
John Koleszar	d63ce5db34	Merge remote branch 'origin/master' into experimental Change-Id: Ie2a4927754a9c220b30a84fc7e1372e565fe9eec	2011-05-28 00:05:12 -04:00
John Koleszar	4070c93bfa	Merge remote branch 'internal/upstream' into HEAD	2011-05-28 00:05:09 -04:00
John Koleszar	4a4ade6dc8	Merge "bug fix check frame buffer index before copy"	2011-05-27 12:35:06 -07:00
James Berry	8795b52512	bug fix check frame buffer index before copy in onyx_if.c update_reference_frames() make sure that frame buffer indexes are not equal before preforming a buffer copy. If two frames share the same buffer the flags will already be set correctly. Change-Id: Ida9b5516d08e3435c90f131d2dc19d842cfb536e	2011-05-27 14:59:29 -04:00
Yunqing Wang	4fb5ce6a92	Merge "Use hex search for realtime mode speed>4"	2011-05-27 11:12:50 -07:00
Yunqing Wang	4d052bdd91	Use hex search for realtime mode speed>4 Test showed using hex search in realtime mode largely speed up encoding process, and still achieves similar quality like the diamond search we have. Therefore, removed the diamond search option. Change-Id: I975767d0ec0539f9f6ed7fdfc09506e39761b66c	2011-05-27 14:05:02 -04:00

... 3 4 5 6 7 ...

1506 Commits