generic-library/vpx

Author	SHA1	Message	Date
John Koleszar	d708e7fbbe	Merge "Fix another multithreaded encoder loopfilter race condition"	2012-05-24 09:40:48 -07:00
Jim Bankoski	57faddb7c5	fix denoiser for temporal patterns and rd This extends the denoiser to work for temporally scalable coding. I believe this also fixes a very rare but really bad bug in the original implementation. Change-Id: I8b3593a8c54b86eb76f785af1970935f7d56262a	2012-05-24 07:44:03 -07:00
John Koleszar	5e39a8c16f	Merge "Make libvpx Chromium build friendly"	2012-05-23 20:56:49 -07:00
Alpha Lam	0f7e4665ae	Make libvpx Chromium build friendly Add PRIVATE macro for adding private_extern directive for yasm to hide global symbols. This is only enabled if -DCHROMIUM is used with YASM. Also fixed a small problem with rtcd_defs.sh to guard TEMPORAL_DENOISING. Change-Id: I9027fce3ebddcf20078293e4b86b396f21da7857	2012-05-23 18:15:05 -07:00
Attila Nagy	4890853010	Fix another multithreaded encoder loopfilter race condition After a key frame encoding, the frame type could change while filtering is still going on. Pass the frame type as parameter to the loopfilter function and don't read it from common storage. vp8cx_set_alt_lf_level has to be done before packing the stream. Currently alt_lf_level is not used so there hasn't been any visible problem here. Change-Id: Ia114162158cd833c2b16e3b89303cc9c91f19165	2012-05-23 13:50:59 -07:00
John Koleszar	410ae576e7	Prevent external frame size changes in two-pass The two-pass code does not support the case where the application changes the frame size dynamically. Add this case to the validation checks in the vpx_codec_enc_config_set() path. Change-Id: Idadc42c7c3bd566ecdbce30d8dd720add097f992	2012-05-23 13:49:05 -07:00
John Koleszar	a419f0f232	Merge changes I38e93fe2,I6d6a0fb6,I51155833,If4c3e5d4,Ia2f40ef2 * changes: Add initial keyframe tests Move all tests to test/ directory Enable unit tests by default Build unit tests monolithically configure: initial support for CXX, CXXFLAGS variables	2012-05-23 13:47:48 -07:00
John Koleszar	cf0970157d	Fix memory leak in vpx_codec_enc_config_set() Resolution changes in calls to vpx_codec_enc_config_set() would cause a memory leak due to failing to release the lookahead and alt ref buffers. Change-Id: I48392ea25e71fe2760d60cfde3fb3874598cc85f	2012-05-23 12:59:40 -07:00
Yunqing Wang	ad479a9b3d	multi-res: modify memory allocation code Reverted part of change in memory alllocation code, which ensures that the function returns 0 and encoder works correctly when CONFIG_MULTI_RES_ENCODING isn't turned on. Change-Id: Id5d5e7f2c8bd9e961a6dca79d257e8185f0d592a	2012-05-23 13:40:24 -04:00
Yaowu Xu	e9818bb697	changed the way that default probs for 8x8 is set. The commit changed how baseline 8x8 coefficient probabilities are initialized, to be consistent with the initialization of baseline 4x4 coefficient probabilities. The commit does not have any effect on compression. Change-Id: Ifb3902b5dc0b0c2e6dc3aa5d4a6589d528e58355	2012-05-23 10:09:41 -07:00
Attila Nagy	ea392d4714	Fix another multithreaded encoder loopfilter race condition After a key frame encoding, the frame type could change while filtering is still going on. Pass the frame type as parameter to the loopfilter function and don't read it from common storage. vp8cx_set_alt_lf_level has to be done before packing the stream. Currently alt_lf_level is not used so there hasn't been any visible problem here. Change-Id: Ia114162158cd833c2b16e3b89303cc9c91f19165	2012-05-23 15:19:48 +03:00
John Koleszar	2d225689d3	Move all tests to test/ directory Consolodate the unit tests under vp8/ to the test/ directory Change-Id: I6d6a0fb60f5e3874a4d6710e9e121dd3e81a93db	2012-05-22 15:00:10 -07:00
John Koleszar	e82d261d10	Build unit tests monolithically Rework unit tests to have a single executable rather than many, which should avoid pollution of the visual studio project namespace, improve build times, and make it easier to use the gtest test sharding system when we get these going on the continuous build cluster. Change-Id: If4c3e5d4b3515522869de6c89455c2a64697cca6	2012-05-22 14:37:30 -07:00
Yunqing Wang	eb5b965b44	Merge "multi-res: force Key frame sychronization"	2012-05-22 08:00:12 -07:00
Christian Duvivier	38ddb426d0	Inline Intrinsic optimized Denoiser Faster version of denoiser, cut cost by 1.7x for C path, by 3.3x for SSE2 path. Change-Id: I154786308550763bc0e3497e5fa5bfd1ce651beb	2012-05-21 07:54:20 -07:00
Paul Wilkins	6c8fb82c14	Merge "Further firstpass.c changes." into experimental	2012-05-18 15:43:58 +00:00
Paul Wilkins	f63894f691	Experimental change to two pass prediction decay calculation. Remove dependency on amount and speed of motion as this may not behave well across different image sizes. Tweak impact of % inter. Add in experimental adjustment based on relative quality of an older second reference frame. Cap range of decay values allowed. Some small + effect on derf but -ve on yt & hd at this stage. Change-Id: I390d6f6ebe67a2eb0b834980d0d4650124980d3e	2012-05-17 09:40:34 +01:00
Paul Wilkins	a29a4e2414	Merge "Move / re-factor some of boost calculation code." into experimental	2012-05-17 08:37:30 +00:00
Yunqing Wang	65dd157c3c	multi-res: force Key frame sychronization In multi-resolution encoding, frame_type decision for each frame is made by the lowest-resolution encoder. For all other higher- resolution encoders, kf_mode is always set to VPX_KF_DISABLED, and they are forced to use the same frame_type picked by the lowest-resolution encoder. Change-Id: Ic4d52ec65bbc012ca9c2d236210e28a295591eaf	2012-05-16 15:06:42 -04:00
Ronald S. Bultje	0b8a95a0b2	Rewrite reference frame costing in the RD loop. I now see I didn't write a very long description, so let's do it here then. We took a pretty big quality hit (0.1-0.2%) from my recent fix of the inversion of arguments to vp8_cost_bit() in the RD reference frame costing. I looked into it and basically the costing prevented us from switching reference frames. This is of course silly, since each frame codes its own prob_intra_coded, so using last frame cost indications as a limiting factor can never be right. Here, I've rewritten that code to estimate costings based partially on statistics from progress on current frame encoding. Overall, this gives us a ~0.2%-0.3% improvement over what we had previously before my argument-inversion-fix, and thus about ~0.4% over current git (on derf-set), and a little more (0.5-1.0%) on HD/STD-HD/YT. Change-Id: I79ebd4ccec4d6edbf0e152d9590d103ba2747775	2012-05-15 15:32:44 -07:00
Paul Wilkins	acf3c729d8	Further firstpass.c changes. base the static image test off a measure of 0,0 motion instead of the decay accumulator value. Change "transition to still detection" to compare the decay rate from successive frames. Minor tweak to the arf extra boost given based on the number of frames affected. Removed unused variable mod_err_per_mb_accumulator. Change-Id: Idd8360083ad409e45f133ce97dd2488259003e64	2012-05-15 18:01:44 +01:00
Deb Mukherjee	c5ddb7f016	Adds new Directional Intra prediction modes. Adds 6 directional intra predictiom modes for 16x16 and 8x8 blocks. Change-Id: I25eccc0836f28d8d74922e4e9231568a648b47d1	2012-05-15 08:54:50 -07:00
Paul Wilkins	520a7a2598	Merge "Firstpass.c refactoring" into experimental	2012-05-15 15:04:48 +00:00
Paul Wilkins	545bc721f3	Merge "Two pass refactoring continued." into experimental	2012-05-15 15:04:14 +00:00
Paul Wilkins	a1680ad324	Merge "Two pass rc refactoring." into experimental	2012-05-15 14:44:34 +00:00
Yaowu Xu	b22cc559b6	Changed to use integer 8x8 dct The commit added an integer version of 8x8 forward DCT, based on the orginal forward DCT from VP6. The constants, roundings, and shifts were adjusted to improve the accuracy. The latest patch has a very similar accuracy in term of round trip error against the floating point version. It should be noted here that the purpose of the patch is to help encoding speed and facilitate all other experiments. There will be futher review in combination with inverse DCT before finalization. configure with "--enable--int_8x8fdct" to use the integer version Change-Id: I5a4f80507429f0e07cf02a13768ec81cbfddc5bc	2012-05-15 07:28:26 -07:00
Paul Wilkins	ae989ae864	Move / re-factor some of boost calculation code. Some marginal impact due to the fact that it makes use of arf more likely / stable even in hard sections. Change-Id: Ic72fda0f63eefc9433914b5d9cd374d515810129	2012-05-15 15:28:02 +01:00
Paul Wilkins	0529320a9e	Firstpass.c refactoring Removed unused function. Added tentative code to take error score of an older frame into account when calculating Q range. However, for now it is disabled pending merging other changes and testing. Change-Id: Ie89955e70319dac31b79e3b833e3352712a061ec	2012-05-15 14:58:13 +01:00
Paul Wilkins	3536ad5bb9	Merge "First pass overhaul preparatory change." into experimental	2012-05-15 10:05:16 +00:00
Yaowu Xu	023304e4fe	Merge "Reversible WHT pair" into experimental	2012-05-14 19:28:21 +00:00
Ronald S. Bultje	5cf8a3272b	Merge "Don't use compound prediction for golden frames based on alt-ref frames." into experimental	2012-05-14 19:25:09 +00:00
Paul Wilkins	e237fd7c5a	Two pass refactoring continued. Remove testing of whether we estimate that it will be possible to code an arf at a lower Q than the ambient Q. This adds quite a bit of extra code and complexity for marginal gain. Factored out some code relating to ARNR selection to a separate function as this is likely to be changed / simplified soon. Change-Id: Ia1cf060405637ef5bbf7018355437be21d12375f	2012-05-14 18:54:05 +01:00
Paul Wilkins	59a5c7d550	Two pass rc refactoring. Removed odd *100 >> 4 factor from boost calculations. Not all the calculations exactly match what was there before so there may be some minor impact on results. Some other minor tidying up in regard to coding conventions. The specific values of factors and thresholds will likely change as part of subsequent patches. Change-Id: Id976321484ac02ba50294cf54fafbc17dda85686	2012-05-14 18:53:14 +01:00
Ronald S. Bultje	959b296a40	Don't use compound prediction for golden frames based on alt-ref frames. These frames can force reference frame (arf), mode (zeromv) and skip, which means that if we use compound prediction (i.e. arf+last), we might use a blend of a perfect (arf) and an imperfect (last) predictor, leading to semi-garbage display and thus a huge drop in SSIM/PSNR (up to 10dB for some frames I analyzed). Gives a +0.2% gain on YT. Change-Id: If1f2b7899ad165684af3808fd379295e82558cbb	2012-05-11 17:48:20 -07:00
Scott Graham	92963df086	fix warnings for building on win32 Change-Id: If6e11ba3d681e831d7d98662c0abdd2ac16b3811	2012-05-11 12:36:50 -07:00
John Koleszar	44d35f7b25	Merge branch 'origin/eider' into master Conflicts: vp8/common/entropymode.c vp8/common/entropymode.h vp8/encoder/encodeframe.c vp8/vp8_cx_iface.c Change-Id: I708b0f30449b9502b382e47b745d56f5ed2ce265	2012-05-11 10:51:05 -07:00
Paul Wilkins	35358320e3	First pass overhaul preparatory change. This is the first patch in a series of changes to the first pass code. (Broken down for ease of testing/merging/review). This patch introduces a new stats element "sr_coded_error". This is the coded error recorded vs the second reference frame (which is updated such that it lags by at least one frame). No use is made of the new structure in this change so this patch should have no material effect. Removed some ifdefs and deprecated code (#if NEW_BOOST). Removed twopass.gf_decay_rate (not used any more) Change-Id: I1be672a73017f7c13fd50fb4f99236aa2ed30916	2012-05-11 18:07:33 +01:00
Yaowu Xu	7968d29fed	Reversible WHT pair This commit changed the forward and the inverse 4x4 Walsh Hadamard transform to a new pair, where the inverse transform can pefectly reconstuct the input to forward transform. It also does so without changing the input and output value range. Even more, it does not change the complexity of the transforms. While it was not expected to improve the results of our current test, it does improve std-hd set by 0.2% on all metrics. No change on derf. Change-Id: Ie4f23ddd3a0f3c5fbe97fb58399f860031f99337	2012-05-10 16:32:47 -07:00
Jim Bankoski	c7ca380832	Merge "vp8e - boolcoder unit test"	2012-05-09 15:41:23 -07:00
Deb Mukherjee	5f320d010f	Improved index remapping for prob updates. Also includes some clean ups and refactoring. Rebased. Change-Id: I268c97fe325b4881103fe19f41ae818569e7ccf7	2012-05-09 11:21:43 -07:00
John Koleszar	c8f4c187b3	Use consistent range for VP8E_SET_NOISE_SENSITIVITY Accept the same range of inputs for the VP8E_SET_NOISE_SENSITIVITY control, regardless of whether temporal denoising is enabled or not. This is important for maintaining compatibility with existing applications. Change-Id: I94cd4bb09bf7c803516701a394cf1a63bfec0097	2012-05-08 15:01:24 -07:00
Yaowu Xu	54cf1d9ad3	a number of fixes to entropy stats collection 1. block types There are only three types of blocks for 8x8 transformed MBs, i.e. Y block with DC does not exist for 8x8 transformed MBs as all MB using 8x8 transform have 2nd order haar transform. This commit introduced a new macro BLOCK_TYPES_8X8 to reflect such fact. 2. context counters This commit also fixed the mixed of context_counters between 4x4 and 8x8 transformed MBs. The mixed use of the counters leads me to think the existing the context probabilities were not properly generated from 8x8 transformed MBs. 3. redundant collecting in recoding The commit also corrected the code that accumulates entropy stats by making sure stats only collected for final packing, not during the recode loop Change-Id: I029f09f8f60bd0c3240cc392ff5c6d05435e322c	2012-05-08 14:13:22 -07:00
Jim Bankoski	9851486b2f	vp8e - boolcoder unit test Adds a unit test to the boolcoder that tests encoding and decoding thousands of different bits, with different probabilities in different patterns. Code borrowed from the webp project - and its committers. Change-Id: Icabbb884d57e666496490c961dd29b246144ab3e	2012-05-08 07:29:19 -07:00
John Koleszar	14d827f44e	fix vp8_ namespace issues Make functions only referenced from one translation unit static. Other symbols with extern linkage get a vp8/vpx prefix. Change-Id: I928c7e0d0d36e89ac78cb54ff8bb28748727834f	2012-05-04 12:24:04 -07:00
John Koleszar	22f56b93e5	Formalize encodeframe.c forward delclarations Change If4321cc5 fixed a bug caused by forward declarations not being kept in sync across C files, resulting in a function call with the wrong arguments. The commit moves the affected function declarations into a header file, along with the other symbols from encodeframe.c that were being sloppily shared. Change-Id: I76a7b4c66d4fe175f9cbef7e52148655e4bb9ba1	2012-05-04 10:44:47 -07:00
Attila Nagy	3e32105d63	Fix multi-resolution threaded encoding mb_row and mb_col was not passed to vp8cx_encode_inter_macroblock in threaded encoding. Change-Id: If4321cc59bf91e991aa31e772f882ed5f2bbb201	2012-05-04 10:44:46 -07:00
John Koleszar	2bf8fb5889	remove deprecated pre-v0.9.0 API Remove a bunch of compatibility code dating back to before the initial libvpx release. Change-Id: Ie50b81e7d665955bec3d692cd6521c9583e85ca3	2012-05-04 10:44:46 -07:00
Attila Nagy	f039a85fd8	Make global data const Removes all runtime initialization of global data. This commit is a squashed version of the following series cherry-picked from master. This is necessary because of a change that was merged to the tester that depends on the scaler being moved to the RTCD framework, which is a worthwhile thing to include in Eider anyway. - `a91b42f02` Makes all global data in entropy.c const - `b35a0db0e` Makes all global data in tokenize.c const - `441cac8ea` Makes all mode token tables const - `5948a0210` Ports vpx_xcaler to new RTCD method - `317d4244c` Makes all mode token tables const part 2 Change-Id: Ifeaea24df2b731e7c509fa6c6ef6891a374afc26	2012-05-04 10:42:21 -07:00
Deb Mukherjee	813c6c3925	Expanding the coefficient encoding contexts This patch expands the set of prev contexts used for video coding from 3 to 4. There is a small improvement of the order of 0.08% for derf and 0.15% on the HD set. The tests were rerun after the various merges last week. There are two columns in each test - the first are the results with the mbskip change, and the second with expanded contexts added on top of that. Derf: http://www.corp.google.com/~debargha/vp8_results/explibvpx_newentropy_expcontext.html HD: http://www.corp.google.com/~debargha/vp8_results/explibvpx_hd_newentropy_expcontext.html Rebased. Broke up 80 char lines. Change-Id: I82d2e72d054e530cbf5ce9aa0e6d85c582965675	2012-05-04 07:11:38 -07:00
Attila Nagy	357800e7cd	Fix multi-resolution threaded encoding mb_row and mb_col was not passed to vp8cx_encode_inter_macroblock in threaded encoding. Change-Id: If4321cc59bf91e991aa31e772f882ed5f2bbb201	2012-05-04 13:32:43 +03:00
John Koleszar	9f9cc8fe71	Merge "Add VPX_TS_ prefix to MAX_LAYERS, MAX_PERIODICITY" into eider	2012-05-03 09:40:50 -07:00
John Koleszar	25a36d6b3c	multi-res: restore v1.0.0 API Move the notion of 0 bitrate implying skip deeper into the codec, rather than doing it at the multi-encoder API level. This preserves v1.0.0 ABI compatibility, rather than forcing a bump to v2.0.0 over a minor change. Also, this allows the case where the application can selectively enable and disable the larger resolution(s) without having to reinitialize the codec instace (for instance, if no target is receiving the full resolution stream). It's not clear how deep to push this check. It may be valuable to allow the framerate adaptation code to run, for example. Currently put the check as early as possible for simplicity, should reevaluate this as this feature gains real use. Change-Id: I371709b8c6b52185a1c71a166a131ecc244582f0	2012-05-02 17:40:08 -07:00
John Koleszar	d8216b19b6	Merge "Fix compiler warnings" into eider	2012-05-02 16:22:34 -07:00
John Koleszar	d46ddd0839	Add VPX_TS_ prefix to MAX_LAYERS, MAX_PERIODICITY Preserved the prior names for compatibility, will remove in the future. Change-Id: I8773f959ebce72f60168a2972f7a8ffe6642b9b2	2012-05-02 16:21:52 -07:00
Yaowu Xu	3b909a6f03	chagned the decoder band to match the encoder missed the decoder side in last commit Change-Id: Ie97f35189e93f78783e3d8072a36eea768beed27	2012-05-02 11:10:25 -07:00
Timothy B. Terriberry	e50c842755	Fix TEXTRELs in the ARM asm. Besides imposing a performance penalty at startup in most configurations, these relocations break the dynamic linker for native Fennec, since it does not support them at all. Change-Id: Id5dc768609354ebb4379966eb61a7313e6fd18de	2012-05-02 10:36:01 -07:00
Timothy B. Terriberry	22ae1403e9	Fix trailing commas in enums. These are warnings in most builds, but show up as compile errors on some platforms when these headers are included from C++ code. Change-Id: I6c523b4dbbc699075fe73830442b51922e5a61d5	2012-05-02 10:35:28 -07:00
Timothy B. Terriberry	28f5451572	Fix trailing commas in enums. These are warnings in most builds, but show up as compile errors on some platforms when these headers are included from C++ code. Change-Id: I6c523b4dbbc699075fe73830442b51922e5a61d5	2012-05-02 10:08:10 -07:00
Attila Nagy	14c9fce8e4	Fix compiler warnings Fix code for following warnings: -Wimplicit-function-declaration -Wuninitialized -Wunused-but-set-variable -Wunused-variable Change-Id: I2be434f22fdecb903198e8b0711255b4c1a2947a	2012-05-02 10:57:57 +03:00
Yaowu Xu	bd69b7d459	slight adjustment to coef band definition This commit adjusted slightly the 4x4 coefficents band definition to better classify coefficients with similar distributions and usages. It helps derf set about .1%, it is alos slightly positive for std-hd set, where 4x4 blocks are used less frequently. The commit also removed a const array not in use. Change-Id: I78d16905d4036641ec905b0c32c190c1def5b249	2012-05-01 19:54:19 -07:00
Ronald S. Bultje	0f68789c24	Fix inversion of probability and value in calls to vp8_cost_bit(). Change-Id: I9f1686249ac812f7b9b872eabe3970d1dfb25e56	2012-04-30 16:33:07 -07:00
Deb Mukherjee	c0d595134e	Turning off filter search for now to improve encode speed. Change-Id: I87291fb40c745f34c36b067f47abdf69774a812f	2012-04-30 12:24:22 -07:00
Deb Mukherjee	fc1a7bd81e	Minor cleanup in tokenize.h Removes a set of spurious declarations that were inadvertently checked in. Change-Id: I2f80b6b66d2ec9ea667c810eaf1a6e7d52478c67	2012-04-30 11:06:04 -07:00
Adrian Grange	87b6f21317	Merge "Removed MV costing from ARNR filtering" into experimental	2012-04-27 22:09:57 +00:00
Adrian Grange	393440db89	Removed MV costing from ARNR filtering The ARNR filter uses a motion compensated temporal filter, but the motion estimation implementation accounts for the cost of the mv in its decision making process. The ARNR filter uses a dummy cost table initialized to 0 as a way to ignore the mv costs (which are irrelevant to the filter). This CL modifies the ARNR filter implementation to so that the mv costing is ignored without the requirement for dummy tables. Change-Id: I0dd9620c3b70682f938b2a70912c11d4d7c9284c	2012-04-27 10:00:20 -07:00
Adrian Grange	3f252e30e4	Modified ARNR MC-filter to ignore ARF frame The ARNR filter uses MC to find the best match between the ARF and other nearby frames in the filter-set. Since the ARF is a member of the filter-set, MC in that case is unnecesssary. This patch modifies the filter so it does not apply MC in this case. Change-Id: Ic0321199c08db2189a57f28d1700b745bc7ff66d	2012-04-27 09:01:17 -07:00
Adrian Grange	f0605f4b7e	Removed MV costing from ARNR filtering The ARNR filter uses a motion compensated temporal filter, but the motion estimation implementation accounts for the cost of the mv in its decision making process. The ARNR filter uses a dummy cost table initialized to 0 as a way to ignore the mv costs (which are irrelevant to the filter). This CL modifies the ARNR filter implementation so that the mv costing is ignored without the requirement for dummy tables. Change-Id: I4196aa5c24da63f858ff54fbaa5fc85ae1f1957f	2012-04-27 08:48:13 -07:00
Attila Nagy	24e7b1b90d	Moves error concealment allocations from common parts to decoder The backup MODE_INFO buffer used in the error concealment was allocated in the codec common parts allocation even though this is a decoder only resource. Moved the allocation to the decoder side. No need to update_mode_info_border as mode_info buffers are zero allocated. This fixes also a potential memory leak as the EC overlaps buffer was not properly released before reallocation after a frame size change. Change-Id: I12803d3e012308d95669069980b1c95973fb775f	2012-04-26 11:49:15 +03:00
Deb Mukherjee	acdda50a0d	Adds search option for best interpolation filter. Adds a speed feature to conduct a brute force search among a set of available interpolation filters for the best one in an RD sense. There is a gain of 0.4% on derf, 1.0% on Std-HD. Patch 2: A macro added to determine if the encoder state is reset for each new filter tried. Patch 3: rebase, also fixes a bug (decodframe.c) introduced by a couple of missing function pointer assignements. Patch 4: rebase. Change-Id: Ic9ccca9d8c35c6af557449ae867391a2f996cc29	2012-04-25 22:37:50 -07:00
Yaowu Xu	a16608aba0	Merge QIMODE experiment This commit merge the QI mode experiment. As the experiment affects the encoding of intra coding modes on key frame only, the overall effect of the experiment on encoding tests is insignificant. Change-Id: I9e4e3933adface88867ad429cee3986e529c511d	2012-04-25 14:18:25 -07:00
Yaowu Xu	c1814d267a	Merge UVINTRA experiment The commit merges the UVINTRA experiment and removed the related macros. The overall effect of the experiment is a small gain (.1% on derf) Change-Id: Ia34b3312fb9b5b34c9ba111bf0fa78c6f78ac80b	2012-04-25 13:47:32 -07:00
Attila Nagy	3939e85b26	Fix loopfilter race condition in multithreaded encoder Race was introduced by https://gerrit.chromium.org/gerrit/15563. If loopfilter related config params were changed between frames, or after a KEY frame, there could be a mismatch between encoder's and decoder's recontructed image. In worst case, when frame buffers are reallocated because of a size change, the loopfilter could do an invalid data access (segmentation fault). Fixes: Sync with the loopfilter before applying any encoder changes in vp8_change_config(). Moved the loopfilter synching to the top of encode_frame_to_data_rate() so that it's done before any alteration of the encoder. Change-Id: Ide5245d2a2aeed78752de750c0110bc4b46f5b7b	2012-04-25 14:26:02 +03:00
Deb Mukherjee	ad06c9f051	Merge "Differential encoding of probability updates" into experimental	2012-04-24 23:07:18 +00:00
Ronald S. Bultje	29f8cbc285	Remove unused header files. Change-Id: I8708358bb37edabcbe5dfc755ed18791d9e143c4	2012-04-24 11:07:27 -07:00
Attila Nagy	0c483d6b68	Simplifies decoder multithread synching Increment the last_row_mb_col counter by nsync after last MB of row is ready. This way we dont need to check for last MB of row when synching. Set last MB of row ready just after row extension is done, This avoids o potential race condition whit the processing of last MB of next row. Change-Id: I19c44fd6041116ee5483be2143b4f4bfcd149eac	2012-04-24 09:36:54 +03:00
Deb Mukherjee	c6f1bf4321	Differential encoding of probability updates Adds differential encoding of prob updates using a subexponential code centered around the previous probability value. Also searches for the most cost-effective update, and breaks up the coefficient updates into smaller groups. Small gain on Derf: 0.2% Change-Id: Ie0071e3dc113e3d0d7ab95b6442bb07a89970030	2012-04-23 23:02:52 -07:00
John Koleszar	504601bb14	Merge "multi-res: restore v1.0.0 API" into eider	2012-04-23 12:07:18 -07:00
Attila Nagy	b41c17d625	Shares one set of RD costs tables between all encoding threads RD costs were local to MACROBLOCK data and had to be copied all the time to each thread's MACROBLOCK data. Tables moved to a common place and only pointers are setup for each encoding thread. vp8_cost_tokens() generates 'int' costs so changed all types to be int (i.e. removed unsigned). NOTE: Could do some more cleaning in vp8cx_init_mbrthread_data(). Change-Id: Ifa4de4c6286dffaca7ed3082041fe5af1345ddc0	2012-04-23 14:15:23 -04:00
Scott LaVarnway	11876faa11	Removed mcomp_filter_type and replaced with use_bilinear_mc_filter. Change-Id: Ie9e9f0bccca4ab7d3e23ae045aefed33536103ff	2012-04-23 13:23:21 -04:00
Scott LaVarnway	88e4a8af0f	Merge "Removes duplication of key frame mode probabilities"	2012-04-23 10:07:50 -07:00
Attila Nagy	175495fe73	Optimizes precalculated decoder block ptrs&offs The block pointers and offset do not need to be calculated for every frame. Block internal predictors can be update once when decoder is allocated. Destination and previous buffer offsets have to be updated just when frame size is changing. Change-Id: I92ca8df0e6aaac4cc35ab890751d446760bf82e2	2012-04-23 15:33:04 +03:00
Attila Nagy	f4126995b7	Removes duplication of key frame mode probabilities Key frame macrobock and block mode probabilities are constant. Remove the allocation of tables for each codec instance and use instead the default const prob tables. Change-Id: I8361798ac491f9b3889e86925a494c58647c753f	2012-04-23 12:58:39 +03:00
Ronald S. Bultje	2210767c3f	Hide some code behind CONFIG_COMP_INTRA_PRED. Change-Id: I7c0597dede20cc71145c053f76bd99aaf759d144	2012-04-20 15:27:47 -07:00
John Koleszar	d72c536ede	multi-res: restore v1.0.0 API Move the notion of 0 bitrate implying skip deeper into the codec, rather than doing it at the multi-encoder API level. This preserves v1.0.0 ABI compatibility, rather than forcing a bump to v2.0.0 over a minor change. Also, this allows the case where the application can selectively enable and disable the larger resolution(s) without having to reinitialize the codec instace (for instance, if no target is receiving the full resolution stream). It's not clear how deep to push this check. It may be valuable to allow the framerate adaptation code to run, for example. Currently put the check as early as possible for simplicity, should reevaluate this as this feature gains real use. Change-Id: I371709b8c6b52185a1c71a166a131ecc244582f0	2012-04-20 11:39:42 -07:00
John Koleszar	8e858f90f3	vp8_change_config: don't force kf with spatial resampling Look for changes in the codec's configured w/h instead of its active w/h when forcing keyframes. Otherwise calls to vp8_change_config() will force a keyframe when spatial resampling is active. Change-Id: Ie0d20e70507004e714ad40b640aa5b434251eb32	2012-04-20 11:09:12 -07:00
Yaowu Xu	ade43d9125	change to allow 8x8 transform always This commit changed to enable the usage 8x8 transform for all frame type, all resolution and all quantizer range. This has an overall benefit .2% to .3% in term of compression, but more importantly, the difficult clips benefits much more, up to 2% to 3% on clips like football, harbour and so on. We observed some weird humps on very high end on a couple of youtube clips, but have determined the underly cause was the aggressive zbin having an effect of lowering rate with lower quality, which have an impact on slide show clips around 60DB. The commit does not change the association between prediction mode and transform size. Change-Id: I33043bdce6207528ae00b4a4b26d8ff63cfea1f4	2012-04-20 09:17:59 -07:00
Yaowu Xu	ecc28cdaa3	added reset of rate estimates for each mode This is to prevent the evaluation of a mode from using values left over from a mode evaluated prior in the loop. Change-Id: Ife2c6ceb76d2f7365fd262515d3ae48229033c2d	2012-04-20 09:17:58 -07:00
Scott LaVarnway	abf1784c31	Merge "Makes all mode token tables const part 2"	2012-04-20 08:22:57 -07:00
Scott LaVarnway	317d4244cb	Makes all mode token tables const part 2 (see Change I9b2ccc88: Makes all mode token tables const) Further remove runtime table initialization and use precalculated const data. Data footprint reduced by 4112 bytes. Change-Id: Ia3ae9fc19f77316b045cabff01f6e5f0876a86ab	2012-04-19 17:35:20 -04:00
John Koleszar	c311b3b3a9	rtcd: serialize function pointer initialization Ensure that RTCD function pointers are set at most once, to silence some data race warnings. Implementation provided for POSIX threads and Win32, with the prior unsynchronized behavior left in place for other platforms. Change-Id: I65c5856df43ef67043b3d5f26ddafddd8fcb2f7e	2012-04-19 14:15:23 -07:00
Ronald S. Bultje	1259b0b22e	Fix splitmv/compound prediction when eightpel is enabled. Change-Id: I9d6083d54e3d478ec20dc6dc48d3f45eb5c7e16b	2012-04-19 10:07:14 -07:00
Attila Nagy	5948a02106	Ports vpx_xcaler to new RTCD method We can get rid of all remaining global initializers now: vp8_scale_machine_specific_config() vp8_initialize() vp8dx_initialize() Change-Id: I2825cea5d1c01ad9f6c45df49a0f86d803bfeb69	2012-04-19 17:40:56 +03:00
Attila Nagy	441cac8ea6	Makes all mode token tables const Mode token tabels precalculated in entropymode.c. Removes vp8_initialize_common()as all common global data is precalculated const now. Change-Id: I9b2ccc883e4f618069e1bc180dad3a823394eb73	2012-04-19 15:46:02 +03:00
Ronald S. Bultje	18433aef17	Compound prediction for splitmv macroblocks. Change-Id: I0af3395500b1cb0ed629249eb6636a0c9322cb18	2012-04-18 14:05:39 -07:00
Adrian Grange	1cc406ab4a	Added update of mode context pointers in decoder With the NEWENTROPY experiment enabled encoding certain clips produced invlid bitstreams, or files that had a high degree of artefacts. This was the results of pointers in MACROBLOCKD not being setup correctly (mode_info_context and prev_mode_info_context). Change-Id: Ice13e1efa8bd122997d2f8f3f1e761c6c16e0403	2012-04-18 13:48:06 -07:00
Johann	97495c5c5c	Merge "Makes all global data in tokenize.c const"	2012-04-18 10:51:46 -07:00
Adrian Grange	1ea2ad1e86	Merge "Added save coding context & modified MV bounds" into experimental	2012-04-18 16:21:16 +00:00
Yaowu Xu	bd49603be4	Merge "Added intra mode probabilites into coding_context" into experimental	2012-04-17 18:37:23 +00:00
Yaowu Xu	31afc77063	Added intra mode probabilites into coding_context These contexts need to be saved and restored for recode, otherwise encoder/decoder mismatch happens for some clips (eg._mobcal 720p) Change-Id: Ic65cfa0bf56ed0472ecab962ce31394d59d344bf	2012-04-17 09:29:27 -07:00
Attila Nagy	b35a0db0e7	Makes all global data in tokenize.c const Removes all runtime initialization of global data in tokenize.c. DCT token and cost tabels are pre-generated. Second patch in a series to make sure code is reentrant. Change-Id: Iab48b5fe290129823947b669413101f22a1bcac0	2012-04-17 15:38:05 +03:00
Attila Nagy	a91b42f022	Makes all global data in entropy.c const Removes all runtime initialization of global data in entropy.c. Precalculated values are used for initializing all entropy related tabels. First patch in a series to make sure code is reentrant. Change-Id: I9aac91a2a26f96d73c6470d772a343df63bfe633	2012-04-17 12:12:58 +03:00
John Koleszar	21173e1999	correct alt-ref contribution to frame rate When producing an invisible ARF, the time stamp counters aren't updated since the last time stamp is seen by the codec twice. The prior code was trapping this case with refresh_alt_ref, but this isn't correct for other uses of the ARF. Instead, use the show_frame flag. Change-Id: If67fff7c6c66a3606698e34e2fb5731f56b4a223	2012-04-16 12:23:33 -07:00
Deb Mukherjee	bb4ed8d8a3	Bug fix introduced from a recent refactoring of skip coding. Change-Id: Iccd0f09a4db02a2bad644deb3c76189c5f5612a3	2012-04-16 11:38:15 -07:00
Adrian Grange	fa589adc5f	Added save coding context & modified MV bounds Added code to save the coding context in vp8_rd_pick_inter_mode when the coding mode is forced to ARF(0,0). Also, modified the MV bounds computation to comply with the change in MV border from 32 to 64 pixels. Change-Id: I96963a6f5f4d04ce84c807ae11e0635177c3ad6c	2012-04-13 10:26:49 -07:00
Scott LaVarnway	3c5ed6f52e	Merge "MB_MODE_INFO size reduction"	2012-04-12 12:03:45 -07:00
Scott LaVarnway	6dc21bce63	Merge "loopfilter improvements"	2012-04-12 12:02:24 -07:00
Scott LaVarnway	e0a80519c7	loopfilter improvements Local variable offsets are now consistent for the functions, removed unused parameters, reworked the assembly to eliminate stalls/instructions. Change-Id: Iaa37668f8a9bb8754df435f6a51c3a08d547f879	2012-04-12 14:22:47 -04:00
Deb Mukherjee	237718dcbd	Turning off interpolation filter selection Turning off the interpolation filter selection based on edge proportion. This heuristics has not been working as well as expected and I have started a more rigorous investigation into this. We can turn this off for now since it is unnecessarily slowing things down. Rebase. Change-Id: Ic5958b2b3a35ec2d8eb73b6d81617ca8fbe07e74	2012-04-12 09:27:25 -07:00
Yaowu Xu	636b2f385e	a set of minor fixes This commit tries to address an issue related to the oddity shown on HD _mobcal clip, where some rather ugly blocks shown in the second frame at low-mid bit rates if the third frame is not made a key frame by he encoder. The fixes include: 1) made calls to sad_16x16 to be consistent with function prototype. 2) remove the error bias to intra and golden in mbgraph search. 3) changed the error accumulation on inter_segment encoding to avoid potential out-of-range. 1) has no effect on encoding results. Encoding test show that the overall effect of the commit helps about .2%(HD) to .3%(cif) Change-Id: I930975a2d0c06252f01c39e0a02351529774e30b	2012-04-12 13:41:54 +01:00
Yaowu Xu	d6f4b71d9f	Adjust the key frame placement condition The commit removed a limit on key frame detection, which caused a big drop in all metric measurements for standard HD clip such as _mobcal. This single change helps two standard HD clips by a huge amount, which help the overall std-hd set by 2.4% (glb psnr), 0.9% (avg_psnr), 2.1% (vpxssim). In the result page: http://pafr9.prod.google.com:26163/?/cns/rc-d/home/on2-prod/sunkaras/borg-test/yaowu 2012_04_02_1649_yaowu_bugfix_std-hd 2012_04_03_1452_yaowu_hump_std-hd represent the encoding test results and std-hd set prior and after this commit respectively. Change-Id: Ie4313e317c737ea0e699c3a7919c1376744baa1a	2012-04-12 11:46:24 +01:00
Yaowu Xu	d56acae660	changed function prototype for macro_block_yrd This commit has made macro_block_yrd_8x8 and macro_block_yrd_8x8 to take same parameters. It also removed a few unnecessary shifts that has the potential to create out-of-range distortion values. Change-Id: I4ec5afb307c3685c2a67a07c2850f0927d214455	2012-04-12 11:40:18 +01:00
Paul Wilkins	d6ac213ce6	Delete unused function. Deleted check_gf_quality(). Change-Id: If75fbd84accb1f6471ad6ea6ff2b65ae99976f5d	2012-04-11 17:14:46 +01:00
Paul Wilkins	13c6d1a8c8	Refactoring of encode loop and bitstream packing Some code re-factored / moved to allow the main pack operation inside the recode loop so that the size estimate is accurate. Deletion of some redundant code relating to one pass. Aproximate improvement over March 27 code base: Derf 0.0%, YT 0.5%, YThd 0.3% Std_hd 0.25% Change-Id: Id2d071794ab44f0b52935f6fcdb5733d09a6bb86	2012-04-11 15:44:14 +01:00
Paul Wilkins	f2ec452fbc	Changes to costing of skip. Update the costing of skip in the recode loop and rd code. Change-Id: I2e5ebbd7ddf201212b32441321e12626cd0423e9	2012-04-11 14:37:48 +01:00
Paul Wilkins	a3392d5718	T8x8 zbin and rate control changes. Some adjustments to zbin for t8x8. Changes to rules for sizing forced key frames. Some extra stats output in tmp.stt. Approximate gain on YT-hd set 0.5% There are still issues in sizing key frames and gf/arf frames when the image is largely static. These in part relate to problems with cost estimates in the recode loop. Change-Id: I6f0159dc8a8faeab4115a19c668d442491619a68	2012-04-11 13:13:28 +01:00
Adrian Grange	9daf3154db	Superblock encoding order This is the first patch to add superblock (32x32) coding order capabilities. It does not yet do any mode selection at the SB level, that will follow in a further patch. This patch encodes rows of SBs rather than MBs, each SB contains 2x2 MBs. Two intra prediction modes have been disabled since they require reconstructed data for the above-right MB which may not have been encoded yet (e.g. for the bottom right MB in each SB). Results on the one test clip I have tried (720p GIPS clip) suggest that it is somewhere around 0.2dB worse than the baseline version, so there may be bugs. It has been tested with no experiments enabled and with the following 3 experiments enabled: --enable-enhanced_interp --enable-high_precision_mv --enable-sixteenth_subpel_uv in each case the decode buffer matches the recon buffer (using "cmp" to compare the dumped/decoded frames). Note: Testing these experiments individually created errors. Some problems were found with other experiments but it is unclear what state these experiments are in: --enable-comp_intra_pred --enable-newentropy --enable-uvintra This code has not been extensively tested yet, so there is every likelihood that further bugs remain. I also intend to do some code cleanup & refactoring in tandem with the next patch that adds the 32x32 modes. Change-Id: I1eba7f740a70b3510df58db53464535ef881b4d9	2012-04-11 10:40:57 +01:00
Deb Mukherjee	6b33ca395f	Fixes to disable MFQE when there is motion. This patch includes: 1. fixes to disable block based termporal mixing when motion is detected (because this version of mfqe only handles zero motion). 2. The criterion used for determining whether to mix or not are changed to use squared differences rather than absolute differences. 3. Additional checks on color mismatch and excessive block flatness added. If the block as decoded has very low activity it is unlikely to yield benefits for mixing. Change-Id: I07331e5ab5ba64844c56e84b1a4b7de823eac6cb	2012-04-10 14:27:28 -07:00
John Koleszar	8106df8f5a	MFQE: apply threshold to subblocks and chroma. In cases where you have a flat background occluded by a moving object of similar luminosity in the foreground, it was likely that the foreground blocks would persist for a few frames after the background is uncovered. This is particularly noticable when the object has a different color than the background, so add the chroma planes in as an additional check. In addition, for block sizes of 8 and 16, the luma threshold is applied on four subblocks independently, which helps when only part of the background in the block has been uncovered. This fixes issue #392, which includes a test clip to reproduce the issue. BUG=392 Change-Id: I2bd7b2b0e25e912dcac342e5ad6e8914f5afd302	2012-04-03 12:05:01 -07:00
Johann	21ac3c8f26	Move variance and SAD RTCD definitions When the functions were moved from encoder/ to common/ the RTCD file was not updated. Change-Id: I1c98715ed51adf1a95aa2492949d8552aec88d1f	2012-04-02 11:06:35 -07:00
John Koleszar	8b15cf6929	Merge "remove unused BOOL_CODER::value"	2012-03-29 14:03:07 -07:00
John Koleszar	3f8349467a	remove unused BOOL_CODER::value Change-Id: Ic7782707afed38c3ec7e996a4a11dc2d55226691	2012-03-29 13:56:48 -07:00
Scott LaVarnway	31322c5faa	MB_MODE_INFO size reduction Reduced the size of the struct by 8 bytes, which would be a memory savings of 64800 bytes for 1080 resolutions. Had an extra byte, so created an is_4x4 for B_PRED or SPLITMV modes. This simplified the mode checks in vp8_reset_mb_tokens_context and vp8_decode_mb_tokens. Change-Id: Ibec27784139abdc34d4d01f73c09f43e9e10e0f5	2012-03-29 16:30:14 -04:00
Scott LaVarnway	a337725625	Updated vp8_build_intra_predictors_mby_s(sse2/ssse3) to work with the latest code. Patch Set 2: aligned the above_row buffers to fix crash Change-Id: I7a6992a20ed079ccd302f8c26215cf3057f8b70c	2012-03-29 14:24:53 -04:00
Deb Mukherjee	78ecbc98e4	Bug fix in probability update savings computation Found this bug while tracking down some anomalies in my experiments. Since vp8_cost_one and vp8_cost_zero return unsigned int, the bit shift by 8 will be incorrect if the value is negative. I am cautiously optimistic that this fix will make the prob updates more correct and somewhat improve results across the board. But the update probabilities will need to be retuned I think. Patch 2: Adding more of the same fixes using a macro. Change-Id: I1a168f040e74e8c67e7225103b1c2af9a611da49	2012-03-29 08:39:02 -07:00
Scott LaVarnway	ccea000c4b	Updated vp8_build_intra_predictors_mbuv_s(sse2/ssse3) to work with the latest code. Change-Id: Ie382bb55d00ea5929bdadba859eea15f696d4cd9	2012-03-26 13:40:14 -04:00
Scott LaVarnway	403966ae00	Removed duplicate vp8_build_intra_predictors_mb y/uv Added y/uv stride as a parameter and remove the duplicate code. Change-Id: I019117a9dd9659a09d3d4e845d4814d3f33341b5	2012-03-26 13:40:14 -04:00
Scott LaVarnway	9e9f5f3d70	New vp8_decode_mb_tokens() This new vp8_decode_mb_tokens() uses a modified version of WebP's GetCoeffs function. For now, the dequant does not occur in GetCoeffs. Tests showed performance improvements up to 2.5% depending on material. Change-Id: Ia24d78627e16ffee5eb4d777ee8379a9270f07c5	2012-03-23 15:50:08 -04:00
Deb Mukherjee	06dc2f6166	Initialize postproc buffer to resolve valgrind warnings Change-Id: I9a7d40b0eac7200796dbe62e75776b2eb77dfdf6	2012-03-22 17:42:41 -07:00
Ronald S. Bultje	9f900a1c64	Only enable compound prediction if multiple reference frames are present. Change-Id: Ia52ac825400eb83ff663e3a05a3fe0b3526bac9a	2012-03-22 10:20:17 -07:00
Deb Mukherjee	66ba79f5fb	Miscellaneous changes in mfqe and postproc modules Adds logic to disable mfqe for the first frame after a configuration change such as change in resolution. Also adds some missing if CONFIG_POSTPROC macro checks. Change-Id: If29053dad50b676bd29189ab7f9fe250eb5d30b3	2012-03-22 09:55:07 -07:00
James Berry	fd9df44a05	Merge "remove __inline for compiler compatibility"	2012-03-21 12:37:17 -07:00
James Berry	3c021e1d79	Merge "bug fix: remove inline from mfqe.c"	2012-03-21 12:36:40 -07:00
Yaowu Xu	2823173ee0	enable 8x8 transform for MBs in intra frames When ac_yquant>171, a key frame is enabled to use 8x8 transform. In such case, MBs with DC_PRED or TM_PRED are selected to use T8x8. This change helped the full STD-HD set by ~.1% or so, which is reasonable considering how often key frame occurs in these encodings. Change-Id: Id17009ef6327252177b19e6bf0d6628827febaf1	2012-03-21 12:25:44 -07:00
Paul Wilkins	c88d335f7d	Only support improved quant Deprecate fast quant and strict_quant code. Small effect on quality as fast was used in first pass but the effect is basically neutral across the derf set. The rationale here is to reduce the number of code paths for now to make experimentation easier. Optimized and fast code options can be re-introduced later along with other encode speed options. Change-Id: Ia30c5daf3dbc52e72c83b277a1d281e3c934cdad	2012-03-21 18:22:33 +00:00
James Berry	451ab0c01e	remove __inline for compiler compatibility __inline removed for broader compiler compatibility Change-Id: I6f2b218dfc808b73212bbb90c69e2b6cc1fa90ce	2012-03-21 14:11:10 -04:00
Yunqing Wang	bcee56bed5	Minor fix: add back a vpx_free call Added back a vpx_free call that was mistakenly removed. Change-Id: Ib662933a8697a4efb8534b5b9b762ee6c2777459	2012-03-21 14:09:33 -04:00
Paul Wilkins	36af2035c8	Merge Exact Quant Change-Id: Id2412a7f24a7c1016ec9fc3b9b0fbd16871f374a	2012-03-21 17:57:09 +00:00
James Berry	921ffdd2c9	bug fix: remove inline from mfqe.c remove inline from mfqe.c for vs compatibility Change-Id: I853f16503d285fcd41a1a12181d8745159156b5c	2012-03-21 13:42:58 -04:00
Deb Mukherjee	475d5d5664	Making subpel filters switchable at frame level Various refactoring to make the subpel motion compensation filters switchable by a frame level field. Two types of 8-tap filters are supported in addition to the existing bilinar and sixtap filters. One is the default 8-tap and the other has a sharper cut-off for use with frames with substantial edge content. Patch 2: Added a preliminary strategy for filter selection based on edginess detecton. Also includes some filter changes. Change-Id: I866085bda5ae143cfdf2ec88157feaabdf7bd63a	2012-03-21 09:17:22 -07:00
Deb Mukherjee	57d953479b	Adding contextual coding of mb_skip_coeff flag. Using contextual coding of the mkb_skip_coeff flag using the values of this flag from the left and above. There is a small improvement of about 0.15% on Derf: http://www.corp.google.com/~debargha/vp8_results/mbskipcontext.html Refactored to use pred_common.c by adding a new context type. Results on HD set (about 0.66% improvement): http://www.corp.google.com/~debargha/vp8_results/mbskipcontext_hd.html Incliding missing refactoring to use the pred_common utilities. Change-Id: I95373382d429b5a59610d77f69a0fea2be628278	2012-03-21 03:55:44 -07:00
Yunqing Wang	9ed1b2f09e	Merge "Add motion search skipping in first pass"	2012-03-16 14:02:51 -07:00
Yunqing Wang	6a819ce4fe	Add motion search skipping in first pass This change added a motion search skipping mechanism similar to what we did in second pass. For a macroblock that is very similar to the macroblock at same location on last frame, we can set its mv to be zero, and skip motion search. This improves first-pass performance for slide shows and video conferencing clips with a slight PSNR loss. Change-Id: Ic73f9ef5604270ddd6d433170091d20361dfe229	2012-03-16 15:59:00 -04:00
Johann	e68953b7c8	Merge "RFC: Reorganize MFQE loops"	2012-03-16 11:35:18 -07:00
Scott LaVarnway	9aa2bd8a03	Merge "x_motion_minq table reduction"	2012-03-16 09:03:47 -07:00
Yaowu Xu	21d3612a2f	added clamp for 2nd motion vector The commit added a clamp to the 2nd motion vector used in compound prediction to insure mv within UMV borders. The clamp is similar to that of the first motion vector except that No SPLITMV is ever used for the 2nd motion vector. Change-Id: I26dd63c304bd66b2e03a083749cc98c641667116	2012-03-14 18:53:26 -07:00
Yaowu Xu	ea43ba4aee	fixed a bug of context overwritten by key frame recoding The recoding loop save and restore frame coding context for recodes. However in recoding of key frames, some of the coding context saved was stale from last encoded inter frame. The save/restore sometimes overwrites the re-inintialized coding context with saved context from last frame, resulting in encoder/decoder mismatch Change-Id: I354ae2f71074d142602d51d06544c05a2462caaf	2012-03-14 17:29:39 -07:00
John Koleszar	422f97d7ab	Merge "fix potential use of uninitialized rate_y"	2012-03-14 12:00:27 -07:00
John Koleszar	20cd3e6b8f	fix potential use of uninitialized rate_y This issue likely doesn't appear in the unmodified encoder, but sufficient hacking on the mode selection loop can expose it. Change-Id: I8a35831e8f08b549806d0c2c6900d42af883f78f	2012-03-14 10:10:30 -07:00
Jim Bankoski	6b66c01c88	Merge "Adds a motion compensated temporal denoiser to the encoder."	2012-03-13 16:18:57 -07:00
Stefan Holmer	9c41143d66	Adds a motion compensated temporal denoiser to the encoder. Some refactoring in rdopt.c and pickinter.c. Change-Id: I4f50020eb3313c37f4d441d708fedcaf219d3038	2012-03-13 15:33:50 -07:00
Jim Bankoski	e9cacfd66d	Merge "Update for key frame target size setting."	2012-03-13 10:13:03 -07:00
Marco Paniconi	301409107f	Update for key frame target size setting. Set an iniital/minimun boost level for the frame rate factor of key frame target size setting. Change-Id: If2586f4ac76a1fa89378aa652a58607356a1f426	2012-03-12 16:23:08 -07:00
Johann	ddf94f6184	Merge "Move SAD and variance functions to common"	2012-03-12 15:08:48 -07:00
Yaowu Xu	3f5feb7d13	fixed .mk files to reflect add/remove of a header file In a previous commit, the duplicate of headerfile defaultcoefcounts.h was identified. This commit updates the .mk file to ensure configure and make works properly for all platforms. Change-Id: I31a39c809a734ba438ee53db700f252e9a03eddd	2012-03-12 14:51:54 -07:00
Scott LaVarnway	7a1590713e	Merge changes I9c26870a,Ifabb0f67 * changes: threading.c refactoring Decoder loops refactoring	2012-03-09 10:48:11 -08:00
Scott LaVarnway	9ed874713f	threading.c refactoring Added recon above/left to MACROBLOCKD Reworked decode_macroblock Change-Id: I9c26870af75797134f410acbd02942065b3495c1	2012-03-08 15:27:41 -05:00
Yaowu Xu	64439c2b77	removed duplicate a head file Removed the copy in encoder and changed to use the one in common Change-Id: Ief0985a50ffd6053a269638fd4816b055ca273ec	2012-03-07 07:21:46 -08:00
Paul Wilkins	68033ca472	Snapshot candidate Pulled out super block code for the snapshot as this is not quite ready and will need an extensive re-merge. Change-Id: I436369b511257447a7b0ea064016cb63f5011849	2012-03-07 11:24:33 +00:00
Johann	fd903902ef	RFC: Reorganize MFQE loops Break MFQE code into it's own file. It is currently only valid for 16x16 and 8x8 Y blocks. It also filters 4x4 U/V blocks. Refactor filtering and add associated assembly. Limited test cases show --mfqe introduces a penalty of ~20% with HD content. The assembly reduces the penalty to ~15% Change-Id: I4b8de6b5cdff5413037de5b6c42f437033ee55bf	2012-03-06 15:20:03 -08:00
Jim Bankoski	157f15e1d3	Imported a change from stable branch https://gerrit.chromium.org/gerrit/#change,17319 fixes cost estimating to take skip_eob into account. No quality difference seen on derf set tests, but about .4% gain on STD_HD set. Change-Id: Ic5fe6d35ee021e664a6fcd28037b8432a0e470ca	2012-03-06 12:20:34 -08:00
Yaowu Xu	98bf413b4b	fix a compiling error when CONFIG_COMP_INTRA_PRED not defined Change-Id: I4159ff365953598d9c9bc3ecacfb5c5f4a315b45	2012-03-06 09:52:15 -08:00
Jim Bankoski	154b4b4196	vp8e - RDLambda fix Last commit went the wrong way. Change-Id: I5e47ee6c25b0893dfa84318229b93c57dfeec24e	2012-03-06 08:47:12 -08:00
Johann	e50f96a4a3	Move SAD and variance functions to common The MFQE function of the postprocessor depends on these Change-Id: I256a37c6de079fe92ce744b1f11e16526d06b50a	2012-03-05 16:50:33 -08:00
Jim Bankoski	888699091d	vp8e - fix coefficient costing Coefficient costing failed to take account of the first branch being skipped ( 0 vs eob) if the previous token is 0. Fixed rd to account for slightly increased token cost & cleaned up warning message Change-Id: I56140635d9f48a28dded5a816964e973a53975ef	2012-03-05 08:20:42 -08:00
Ronald S. Bultje	c3f5b2931b	Use per-MB compound intra prediction. This gives a modest gain on derf overall, although at low bitrates the cost is still too high, so this can be improved further. Patch 2. Re-base and fix 80 column issues Change-Id: Ida2f9fa3fe75370669f6a27b37108dc602231c63	2012-03-05 15:23:10 +00:00
Yaowu Xu	6898e8b7ab	Changed how UV r/d estimates are done for Intra Modes The commit changed to compute UV intra RD estimates for 4x4 and 8x8 separately to be used in mode decision for MB modes associated with the appropriate transform size respectively. Now finally after many other changes related 8x8 quantizer zbin boost and zbin_mode_boost, this change overall helps the HD(with 8x8) by around ~.13%. (avg .13% glb .13% ssim .17%) The commit also has a few changes for eliminating compiler warnings. Change-Id: Ibab35dad44820c87e6b44799c66f8d519cc37344	2012-03-05 13:16:41 +00:00
Yaowu Xu	eaa955ba98	Fixed zbin_mode_boost initialization The commit added the correct Zbin_mode_boost initialization based on Intra Mode before using rate distortion to pick UV intra mode. Change-Id: I8e57878ff356a06672f6fa2431be860bf9b9a5c7	2012-03-05 13:10:49 +00:00
Yaowu Xu	848bccabd5	refactored code that checks if a macroblock is skippable Change-Id: I4ea6d819bbbde312792c4f813ab63ea50cf0cd1d	2012-03-05 11:30:43 +00:00
Paul Wilkins	a0be3faa6e	Allow for frame overheads in min frame bandwidth. Change-Id: I6ade229ff400fe492709010ac5bada37f8afa73e	2012-03-05 11:25:44 +00:00
Yaowu Xu	89ee68b1f7	Merge t8x8 experiments Change-Id: I8e9b6b154e1a0d0cb42d596366380d69c00ac15f	2012-03-01 12:59:11 -08:00
Jim Bankoski	a60461a340	Merge "vp8e - attempt to lessen blockiness"	2012-03-01 11:47:09 -08:00
Yaowu Xu	3ceb43104f	disable usage of 8x8 for resolution below 360p need further investigation on some odd behavior on one youtube clip Change-Id: Iec477986a86b54ef26df2ef69d2f9484e2d1a043	2012-03-01 11:30:49 -08:00
Jim Bankoski	f0f609c2e2	Merge "vp8e - static key boost"	2012-03-01 11:23:10 -08:00
Jim Bankoski	91b5c98b3d	Merge "vp8e - force at least some change in over and under shoots"	2012-03-01 11:22:52 -08:00
Paul Wilkins	8d07a97acc	vp8e - static key boost This seeks to boost the size of the keyframe if the entire section is a single frame clip Change-Id: I3c00268dc155b047dc4b90e514cf403d55a4f8ef	2012-03-01 10:39:41 -08:00
Paul Wilkins	6d84322762	vp8e - force at least some change in over and under shoots Change-Id: Ie1796f272dc33bf5a1c8ac990da625961d272aa9	2012-03-01 10:35:22 -08:00
Deb Mukherjee	e41e5ce5ad	Various bug fixes related to high precision mv Change-Id: Ie5a7c87d71bd4a541463b68704620d89cec142cf	2012-03-01 03:10:21 -08:00
Paul Wilkins	2ad7a4a271	Bug fix in vp8_estimate_entropy_savings() Incorrect scaling of savings for t8x8. Change-Id: If01e08f8c73faa73afc3c70e501e6acc54d7e26f	2012-03-01 01:42:02 +00:00
Attila Nagy	52cf4dcaea	Packing bitstream on-the-fly with delayed context updates Produce the token partitions on-the-fly, while processing each MB. Context is updated at the beginning of each frame based on the previoud frame's counters. Optimally encoder outputs partitions in separate buffers. For frame based output, partitions are concatenated internally. Limitations: - enabled just in combination with realtime-only mode - number of encoding threads has to be equal or less than the number of token partitions. For this reason, by default the encoder will do 8 token partitions. - vpxenc supports partition output (-P) just in combination with IVF output format (--ivf) Performance: - Realtime encoder can be up to 13% faster (ARM) depending on the number of threads and bitrate settings. Constant gain over the 5-16 speed range. - Token buffer reduced from one frame to 8 MBs Quality: - quality is affected by the delayed context updates. This again dependents on input material, speed and bitrate settings. For VC style input the loss seen is up to 0.2dB. If error-resilient=2 mode is used than the effect of this change is negligible. Example: ./configure --enable-realtime-only --enable-onthefly-bitpacking ./vpxenc --rt --end-usage=1 --fps=30000/1000 -w 640 -h 480 --target-bitrate=1000 --token-parts=3 --static-thresh=2000 --ivf -P -t 4 -o strm.ivf tanya_640x480.yuv Change-Id: I127295cb85b835fc287e1c0201a67e378d025d76	2012-02-29 12:13:37 -05:00
Jim Bankoski	b8fa2839a2	vp8e - attempt to lessen blockiness applies a penalty to intra blocks in order to cut down on blockiness in easy sections. Change-Id: Ia9e5df16328b0bf01bf0f2e6e61abcb687316c12	2012-02-29 09:03:13 -08:00
Scott LaVarnway	2578b767c3	Decoder loops refactoring Eliminated some mb branches along with other code cleanups. This is part of an ongoing effort to remove cut/paste code in the decoder. Change-Id: Ifabb0f67cafa6922b5a0e89a0d03a9b34e9e5752	2012-02-29 10:38:14 -05:00
Yaowu Xu	d8670b3c2e	Fix compiling issue when CONFIG_HIGH_PRECISION_MV is defined Change-Id: I3b8a230d5b119910e6a6767331a4d97768089355	2012-02-28 17:45:46 -08:00
Ronald S. Bultje	921b1c3c94	Rename "dual" prediction to "compound" prediction. Change-Id: Ibcd2b9b247ff9f83331dac47f91ec285e8955ff1	2012-02-28 17:43:46 -08:00
Ronald S. Bultje	d476165107	Compound intra prediction (b_pred/4x4 only, for now), Also remove duplicate build_intra_predictors_mby/uv(). Change-Id: I78607e7304952a9b962a5b25af9bb9c48692187b	2012-02-28 17:41:03 -08:00
Yaowu Xu	af5c774b5c	Correct zbinboost lookup for 8x8 quantizer The commit fixed a problem where 8x8 regular quantizer was using the 4x4 zbinboost lookup table that only has 16 entries at each Q. The commit assigned a uniform zbin boost value for all cases that there are more than 16 consective zeros. The change only affects MBs using 8x8 transform. The fix has a slightly positive impact on quality. Test results: http://www.corp.google.com/~yaowu/no_crawl/hd_fixzbinb.html (avg psnr: .26% glb psnr: .21% ssim: .28%) Results on cif size clip are also positive even though gain is smaller http://www.corp.google.com/~yaowu/no_crawl/derf_fixzbinb.html Change-Id: Ibe8f6da181d1fb377fbd0d3b5feb15be0cfa2017	2012-02-28 17:17:47 -08:00
Paul Wilkins	dc825f1e2b	Merge "Merge new loop filter." into experimental	2012-02-28 23:28:04 +00:00
Deb Mukherjee	3e1cad9c69	Initial refactoring of high_precision mv code. This is the first patch for refactoring of the code related to high-precision mv, so that 1/4 and 1/8 pel motion vectors can co-exist in the same bit-stream by use of a frame level flag. The current patch works fine for only use of 1/4th and only use of 1/8th pel mv, but there are some issues with the mode switching in between. Subsequent patches on this change Id will fix the remaining issues. Patch 2: Adds fixes to make sure that multiple mv precisions can co-exist in the bit-stream. Frame level switching has been tested to work correctly. Patch 3: Fixes lines exceeding 80 char Patch 4: http://www.corp.google.com/~debargha/vp8_results/enhinterp.html Results on derf after ssse3 bugfix, compared to everything enabled but the 8-tap, 1/8-subpel and 1/16-subpel uv. Overall the gains are about 3% now. Hopefully there are no more bugs lingering. Apparently the sse3 bug affected the quartel subpel results more than the eighth pel ones (which is understandabale because one bad predictor due to the bug, matters less if there are a lot more subpel options available as in the 1/8 subpel case). The results in the 4th column correspond to the current settings. The first two columns correspond to two settings of adaptive switching of the 1/4 or 1/8 subpel mode based on initial Q estimate. These do not work as good as just using 1/8 all the time yet. Change-Id: I3ef392ad338329f4d68a85257a49f2b14f3af472	2012-02-28 15:09:20 -08:00
Yaowu Xu	3d93ae521c	Merge "Try to enable 8x8 tranform for smaller resolution" into experimental	2012-02-28 22:31:03 +00:00
Paul Wilkins	19b9d28f70	Merge new loop filter. Merge of the NEWLPF configuration experiment so it is always on. Change-Id: I7054772b6eab28bad1ff807bfa54d98f83de9308	2012-02-28 20:58:52 +00:00
Yaowu Xu	42891098f3	Try to enable 8x8 tranform for smaller resolution The commit overall on derf test is break even to very slightly positive comparing to all 4x4 transform. Change-Id: I2a7c19599aa54c2d3a5b35db0dc891ba8a6a2b26	2012-02-28 11:49:12 -08:00
Scott LaVarnway	ce328b855f	Merge changes Ifb450710,I61c4a132 * changes: Eliminated reconintra_mt.c Eliminated vp8mt_build_intra_predictors_mbuv_s	2012-02-28 11:42:45 -08:00
Scott LaVarnway	aab70f4d7a	Merge "Removed duplicate code in threading.c"	2012-02-28 11:25:43 -08:00
Scott LaVarnway	bcba86e2e9	Eliminated reconintra_mt.c Reworked the code to use vp8_build_intra_predictors_mby_s, vp8_intra_prediction_down_copy, and vp8_intra4x4_predict_d_c functions instead. vp8_intra4x4_predict_d_c is a decoder-only version of vp8_intra4x4_predict. Future commits will fix this code duplication. Change-Id: Ifb4507103b7c83f8b94a872345191c49240154f5	2012-02-28 14:12:30 -05:00
Scott LaVarnway	9a4052a4ec	Removed duplicate code in threading.c Change-Id: Id7e44950ceda67b280e410e541510106ef02f1da	2012-02-28 14:00:32 -05:00
Yunqing Wang	b1bfd0ba87	Merge "Only do uv intra-mode evaluation when intra mode is checked"	2012-02-28 10:11:24 -08:00
Yunqing Wang	019384f2d3	Only do uv intra-mode evaluation when intra mode is checked When we encode slide-show clips, for the majority of the time, only ZEROMV mode is checked, and all other modes are skipped. This change delayed uv intra-mode evaluation until intra mode is actually checked. This gave big performance gain for slide-show video encoding (2nd pass gain: 18% to 28%). But, this change doesn't help other types of videos. Also, zbin_mode_boost is adjusted in mode-checking loop, which causes bitstream mismatch before/after this change when --best or --good with --cpu-used=0 are used. Change-Id: I582b3e69fd384039994360e870e6e059c36a64cc	2012-02-28 13:08:17 -05:00
Paul Wilkins	25c127f5f0	Experimental branch code clean up. Removal of some further code relating to partitions and error resilience. Spelling correction. Change-Id: I36067aae67a4a23bec359541dda3400b0bbf26d0	2012-02-28 17:59:21 +00:00
Paul Wilkins	b6f02c8592	Code Simplification Removal of code relating to token partitioning Change-Id: Iaf3c88d6758639a55bd92c3be5c51e6bed407a3c	2012-02-28 17:55:42 +00:00
Yaowu Xu	eb87b56eab	fixed a wrong intialization value The "update" variable was used as a flag in coef_prob update dry run that tests if a frame should encodes update at all. The wrong init value forced the update happening always. fixing this has a minor improvement in low bit rate situation when 8x8 transform is allowed. Change-Id: Icb498e8d6a62fd074dcbc2065b797cba9237cb51	2012-02-28 09:10:34 -08:00
Paul Wilkins	3cdd0a8e75	Merge "Corrected spelling" into experimental	2012-02-28 02:07:49 +00:00
Paul Wilkins	b00ed02a16	Corrected spelling Apparently the correct spelling of segement is segment ! Change-Id: I88593ee0523f251b3a96794c6166ef8c7898a029	2012-02-27 21:42:36 +00:00
James Berry	e2c6b05f9a	bugfix: use oxcf width/height for reinit check use oxcf instead of common in check to Reinit the lookahead buffer if the frame size changes prior behavior would cause assertion fail/crash first observed in: support changing resolution with vpx_codec_enc_config_set Change-Id: Ib669916ca9b4f206d4cc3caab5107e49d39a36aa	2012-02-27 16:10:45 -05:00
Paul Wilkins	88a867c6dd	Merge "Code Cleanup." into experimental	2012-02-27 20:50:21 +00:00
Paul Wilkins	2e9d7d647a	Merge "Removal of temporal re sampling code." into experimental	2012-02-27 20:50:01 +00:00
Yunqing Wang	61c5e31ca1	Merge "Fix skippable evaluation in mode decision"	2012-02-27 11:06:13 -08:00
Paul Wilkins	46ab54abf8	Merge "Code Simplification." into experimental	2012-02-27 17:58:57 +00:00
John Koleszar	02a31e6b3c	Merge "decoder: reset segmentation map on keyframes"	2012-02-27 09:58:29 -08:00
Paul Wilkins	d90b1ee16c	Merge "Further code simplification and clean up." into experimental	2012-02-27 17:58:12 +00:00
Yunqing Wang	84be08b07f	Fix skippable evaluation in mode decision Yaowu fixed the skippable evaluation by correcting 2nd order block's eob. Change-Id: Id47930cbc74a90a046c0c0e324efb03477639ee0	2012-02-27 12:45:12 -05:00
Deb Mukherjee	063d68b7ea	Merge "Changing default 8-tap filter to Langrangian interpolator." into experimental	2012-02-26 06:06:07 +00:00
Paul Wilkins	646e62211e	Code Cleanup. Removal of error_resilient_mode features. The interface has been left in place but does nothing. Change-Id: I2407863bd0d3c98407354507423ca48d29f63b17	2012-02-26 01:15:47 +00:00
Paul Wilkins	80b873e318	Removal of temporal re sampling code. For now the interface elements have been left in place to make sure existing parameter files work but parameters relating to drop frame wont do anything. Change-Id: I579ee614726387381c546845dac4bc03c74c6a07	2012-02-25 18:13:57 +00:00
Deb Mukherjee	88b36eb0d9	Bug fix in ssse3 variance computation. Fixes a bug that was introduced in the high precision mv patch. Change-Id: Ieadb433ebe4c3ef3e0e63944dab11528bf8bd73a	2012-02-24 20:24:54 -08:00
Paul Wilkins	69e80a028c	Code Simplification. Removal of code relating to spatial re sampling Change-Id: Iff1bc651c62cd528f960c4b27f9673b172e68835	2012-02-24 23:58:24 +00:00
Paul Wilkins	3cc5b92c65	Further code simplification and clean up. Change-Id: Ifdb17b56090a317b2aa82cf125d57934902c5298	2012-02-24 23:38:36 +00:00
Deb Mukherjee	c64a27801c	Changing default 8-tap filter to Langrangian interpolator. The Lagrangian interpolation filter is maximally flat in the passband. There is non-trivial improvement with the hd set, while for derf the results are virtually unchanged. See: http://www.corp.google.com/~debargha/vp8_results/enhinterpn.html (derf) http://www.corp.google.com/~debargha/vp8_results/enhinterpn_hd.html (HD) Patch 2: Updated the results for derf in the html above to use the new baseline. There is still about 4% improvement. Will update the hd baseline later (since it takes 9 hours to run on my machine) Patch 3: By mistake the default filter was left at 60 - should be 0 to use the new interpolation filter. Change-Id: If5f64444976562415d68a2aeabb94fdfa0d47890	2012-02-24 11:13:36 -08:00
Paul Wilkins	583f2d8fc7	Deleted code. Removed redundant code for ref frame cost.	2012-02-24 02:16:53 +00:00
Deb Mukherjee	fb472c5b64	Clean ups and minor changes in high precision mv with 8-tap interpolation * Removes EDGE_PIXEL_FILTER for external sanpshot * changes the default 8-tap filter based on high precision results in http://www.corp.google.com/~debargha/vp8_results/enhinterpn.html * changes the default prob tables for high-precision mv encoding to favor zeros in the last bit (i.e. quarter pel). This is only important for short clips. Change-Id: I02bb0de8679d9eec06cdbcc8160dbf073cd847a4	2012-02-23 11:47:18 -08:00
Deb Mukherjee	18e90d744e	Supporting high precision 1/8-pel motion vectors This is the initial patch for supporting 1/8th pel motion. Currently if we configure with enable-high-precision-mv, all motion vectors would default to 1/8 pel. Encode and decode syncs fine with the current code. In the next phase the code will be refactored so that we can choose the 1/8 pel mode adaptively at a frame/segment/mb level. Derf results: http://www.corp.google.com/~debargha/vp8_results/enhinterp_hpmv.html (about 0.83% better than 8-tap interpoaltion) Patch 3: Rebased. Also adding 1/16th pel interpolation for U and V Patch 4: HD results. http://www.corp.google.com/~debargha/vp8_results/enhinterp_hd_hpmv.html Seems impressive (unless I am doing something wrong). Patch 5: Added mmx/sse for bilateral filtering, as well as enforced use of c-versions of subpel filters with 8-taps and 1/16th pel; Also redesigned the 8-tap filters to reduce the cut-off in order to introduce a denoising effect. There is a new configure option sixteenth-subpel-uv which will use 1/16 th pel interpolation for uv, if the motion vectors have 1/8 pel accuracy. With the fixes the results are promising on the derf set. The enhanced interpolation option with 8-taps alone gives 3% improvement over thei derf set: http://www.corp.google.com/~debargha/vp8_results/enhinterpn.html Results on high precision mv and on the hd set are to follow. Patch 6: Adding a missing condition for CONFIG_SIXTEENTH_SUBPEL_UV in vp8/common/x86/x86_systemdependent.c Patch 7: Cleaning up various debug messages. Patch 8: Merge conflict Change-Id: I5b1d844457aefd7414a9e4e0e06c6ed38fd8cc04	2012-02-23 09:25:21 -08:00
James Berry	313bfbb6a2	Merge "Add unit tests for idctllm_test and idctllm_mmx"	2012-02-23 08:50:36 -08:00
Jim Bankoski	2089f26b08	Merge "Remove the frame rate factor for key frame size."	2012-02-23 08:38:44 -08:00
Marco Paniconi	507ee87e3e	Remove the frame rate factor for key frame size. When temporal layers is used (i.e., number_of_layers > 1), we don't use the frame rate boost for setting the key frame target size. The factor was forcing the target size to be always at its minimum (2* per_frame_bandwidth) for low frame rates (i.e., base layer frame rate). Generally we should modify or remove this frame rate factor; for now we turn if off for number_of_layers > 1. Change-Id: Ia5acf406c9b2f634d30ac2473adc7b9bf2e7e6c6	2012-02-22 15:25:32 -08:00
Yaowu Xu	3c872b6c27	Merge "Fixed skippable evaluation in mode decision" into experimental	2012-02-22 17:13:04 +00:00
Yaowu Xu	0f430084e0	Merge "Reduced bias in picking loop filter level" into experimental	2012-02-22 17:12:52 +00:00
Yaowu Xu	7670933386	Merge "a bit code clean-up" into experimental	2012-02-22 15:55:53 +00:00
Yaowu Xu	c54bfcb6f0	Merge "Reworked context conversion between 8x8 and 4x4" into experimental	2012-02-22 15:55:37 +00:00
Yaowu Xu	2b4cd4cc01	Fixed skippable evaluation in mode decision Yunqing fixed an oddity in UVIntra skippable evaluation for stable branch, which brought up the fact that the evaluation is broken. The issue was that for MBs with 2nd order block, the eob for 1st order blocks is set at 1. The previous evaluation did not take that into account. This commit intend to fix the problem. The commit also absorbed Yunqing's fix for UVIntra skippable evalution. Test on hd showed some good gains in combination with LPF bias fix: http://www.corp.google.com/~yaowu/no_crawl/LPFBias_FixSkip.html (avg psnr: .34%, glb psnr: .32%, ssim: .22%) Change-Id: I36af11c8ef7f643e8ff46da7bf3a167b437039d4	2012-02-22 06:49:13 -08:00
Scott LaVarnway	f2bd11faa4	Eliminated vp8mt_build_intra_predictors_mbuv_s Reworked the code to use vp8_build_intra_predictors_mbuv_s instead. This is WIP with the goal of eliminating all functions in reconintra_mt.h Change-Id: I61c4a132684544b24a38c4a90044597c6ec0dd52	2012-02-21 14:59:05 -05:00
James Berry	0c1cec2205	Add unit tests for idctllm_test and idctllm_mmx add unit tests for vp8_short_idct4x4llm_c Change-Id: I472b7c0baa365ba25dc99a3f6efccc816d27c941	2012-02-21 14:52:36 -05:00
John Koleszar	dadc9189ed	Merge changes I0341554f,I64e110c8 * changes: Consolidate C version of token packing functions Multithreaded encoder, late sync loopfilter	2012-02-21 10:09:23 -08:00
Scott LaVarnway	f05feab7b9	Merge "Remove redundant init of segment_counts in vp8_encode_frame"	2012-02-21 09:51:02 -08:00
John Koleszar	02360dd2c2	Merge "Update encoder mb_skip_coeff and prob_skip_false calculation"	2012-02-21 09:48:26 -08:00
Yaowu Xu	737179f275	Reduced bias in picking loop filter level The bias in picklpf intended to bias toward less greedy in getting best frame level psnr while maximize overall quality for a clip. This commit reduced the bias for frames using 8x8 transform to achieve better compression overall. The change improve compression by ~.15% consistently on most of the HD clips tested. http://www.corp.google.com/~yaowu/no_crawl/LPFBias_FixSkip.html Change-Id: Ic30932d2b8eaebd52339b0195f569edc48eed7bc	2012-02-17 16:44:08 -08:00
Yunqing Wang	f93b1e7be1	Merge "Fix incorrect use of uv eobs in intra modes"	2012-02-17 10:43:05 -08:00
Paul Wilkins	4cfb8ed4c9	Code base simplification. Removal of most code to do with 1 pass. Removal of cyclic refresh code. Change-Id: I74971082bc19dd76e795d4d2e781a0424cec5c8c	2012-02-17 16:29:03 +00:00
Yunqing Wang	04b9e0d787	Fix incorrect use of uv eobs in intra modes In vp8_rd_pick_inter_mode(), if total of eobs is zero, rate needs to be adjusted since there are no non-zero coefficients for transmission. The uv intra eobs calculated in rd_pick_intra_mbuv_mode() need to be saved before they are overwritten by inter-mode eobs. Change-Id: I41dd04fba912e8122ef95793d4d98a251bc60e58	2012-02-17 09:15:08 -05:00
Attila Nagy	ce42e79abc	Update encoder mb_skip_coeff and prob_skip_false calculation mode_info_context->mbmi.mb_skip_coeff has to always reflect the existence or not of coeffs for a certain MB. The loopfilter needs this info. mb_skip_coeff is either set by the vp8_tokenize_mb or has to be set to 1 when the MB is skipped by mode selection. This has to be done regardless of the mb_no_coeff_skip value. prob_skip_false is needed just when mb_no_coeff_skip is 1. No need to keep count of both skip_false and skip_true as they are complementary (skip_true+skip_false = total_mbs) Change-Id: I3c74c9a0ee37bec10de7bb796e408f3e77006813	2012-02-17 14:27:40 +02:00
Paul Wilkins	5e1b5bff7d	Merge "Code simplification" into experimental	2012-02-17 11:04:32 +00:00
Attila Nagy	565d0e6feb	Remove redundant init of segment_counts in vp8_encode_frame segment_counts was zero init twice in the beginning of vp8_encode_frame. Change-Id: Ibc29f6896dabd9aab1d0993f3941cf6876022e70	2012-02-17 09:51:24 +02:00
Johann	6b151d436d	Clarify 'max_sad' usage Depending on implementation the optimized SAD functions may return early when the calculated SAD exceeds max_sad. Change-Id: I05ce5b2d34e6d45fb3ec2a450aa99c4f3343bf3a	2012-02-16 15:17:44 -08:00
Yaowu Xu	47d545f166	a bit code clean-up Removed some transform code that is not in use. Change-Id: I9489af7e23d9d7fe052feb6c8bbafa62ebbda39c	2012-02-16 15:15:06 -08:00
Yaowu Xu	b92a96d8ad	Reworked context conversion between 8x8 and 4x4 The commit rationized and simplified the entropy context conversion betwen MB using 8x8 transform and MB using 4x4 transform. The old version had a number of weirdness in how 4x4 transform MB's context is used for 8x8 blocks other than the first 8x8 within a MB. Test showed the change has a gain ~.1% for avg psnr, glb psnr and ssim on the limited HD set. Change-Id: I774536c416baa6845aa741f956d8a69fa40e5d47	2012-02-16 15:00:10 -08:00
John Koleszar	e8223bd250	decoder: reset segmentation map on keyframes Refactoring some of the mode decoding logic introduced a bug where the segmentation maps would not be properly reset on keyframes. http://code.google.com/p/webm/issues/detail?id=378 The text of the bug is somewhat misleading as I initially read it to imply the bug was present in v0.9.7-p1 (Cayuga), but note the text "master", which indicates this was something subsequent. This issue bisects back to v0.9.7-p1-84-ga99c20c, so unfortunately it was broken during the Duclair release. Thanks to Alexei Leonenko for investigating the root cause. Change-Id: I9713c9f070eb37b31b3b029d9ef96be9b6ea2def	2012-02-16 12:22:18 -08:00
Makoto Kato	7989bb7fe7	Support Android x86 NDK build On Android NDK, rand() is inlined function. But, on our SSE optimization, we need symbol for rand() Change-Id: I42ab00e3255208ba95d7f9b9a8a3605ff58da8e1	2012-02-16 12:03:30 -08:00
Deb Mukherjee	6b86208dae	Removing a stray CONFIG_DUALPRED, and a INTERP_EXTEND fix. Change-Id: I7549e424ca6846b07a796f2b9cd4e9d4e550ca9b	2012-02-16 10:40:39 -08:00
Scott LaVarnway	6776bd62b5	Simplify mb_to_x_edge calculation during mode decoding Change-Id: Ibcb35c32bf24c1d241090e24c5e2320e4d3ba901	2012-02-16 13:36:46 -05:00
Scott LaVarnway	a5879f7c81	Merge "decodemv cleanup/improvements"	2012-02-16 09:33:59 -08:00
Paul Wilkins	79d330d7d5	Code simplification Removal of the pickinter.c and .h files and calls to this code. Removal of some code relating to real time and one pass settings though there is more to be done in this regard. However, vp8_set_speed_features() now only supports modes 0 and 1 and speeds up to 3 so rd should always be set. Change-Id: I62c0c1b6154ab499785baef310536080e87bc4d8	2012-02-16 17:21:20 +00:00
Scott LaVarnway	12ee845ee7	decodemv cleanup/improvements Removed unnecessary variables, unrolled functions, eliminated unnecessary mv bounds checks and branches. Change-Id: I02d034c70cd97b65025d59dd67c695e1db529f0b	2012-02-16 11:38:33 -05:00
Deb Mukherjee	9493595533	Merge "Adjusting 8-tap filter + prelim edge pixel filter code." into experimental	2012-02-16 16:32:38 +00:00
Yaowu Xu	f90983e167	revised the rate distortion computation for UV this commit changed the UV r/d calculation in the mode decision process to properly account for the rate of 8x8 transform coefficients. Change-Id: I485f8f35f2b61db0b6539beb32e83481b1cf083b	2012-02-16 07:34:46 -08:00
Yaowu Xu	8b71f3e059	Merge "revised the rate distortion computation for UV" into experimental	2012-02-16 15:34:38 +00:00
Yaowu Xu	bc3dd313ef	Merge "optmized rounding for transforms" into experimental	2012-02-16 15:33:01 +00:00
Yaowu Xu	a96bf2038c	Merge "re-scaled 2nd order haar transform" into experimental	2012-02-16 15:32:33 +00:00
Yaowu Xu	a78a4b4551	Merge "moved scaling from dequantization to inverse transform for T8x8" into experimental	2012-02-16 15:32:17 +00:00
Yaowu Xu	efa9abd028	optmized rounding for transforms the changes are still temporary, the final transforms, especially inverse ones should take in account both accuracy, complexity, and sign-bias, which should be decided at a later time. Change-Id: I116b0c70b25f5ee324ae5713d4564f5d0aa27151	2012-02-16 07:03:57 -08:00
Yaowu Xu	62a78f0342	re-scaled 2nd order haar transform During the work of extend_qrange, we have rolled a factor of 2 from quantization/dequatnization into 2nd order walsh-hadamard transform. This commit does the same for the 2nd order haar transform. so they can share the same quantizaiton process as the 2nd order WHT. Change-Id: I734af4a20ea8149a01b5b1971a065092977dfe33	2012-02-16 07:03:56 -08:00
Yaowu Xu	454c7abc1a	moved scaling from dequantization to inverse transform for T8x8 Previously, the scaling related to extended quantize range happens in dequantization stage, which implies the coefficients form forward transform are in different scale(4x) from dequantization coefficients This worked fine when there was not distortion computation done based on 8x8 transform, but it completely wracked the distortion estimation based on transform coefficients and dequantized transform coefficients introduced in commit `f64725a00` for macroblocks using 8x8 transform. This commit fixed the issue by moving the scaling into the stage of inverse 8x8 transform. TODO: Test&Verify the transform/quantization pipeline accuracy. Change-Id: Iff77b36a965c2a6b247e59b9c59df93eba5d60e2	2012-02-16 07:03:55 -08:00
Attila Nagy	d02e74a073	Consolidate C version of token packing functions Replace inner loops of pack_mb_row_tokens_c and pack_tokens_into_partitions_c with a call to pack_tokens_c. Change-Id: I0341554fb154a14a5dadb63f8fc78010724c2c33	2012-02-16 14:11:28 +02:00
Attila Nagy	78071b3b97	Multithreaded encoder, late sync loopfilter Second shot at this... Sync with loopfilter thread as late as possible, usually just at the beginning of next frame encoding. This returns control to application faster and allows a better multicore scaling. When PSNR packets are generated the final filtered frame is needed imediatly so we cannot delay the sync. Same has to be done when internal frame is previewed. Change-Id: I64e110c8b224dd967faefffd9c93dd8dbad4a5b5	2012-02-16 12:26:39 +02:00
Deb Mukherjee	ef8ade138d	Adjusting 8-tap filter + prelim edge pixel filter code. Results with the new filter coefficients compared with the previous versions are here: http://www.corp.google.com/~debargha/vp8_results/enhinterp.html (derf) http://www.corp.google.com/~debargha/vp8_results/enhinterp_hd.html (HD) Overall, the derf set improves by 0.94% with 8-tap filters while the HD set improves by 0.58%. Patch 1: resolving merge conflict Change-Id: I09db0abdab7b08bb19f86d911de23d2123309748	2012-02-15 16:50:18 -08:00
Ronald S. Bultje	721711fb51	Remove dual prediction frame re-encoding loop. I'm basically not convinced that the concept works at all, let alone that this is the right place to do it. I think if we want something like this at all, I should integrate it with the main encoding loop and re-encode checks in onyx_if.c, and show that it has a significant benefit (which right now, it doesn't; removing this re-encode check actually increases all metrics by ~0.15%). Change-Id: I1b597385dc17f468384a994484fb24813389411f	2012-02-15 16:38:04 -08:00
Ronald S. Bultje	0930dde249	Fix overflows in dual prediction mode selection. Change-Id: I265ad46e01a307bca21e6223725e4055f5e08648	2012-02-15 15:57:49 -08:00
Paul Wilkins	46f9ad2ca5	Experimental code base simplification. Remove error concealment code. Change-Id: I882705174fbfea212e96f7f684e47a671dbe5c67	2012-02-15 16:08:47 +00:00
Yaowu Xu	d327dcf3aa	moved segment based LPF level selection under CONFIG_FEATUREUPDATES This commit moved segment based loop filter level selection into the experiment of CONFIG_FEATUREUPDATES. As previous commit noted, the segment based loop filter selection helps the compression by ~0.1% on cif set, the ongoing experiment CONFIG_FEATUREUPDATES made encoding updates of the segment based LPF level more efficient, hence, another .04% gain on cif set. The commit also fixed an issue previously where encoder/decoder may use different loop filter level for one of the segments. Change-Id: Ia978b14aae95bb107d561ba53a7a2bb6ff01faf3	2012-02-15 07:18:05 -08:00
Yaowu Xu	9b68ad0f30	added 8x8 based Rate estimation for dualpred case This commmit added logic for MB using dual-pred to compute rate estimation based on correct transform size. The section of code was previously located under #if CONFIG_DUALPRED, that was made to be working with T8x8 experiment at the same time. Change-Id: Iebc2518c03f11378b9c2e72905520f088b54d5c0	2012-02-14 09:23:21 +00:00
Paul Wilkins	9a8204d6ee	Simplification of experimental code base. Removed ~CONFIG_REALTIME_ONLY code. Change-Id: I5fafff29a08acd8928699f9ddce8744787024d8c	2012-02-14 09:03:56 +00:00
Jim Bankoski	af8f1928d1	vp8 - config_featureupdates Added a bit to signify that the feature changed since the last time we sent it, or not so that we don't need to send all the databits for every feature change. added config Change-Id: I8d3064ce90d4500bf0d5c6b87c664e46138dfcac	2012-02-13 12:31:12 -08:00
Yaowu Xu	2d1ead342c	Changed how coefficient probability table is updated Added a frame level flag to indicate if coef probabilities are updated at all for the frame. During the experimental work with 8x8 transform, it is discovered that even in the case of no probability is ever update, cost of transmitting "no update" for each of probabilities can run up to become a significant overhead cost. A single bit to indicate no-update for all coef probs is therefore helpful, which is also demonstrated by the test results: 1. On Cif set: http://www.corp.google.com/~yaowu/no_crawl/t8x8/cif_t8x8_updprob.html (avg psnr: .14%, glb psnr: .14% SSIM: .13%) 2. On HD set: http://www.corp.google.com/~yaowu/no_crawl/t8x8/HD_t8x8_updprob.html (avg psnr: .02% glb psnr: .01% SSIM: .02%) It should be noted that the gain on HD is smaller because the average bit rate is much higher in contrast to the overhead bit cost. Change-Id: I46db270e693ee8799fef34a14d8260868ce4cd16	2012-02-13 13:20:02 +00:00
Paul Wilkins	21108d800c	Fixed typo on #define name SE_LVL_EOB => SEG_LVL_EOB Change-Id: I6d10169878a709bc9b82f03e5d5903c629fa7679	2012-02-13 12:06:18 +00:00
John Koleszar	e6df50031e	Merge "support changing resolution with vpx_codec_enc_config_set"	2012-02-10 16:18:00 -08:00
Yaowu Xu	9ded6e375a	fixed an issue related to 2nd order size due to merge artifacts. For 8x8 transformed macroblock, the 2nd order transform is a 2x2 haar transform, here there is only 4 coefficients total. A previous merge changed these to 64, causing crashes when encoding with 8x8 transform enabled. (i.e. when input video image size > 640x360 ) This commit reverts them back to 4 and fixes the crashes. Change-Id: I3290b81f8c0d32c7efec03093a61ea57736c0550	2012-02-10 11:49:22 -08:00
Johann	169823428f	Missed some variance casts Change-Id: I9fb510f9421fb3c317a8e32e3058cee977ddf9fa	2012-02-10 11:07:33 -08:00
Johann	12d45f62f6	Merge "max_sad check is not always implemented"	2012-02-10 10:28:00 -08:00
Paul Wilkins	2615ca5d41	Removal of threading code. For the experimental branch we are trying to slim the codebase down removing features such as threading for now which complicate the process of development and testing. Change-Id: I657c0246aef4d1fa8c8ffc6a1adfeee45bce8e24	2012-02-10 16:23:59 +00:00
Ronald S. Bultje	f64725a009	Improved coding using 8x8 transform In summary, this commit encompasses a series of changes in attempt to improve the 8x8 transform based coding to help overall compression quality, please refer to the detailed commit history below for what are the rationale underly the series of changes: a. A frame level flag to indicate if 8x8 transform is used at all. b. 8x8 transform is not used for key frames and small image size. c. On inter coded frame, macroblocks using modes B_PRED, SPLIT_MV and I8X8_PRED are forced to using 4x4 transform based coding, the rest uses 8x8 transform based coding. d. Encoder and decoder has the same assumption on the relationship between prediction modes and transform size, therefore no signaling is encoded in bitstream. e. Mode decision process now calculate the rate and distortion scores using their respective transforms. Overall test results: 1. HD set http://www.corp.google.com/~yaowu/no_crawl/t8x8/HD_t8x8_20120206.html (avg psnr: 3.09% glb psnr: 3.22%, ssim: 3.90%) 2. Cif set: http://www.corp.google.com/~yaowu/no_crawl/t8x8/cif_t8x8_20120206.html (avg psnr: -0.03%, glb psnr: -0.02%, ssim: -0.04%) It should be noted here, as 8x8 transform coding itself is disabled for cif size clips, the 0.03% loss is purely from the 1 bit/frame flag overhead on if 8x8 transform is used or not for the frame. ---patch history for future reference--- Patch 1: this commit tries to select transform size based on macroblock prediction mode. If the size of a prediction mode is 16x16, then the macroblock is forced to use 8x8 transform. If the prediction mode is B_PRED, SPLITMV or I8X8_PRED, then the macroblock is forced to use 4x4 transform. Tests on the following HD clips showed mixed results: (all hd clips only used first 100 frames in the test) http://www.corp.google.com/~yaowu/no_crawl/t8x8/hdmodebased8x8.html http://www.corp.google.com/~yaowu/no_crawl/t8x8/hdmodebased8x8_log.html while the results are mixed and overall negative, it is interesting to see 8x8 helped a few of the clips. Patch 2: this patch tries to hard-wire selection of transform size based on prediction modes without using segmentation to signal the transform size. encoder and decoder both takes the same assumption that all macroblocks use 8x8 transform except when prediciton mode is B_PRED, I8X8_PRED or SPLITMV. Test results are as follows: http://www.corp.google.com/~yaowu/no_crawl/t8x8/cifmodebase8x8_0125.html http://www.corp.google.com/~yaowu/no_crawl/t8x8/hdmodebased8x8_0125log.html Interestingly, by removing the overhead or coding the segmentation, the results on this limited HD set have turn positive on average. Patch 3: this patch disabled the usage of 8x8 transform on key frames, and kept the logic from patch 2 for inter frames only. test results on HD set turned decidedly positive with 8x8 transform enabled on inter frame with 16x16 prediction modes: (avg psnr: .81% glb psnr: .82 ssim: .55%) http://www.corp.google.com/~yaowu/no_crawl/t8x8/hdintermode8x8_0125.html results on cif set still negative overall Patch 4: continued from last patch, but now in mode decision process, the rate and distortion estimates are computed based on 8x8 transform results for MBs with modes associated with 8x8 transform. This patch also fixed a problem related to segment based eob coding when 8x8 transform is used. The patch significantly improved the results on HD clips: http://www.corp.google.com/~yaowu/no_crawl/t8x8/hd8x8RDintermode.html (avg psnr: 2.70% glb psnr: 2.76% ssim: 3.34%) results on cif also improved, though they are still negative compared to baseline that uses 4x4 transform only: http://www.corp.google.com/~yaowu/no_crawl/t8x8/cif8x8RDintermode.html (avg psnr: -.78% glb psnr: -.86% ssim: -.19%) Patch 5: This patch does 3 things: a. a bunch of decoder bug fixes, encodings and decodings were verified to have matched recon buffer on a number of encodes on cif size mobile and hd version of _pedestrian. b. the patch further improved the rate distortion calculation of MBS that use 8x8 transform. This provided some further gain on compression. c. the patch also got the experimental work SEG_LVL_EOB to work with 8x8 transformed macroblock, test results indicates it improves the cif set but hurt the HD set slightly. Tests results on HD clips: http://www.corp.google.com/~yaowu/no_crawl/t8x8/HD_t8x8_20120201.html (avg psnr: 3.19% glb psnr: 3.30% ssim: 3.93%) Test results on cif clips: http://www.corp.google.com/~yaowu/no_crawl/t8x8/cif_t8x8_20120201.html (avg psnr: -.47% glb psnr: -.51% ssim: +.28%) Patch 6: Added a frame level flag to indicate if 8x8 transform is allowed at all. temporarily the decision is based on frame size, can be optimized later one. This get the cif results to basically unchanged, with one bit per frame overhead on both cif and hd clips. Patch 8: Rebase and Merge to head by PGW. Fixed some suspect 4s that look like hey should be 64s in regard to segmented EOB. Perhaps #defines would be bette. Bulit and tested without T8x8 enabled and produces unchanged output. Patch 9: Corrected misalligned code/decode of "txfm_mode" bit. Limited testing for correct encode and decode with T8x8 configured on derf clips. Change-Id: I156e1405d25f81579d579dff8ab9af53944ec49c	2012-02-10 14:23:27 +00:00
Ronald S. Bultje	e3ca23a361	Reindent some code after merging the dualpred experiment. Change-Id: Idb328dd29ebcd360e39886abe48694f90f2e1140	2012-02-09 16:29:22 -08:00
Ronald S. Bultje	29e4d7e861	Merge dualpred (compound prediction) experiment. Change-Id: Ieaaa07c50eae41118596197f6a4d848135946e41	2012-02-09 16:29:18 -08:00
Johann	8c50a70a95	max_sad check is not always implemented As an optimization some architectures use the max_sad argument to break out early from the SAD. Pass in INT_MAX instead of 0 to prevent this. Change-Id: I653c476834b97771578d63f231233d445388629d	2012-02-09 16:19:10 -08:00
Scott LaVarnway	768ae275dc	x_motion_minq table reduction Reduced by 4080 bytes. Change-Id: I037b55bc9684bf4a54bce238be00e8c4db3f643e	2012-02-09 16:49:34 -05:00
Johann	fea3556e20	Fix variance overflow In the variance calculations the difference is summed and later squared. When the sum exceeds sqrt(2^31) the value is treated as a negative when it is shifted which gives incorrect results. To fix this we cast the result of the multiplication as unsigned. The alternative fix is to shift sum down by 4 before multiplying. However that will reduce precision. For 16x16 blocks the maximum sum is 65280 and sqrt(2^31) is 46340 (and change). PPC change is untested. Change-Id: I1bad27ea0720067def6d71a6da5f789508cec265	2012-02-09 12:38:31 -08:00
Paul Wilkins	d90f0eb4c5	Removal of SEGFEATURES placeholder comments This commit only involves the removal of placeholder comments //#if CONFIG_SEGFEATURES. Change-Id: I94b350daaf998ee0cfdde5aa25b1d3b0522ab816	2012-02-09 17:25:05 +00:00
Paul Wilkins	3e9890a394	Merge Extended Q experiment. Merge the extended Q experiment as indicated by the Change-Id: I02d9e654fff9998cc7e9e2f1f5cd838dad8fb431	2012-02-09 17:22:34 +00:00
Paul Wilkins	cf8af867dd	Merge COMPRED Merged in most of the current common prediction changes that were under the #if CONFIG_COMPRED option. Change-Id: If4e6f61dbe7b86dd449f6effbe93b5eb7e893885	2012-02-09 16:10:46 +00:00
Paul Wilkins	8266abfe96	Dual pred flag Further changes to make experiments with the context used for coding the dual pred flag easier. Current best performing method tested on derf is a two element context based on reference frame. I also tried various combinations of mode and reference frame as shown in commented out case using up to 6 contexts. Derf +0.26 overall psnr +0.15% ssim vs original method. Change-Id: I64c21ddec0abbb27feaaeaa1da2e9f164ebaca03	2012-02-09 15:44:18 +00:00
Paul Wilkins	59a200f1ea	Changes to coding of dual_pred flag. Further use of common prediction functions and experiments with alternate contexts based on mode and reference frame. For the Derf set using reference frame as basis of context gives +0.18% Overall Psnr and +0.08 SSIM Change-Id: Ie7eb76f329f74c9c698614f01ece31de0b6bfc9e	2012-02-09 15:27:20 +00:00
Ronald S. Bultje	915f13bd59	Fix dual prediction recode loop. We should only change the dual prediction mode if we actually entered the recode branch. Else, it may potentially undo beneficial changes to the dual prediction mode in the first encode iteration. Change-Id: I79fc53e5fd0bb551092ed422c797619f1566f002	2012-02-08 14:55:46 -08:00
Ronald S. Bultje	3adcbe2f15	Remove write-only variable "mbs_dual_count". Change-Id: Icf7a6749ca2f8ad6a032f86c34540d1c5880cf68	2012-02-08 14:18:02 -08:00
John Koleszar	2e0d55314c	Merge "Add OS/2 supports"	2012-02-08 11:00:55 -08:00
KO Myung-Hun	2dad8d65d9	Add OS/2 supports Change-Id: I792d5236451905eb20a8ebe444ef5b2274e4f7a4	2012-02-08 09:44:42 -08:00
Ronald S. Bultje	c8ec59d858	Fix dual prediction recode loop. Some conditions were conditional under a threshold, whereas they should always execute. Also, some conditions were testing an array instead of the values within it. Change-Id: Ia6892945cfbbe07322e6af6be42cd864bf9479c1	2012-02-08 10:09:02 +00:00
John Koleszar	51acb01167	support changing resolution with vpx_codec_enc_config_set Allow the application to change the frame size during encoding. This is only supported when not using lagged compress. Change-Id: I89b585d703d5fd728a9e3dedf997f1b595d0db0f	2012-02-07 17:09:40 -08:00
John Koleszar	417b852967	Align internal mfqe framebuffer dimensions MFQE postproc crashed with stream dimensions not a multiple of 16. The buffer was memset unconditionally, so if the buffer allocation fails we end up trying to write to NULL. This patch traps an allocation failure with vpx_internal_error(), and aligns the buffer dimensions to what vp8_yv12_alloc_frame_buffer() expects. Change-Id: I3915d597cd66886a24f4ef39752751ebe6425066	2012-02-07 10:40:26 -08:00
Yunqing Wang	a040eb37e4	Merge "Allow to skip highest-resolution encoding in multi-resolution encoder"	2012-02-06 13:58:11 -08:00
Paul Wilkins	e1050bd3dc	Move update of ref frame probabilities in encode loop. The existing code updated the reference frame probabilities before the test to evaluate the impact of using updated probabilities in vp8_estimate_entropy_savings(). The estimate of cost and savings is still basic and does not reflect the new prediction code but this would require per MB costings and the benefit is probably marginal, as this is really just used for rate estimation in the loop. Change-Id: Id6ba88ae6e11c273b3159deff70980363ccd8ea1	2012-02-06 16:42:00 +00:00
Paul Wilkins	9c9300f56f	Merged NEWNEAR experiment This commit merges the NEWNEAR experiment such that it is effectively always on. The fact that there were changes in the threading code again highlights the need to strip out such features during the bitstream development phase as trying to maintain this code (especially as it is not being tested) slows the development cycle. Change-Id: I8b34950a1333231ced9928aa11cd6d6459984b65	2012-02-06 16:40:57 +00:00
Paul Wilkins	82b865da94	Coding the hybrid dual prediction signal. Initial modifications to make limited use of common prediction functions. The only functional change thus far is that updates to the probabilities are no longer "damped". This was a testing convenience but in fact seems to help by a little over 0.1% over the derf set. Change-Id: I8b82907d9d6b6a4a075728b60b31ce93392a5f2e	2012-02-06 16:38:41 +00:00
Paul Wilkins	c98e9d2882	Moved prob_dualpred to common. Moved the prob_dualpred[] sturcture to common. Created common prediction entry for Dual flag. Change-Id: I9ac3d128bae6114f09e5c18216d4b95cf36453d5	2012-02-06 16:37:11 +00:00
Paul Wilkins	58ec6fe8c3	Modified prediction behavior for reference frame. Trial of a modified prediction function that ranks each possible reference frame based on a combination of local usage and frame level probability. The code is a bit cleaner and simpler. In direct comparison with old unpredicted method with segment level coding turned off for mode,ref & EOB the prediction gives a gain on derf of around 0.4%. There is some further gain from bug fixes over earlier code. With segment coding on the prediction method is slightly -ve on some very easy clips (at low rates) due to slightly higher overheads, but better on harder clips. Overall neutral on derf in direct comparison on latest code base, but compared to earlier code without bug fixes about +0.7% overall psnr +0.3% SSIM. Change-Id: I5b8474658b208134d352d24f6517f25795490789	2012-02-06 16:34:41 +00:00
Yunqing Wang	fa1a9290e6	Allow to skip highest-resolution encoding in multi-resolution encoder Sometimes, a user doesn't have enough bandwidth to send high-resolution (i.e. HD) video even though the camera catches HD video. This change allowed users to skip highest-resolution encoding by setting that level's target bit rate to 0. To test it, modify the following line in vp8_multi_resolution_encoder.c. unsigned int target_bitrate[NUM_ENCODERS]={1400, 500, 100}; To skip the highest-resolution level, change it to unsigned int target_bitrate[NUM_ENCODERS]={0, 500, 100}; To skip the first and second highest resolution levels, change it to unsigned int target_bitrate[NUM_ENCODERS]={0, 0, 100}; This change also fixed a small problem in mapping, which slightly helped quality and performance. Change-Id: I977bae9a9fbfba85c8be4bd5af01539f2b84bc81	2012-02-03 13:39:05 -05:00
Paul Wilkins	f0459549a6	Reference frame prediction: Extended prediction and coding of reference frame where a subset of options are flagged as available at the segment level. Updated copyright notices. Switch to SAD in mbgraph code as SATD problematic for the foreground and background separation as it can ignore large DC shifts. Change-Id: I661dbbb2f94f3ec0f96bb928c1655e5e415a7de1	2012-02-03 12:44:45 +00:00
Scott LaVarnway	d8ebdcd89d	Moved ref_frame_cost from MACROBLOCKD to MACROBLOCK Change-Id: I05788522e9cde4322cfb12032483bdbf184bdf0b	2012-02-02 13:40:08 -05:00
Scott LaVarnway	11c706488b	Removed frames_till_alt_ref_frame from MACROBLOCKD Change-Id: Ieb05270ac332a4cc38ec4b7b995fc0150e0fffdf	2012-02-02 13:34:13 -05:00
Adrian Grange	5d0b5a17d9	Added encoding in Superblock Order As a precursor to encoding 32x32 blocks this cl adds the ability to encode the frame superblock (=32x32 block) at a time. Within a SB the 4 indiviual MBs are encoded in raster-order (NW,NE,SW,SE). This functionality is added as an experiment which can be enabled by ispecifying --enable-superblocks in the command line specified to configure (CONFIG_SUPERBLOCKS macro in the code). To make this work I had to disable the two intra prediction modes that use data from the top-right of the MB. On the tests that I have run the results produce almost exactly the same PSNRs & SSIMs with a very slightly higher average data rate (and slightly higher data rate than just disabling the two intra modes in the original code). NOTE: This will also break the multi-threaded code. This replaces the abandoned change: Iebebe0d1a50ce8c15c79862c537b765a2f67e162 Change-Id: I1bc1a00f236abc1a373c7210d756e25f970fcad8	2012-02-02 10:30:57 -08:00
Scott LaVarnway	e2000cc5ca	Removed frames_since_golden from MACROBLOCKD Change-Id: I10efa441d663fceb6bc97a3bfad518cd3d9a5128	2012-02-02 13:28:41 -05:00
Paul Wilkins	92ffb17cc1	Comment out segref segmentation filter changes. Commented out changes from earlier checking: "Change Iab7f1eff: vpnext use segref segmentation filter" Which in its current state breaks the decoder. Change-Id: I9185098aeda8ce65310f338c4c9375f4a39005d3	2012-02-02 12:43:53 +00:00
Yaowu Xu	27cca3dd94	Fixes a decoder bug The bug was introduced by the commit that added I8X8 intra prediction mode for inter frames, the decoder was not update to accept the additional probability update from encoder. This causes the decoder typicall to crash when encoder sends intra mode probability update. Change-Id: Ib7dc42dc77a51178aa9ece41e081829818a25016	2012-02-01 14:56:14 -08:00
Author: John Koleszar	d24de592a6	Import another decoder bug fix from public stable branch Please see the following for details: https://gerrit.chromium.org/gerrit/#change,10925 Change-Id: Ie692261c255c58d7762df22eeca566a7d13adcba	2012-02-01 14:56:12 -08:00
Scott LaVarnway	caed92d0e5	Import a decoder bug fix from public stable branch Please see the following public commit for details: https://gerrit.chromium.org/gerrit/#change,7608 Change-Id: I589eed0b6078e2c5c9c74e942886e503bd02b273	2012-02-01 14:56:11 -08:00
Adrian Grange	3ff8c7d968	Correctly capped minqtarget to maxq This line of code incorrectly set maxq = maxq rather than capping minqtarget. Change-Id: Ifbc86df8b0ff2779e7b2a5f7349724d04a18bd62	2012-01-31 12:58:34 -08:00
Scott LaVarnway	07c6eb18ad	Merge "Improved uv mv calculations in build inter predictor"	2012-01-31 10:43:49 -08:00
Deb Mukherjee	b72ab88d75	Merge "Refining the 8-tap interpolation filters." into experimental	2012-01-31 16:40:41 +00:00
Scott LaVarnway	749bc98618	BLOCKD structure cleanup Removed redundancies. All of the information can be found in the MACROBLOCKD structure. Change-Id: I7556392c6f67b43bef2a5e9932180a737466ef93	2012-01-31 11:02:39 -05:00
Paul Wilkins	1335ac3071	Implementation of new prediction model for reference frame coding. This check in uses the common prediction interface functions to code reference frame. Some updates made regarding the impact of the new code in rd loop but there remain TODOs in this regard. Change-Id: I9da3ed5dfdaa489e0903ab33258b0767a585567f	2012-01-31 12:54:05 +00:00
Paul Wilkins	56904be19d	Use common prediction interface for segment coding. This does not change any functionality just modifies the code to use the common prediction module interface for coding the segment data. Change-Id: Ifd43e9153573365619774a4f5572215e44fb5aa3	2012-01-31 12:53:49 +00:00
Paul Wilkins	b2f64dff7d	Added common prediction modules. This function adds the common prediction modules, some data structures and a config option but does not use them. It also corrects a bug in clearing down the MODE_INFO border and introduces a new element that indicates if an entry corresponds to an "in image" macro block or is part of the border. Change-Id: Ib69eec0876173ebe9d1de9df9537d0b2447702e0	2012-01-31 12:53:36 +00:00
Paul Wilkins	048e4fd524	Moved some reference frame data structures into common. Encoder side changes Change-Id: I8921800e4fccec5e5a9e4755b80cbd472623107b	2012-01-31 12:52:51 +00:00
Paul Wilkins	5f0f260f0c	Moved some reference frame data structures into common. In this commit only the decoder side was updated. Change-Id: Ia9bd58da07d1a943f028e330f0489344e62b0d02	2012-01-31 12:52:33 +00:00
Paul Wilkins	fe96afa705	Moved some segmentation data structures. Moved some segmentation data structures into VP8_COMMON Change-Id: I59c6e2edf7a0176e35319936eea450027aeb3b39	2012-01-31 12:51:57 +00:00
John Koleszar	57d459ba82	RTCD: remove unimplemented vp8_short_walsh4x4_mmx This function does not exist. Change-Id: I84b72fb17d572d5cccee92220467b84c15842d4d	2012-01-30 12:55:45 -08:00
John Koleszar	8aae246089	RTCD: finalize removal of old RTCD system This is the final commit in the series converting to the new RTCD system. It removes the encoder csystemdependent files and the remaining global function pointers that didn't conform to the old RTCD system. Change-Id: I9649706f1bb89f0cbf431ab0e3e7552d37be4d8e	2012-01-30 12:10:48 -08:00
John Koleszar	109b69a706	RTCD: add arnr functions This commit continues the process of converting to the new RTCD system. It removes the last of the VP8_ENCODER_RTCD struct references. Change-Id: I2a44f52d7cccf5177e1ca98a028ead570d045395	2012-01-30 12:10:48 -08:00
John Koleszar	0b0bc8d098	RTCD: add motion search functions This commit continues the process of converting to the new RTCD system. Change-Id: Ia5828b7ecc80db55b21916704aa3d54cbb98f625	2012-01-30 12:10:47 -08:00
John Koleszar	be8af188d0	RTCD: add block subtraction functions This commit continues the process of converting to the new RTCD system. Change-Id: Id8a287fdd4bd050ea4452e1582ad85520f3081be	2012-01-30 12:10:47 -08:00
John Koleszar	61311e6103	RTCD: add quantizer functions This commit continues the process of converting to the new RTCD system. Change-Id: Iba9df4c03a508e51c37201c621be43523fae87d9	2012-01-30 12:10:46 -08:00
John Koleszar	510e0ab467	RTCD: add FDCT functions This commit continues the process of converting to the new RTCD system. Change-Id: I3f9c07db65eb206f6363d21bdb80e871570da767	2012-01-30 12:10:42 -08:00
John Koleszar	83a91e789c	RTCD: add variance functions This commit continues the process of converting to the new RTCD system. Change-Id: Ie5c1aa480637e98dc3918fb562ff45c37a66c538	2012-01-30 12:08:30 -08:00
John Koleszar	f103dcefaf	RTCD: add subpixel functions This commit continues the process of converting to the new RTCD system. Change-Id: I6c519ab61e4f4e0ebcc796f2df061f945c48cefe	2012-01-30 12:08:29 -08:00
John Koleszar	2a8f57f50d	RTCD: add postproc functions This commit continues the process of converting to the new RTCD system. Change-Id: If54eb5cb5d1b0cac6c4c0633a9e99c93ca860ba2	2012-01-30 12:08:29 -08:00
John Koleszar	fdb61a4531	RTCD: add recon functions This commit continues the process of converting to the new RTCD system. Change-Id: I9bfcf9bef65c3d4ba0fb9a3e1532bad1463a10d6	2012-01-30 12:08:28 -08:00
John Koleszar	ab77b4e898	RTCD: add remaining IDCT functions This commit continues the process of converting to the new RTCD system. Change-Id: I03c4dbf30dfd3558b0e256ff9d3ff4c012aadc80	2012-01-30 12:08:22 -08:00
John Koleszar	55f74c59c7	RTCD: add loopfilter functions This commit continues the process of converting to the new RTCD system. Change-Id: Ic8a4047d72ff3a54ec98977dd90e70c13213db71	2012-01-30 12:06:31 -08:00
John Koleszar	a910049aea	New RTCD implementation This is a proof of concept RTCD implementation to replace the current system of nested includes, prototypes, INVOKE macros, etc. Currently only the decoder specific functions are implemented in the new system. Additional functions will be added in subsequent commits. Overview: RTCD "functions" are implemented as either a global function pointer or a macro (when only one eligible specialization available). Functions which have RTCD specializations are listed using a simple DSL identifying the function's base name, its prototype, and the architecture extensions that specializations are available for. Advantages over the old system: - No INVOKE macros. A call to an RTCD function looks like an ordinary function call. - No need to pass vtables around. - If there is only one eligible function to call, the function is called directly, rather than indirecting through a function pointer. - Supports the notion of "required" extensions, so in combination with the above, on x86_64 if the best function available is sse2 or lower it will be called directly, since all x86_64 platforms implement sse2. - Elides all references to functions which will never be called, which could reduce binary size. For example if sse2 is required and there are both mmx and sse2 implementations of a certain function, the code will have no link time references to the mmx code. - Significantly easier to add a new function, just one file to edit. Disadvantages: - Requires global writable data (though this is not a new requirement) - 1 new generated source file. Change-Id: Iae6edab65315f79c168485c96872641c5aa09d55	2012-01-30 12:06:27 -08:00
Deb Mukherjee	57d521a095	Refining the 8-tap interpolation filters. Fixes a rounding issue with the 8-tap filters, which allows us to use sharper filters that before with a little better performance. Results on derf (0verall gain 0.76): http://www.corp.google.com/~debargha/vp8_results/enhinterp.html Results on a 720P set (Overall gain 0.61): http://www.corp.google.com/~debargha/vp8_results/enhinterp_hd.html Change-Id: I4cd7bdf3583db974dc5589fa64857bc31ac861fa	2012-01-30 09:41:26 -08:00
John Koleszar	9951f46133	Merge "Hook up VP8D_GET_LAST_REF_USED"	2012-01-27 11:31:35 -08:00
John Koleszar	8be41bba80	Hook up VP8D_GET_LAST_REF_USED Commit `892e23a5b` introduced support for the VP8D_GET_LAST_REF_USED, but missed the mapping of the control id to the underlying function, so it was unavailable to applications. In addition, the underlying function vp8_references_buffer() is moved from common/postproc.c to decoder/onyxd_if.c as postproc.c is not built in all configurations. Change-Id: I426dd254e7e6c4c061b70d729b69a6c384ebbe44	2012-01-27 10:13:20 -08:00
Jim Bankoski	a127c940c0	vpnext use segref segmentation filter Goes through set of ref frames used by each macroblock and sets seg_lvl_ref_frame flags accordingly.. http://www.corp.google.com/~jimbankoski/no_crawl/segref.html Change-Id: Iab7f1effd75a839b34eb310d7168692c8f105411	2012-01-27 07:53:15 -08:00
Jim Bankoski	1a9fd54ecd	vpnext: pick loop filter segment by segment Picks a per segment loopfilter. Adapts the algorithm to search for a loopfilter value for each separate segment. Further todo fix the bias Improvements .06 % ov psnr, .11% ssim http://www.corp.google.com/~jimbankoski/no_crawl/segmentedpicklpf.html Change-Id: Ic6a571c16fcd6ec0139f4de1f8061f87c6515a10	2012-01-27 07:48:10 -08:00
Yaowu Xu	9a4dde1b36	Merge "fixed an issue with 8x8 token cost in trellisquant" into experimental	2012-01-27 15:11:18 +00:00
Yaowu Xu	81d16e3f53	fixed an issue with 8x8 token cost in trellisquant changed the token cost for 8x8 transformed macroblock used in trellisquant from those derived from 4x4 transform coefficient distribution to those derived from 8x8 transform coefficient distribution. Test results show this fix help 8x8 transform based compression consistently on cif and hd sets: http://www.corp.google.com/~yaowu/no_crawl/t8x8/cif_cost8x8only.html (avg psnr:.14% glb psnr: .17% ssim: .20%) http://www.corp.google.com/~yaowu/no_crawl/t8x8/hd_cost8x8only.html (avg psnr:.17% glb psnr: .18% ssim: .58%) Note: To test the effect of this change, 8x8 transform was forced to be used only on 16x16 predicted macroblocks on inter frames, the effect would be bigger had all macroblocks been forcd to use 8x8 transform. Change-Id: If9b7868b75357c66541f511e5ee78e4d2d4929a4	2012-01-26 14:50:11 -08:00
John Koleszar	319f7c4d56	Merge changes I17e1a348,Iad710941 * changes: Correct clamping in use of vp8_find_near_mvs() Revert "Multithreaded encoder, late sync loopfilter"	2012-01-26 14:33:28 -08:00
Deb Mukherjee	6fa47a5f16	Adds support for enhanced interpolation for subpel motion using an 8-tap filter. The results with 3 different 8-tap filters on the derf set are in: http://www.corp.google.com/~debargha/vp8_results/enhinterp.html The one that gives the most gain achieves an overall gain of about 0.6%. The results for a set of 12 hd (720p) videos are in: http://www.corp.google.com/~debargha/vp8_results/enhinterp_hd.html with max gain of 0.55% with the same filter. The best filter apparently achieves the best trade-off between pass band ripple and stop band attenuation. Change-Id: I919e28ae245c0493147fa0864f8c9d048a9dd530	2012-01-26 10:24:47 -08:00
John Koleszar	83cef816fd	Correct clamping in use of vp8_find_near_mvs() Commit `e06c242ba` introduced a change to call vp8_find_near_mvs() only once instead of once per reference frame by observing that the only effect that the frame had was on the bias applied to the motion vector. By keeping track of the sign_bias value, the mv to use could be flip-flopped by multiplying its components by -1. This behavior was subtley wrong in the case when clamping was applied to the motion vectors found by vp8_find_near_mvs(). A motion vector could be in-bounds with one sign bias, but out of bounds after inverting the sign, or vice versa. The clamping must match that done by the decoder. This change modifies vp8_find_near_mvs() to remove the clamping from that function. The vp8_pick_inter_mode() and vp8_rd_pick_inter_mode() functions instead track the correctly clamped values for both bias values, switching between them by simple assignment. The common clamping and inversion code is in vp8_find_near_mvs_bias() Change-Id: I17e1a348d1643497eca0be232e2fbe2acf8478e1	2012-01-26 09:37:27 -08:00
Attila Nagy	294aa37745	Rename save_neon_reg.asm as save_reg_neon.asm Easier to filter out all NEON asm. Change-Id: I0022dae8321a9608e864b09d4181414c5fff4610	2012-01-26 09:44:00 +02:00
John Koleszar	630d3b95e2	Revert "Multithreaded encoder, late sync loopfilter" This commit is incomplete, as it does not synchronize the loop filter before returning a handle to the reconstructed frame in vpx_codec_get_preview_frame(), which can cause (false?) failures when running the test_reconstruct_buffer test. This may be related to a bug that does cause visible artifacts, which is also under investigation. This reverts commit `380d64ecb1`. Change-Id: Iad710941e7731d44fc2bde63bc63d6763cc4629e	2012-01-24 15:41:59 -08:00
Yaowu Xu	5a5d24eed6	Merge "changed loop filter for MBs using 8x8 transform" into experimental	2012-01-24 21:45:16 +00:00
Jim Bankoski	91325b8fe7	vpn common -> implicit segmentation This introduces base functions for introducing implicit segmentation. The code that actually stores the results to the segment map isn't here yet. This just prints out the segmentation map results if you call it. Uses connected component labeling technique on mbmi info so that only if 2 mbs are horizontally or vertically touching do they get the same segment. vp8next - plumbing for rotation code to produce taps for rotation ( tapify. py ), code for predicting using rotation ( predict_rotated.c ) , code for finding the best rotation find_rotation.c. didn't checkin code that uses this in the codec. still work in progress. Fixed copyright notice Change-Id: I450c13cfa41ab2fcb699f3897760370b4935fdf8	2012-01-24 11:20:13 -08:00
Fritz Koenig	c14754be1d	Merge "Disconnect ARM tgt_isa from dsp extensions"	2012-01-23 11:13:33 -08:00
Scott LaVarnway	8a6af9f98f	Improved uv mv calculations in build inter predictor Changed calculations to use shifts instead of if-then-else. Eliminates branches. Change-Id: I11b75e8bb305301ffd9cb577fb7df059a3cf9ea4	2012-01-23 11:34:43 -05:00
Fritz Koenig	892102842a	Disconnect ARM tgt_isa from dsp extensions A processor with ARMv7 instructions does not necessarily have NEON dsp extensions. This CL has the added side effect of allowing the ability to enable/disable the dsp extensions cleanly. Change-Id: Ie1e879b8fe131885bc3d4138a0acc9ffe73a36df	2012-01-20 10:38:15 -08:00
Yaowu Xu	aebb16bfa8	changed loop filter for MBs using 8x8 transform This commit added a set of loop filter functions for macroblocks using 8x8 transform. First we turned off the regular loop filtering on 4x4 block boundaries that do not exist in macroblocks using 8x8 transform. Second, we change to use the same loop filter(mask and 7 tap filter) that used for macroblock edge filtering. Change-Id: I3a00460b7674ced116917d86812ffc32578c1d3a	2012-01-20 10:09:24 -08:00
Deb Mukherjee	f357e5e2f7	Merge "Overhauling the thresholds and mixing proportions for mfqe postprocessor."	2012-01-20 08:45:42 -08:00
Deb Mukherjee	47dcd769c1	Overhauling the thresholds and mixing proportions for mfqe postprocessor. Makes the thresholds for the multiframe quality enhancement module depend on the difference between the base quantizers. Also modifies the mixing function to weigh the current low quality frame less if the difference in quantizer is large. With the above modifications mfqe works well for both scalable patterns as well as low quality key frames. Change-Id: If24e94f63f3c292f939eea94f627e7ebfb27cb75	2012-01-20 05:11:00 -08:00
Yaowu Xu	5aab0c3fb7	Added code to prevent I8X8_PRED mode for MBs using 8x8 transform This fixed a conflict introduced by the change of adding 8x8 intra prediction modes. The 8x8 intra prediction mode code assumed the use of 4x4 transform, and causes encoder crashes when the codec is configured with --enable-t8x8. Change-Id: I00cc94df63e9725377ffba9eb51be6b77fe3fcf9	2012-01-19 17:09:40 -08:00
Yaowu Xu	be9af16e16	reverted an accidental code deleting commit `cf561bad` accidentally deleted a line of code that sets the base_qindex for each frame, which leads to every frame is encoded at Q of 0. Change-Id: Ib5f8022e856bf3b3bd0d4147405e46241e3dcf2d	2012-01-19 16:56:46 -08:00
Jeff Faust	ac97b089d1	Merge "Simplify an assignment statement"	2012-01-18 21:14:51 -08:00
John Koleszar	6a4ff6f325	Merge "get_plane_pointers: use u/v planes consistently"	2012-01-18 14:22:55 -08:00
John Koleszar	4753ee4166	get_plane_pointers: use u/v planes consistently The prior commit accidentally used the u plane where it should have used the v plane. Change-Id: Ib6c8443b99061536389f05ac25b8e0a307ace637	2012-01-18 12:50:06 -08:00
Jeff Faust	15c29afeca	Simplify an assignment statement Separated a double assignment that looked suspiciously like an assignment and equality typo. Change-Id: I7813979e9d7ea2539afb3c8ae6074f9df5ebdf52	2012-01-18 12:49:43 -08:00
Jim Bankoski	de368fd0b5	Merge "vp8d - valgrind warnings in mb post processor"	2012-01-18 11:12:37 -08:00
Yaowu Xu	df91ab7b0d	Merge "new loop filter functions for macroblock boundaries" into experimental	2012-01-18 17:54:19 +00:00
Yaowu Xu	5e7d7d3d95	new loop filter functions for macroblock boundaries The commit adds a new set of loop filter for macroblock edge filtering. The new loop filter has a mask to detect so-called "flat" regions. The detection checks 5 pixels of each side of an edge. If the all pixels have value with +/-1 from the edge pixel on the same side, the region is treated as a "flat" region. For such case, a 7 tap filter is used to change 3 pixel values on each side. The 7 taps are: [1, 1, 1, 2, 1, 1, 1]/8 The furthest away pixels used as input are +/-5 away from edge. For non-flat region, we fall back to old filtering. It should be noted here that the thresholds and filter taps may require more optimization for best possible results. Tests on a set of hd clips showed consistent gains: http://www.corp.google.com/~yaowu/no_crawl/mblpf_hd.html (avg psnr: .83% glb psnr: .77% ssim: .82%) Tests on derf set also showed consistent gains: http://www.corp.google.com/~yaowu/no_crawl/mblpf_derf.html (avg psnr: .24% glb psnr: .22% ssim: .48%) Change-Id: I0855b1ff48e79e1175c20b81967137e18b2af352	2012-01-18 09:51:29 -08:00
John Koleszar	0e06bc817a	Merge changes I1ebe76aa,Ia079b52b * changes: rdopt/pickinter: factor out some common setup rdopt: remove unused frame_lf_or_gf	2012-01-18 09:30:46 -08:00
Deb Mukherjee	d3879738bf	Merge "Modifying the base q propagation in the mfqe post processing filter in a way such that when there is a single bad frame, the post-processing is applied not only to just that frame but a few subsequent frames as well."	2012-01-18 09:15:24 -08:00
Paul Wilkins	bd5f384bef	Possible divide by 0 error. Put traps to prevent two possible divide by 0 errors. Change-Id: Ia415b945244253dcdd12f54f1f157f9ca8c94d6b	2012-01-18 11:10:51 +00:00
Jim Bankoski	ed208f7d2f	vp8d - valgrind warnings in mb post processor Solved by extending the border in the postproc buffer as necessary Change-Id: Ic3f61397fe5bc8e4db6fc78050b0b160bd0aee86	2012-01-17 17:27:39 -08:00
Paul Wilkins	cf561bad1d	Rate control on static scenes plus Y2dc delta Q fix. A problem can arise on static clips with force key frames where attempts to avoid popping lead to a progressive reduction in key frame Q that ultimately may lead to unexpected overspend against the rate target. The changes in this patch help to insure that in such clips the quality of the key frames across the clip is more uniform (rather than starting bad and getting better - especially at low target rates). This patch also includes a fix that removes a delta on the Y2DC when the baseline q index < 4 as this is no longer needed. There is also a fix to try and prevent repeat single step Q adjustment in the recode loop leading to lots of recodes, especially where the use of forced skips as part of segmentation has made the impact of Q on the number of bits generated much smaller. Patch 2: Amend "last_boosted_qindex" calculation for arf overlay frames. Change-Id: Ia1feeb79ed8ed014e4239994fcf5e58e68fd9459	2012-01-17 17:42:46 +00:00
Deb Mukherjee	90b9f993c1	Modifying the base q propagation in the mfqe post processing filter in a way such that when there is a single bad frame, the post-processing is applied not only to just that frame but a few subsequent frames as well. Change-Id: Iba5d9896eed77244eb76b4a74692a93f8ecff634	2012-01-17 09:28:03 -08:00
Adrian Grange	e479379abb	Fixed bugs in multi-layer code related to changing params When running multi-layer (ML) encodes and dynamically changing coding parameters on the fly (e.g. frame duration/rate, bandwidths allocated to each layer) the encoder would not produce sensible output. In certain cases the rate targeting would be hideously inaccurate. These fixes make it possible to change these coding parameters correctly and to maintain accurate control of the rate targeting. I also added the specification of the input timebase into the test program, vp8_scalable_patterns.c. Patch 2: Moved declaration to appease MS compiler) Change-Id: Ic8bb5a16daa924bb64974e740696e040d07ae363	2012-01-13 16:52:25 -08:00
Yaowu Xu	483b262bab	Merge "Added an emms to prevent invalid stats output" into experimental	2012-01-11 23:07:54 +00:00
John Koleszar	4ade079633	rdopt/pickinter: factor out some common setup Add new get_predictor_pointers() and get_reference_search_order() functions for code shared between the two implementations. Change-Id: I1ebe76aa8f168b1f5cfabc00d05d8f19a0d4d207	2012-01-11 14:43:52 -08:00
John Koleszar	bd5bfd94b8	rdopt: remove unused frame_lf_or_gf This flag was set but unused. Change-Id: Ia079b52b88ffbe3b16fdbde4b84e2b87304eaa13	2012-01-11 13:02:19 -08:00
Deb Mukherjee	9c2ca8c1ca	Allowing the mfqe post-processing filter to be used in conjunction with deblock or demacroblock filters. When --mfqe is used together with --demacroblock or --deblock, mfqe is applied first and then demacroblock/deblock is applied to the mfqe result. Change-Id: Id83ee01f1b4a33a116f071dcf26d59c7f3497c32	2012-01-10 14:14:41 -08:00
John Koleszar	e6c91b625e	Merge "fix: roundoff initializer is not a constant"	2012-01-10 13:33:26 -08:00
James Berry	6ce1f15dfb	fix: roundoff initializer is not a constant precision used in initialization of roundoff is not a constant updated to use #define MFQE_PRECISION 4 Change-Id: If2fc3d3d633d58a7f4ab34d258c232ec1e5f0a79	2012-01-10 14:19:30 -05:00
Jim Bankoski	892e23a5ba	vp8d - function to check if a reference frame is used. Change-Id: Id683b4d7f46ffa99145fc4b824c7232ab4182f21	2012-01-10 10:10:26 -08:00
Deb Mukherjee	28aa08748e	Merge "Multiframe quality enhancement postprocessing"	2012-01-09 10:23:24 -08:00
James Zern	80528410fc	Merge "Reduce the default kf_max_dist to 128."	2012-01-06 12:32:07 -08:00
John Koleszar	66da859e5e	Merge "Reduced the size of Y1Dequant and friends to [128][2]"	2012-01-06 11:59:06 -08:00
Ralph Giles	2a0d7b1a55	Reduce the default kf_max_dist to 128. The default maximum keyframe interval is 9999, or over five minutes at normal video rates. When the encoder produces streams with such a long interval seeking (with correct output) is more expensive, and live streaming is impossible. Of course the encoding application should set this parameter based on its knowledge of the intended use of the stream, but reducing the default gives better results for applications which do not. Change-Id: I900b15d74ce72ecc3ade4d43f758c5cf97a2098a	2012-01-06 11:34:45 -08:00
Scott LaVarnway	5f25d4c175	Reduced the size of Y1Dequant and friends to [128][2] This patch removes the local copies of the dequantize constants and implements John's idea as described in "Make a local copy of the dequantized data" commit. Change-Id: Ic6b7d681f00bf63263f71ff1e39ab2f80729e8b2	2012-01-06 11:12:00 -08:00
Yaowu Xu	a5ea68447f	Added an emms to prevent invalid stats output In certain hardware configuration, where mmx code is enabled and other simd (sse2/sse3) disabled, lacking of this emms caused invalid internal stats outputs. Change-Id: I77c61cf6e0448d3f3b8c11781aa9e42f31d231c9	2012-01-05 13:25:41 -08:00
Deb Mukherjee	87aa846b47	Multiframe quality enhancement postprocessing Adds a multiframe postprocessing module to enhance the quality of certain frames that are coded at lower quality than preceding frames. The module can be invoked from the commandline by use of the --mfqe option, and will be most beneficial for enhancing the quality of frames decoded using scalable patterns. Uses the vp8_variance_var16x16 and vp8_variance_sad16x16 function pointers to compute SAD and Variance of blocks. Change-Id: Id73d2a6e3572d07f9f8e36bbce00a4fc5ffd8961	2012-01-05 10:21:48 -08:00
Johann	0780f258da	Merge "Improve SSSE3 fast quantizer function"	2012-01-05 10:09:39 -08:00
Scott LaVarnway	b2c8dff727	Merge "Removed unused diff buffer"	2012-01-05 09:06:28 -08:00
Scott LaVarnway	89cdfdb231	Merge "SSE2 optimizations for vp8_build_intra_predictors_mby{,_s}()"	2012-01-05 09:05:19 -08:00
Scott LaVarnway	77119a5cd8	Merge "Improved sse2 version of simple loopfilter"	2012-01-04 13:26:13 -08:00
Scott LaVarnway	5bfa29b6c5	Merge "Make a local copy of the dequantized data"	2012-01-04 07:50:13 -08:00
Yunqing Wang	9f1083e9a0	Merge "Improve vp8cx_init_quantizer()"	2012-01-04 06:22:15 -08:00
Scott LaVarnway	33d9ea5471	Merge "Remove useless g_common.h"	2012-01-03 09:48:35 -08:00
Yunqing Wang	2b2c0c9bda	Improve SSSE3 fast quantizer function Simplified the EOB calculation in the function. Change-Id: I7422f18be40ae270358f5cb0811d66e64436b56f	2011-12-29 12:05:50 -05:00
John Koleszar	3cb92b85b9	Remove unused MACROBLOCK member vector_range Change-Id: Ie2dc0d72363ff38e0f71b59f6e2d1a2d70c5266b	2011-12-28 14:58:38 -08:00
John Koleszar	31e86192ba	Remove unused BLOCK member force_empty Change-Id: I72ed49ce14ca0124dd0d31bfcf4c7630a4681587	2011-12-28 13:57:51 -08:00
Yunqing Wang	b510863f8f	Improve vp8cx_init_quantizer() Except zrun_zbin_boost, 15 AC values are the same for all other parameters. Removed unneccessary calculation. Change-Id: I6101c0fe8080bd2b4387c3b04d7ddedbf6010409	2011-12-28 13:55:55 -05:00
Christian Duvivier	e4ca542a3b	Fix more warnings. Change-Id: Ifadf65026a11bdb5d39840748613880bcfb364bb	2011-12-22 16:33:06 -08:00
John Koleszar	03fadc4b20	Merge "Remove unnecessary ternary constructs"	2011-12-22 13:01:05 -08:00
John Koleszar	d48ea5a2ab	Merge "Remove legacy integer types"	2011-12-22 13:00:23 -08:00
John Koleszar	adb10c47a8	Merge "Use lookup tables for mode_check_freq"	2011-12-22 12:59:47 -08:00
John Koleszar	64c4be2669	Merge "Use lookup tables for thresh_mult"	2011-12-22 10:31:21 -08:00
John Koleszar	0c2b2c79ae	Remove unnecessary ternary constructs The code had a number of constructs like (condition)?1:0, which is redundant with C's semantics. In the cases where a boolean operator was used in the condition, simply remove the ternary part. Otherwise adjust the surrounding expression to remove the condition (eg, for rounding up. See pickinter.c and rdopt.c) Change-Id: Icb2372defa3783cf31857d90b2630d06b2c7e1be	2011-12-22 10:09:46 -08:00
John Koleszar	f56918ba9c	Remove legacy integer types Remove BOOL, INTn, UINTn, etc, in favor of C99-style fixed width types. Change-Id: I396636212fb5edd6b347d43cc940186d8cd1e7b5	2011-12-22 09:58:40 -08:00
John Koleszar	aa8650dd7f	Use lookup tables for mode_check_freq Mostly cosmetic. Trying for a more compact representation of speed selection thresholds. Change-Id: I339e7840049b91ad569aabbdc9c702a496110d3b	2011-12-22 09:43:44 -08:00
John Koleszar	efb4783d36	Use lookup tables for thresh_mult Mostly cosmetic. Trying for a more compact representation of speed selection thresholds. Change-Id: Icaebea632c7bb71ca8e07b4def04a046d4515e27	2011-12-22 09:43:40 -08:00
John Koleszar	a2407935d2	Merge "Remove opaque pointer VP8D_PTR"	2011-12-22 09:36:11 -08:00
Christian Duvivier	a7eb21760f	Fix a couple of warnings.	2011-12-21 15:58:14 -08:00
John Koleszar	0c2f8e77cc	Remove useless g_common.h This file declared a bunch of nonexistent, unreferenced global function pointers. Change-Id: Ic26bb8c7712deba754c49fc01f383b53afc9e728	2011-12-21 15:02:23 -08:00
John Koleszar	bf1a8073c3	Remove opaque pointer VP8D_PTR Use an opaque struct rather than typecasting through VP8D_PTR, an int*. Change-Id: Ia260b7d53d7e0950cfa1e00f4ecead1099bd3b87	2011-12-21 14:48:10 -08:00
James Zern	b651875e24	squash some signed/unsigned comparison warnings Change-Id: Ifc64cf990ae04d77934da3324d0afb3993f061e7	2011-12-21 13:49:19 -08:00
Johann	db389cb804	Make a local copy of the dequantized data Multithreaded encoding was breaking at low bitrates Please review/comment. Not sure if this is the best fix. Change-Id: I87468c765372593fd865bc82e25121ebb8ca6af2	2011-12-21 12:39:39 -08:00
Yaowu Xu	4e9b4a1570	changed mode_context update strategy Previously, the mode context is always udpated based on stats of current frame, when there is no count, 50% is used for both left and right branch. However, it is observed that with such strategy, a small count or no count at all can skew the probability distribution significantly. This commmit changed the mode_context update strategy to prevent small counts from skewing the probability distributions. Tests on derf set showed a small gain: .06% in psnr and .09% in ssim Change-Id: Ic812e64ae5f70251c170b0717f7b7fa587055488	2011-12-21 12:05:10 -08:00
John Koleszar	bb1915274f	Remove unreferenced includes These files are legacy and have no current references. Change-Id: I38224961fafeb33bc3eb6150bb0c2249ccbb4f60	2011-12-21 11:01:11 -08:00
John Koleszar	16a8948c45	Merge "Remove opaque pointer VP8_PTR"	2011-12-21 09:59:22 -08:00
Scott LaVarnway	1d7d18c69c	Improved sse2 version of simple loopfilter Change-Id: Iae406d16fab5bace47fbcf5ef7ed021f08af159d	2011-12-21 12:52:18 -05:00
John Koleszar	63d9c4da5e	Merge "tokenizer: use correct block type context in stuff1st_order_b"	2011-12-21 09:20:21 -08:00
John Koleszar	b0056c3b5e	Remove opaque pointer VP8_PTR Use an opaque struct rather than typecasting through VP8_PTR, an int*. Change-Id: I5ed4d9238ba2e8d51bfa07a8da87a2eb4c8fa43a	2011-12-21 09:13:51 -08:00
Paul Wilkins	5920b520a0	Merge "Extended Q:" into experimental	2011-12-20 11:12:28 +00:00
Paul Wilkins	0d203eff8e	Merge "Extend to 256 Q steps." into experimental	2011-12-20 11:12:08 +00:00
Paul Wilkins	df5a1fca8c	Merge "QRange experiements." into experimental	2011-12-20 11:11:50 +00:00
Paul Wilkins	e92c8ccd92	Merge "Further QIndex realted Fixes:" into experimental	2011-12-20 11:11:09 +00:00
John Koleszar	056bcc8771	remove armv6 files from armv5 build Make bilinearfilter_arm.c compiled only when HAVE_ARMV6, as its definitions are v6 only. This is normally not a problem for static builds as the file is elided at link time, but this was not being done properly for the --enable-shared --enable-pic build. Change-Id: Ic800a7cde751f74f22555c5b247f99f9df5e550d	2011-12-19 13:51:11 -08:00
Johann	080919b3c2	Merge "Avoid heap allocation of firstpass stats"	2011-12-19 10:11:23 -08:00
John Koleszar	c75f0ec379	Merge "fix: make sure ss_err is large enough"	2011-12-19 09:50:12 -08:00
Yunqing Wang	fd294c553a	Merge "Merge mr_pick_inter_mode and pick_inter_mode"	2011-12-19 08:42:20 -08:00
Paul Wilkins	7187c462d8	Extended Q: Cleanup and switch to Q extended at low end too. Change-Id: Ie22676bb9e961097d75dbd1d81745208b63e5f4b	2011-12-19 09:37:23 +00:00
Paul Wilkins	df4e79f7f7	Extend to 256 Q steps. This commit extends the number of Q steps to 256 from 128. The q_trans[] array has been altered to distribute available Q index values (using the current 64 steps available as input parameters) evenly across the available range. This is coupled with the fact that each Q step where possible now equates to a fixed % change in the quantizer. This may want refinement later especially in terms of the granularity at the high quality end but is a reasonable starting point. Change-Id: I2aaa6874fa10ce05c958dd182947ce39f6f1eecb	2011-12-19 09:36:19 +00:00
Paul Wilkins	ec670bc558	QRange experiements. High Q end extended a little. Some clean up. Slightly better on SSIM, Slightly worse on PSNR over derf set. Change-Id: I3dceea8a39e11c26e1a389a40e40b86efc76d28c	2011-12-19 09:35:10 +00:00
Paul Wilkins	fb807776a2	Further QIndex realted Fixes: Added code to support 256 index steps instead of 128 but disabled for now. Replace hard wired table vp8cx_base_skip_false_prob[128] Observed Qindex problem with setting minimum loop filter value. (Experiment code using real Q in place but for now just returning 0. This has a big beneficial effect on some clips, particularly waterfall which shows 5% ssim gain) Change-Id: I2f7117de8adc1797164c106aa13effc900a1467e	2011-12-19 09:27:19 +00:00
Yunqing Wang	c647ec4462	Merge mr_pick_inter_mode and pick_inter_mode Merged multi-resolution motion estimation with regular motion estimation function in order to remove duplicated part. This caused slight changes in multi-resulotion encoder quality & performance. Change-Id: Ib4ecc7acfebfe5eea959b5b91febae6db7b95fd1	2011-12-16 18:02:29 -05:00
James Berry	24196dd987	fix: make sure ss_err is large enough increase size of ss_err by one to make sure there is room for 64 elements. Change-Id: I355cb8c499aa7da3b9675f2326a8d25a74bb88d2	2011-12-16 17:43:55 -05:00
Adrian Grange	0fafd0543f	Reset segment_id to 0 when segmentation is disabled Whilst the encoder explicitly set the segment_id to 0 when segmentation is diabled, the decoder would allow the segment_id to persist from the previous frame. This fix attempts to make the decoder behave the same as the encoder by explicitly setting the segment_id to 0 in this case. Change-Id: I65c3a05247550edb10706eb5d54d306dfb792309	2011-12-16 14:00:36 -08:00
John Koleszar	26c6a44c66	Avoid heap allocation of firstpass stats The total_stats, this_frame_stats, and total_left_stats structures were previously create by a heap allocation, despite being of fixed size. These structures were allocated and deallocated during {de,}allocate_compressor_data, which is reinvoked whenever the frame size changes. Unfortunately, this clobbers the total_stats and total_left_stats data. Historically, these were variable size at one time, due to the first pass motion map, which necessitated their being created by a unique heap allocation. However, this bug with the total_stats being clobbered has probably been present since that initial implementation. These structures are instead moved to be stored within the struct twopass_rc directly, rather than being heap allocated separately. Change-Id: I7f9e519e25c58b92969071f0e99fa80307e0682b	2011-12-16 11:40:23 -08:00
Scott LaVarnway	0ccefd2c8f	Fixed mb_skip_coeff bug When mb_skip_coeff is set, the idct is not necessary. Prior to this patch, the code would call idcts based on leftover eob information. This patch will now skip the idct for SPLIT_MV and clear out the eobs for B_PRED, forcing the idct to be skipped. Change-Id: If5b0d2ed3ebd07789d30ec5160df927485fcaa17	2011-12-16 13:48:01 -05:00
Adrian Grange	b3ade15a26	Fixed stride bug in segmentation code mode_info_context is padded with an additional column of data, so mode_info_stride should be used to move between rows rather than mb_cols. Change-Id: I598559a2cd9df1c486d64aaeccf76b76a7ecf21c	2011-12-15 12:27:38 -08:00
Scott LaVarnway	a53d5a4c44	Moved dequant idct into common These functions are now used by the encoder. This is WIP with the goal of creating a common idct/add for the encoder and decoder. A boost of 1.8% was seen for the HD rt test clip used. [Tero] Added needed changes to ARM side. Change-Id: Ibbb8000be09034203d7adffc457d3c3f8b06a5bf	2011-12-15 14:23:41 -05:00
Adrian Grange	ae63ce248a	Fixed bug to use mode_info_stride rather than mb_cols Both encoder & decoder were using mb_cols to offset from one row of MODE_INFO structures to the next when they should have been using mode_info_stride. Fixing this in both encoder and decoder gives around a 3KB size saving and 0.025dB PSNR improvement on the one 720P clip I tried. (Also removed "index" which was being updated but not used) Change-Id: I413bea802b142886bfcf8d8aa7f5a2f0c524fd4b	2011-12-15 10:00:46 -08:00
Yunqing Wang	c8df1656bd	Merge "Only call vp8_find_near_mvs() once for each macroblock"	2011-12-15 09:53:25 -08:00
Yunqing Wang	e06c242baa	Only call vp8_find_near_mvs() once for each macroblock While doing motion search on a macroblock, we usually call vp8_find_near_mvs once per reference frame. Actually, for different reference frames, the only difference in calculating these near_mvs is they may have different sign_bias, which causes a sign change in resulting near_mvs. In this change, we only do find_near_mvs for the first reference frame. For other reference frames, only need to adjust the near_mvs according to that reference frame's sign_bias value. Change-Id: I661394b49c6ad79fed7d0f2eb2be239b9c56f149	2011-12-15 11:19:18 -05:00
Yunqing Wang	d7e09b6ada	Merge "Force realtime version 1 streams to only use simple loopfilter"	2011-12-15 05:38:57 -08:00
John Koleszar	72f459c77f	Merge "Avoid multiple test for same lvl in auto filter lvl pick"	2011-12-14 16:28:13 -08:00
John Koleszar	4f8f360098	Merge "add check to ensure that cq_level falls within min and max q"	2011-12-14 12:10:06 -08:00
John Koleszar	e542627b0c	Merge "fix: active_worst_quality could be set above 127"	2011-12-14 11:22:00 -08:00
James Berry	c1c47e83b0	add check to ensure that cq_level falls within min and max q Add the notion of deferred validation of parameters. We don't want to validate the cq_level at initialization time, because it won't have been set via set_param() yet. Change-Id: Ia1308395e8c10e0b1dc4e9af3a09b2bd6744cc30	2011-12-14 10:39:06 -08:00
Attila Nagy	51c4f9e6b1	Avoid multiple test for same lvl in auto filter lvl pick Sometimes same level is tested 2-3 times; store and reuse the calculated error value. Change-Id: Ia1c04a2568232edf9a5a62c4e2d8e8a50d85e00e	2011-12-14 15:56:29 +02:00
Attila Nagy	55fbdd58ac	Force realtime version 1 streams to only use simple loopfilter ...regardless of the speed settings. Change-Id: I4b91ac7a7208efd690dfc69e175f8eb769b6ce03	2011-12-14 12:57:49 +02:00
James Berry	f8b431c334	fix: active_worst_quality could be set above 127 add check to set active_worst_quality to 127 if it is set above 127 Change-Id: I7db353d5c1b1c8516a116542b6ed21c0110bb512	2011-12-13 14:58:59 -05:00
John Koleszar	d6020f9d52	tokenizer: use correct block type context in stuff1st_order_b The fast-path for skipped MBs was not correctly respecting the block type during update of the coefficient counts. Extracted this from part of change I365cfb6ac636f19c545f682e3aeac185253abaef Change-Id: I53d8cf0a00a98034b97b0ed3414b703bae74a739	2011-12-13 11:58:56 -08:00
Jim Bankoski	6b2792b0e0	Merge "vp8e - entropy stats per frame type"	2011-12-12 09:08:34 -08:00
Scott LaVarnway	c4aa1d508e	Removed unused diff buffer Change-Id: I9211358cca89b1c4f84b53a202a63ecf9e79ae4c	2011-12-12 11:06:55 -05:00
Scott LaVarnway	afa1b66108	Merge "Improved mmx/sse2 versions of iwalsh"	2011-12-12 06:40:28 -08:00
Paul Wilkins	ae9023a3c9	QINDEX_RANGE fixed tables. Removed a couple more fixed tables for the extended quantizer experiment that depend on QINDEX_RANGE. Change-Id: I2c15ffc7488c2a2b8d6504e2c4b6b2339799d117	2011-12-12 11:18:57 +00:00
Jim Bankoski	6de67cd6e8	vp8e - entropy stats per frame type Change-Id: I4168eb6ea22ae541471738a7a3453e7d52059275	2011-12-09 16:56:18 -08:00
Yaowu Xu	be360d47f4	Enabled adaptive UV intra coding for inter frames Previously, Y-adaptive UV intra coding only enabled on key frames in UVINTRA experiment. This commit enabled the same coding for inter frames, so the encoding of UV intra modes are consistent cross all frame types. Tests on derf set showed a very small overall gain around .04%: http://www.corp.google.com/~yaowu/no_crawl/interUVintra.html The gain looks to be reasonable given inta coded MBs is only a small portion of MBs in inter frames. Change-Id: Ic6fc261923f2c253f4a0c9f8bccf4797557b9e16	2011-12-09 14:44:13 -08:00
Adrian Grange	43a059de71	Merge "Fix out of bounds read in update_mbgraph_frame_stats" into experimental	2011-12-09 21:05:00 +00:00
Adrian Grange	95b4cf059c	Fix out of bounds read in update_mbgraph_frame_stats update_mbgraph_frame_stats used xd->mode_info_context before it had been setup, resulting in potentially random accesses of uninitialized memory. This fix allocates a local MODE_INFO structure to hold the data generated in the function. Change-Id: Ic9e75610008ce0e2d690e8e583c21582fee6fc45	2011-12-09 12:47:57 -08:00
Yaowu Xu	ba1a6619b3	Revised coding using adaptive mode context to depend on frame type A previous commit `76feb965` made the vp8_mode_context adaptive on a frame frame basis, this commit further made the coding context adaptive to two frame types separately. Tests on derf set showed a further small gain on all metrics: avg psnr 0.10%, glb psnr: 0.11%, ssim: 0.08% http://www.corp.google.com/~yaowu/no_crawl/newNearMode_1209.html Change-Id: I7b3e32ec8729de1903d14a3f1213f1624b78cdee	2011-12-09 12:13:42 -08:00
Paul Wilkins	7748d833e8	Experiment with old Q range: Experiment with old Q range but new higher precision quantizer and transform code. Change-Id: Id1ff4cb433e5775d709d0133e2aec0322975c292	2011-12-09 16:19:08 +00:00
Scott LaVarnway	9fa6132fc5	Improved mmx/sse2 versions of iwalsh Removed unnecessary transposes. Change-Id: I029fbaf8afafee34d54a4f3333c22023c15003c3	2011-12-08 14:37:59 -05:00
Yaowu Xu	ebcc6605c1	fixed a crash caused invalid Q choice The commit fixed a problem by capping cpi->active_best_quality to be smaller than cpi->worst_quality. Also fixed a few line of code that was misplaced. Change-Id: Ie908264b72140c669122a0afde5d886619c33474	2011-12-08 07:04:23 -08:00
Yaowu Xu	b70f23caec	Removed #if CONFIG_MULCONTEXT This commit removed the macro CONFIG_MULCONTEXT, which was used to indicate the experiment code for using separate context for altref and normal frames. This commit made the change fully merged in. Change-Id: I525f927f68e2365d37b340ef23b836a136a4f70b	2011-12-07 14:01:07 -08:00
Yaowu Xu	d37cd97682	Removed #if CONFIG_I8X8 This commit removed the macro CONFIG_I8X8, which was used to indicate the 8x8 intra prediction experiment, made the change fully merged in. Change-Id: Iafa4443781ce6e83f5591c12ba615a0e92ce0ea0	2011-12-07 13:48:53 -08:00
Yaowu Xu	76feb965d3	made vp8_mode_context adaptive vp8_mode_contexts[] is an entropy table used to code inter mode choices. It was a fixed constant table. This commit made the entropy context adaptive. Tests on derf set showed very good consistent gains on all metrics: avg psnr .47%, overall psnr .46% and ssim .40%. http://www.corp.google.com/~yaowu/no_crawl/newModeContext.html Change-Id: Ia62b14485c948e2b74586118619c5eb2068b43b2	2011-12-07 11:01:59 -08:00
Yaowu Xu	b1823a7dd2	fixed a crash when MODE_STATS is enabled The MODE_STATS macro was used to #ifdef around code for mode entropy stats collection, this commit fixed a crash when MODE_STATS is on. The commit also changed a number of array definitions to use defined macros instead of hard-coded numbers. Change-Id: I114592f53a1e44e31e455f5725f036ae6168735a	2011-12-07 10:56:39 -08:00
Yaowu Xu	d0e3acf98c	Merge "Minor fixes:" into experimental	2011-12-07 18:52:51 +00:00
Johann	a69810b893	Merge "Reduce mem copies in encoder loopfilter level picking"	2011-12-07 10:41:00 -08:00
Paul Wilkins	79774d108f	Minor fixes: fixed issues caused by conflicts between two experiments. Change-Id: I56a9bd69493e4850c121ea057a6233c55777c2a5	2011-12-07 09:55:27 -08:00
Ronald S. Bultje	73bbdfe506	Rename use_dc_pred to use_16x16_pred. Because the variable doesn't distinguish between DC and non-DC prediction, but rather between 16x16 or 4x4 prediction. Change-ID: Ia4e7dda2bd6230c91515072e3277be2d64e42629	2011-12-07 09:10:26 -08:00
Attila Nagy	e570b0406d	Reduce mem copies in encoder loopfilter level picking Do the test filtering in the existing backup frame buffer instead of the original. Copy the original data into extra buffer before doing the filtering. This way there is no need to restore the original unfiltered frame at the end of level picking process. This came up in some discussions with Johann. Thanks! Change-Id: I495f4301d983854673276c34ec0ddf9a9d622122	2011-12-07 09:59:50 +02:00
Yaowu Xu	b1781b48db	Merge "corrected an enum name" into experimental	2011-12-07 03:25:08 +00:00
Ronald S. Bultje	0072b8bc73	Fix for RD thresholds if both I8X8 and DUALPRED are enabled. Change-Id: I5f9fc894e6a332d9be6d7336c7c5fe11e65b8498	2011-12-06 15:13:11 -08:00
Ronald S. Bultje	60cb39da86	Dual 16x16 inter prediction. This patch introduces the concept of dual inter16x16 prediction. A 16x16 inter-predicted macroblock can use 2 references instead of 1, where both references use the same mvmode (new, near/est, zero). In the case of newmv, this means that two MVs are coded instead of one. The frame can be encoded in 3 ways: all MBs single-prediction, all MBs dual prediction, or per-MB single/dual prediction selection ("hybrid"), in which case a single bit is coded per-MB to indicate whether the MB uses single or dual inter prediction. In the future, we can (maybe?) get further gains by mixing this with Adrian's 32x32 work, per-segment dual prediction settings, or adding support for dual splitmv/8x8mv inter prediction. Gain (on derf-set, CQ mode) is ~2.8% (SSIM) or ~3.6% (glb PSNR). Most gain is at medium/high bitrates, but there's minor gains at low bitrates also. Output was confirmed to match between encoder and decoder. Note for optimization people: this patch introduces a 2nd version of 16x16/8x8 sixtap/bilin functions, which does an avg instead of a store. They may want to look and make sure this is implemented to their satisfaction so we can optimize it best in the future. Change-ID: I59dc84b07cbb3ccf073ac0f756d03d294cb19281	2011-12-06 11:53:02 -08:00
Paul Wilkins	b4ad9b5d50	Some further QIndex issues with extended Q Resolved or factored out some further issues with Q index. Put in a 3rd order polynomial instead of less accurate power function as the best fit on gf and kf boost adjustment. Added avg_q value to use instead of ni_av_qi. Compute segment delta Q values based on avg_q. Fixed bug in adjust_maxq_qrange(). The extended range Q on the derf set, using standard data rates (which do not extend high enough to get big benefits) still show a shortfall of between 0.5 and 1% though so there would appear to be further issues that need to be tracked down. Change-Id: Icfd49b9f401906ba487ef1bef7d397048295d959	2011-12-06 15:43:17 +00:00
Yaowu Xu	0404a5a7e1	corrected an enum name CNT_INTRA has been used for counting (0,0) motion vectos, this commit renames it to CNT_ZEROMV Change-Id: I8f67c5468370090525faf84ba5b3f780d302443f	2011-12-06 07:09:08 -08:00
Yunqing Wang	aa7335e610	Multiple-resolution encoder The example encoder down-samples the input video frames a number of times with a down-sampling factor, and then encodes and outputs bitstreams with different resolutions. Support arbitrary down-sampling factor, and down-sampling factor can be different for each encoding level. For example, the encoder can be tested as follows. 1. Configure with multi-resolution encoding enabled: ../libvpx/configure --target=x86-linux-gcc --disable-codecs --enable-vp8 --enable-runtime_cpu_detect --enable-debug --disable-install-docs --enable-error-concealment --enable-multi-res-encoding 2. Run make 3. Encode: If input video is 1280x720, run: ./vp8_multi_resolution_encoder 1280 720 input.yuv 1.ivf 2.ivf 3.ivf 1 (output: 1.ivf(1280x720); 2.ivf(640x360); 3.ivf(320x180). The last parameter is set to 1/0 to show/not show PSNR.) 4. Decode: ./simple_decoder 1.ivf 1.yuv ./simple_decoder 2.ivf 2.yuv ./simple_decoder 3.ivf 3.yuv 5. View video: mplayer 1.yuv -demuxer rawvideo -rawvideo w=1280:h=720 -loop 0 -fps 30 mplayer 2.yuv -demuxer rawvideo -rawvideo w=640:h=360 -loop 0 -fps 30 mplayer 3.yuv -demuxer rawvideo -rawvideo w=320:h=180 -loop 0 -fps 30 The encoding parameters can be modified in vp8_multi_resolution_encoder.c, for example, target bitrate, frame rate... Modified API. John helped a lot with that. Thanks! Change-Id: I03be9a51167eddf94399f92d269599fb3f3d54f5	2011-12-05 17:59:42 -05:00
John Koleszar	6127af60c1	Merge "Speed selection support for disabled reference frames"	2011-12-05 14:36:54 -08:00
Yaowu Xu	82d99257f2	removed leftover code from a couple merge problems. Change-Id: I17d9c1246d69e102297ec1c3efb359691b3da313	2011-12-05 11:22:35 -08:00
Yaowu Xu	acf5d20ce5	added separate entropy context for alt_ref This commit added code to keep track of separate entropy contexts for normal frames and alt ref frames. The underly assumption was that the two type of frames have different entropy characteristics given they typically have quite different quantization levels. By keeping entropy contexts separate, it helps the entropy context distribution to be more closely adapted to each frame type. Tests on derf set showed a good and very consistent gain on all clips on all metrics, avg psnr: 0.89%, overall psnr: 0.84% and ssim 0.93%. http://www.corp.google.com/~yaowu/no_crawl/mulcontext.html Change-Id: I15bc9697f6ff7829042911fe0c62930585d7e65d	2011-12-02 14:43:33 -08:00
Yaowu Xu	a8fbab8697	enabled 8x8 intra prediction modes on inter frames This commit enabled the usage of 8x8 intra prediction modes on inter frames. There are a few TODO items related to this: 1)baseline entropy need be calibrated; 2)cost of UV need to be done more properly rather than using decision only relying on Y; 3)Threshold for allowing picking 8x8 intra prediction should be lowered to lower than the B_PRED. Even with all the TODOs, tests showed consistent gain on derf set ~0.1% (PSNR:0.08% and SSIM:0.14%). It is assumed that 8x8 intra prediction will help more on large resolution clips, especially with above TODOs addressed. Change-Id: I398ada49dfc32575cfab962a569c2885111ae3ba	2011-12-02 13:44:47 -08:00
Paul Wilkins	8487a68baf	Further work on extended Q range. Fixed some further QIndex related issues and replaced some tables (eg zbin and rounding) Also Added function (currently disabled by default) to populate the main AC and DC quantizer tables. Using the original AC range the resulting computed DC values give behavior broadly comparable on the DERF set. That is not to say that the equations will hold good over a more extended range. The purpose of this code is to make it easier to experiment with further alterations to the Q range and distribution of Q values plus the relative weights given to AC and DC. The function find_fp_qindex() ensures that changes to the Q tables are reflected in the value passed in to the first pass code. Slight experimental adjustment to static segment Q offset. Change-Id: I36186267d55dfc2a3d565d0cff7218ef300d1cd5	2011-12-02 15:30:01 +00:00
Paul Wilkins	2b307b38e3	CR/LF issue. Change-Id: I95fab6f51967008acf1bc9e98fdb7bb56974807f	2011-12-02 15:06:15 +00:00
Yaowu Xu	bba710fcbd	added transform type to MB_MODE_INFO this commit is to add an variable in the macroblock level mode info structure to track the transform size used in each MB, so the information can be used later in the loop filter to change how loop filter works on MBs with different transform sizes. Change-Id: Id0eeaba6cc854c6d1be00ed8d237b3d9e250e447	2011-12-01 07:34:27 -08:00
Paul Wilkins	a917afabbb	MinQ equations. Slight tweaks to the new minq equations to bring results more into line with original lookup tables. Change-Id: I969fc87d95912df549b6775e83ee2345e84d4da0	2011-11-29 18:03:57 +00:00
Paul Wilkins	b9ce9bcbc5	Extended Q Range: Addressed a couple of other QIndex dependencies. Change-Id: I15b224bffd0210d3c7065cb6905156f2ca8e9ea9	2011-11-29 18:02:56 +00:00
Paul Wilkins	99df6bb629	Further work on extended Q range. Fixed bug in firspass.c call to vp8_initialize_rd_consts() This was passing in vp8_dc_quant(cm->base_qindex, cm->y1dc_delta_q) instead of (cm->base_qindex + cm->y1dc_delta_q). It just so happens that for the value 26 used for cm->base_qindex in the unextended Q case, the two give similar results. However, when using the extended Q range the two are very different. Also added more stats output and partly disabled another broken feature. Change-Id: Iddf6cf5ea8467c44b7c133f38e629f6ba6f2581e	2011-11-29 17:59:23 +00:00
Yunqing Wang	06fc0f83b6	Populate q_index in multi-thread encoding This value needs to be copied to each thread's data structure. This fixed artifact problem in multi-thread encoder. Change-Id: Iab6d9745a1d44846aa503184705376f63a505597	2011-11-28 15:58:28 -05:00
Ronald S. Bultje	82733643ca	mbgraph: fix invalid memory access if motion vectors are too big.	2011-11-28 12:39:38 -08:00
Scott LaVarnway	34d7c8b3d4	Added vp8_dequant_idct_add_y_block_sse2 setup In Change I83202ffd, I deleted one too many lines. Change-Id: If05d7c8988eb5c00898dc7c833ad7d99b5eb23e7	2011-11-28 13:06:13 -05:00
Yaowu Xu	643238a3e0	changed find_near_mvs search to include a mb from last frame This is an experiment to include a mv contribution from last frame to nearest and near mv definition. Initial test showed some small though consistent gain. latest patch slightly better result ~.13%-~.18%. TODO: the entropy used to encode the mode choice, i.e. the mv counts based conditional distribution of modes should be re-collected to reflect this change, it is expected that there is some further gain from that. Change-Id: Ief1e284a36d8aa56b49ae5b360c91419ec494fa4	2011-11-28 08:52:08 -08:00
Scott LaVarnway	f46e17fd6f	Merge "Modified the inverse walsh to output directly"	2011-11-28 07:26:07 -08:00
Scott LaVarnway	4a91541c94	Modified the inverse walsh to output directly to the dqcoeff or qcoeff buffer. The encoder would populate the dc coeffs of the y blocks as a separate stage (recon_dcblock) and the decoder would use a special version of the idct. This change eliminates the extra copy and reduces the code footprint. [Tero] Added needed changes to armv6 and NEON assembly. Change-Id: I83202ffdbaf83f6e5dd69f4ba2519fcf0b13b3ba	2011-11-25 09:24:04 +02:00
Johann	e2bacd581a	Merge "Move shared data to shared location"	2011-11-23 11:20:54 -08:00
Paul Wilkins	ee2051f650	Two pass rate control code changes. This comitt brings accross changes from the public branch commit number Icf74d13af77437c08602571dc7a97e747cce5066. The main puurpose of this comit relates to CQ mode but it also includes some refactoring of the two pass code which I hope will make tuning the experimental branch for the new quantizer range a little less painfull. Change-Id: I278e989436a928fc1fe7761068960048f9d7a376	2011-11-23 17:18:31 +00:00
Johann	15ea268d62	Merge "Fix encoder partitioned output on ARM"	2011-11-23 08:44:21 -08:00
Paul Wilkins	a0b7db22e6	Further resolution of QIndex LUTS; This commit resolves further QIndex look up tables to facilitate experimentation with the quantizer range. In some cases rather than remove the look up tables completely I have created functions that are called once to populate them using a formulaic approach base on the actual quantizer. The use of these functions based on best fit of data from the original tables does affect the results on some clips but across the derf test set the effect was broadly neutral. Change-Id: I8baa61c97ce87dc09a6340d56fdeb681b9345793	2011-11-23 11:32:20 +00:00
Attila Nagy	97259b460c	Fix encoder partitioned output on ARM API was not returning correct partition sizes on arm targets. The armv5 token packing functions were not storing the information to the partition size table. As a fix, have one boolcoder instance allocated for each partition so that partition sizes are internally available after all partitions were encoded. This will also allow more flexibility in producing several partitions in parallel. Use buffer validation (overflow check) in all ARM bitpacking functions. Change-Id: I31c8a11d8a7613676f0ff50928cb2a2ab14fd169	2011-11-23 12:29:43 +02:00
John Koleszar	b79879c2e3	Merge "Decoder fixes to better support reference picture selection."	2011-11-22 17:12:06 -08:00
Adrian Grange	08491b8665	Remove redundant code (lf_or_gf and frame_lf_or_gf) Removed unused variables lf_or_gf and frame_lf_or_gf. Change-Id: I88692cd7d53e532d303c4525ee4667c1ecea3026	2011-11-22 08:47:08 +00:00
Paul Wilkins	d39b5d0546	Removal of Qindex LUTS. One of the problems arising when tweaking or adjusting the quantizer tables is that there are a lot of look up tables that depend on the QINDEX. Any adjustment to the link between QINDEX and real quantizer therefore tends to break aspects of for example the rate control. In this check in I have replaced several of the look up tables with functions that approximate the same results as the old Q luts but use a formulaic approach based on real Q values rather than QIndex. This should hopefully make it easier to experiment with changes to the Q tables without always having to go through and hand optimize a set of look up tables. Once things stabilize we may choose to re-instate luts for the sake of performance. Patch 2: Addressed Ronald's comments. vp8_init_me_luts() Added so luts only initialized once. Change-Id: Ic80db2212d2fd01e08e8cb5c7dca1fda1102be57	2011-11-22 08:42:33 +00:00
Paul Wilkins	9bac509ac5	Extended Q range Experiment. Corrected dc lookup table to maintain ac/dc balance close to what it was previously. Firstpass not being passed the adjusted Q index for the extended range. Change-Id: Ic0200dabda445fea03bf81067999cb2670e99b77	2011-11-21 15:53:40 +00:00
Paul Wilkins	54f090b119	Cosmetic clean up. Clean up of vp8_kfread_modes(). Remove unnecessary indentation and enforce line length. Change-Id: I0864d1aff55368126db01bb23efa815786b5245d	2011-11-21 15:51:21 +00:00
Paul Wilkins	19d87e8ed7	Decoder segmentation bug. Fix decoder segmentation bug for temporal coding where the segment map was first initialized on a key frame. in vp8_kfread_modes() after reading the segment id it must be written to the pbi->segmentation_map[] for use in temporal coding on subsequent frames. Change-Id: I1489305efc376564e734a216f69c2844646ee3d3	2011-11-21 15:49:47 +00:00
Paul Wilkins	4f792921e7	CONFIG_T8X8 experiment.: Block the selection of 4x4 modes in key frames if 8x8 is selected. Change-Id: Ie5729ec22a999d9a1996f020bd4b941e29514992	2011-11-21 15:46:32 +00:00
Stefan Holmer	b5ee7b12d2	Decoder fixes to better support reference picture selection. Change-Id: Id3388985d754706b9fd1f079c47121e79a63efdf	2011-11-21 10:25:21 +01:00
Johann	f2cd4ded22	Move shared data to shared location Storing vp8_bilinear_filters_mmx in an mmx file and using it in an sse2 file is bad Moving towards allowing --disable-mmx Change-Id: I20493b35bdedcdcfc0915e6f05fdbe6c81a4a742	2011-11-18 16:23:14 -08:00
John Koleszar	e55974bf86	Speed selection support for disabled reference frames There was an implicit reference frame test order (typically LAST, GOLD, ARF) in the mode selection logic, but this doesn't provide the expected results when some reference frames are disabled. For instance, in real-time mode, the speed selection logic often disables the ARF modes. So if the user disables the LAST and GOLD frames, the encoder was always choosing INTRA, when in reality searching the ARF in this case has the same speed penalty as searching LAST would have had. Instead, introduce the notion of a reference frame search order. This patch preserves the former priorities, so if a frame is disabled, the other frames bump up a slot to take its place. This patch lays the groundwork for doing something smarter in the frame test order, for example considering temporal distance or looking at the frames used by nearby blocks. Change-Id: I1199149f8662a408537c653d2c021c7f1d29a700	2011-11-18 13:53:21 -08:00
Attila Nagy	c84d42f864	Validate encoder buffer writes for single token partition Extend buffer write validation (overflow check) to single token partition packing, both mb and row based functions. Change-Id: I36e19b7d37fc43712d05c70e3ad223d3eb5b973d	2011-11-18 12:49:27 +02:00
Adrian Grange	eb15fe85e0	Clip buffer level to the maximum buffer size in CBR The buffer level was able to increase indefinitely rather than being clipped to the maximum buffer size specified by the user. This change checks the buffrer level and prevents it from going beyond the upper limit of the buffer. Change-Id: Ifff55f79d3c018e4d3d77e554b11ada543cc1654	2011-11-17 15:57:37 -08:00
Scott LaVarnway	3c755577b8	Merge "Added predictor stride argument(s) to subtract functions"	2011-11-17 10:17:53 -08:00
Yaowu Xu	6dddcbc57d	Merge "fixed the scaling in 8x8 trellis quant" into experimental	2011-11-17 14:55:43 +00:00
Yaowu Xu	7f33be9e96	fixed the scaling in 8x8 trellis quant This commit has a few minor fixes to the 8x8 trellis quant, so to make it work regardless if extend_qrange is enabled or not. It also borrowed adaptive RDMULT constants from 4x4 trellis that was missed in the 8x8 trellis quant. Change-Id: I60d7769071f102c699b5084597e62bca87a1f759	2011-11-16 14:29:02 -08:00
Paul Wilkins	cee3d2223a	Header inclusion for Unix build Explicit inclusion of limits.h to satisfy unix build for definition of INT_MAX. Some commented out code removed. Change-Id: I5b5980dfaa9b4d2d12bfd729cfd35bd982106908	2011-11-16 10:34:47 +00:00
Scott LaVarnway	edd98b7310	Added predictor stride argument(s) to subtract functions Patch set 2: 64 bit build fix Patch set 3: 64 bit crash fix [Tero] Patch set 4: Updated ARMv6 and NEON assembly. Added also minor NEON optimizations to subtract functions. Patch set 5: x86 stride bug fix Change-Id: I1fcca93e90c89b89ddc204e1c18f208682675c15	2011-11-15 12:53:01 -05:00
Paul Wilkins	3cdfdb55e4	Merge CONFIGURE_SEGMENTATION experiment. Removal of CONFIGURE_SEGMENTATION ifdefs. Removal of legacy support code fo the old coding mechanism. Use local reference "xd" for MACROBLOCKD structure in encode_frame_to_data_rate() Moved call to choose_segmap_coding_method() out of encode loop as the cost of segmentation is not properly accounted in the loop anyway. If this is desirable in the future it can be moved back. The use of this function to do all the analysis and set the probabilities also removes the need to track segment useage in threading code. Change-Id: I85bc8fd63440e7176c73d26cb742698f9b70cade	2011-11-15 16:15:23 +00:00
Paul Wilkins	6394ef28d7	Further clean up of Segmentation experiment code Changed name and sense of segment_flag to "seg_id_predicted" Added some additional comments and retested. I also did some experimentation with a spatial prediction option using a similar strategy to the temporal mode implemented. This helps in some cases where temporal prediction is bad but I suspect there is more overlap here with work on a larger scale block structure and spatial correlation will likely be better handled through that mechanism. Next check in will remove #ifdefs and legacy mode code. Change-Id: I3b382b65ed2a57bd7775ac0f3a01a9508a209cbc	2011-11-15 15:22:26 +00:00
Paul Wilkins	661b2c2dcf	Further work on Segmentation Experiment: This check in includes quite a lot of clean up and refactoring. Most of the analysis and set up for the different coding options for the segment map (currently simple distribution based coding or temporaly predicted coding), has been moved to one location (the function choose_segmap_coding_method() in segmenation.c). This code was previously scattered around in various locations making integration with other experiments and modification / debug more difficult. Currently the functionality is as it was with the exception that the prediction probabilities are now only transmitted when the temporal prediction mode is selected. There is still quite a bit more clean up work that will be possible when the #ifdef is removed. Also at that time I may rename and alter the sense of macroblock based variable "segment_flag" which indicates (1 that the segmnet id is not predicted vs 0 that it is predicted). I also intend to experiment with a spatial prediction mode that can be used when coding a key frame segment map or in cases where temporal prediction does not work well but there is spatial correlation. In a later check in when the ifdefs have gone I may also move the call to choose_segmap_coding_method() to just before where the bitsream is packed (currently it is in vp8_encode_frame()) to further reduce the possibility of clashes with other experiments and prevent it being called on each itteration of the recode loop. Change-Id: I3d4aba2a2826ec21f367678d5b07c1d1c36db168	2011-11-15 11:13:33 +00:00
John Koleszar	bdd35c13cc	avoid resetting framerate during vpx_codec_enc_config_set() The calculated frame_rate is a state variable in the codec, and shouldn't be maintained in the configuration struct. Move it to the main part of cpi so that it isn't clobbered when the configuration struct is updated. The initial framerate estimate is moved from the vp8_cx_iface.c wrapper into the body of init_config() in onyx_if.c, so that it is only called once and not reset on every call to vp8_change_config(). Change-Id: I8d9a3d1283330d1ee297d07e9d78d1f2875f2465	2011-11-11 14:45:58 -08:00
Paul Wilkins	c9130bdbbc	Segmentation experiment: Added last_segmentation_map[] structure to keep track of what we had before when doing temporal prediction. With this change the existing code does once again appear to be giving a decodable bitstream for both temporal and standard prediction modes. However, it is still somewhat messy and confused and there is no option to take advantage of spatial prediction so it could do with further work. Some housekeeping / clean out. Change-Id: I368258243f82127b81d8dffa7ada615208513b47	2011-11-11 18:33:25 +00:00
Paul Wilkins	bf25d4ad7f	SEGMENTATION experiment: Some initial cleanup to aid testing and debug. Pull code to choose temporal or spatial encoding out of encodeframe.c into a dedicated function in segmentation.c. For now disable broken temporal mode. Move the coding of "temporal_update" flag and only transmit if segment map update is indicated. Rename the functions read_mb_features() and write_mb_features() to read_mb_segid() and read_mb_segid() as they only read and write the macroblock segment id not any of the features. Change-Id: Ib75118520b1144c24d35fdfc6ce46106803cabcf	2011-11-11 18:31:21 +00:00
Yaowu Xu	e01b39254b	changed function name for clarity The dequantizer functions for 2nd order haar block had confusing 8x8 in their names. this commit fixed their name to avoid confusion. Change-Id: I6ae4e7888330865f831436313637d4395b1fc273	2011-11-11 09:01:16 -08:00
Yaowu Xu	0c846f6602	Merge "fixed the decoder when using 8x8 transform" into experimental	2011-11-11 15:38:42 +00:00
Yaowu Xu	982b061dc2	Make 8x8 and extend_qrange to work together This commit added scaling factors to 8x8 transform, quant, dequant and inverse transform pipeline to make 8x8 transform to work when configed with enable-extend_qrange. This commit also disabled the trellis-quant when extend_qrange is configured. Change-Id: Icfb3192e4746f70a4bb35ad18b7b47705b657e52	2011-11-11 07:31:00 -08:00
Yaowu Xu	7189198d53	fixed the decoder when using 8x8 transform updated the decode_macroblock logic to reflect that 8x8 transform is not used for "SPLITMV". Also fixed an issue where 2nd order haar block has wrong dequant/idct process. Change-Id: I1e373f6535c009dfec503b6362c8a5cfc196e1da	2011-11-10 17:15:06 -08:00
Yaowu Xu	a0ed4e6380	Merge "scaled the threshold for 2nd order coefficient reset" into experimental	2011-11-10 16:34:36 +00:00
Yaowu Xu	cbcba9e7c0	scaled the threshold for 2nd order coefficient reset extend_qrange introduces a different scaling factor, this commit takes the scaling difference into account for reset 2nd order coefficients. Change-Id: Ie58bca9f52698fa759e3f88da2aa4d82630fa91a	2011-11-10 08:31:13 -08:00
Paul Wilkins	151b7f25db	Merge "T8x8 experiment merge." into experimental	2011-11-10 09:37:26 +00:00
Yaowu Xu	842dc7ca60	Merge "fixed an encoder bug" into experimental	2011-11-10 00:43:12 +00:00
Yaowu Xu	8974daea11	fixed an encoder bug the bug caused the encoder to produce invalid bitstream when configured with enable_extend_qrange. Change-Id: I1e81c48b13359d0043cbbd480e679380a2da117c	2011-11-09 16:03:23 -08:00
Scott LaVarnway	df49c7c58d	SSE2 optimizations for vp8_build_intra_predictors_mby{,_s}() Ronald recently sent me this patch that he did in April. > From: Ronald S. Bultje <rbultje@google.com> > Date: Thu, 28 Apr 2011 17:30:15 -0700 > Subject: [PATCH] SSE2 optimizations for > vp8_build_intra_predictors_mby{,_s}(). HD decode tests have shown a performance boost up to 1.5%, depending on material. Patch set 3: Fixed encoder crash. Change-Id: Ie1fd1fa3dc750eec1a7a20bfa2decc079dcf48c8	2011-11-09 15:30:35 -05:00
Scott LaVarnway	9532bda0fb	Merge "Relocated idct/add calls for encoder"	2011-11-09 10:17:43 -08:00
Johann	ea2229bab6	Merge "ARMv6 optimized Intra4x4 prediction"	2011-11-09 09:36:33 -08:00
John Koleszar	2999ca3094	Merge "Reset FPU state after calc_plane_error()"	2011-11-09 09:35:08 -08:00
John Koleszar	3fcf0e3668	Merge "Compiler warning fix for const array."	2011-11-09 09:34:50 -08:00
John Koleszar	82e8884ad8	Merge "Remove unused file recon.c"	2011-11-09 09:31:23 -08:00
Paul Wilkins	0789253125	T8x8 experiment merge. For ease of testing and merging experiments I have removed in line code in encode_frame() that assigns MBs to be t8x8 or t4x4 coded segments and have moved the decision point and segment setup to the init_seg_features0 test function. Keeping everything in one place helps make sure for now that experiments using segmentation are not fighting each other. Also made sure mode selection code can't choose 4x4 modes if t8x8 is selected. Patch2: In init_seg_features() add checks for SEG_LVL_TRANSFORM active. Change-Id: Ia1767edd99b78510011d4251539f9bc325842e3a	2011-11-09 15:46:05 +00:00
Scott LaVarnway	861ed6a5c1	Relocated idct/add calls for encoder Call the idct/add after the tokenize. This is WIP with the goal of creating a common idct/add for the encoder and decoder. This move is necessary because the decoder's version of the idct clobbers qcoeff, which is used by the tokenize. Change-Id: I6b08d8e8397cd873647fa4fb9469884e3c876756	2011-11-09 10:41:05 -05:00
Paul Wilkins	b0f9f15dbd	Merging and testing of SEGMENTATION experiment. Removed code in #if CONFIG_SEGMENTATION that enables segmentation and creates a test segmentation map, to avoid conflicts with the other segmentation test code, Change-Id: I7a21a44ed188b814cd80b30dd628c62474eba730	2011-11-09 11:59:20 +00:00
Paul Wilkins	ac2ab02dcf	Segmentation feature logic fix. Bug fix to logic in vp8_pick_inter_mode() and vp8_rd_pick_inter_mode(). The block on the use of segment features for the cm->refresh_alt_ref_frame case was just for testing and is not correct. The special case code for alt ref can be re-enabled as an else clause. Change-Id: Ic9b57cdb5f04ea7737032b8fb953d84d7717b3ce	2011-11-09 11:53:26 +00:00
Tero Rintaluoma	5a2fd63a2a	ARMv6 optimized Intra4x4 prediction Added ARM optimized intra 4x4 prediction - 2x faster on Profiler compared to C-code compiled with -O3 - Function interface changed a little to improve BLOCKD structure access Change-Id: I9bc2b723155943fe0cf03dd9ca5f1760f7a81f54	2011-11-09 09:13:51 +02:00
Yaowu Xu	5883246d01	make debug match release build on win32 with 8x8 transform enabled The 8x8 forward transform makes use of floating operations, therefore requires emms call to reset mmx registers to correct state. Without the resets, the 8x8 forward transform results are indefinite on win32 platform. Change-Id: Ib5b71c3213e10b8a04fe776adf885f3714e7deb1	2011-11-08 20:36:54 -08:00
James Zern	9d60506130	threading: avoid defining _WIN32_WINNT The referenced function (SignalObjectAndWait) isn't used. Reduces the warnings with mingw32-w64 which defines this. Change-Id: I4ce592879ec9372bf196dac640204c4d370bd210	2011-11-08 18:50:45 -08:00
John Koleszar	f89e109f56	Remove unused file recon.c File not referenced from anywhere and no longer compiles. Change-Id: I38b11bd60db615c2c2c9d7ad35caba3a1adf1750	2011-11-08 15:54:56 -08:00
Yunqing Wang	4c14efd234	Fix checks in MB quantizer initialization vp8cx_mb_init_quantizer() needs to be called at least once to get all values calculated. This change added one check to decide if we could skip initialization or not. Change-Id: I3f65eb548be57580a61444328336bc18c25c085b	2011-11-08 12:11:48 -05:00
Yaowu Xu	6e165e86a7	Attempt to fix an issue related to 8x8 transform and segfeature logically this commit should NOT change anything, but seems to help revert the 3DB loss on bowing in the following commit: https://on2-git.corp.google.com/g/#change,6193 This is still debugging in progress. Need further investigation to understand the root cause of the issue. Change-Id: I0b49d1ef3a311dfff58c6acd3eaebdb3bda6257c	2011-11-08 16:15:41 +00:00
Paul Wilkins	a9df4183a6	Segment signaling of TX size Initial attempt at using new segment feature signaling to indicate 4x4 or 8x8 transform. needs --enable-experimental --enable-t8x8 Note this is work in progress. Change-Id: Ib160d46a5d810307bfcbc79853ce1a65b5b870b7	2011-11-08 12:21:08 +00:00
Adrian Grange	b615a6d47f	Third set of checks of buffer level against maximum buffer size Additional check of buffer level to ensure it doesn't exceed the maximum buffer size. Change-Id: I1ba4f8b09bbec89646885040ff47470196af521e	2011-11-07 17:15:28 -08:00
Adrian Grange	fa25a31ed4	Additional clipping of buffer level to maximum buffer size Added additional check of buffer level against maximum buffer size. Change-Id: Iaf1fbaf008601161e402b43ce82c3dbc129bf740	2011-11-07 16:54:40 -08:00
Adrian Grange	9dc95b0a12	Added check to make sure maximum buffer size not exceeded Added code to clip the buffer level to the maximum buffer size. Without this the buffer level would increase unchecked. This bug was found when encoding an essentially static scene at 2Mb/s. The encoder is unable to generate frames consistent with the high data-rate because Q bottoms out at Qmin. As frames generated are consistently undersized the buffer level increases and does not get checked against the maximum size specified by the user (or default). Change-Id: Id8a3c6323d3246da50f7cb53ddbf78b5528032c6	2011-11-07 16:28:13 -08:00
James Zern	f89ea3432f	fix file permissions all of googletest import (0ab00a22) was marked executable Change-Id: Id7b7ee03efc21ab998bb03349bd91644e8af25da	2011-11-04 18:50:35 -07:00
Fritz Koenig	f0c01413fb	Compiler warning fix for const array. Fix compiler warning for passing a non const array to a function expecting a const array by using an intermediary pointer and casting. Change-Id: I9bdd358ebdc926223993fb8fb2098ffedd2f3fc7	2011-11-04 18:19:26 -07:00
Yunqing Wang	e1a55b504a	Merge "Add checks in MB quantizer initialization"	2011-11-04 11:52:27 -07:00
Scott LaVarnway	44b5f76e34	Merge "Fix issue 374: eob read incorrectly"	2011-11-04 11:31:17 -07:00
John Koleszar	7ca6c91732	Merge "Changing decoder input partition API to input fragments."	2011-11-04 09:36:27 -07:00
Yaowu Xu	f82d601f94	Merge "Added context reset when 2nd order coefficients are cleared" into experimental	2011-11-04 16:25:15 +00:00
Paul Wilkins	fe38082f44	Segment Features with 8x8DCT. Temporary check in to turn off other segment features tests when #if CONFIG_T8X8 is set as the assignment of MBs to differnt segments in each case will conflict. The 8x8 code will be modified to use the new segment feature method properly in a later check in. Increase bits allowed for EOB end stop marker to 6 ready for 8x8. Change-Id: I4835bc8d3bf98e1775c3d247d778639c90b01f7f	2011-11-04 11:06:24 +00:00
Paul Wilkins	a258bba1fb	Segment Feature Data Access No change to functionality or output. Updates to the segment feature data structure now all done through functions such as set_segdata() and get_segdata() in seg_common.c. The reason for this is to make changing the structures (if needed) and debug easier. In addition it provides a single location for subsequent addition of range and validity checks. For example valid combination of mode and reference frame. Change-Id: I2e866505562db4e4cb6f17a472b25b4465f01add	2011-11-04 10:42:12 +00:00
Tero Rintaluoma	d497ec688d	Fix issue 374: eob read incorrectly Updated eob changes to check_reset_2nd_coeffs function. Change-Id: Id1b21c91c7f0fd286640b487ffe47867009b717d	2011-11-04 09:36:49 +02:00
Yaowu Xu	2bbde25003	make uv intra mode coding adaptive to Y mode This commit tries to do UV intra mode coding adaptive to Y intra mode. Entropy context is defined as conditional PDF of uv intra mode given the Y mode. All constants are normalized with 256 to be fit in 8 bits. This provides further coding efficiency beyond the quantizer adaptive y intra mode coding. Consistent gains were observed on all clips and all bit rates for HD all key encoding tests. To test, configure with --enable-experimental --enable-uvintra Change-Id: I2d78d73f143127f063e19bd0bac3b68c418d756a	2011-11-03 21:48:08 -07:00
Yaowu Xu	d8afecef71	Added context reset when 2nd order coefficients are cleared As discovered in path 10 of Change Ia12acd2f, reset 2nd order coeffs without reset of above and left coding context may have introduced problem that causes encoder/decoder mismatching. This commit added update to coding context when the 2nd coefficients are cleared. In addition, this commit also introduced early breakout in the checks to speed up when coefficients are too significant to be cleared. Change-Id: I85322a432b11e8af85001525d1e9dc218f9a0bd6	2011-11-03 16:05:29 -07:00
Paul Wilkins	a10a268e58	Segment Features. Removal of #ifdefs Removal of configure #ifdefs so that segment features always available. Removal of code supporting old segment feature method. Still a good deal of tidying up to do. Change-Id: I397855f086f8c09ab1fae0a5f65d9e06d2e3e39f	2011-11-03 17:14:26 +00:00
Scott LaVarnway	46639567a0	Merge "Change use of eob in the encoder"	2011-11-03 08:06:06 -07:00
Tero Rintaluoma	e4f2ec7a52	Change use of eob in the encoder Changed 'int eob' to 'char *eob' in BLOCKD so that both encoder and decoder will use eobs[25] array from MACROBLOCKD structure. In future, this will enable use of the decoder side IDCT in the encoder. Change-Id: I6e1c011628cb8864fd4a0b80f0279ce16a5ca978	2011-11-03 16:08:09 +02:00
Paul Wilkins	2370d440bd	Merge "Segmentation: Reference frames" into experimental	2011-11-03 12:58:43 +00:00
Yaowu Xu	8002c31804	Merge "added code to clear 2nd order block when appropriate"	2011-11-02 08:22:58 -07:00
Paul Wilkins	ab9a4ce065	Merge "Change to prevent encoding of effect-less 2nd order coefficients" into experimental	2011-11-02 14:48:17 +00:00
Paul Wilkins	87ff8620b2	Segmentation: Reference frames Modify reference frame segmentation so that ONE or MORE reference frames may be marked as a available for a given segment. Fixed bugs relating to segment coding of INTRA and some INTER modes at the segment level. Modified Q boost for static areas based on ambient average Q. Strong results now on clips with significant static areas. (some data points in derf set as high as 9% and some static & slide show type content in YT set > 20%) Change-Id: Ia79f912efa84b977f35a23683ae3643251e24f0c	2011-11-02 13:31:54 +00:00
John Koleszar	63bf108731	Merge "Fix: Increase default cx_data_size"	2011-11-01 15:25:47 -07:00
Stefan Holmer	1427205215	Changing decoder input partition API to input fragments. Adding support for several partitions within one input fragment. This is necessary to fully support all possible packetization combinations in the VP8 RTP profile. Several partitions can be transmitted in the same packet, and they can only be split by reading the partition lengths from the bitstream. Change-Id: If7d7ea331cc78cb7efd74c4a976b720c9a655463	2011-11-01 14:44:37 -07:00
Yunqing Wang	e44720af84	Add checks in MB quantizer initialization In some situations (f.g. error-resilient is turned on), vp8cx_mb _init_quantizer() was called once per macroblock. Added checks to avoid calculations when there is no change. Change-Id: Ie4f0a5ade2202041254990a4e9d5b03bd1ac5aea	2011-11-01 17:41:22 -04:00
Adrian Grange	2b450a460f	Deleted repeated code block The block of code skipped testing the current mode if the reference frame is AltRef, the mv is not (0,0) and ARNR filtering is disabled. This block of code has already been tested above if the macro CONFIG_SEGFEATURES is set to 0. Change-Id: I3f5710bb8270caad06c9a0eee59fa0daf1f70776	2011-11-01 08:41:43 -07:00
John Koleszar	9bf3bc9a72	Correct SPLITMV clamping Prior to this fix, the clamping state of the last subblock partition dominated, whereas the correct behavior is to clamp if any partition needs clamping. This bug was introduced by v0.9.6-232-g6b25501 See also: [1]: http://code.google.com/p/webm/issues/detail?id=371 [2]: https://bugzilla.mozilla.org/show_bug.cgi?id=696390 Change-Id: I444db492b4c4f05f039c7da6f4216da8207dc138	2011-10-31 14:42:51 -07:00
Adrian Grange	71fb1f8eab	Fixed this_mode used before set in vp8_pick_inter_mode The variable this_mode was being used before it had been initialized. Moved the line that sets-up this_mode toward the top of the enclosing loop, prior to its first use. The bug would result in tests in the loop lagging the mode that was expected to be tested. Change-Id: If4e51600449ce6b4285f112da17a44c24b4a19fb	2011-10-31 12:42:00 -07:00
Paul Wilkins	795c6dd2c9	Segmentation Entropy and tweaks. Some correction for entropy impact of segment signaled (EOB and ref frame) Other slight tweaks. Derf VBR average gain now over 1% (best over 7%) One YT test clip has gains of circa 30% (VBR) There is still an issue with noisy clips where making the background static and coded with 0,0 can have a negative effect, especially at low Q. This is probably because of the loss of smoothing by fractional pixel filters. Change-Id: I7a225613c98067b96f8fc7a7e36f95d465b2b834	2011-10-31 10:59:25 +00:00
Yaowu Xu	88e24f07ae	added code to clear 2nd order block when appropriate It is discovered that in rare situations the 2nd order block may produce a few small magnitude coefficients that has no effect on reconstruction. The situations are a combination of low quantizer values (high quality) and low energy in residual signals (content dependent). This commit added code to detect such cases and reset the 2nd order block to all 0. Patch 1 to 4 used code to do all-zero-check on idct result buffer, and tests on derf set showed a consistent gain of .12%-.14% on all metrics.But due to a recent change Ie31d90b, the idct result buffer is not longer populated. So patch 5&6 use an alternative method to detect the situations. Tests on derf set now shows a consistent quality gain of .16%-.20%. As suggested by Jim, Patch 7&8 removed the condition of all first order block not having any coefficient, instead we reset 2nd order coefficients to all 0 if sum of absolute value of the coefficients is small. So it does slightly more than just detecting the oddity as discussed above, but tests on derf set now show a consistent gain of .20%-.23% on all metrics. It is worth noting here that this change does not have any effect on mid/high quantizer range, it only affects the quantizer value 18 or blow. Within this range, the change helps compression by up to 2.5% on clips in the derf set. Change-Id: I718e19cf59a4fc2462cb7070832759beb9f7e7dd	2011-10-28 12:07:21 -07:00
Scott LaVarnway	e0309e1509	Merge "Improved decode_split_mv()"	2011-10-28 09:27:17 -07:00
Johann	cd1ef53d12	Merge "Fix ARM build problem introduced by CL I3fab6f2b"	2011-10-27 11:17:54 -07:00
Scott LaVarnway	6064384d59	Improved decode_split_mv() Tests showed ~1.2% performance boost on the HD clip used. Performance will vary based on material. Change-Id: Icbcf1a828750d5b4ae5252bf596b3ef594042e8a	2011-10-27 11:26:30 -04:00
Scott LaVarnway	0db5599957	Merge "Improved mv_bias"	2011-10-27 06:14:00 -07:00
Paul Wilkins	afb52f65f2	Resolve build problem Resolved experimental branch build problem when seg_features not configured. Change-Id: Ia0f9b460a26dc3eac9844ee595a7b196e9faf6a5	2011-10-27 12:35:36 +01:00
Attila Nagy	9452dce181	Fix ARM build problem introduced by CL I3fab6f2b Update ARM asm implementation of vp8_start_encode to new definition. Change-Id: Ic44791c969e351082331ba6146c3384c01a0dfad	2011-10-27 09:06:45 +03:00
Johann	294777b915	Merge "Reduce partial frame copy in encoder's pick_filter_level_fast"	2011-10-26 11:33:14 -07:00
Scott LaVarnway	21970d1dc2	Improved mv_bias Small performance gains. Change-Id: I709b9390a8a27a70f5f23574313b8db85ac7f23d	2011-10-26 11:46:10 -04:00
Scott LaVarnway	efa69d26a1	Merge "Improved read_mb_modes_mv()"	2011-10-26 08:26:30 -07:00
Scott LaVarnway	ff1d170e69	Improved read_mb_modes_mv() Interleaved vp8_find_near_mvs and vp8_mv_ref_probs. 2.5% to 4% performance improvement for the HD clips used. Change-Id: Id888b667cf5ae2f0e19da18743140f055ff7de8d	2011-10-26 10:46:36 -04:00
Attila Nagy	de82809444	Reduce partial frame copy in encoder's pick_filter_level_fast The partial frame copy function used to copy an extra 8 lines above and below. The partial frame filtering can only modify 3 pixel rows above the partial frame. Reduce copy to bare minimum needed, which is 4 lines, so that partial filtering on copied frame is possible. Define the "magic" fraction number for partial filtering in loopfilter.h . Change-Id: I4791ffc541b6884b12759a0d0714a8faf16147ec	2011-10-26 15:25:07 +03:00
Johann	f9dba66877	Merge "remove uninitialized variable warning"	2011-10-25 14:42:21 -07:00
Yaowu Xu	c8ef79d22e	Change to prevent encoding of effect-less 2nd order coefficients similar logic to http://gerrit.chromium.org/gerrit/#change,10359 Change-Id: Ia12acd2f2b3b92ef2a601da43c2497034ef62174	2011-10-25 10:25:02 -07:00
Scott LaVarnway	e03330bd80	Merge "Improved token decoder"	2011-10-25 10:04:11 -07:00
Johann	9409af2083	remove uninitialized variable warning Restructure if statement to clarify the error condition. Trigger the error before clobbering pc-> variables. Change-Id: Id01cab798a341ce9899078fdcec265a0e942a0b7	2011-10-25 09:24:40 -07:00
Yaowu Xu	4081e5b3fe	Merge "added a last stage rounding for 8x8 inverse dct" into experimental	2011-10-25 16:08:51 +00:00
Scott LaVarnway	3579baa115	Merge "Removed read_mv_ref"	2011-10-25 08:01:32 -07:00
Johann	a82cc0205d	remove unused variable warning Change-Id: I4fcd6e4656d9823aead941616cd63501aecbd6e2	2011-10-24 16:33:45 -07:00
Scott LaVarnway	49ea2bc3f4	Removed read_mv_ref Decode the mv mode with if-then-elses instead of traversing the vp8_mv_ref_tree data structure. This will make it easier to interleave vp8_find_near_mvs and vp8_mv_ref_probs. Change-Id: I1e798d6ec40fcaeeff06ccc82f81201978d12f74	2011-10-24 16:16:08 -04:00
Yaowu Xu	a66c945c59	added a last stage rounding for 8x8 inverse dct Prior to the added rounding, tests on randomly generated data showed that forward-inverse transform round trip errors are about 3.02/block for input range [-10,10] and 2.68/block for input range [-256, 255]. The added rounding reduced the errors to 0.031/block for input range [-10,10] and 0.037/block for input range [-256, 255]. Maximum round trip error on for any pixel position is 1. The average errors are calculated based on 100,000 blocks of randomly with the specified ranges. Paul mentioned in discussion that the change was not clear on why we need change the rounding, so Patch 2 intends to make the rationale obvious in code, it merged the two separate shifts into one, and the two separate rounding factors into one. Patch 1 and 2 have same numerical test results. Change-Id: Ic5e2f5463de17253084d8b2398c4a210194b20de	2011-10-24 11:56:47 -07:00
Scott LaVarnway	f182376dd6	Moved the split motion vector decode into a function. Change-Id: Ia023a0587100a52cb084f5d9d5512efa6198dad3	2011-10-24 13:52:15 -04:00
Scott LaVarnway	231339932b	Merge "Removed redundant mv clamps for nearmv and nearestmv"	2011-10-24 10:27:53 -07:00
Paul Wilkins	23701f4f87	Segmentation Features; Only encode sign bit for feature data that can have a sign. Tweaks to the test segmentation rules so that it now actually gives a net benefit on the derf set of about 0.4% though much higher on some clips at the low end. Change-Id: I8e61f1aebf41c9037db7e67e2f8975aa18a0c986	2011-10-24 17:06:29 +01:00
James Berry	bac6c229e5	Fix: Increase default cx_data_size Prior size could be too small in some instances resulting in an error. Change-Id: Ic601e49cbae92c98a0e7fb51ba8c186b352ffba6	2011-10-24 11:50:27 -04:00
Scott LaVarnway	a99c20c0f4	Removed redundant mv clamps for nearmv and nearestmv Did some cleanup as well. Patchset 2: Fixed bug. Will revisit the segmentation logic. Change-Id: Idf9fbcff9aaf467bdace9fbd58ef2cea6c602049	2011-10-24 11:37:52 -04:00
Paul Wilkins	01ce04bc06	Further segment feature extensions. This quite large check in includes the following: Merge in some code from Ronald (mbgraph.c) that scans a Gf/arf group. This is used as a basis for a simple segmentation for the normal frames in a gf/arf group. This code also uses satd functions from Yaowu. Adds functionality for coding the latest possible position of an EOB for blocks in the segment. (Currently 0-15 only, hence just for 4x4 dct). Where the EOB position is 0 this acts like "skip" and the normal coding of skip at the per mb level is disabled. Added functions (seg_common.c) for setting and reading segment feature elements. These may want to be optimized away at some point but while the mecahnism is in a state of flux they provide a single location for making changes and keep things a bit cleaner. This is still proof of concept code. Currently the tested feature set:- Quantizer, Loop Filter level, Reference frame, Prediction Mode, EOB end stop. TBD:- Add functions for setting and reading the feature data with range and validity checking. Handling of signed and unsigned feature data. At the moment all is assumed to be signed and a sign bit is coded but many cannot be negative. Correct handling of EOB feature with intra coded blocks. Testing/trapping of legal/illegal ref frame and mode combinations. Transform size switch plus merge and test with 8c8 DCT work Merge and test with Sumans Segmenation coding optimizations Change-Id: Iee12e83661c7abbd1e0ce6810915eb4ec35e2d8e	2011-10-24 15:52:18 +01:00
Scott LaVarnway	e59d53e999	Merge "Remove unused DETOK structure"	2011-10-21 06:30:57 -07:00
Tero Rintaluoma	bdb4fb8991	Remove unused DETOK structure DETOK structure is not used anymore. Change-Id: Id22e1af78fb85d4bb151237a60290d9364faf217	2011-10-21 09:33:49 +03:00
John Koleszar	2c0b4a24b9	Merge "Fix: check cx_data buffer prior to write"	2011-10-20 17:36:40 -07:00
James Berry	bc7151131d	Fix: check cx_data buffer prior to write check to make sure that cx_data buffer has enough room before writting to it, prior behavior did not which could result in a crash. Change-Id: I3fab6f2bc4a96d7c675ea81acd39ece121738b28	2011-10-20 15:55:00 -04:00
Johann	7cdc986cdf	Don't copy borders for loop_filter_pick During the _pick only the Y plane is examined. In addition, data beyond the borders of the frame is not read. Change-Id: Ic549adfca70fc6e0b55f8aab0efe81f0afac89f9	2011-10-19 18:54:14 -07:00
Johann	f382173225	Merge "enc: save entropy probs only when needed for refresh"	2011-10-19 14:36:29 -07:00
Scott LaVarnway	5e54085703	Improved token decoder Tests showed over 2% improvement on various HD clips. Change-Id: I94a30d209c92cbd5fef285122f9fc570688635fe	2011-10-19 13:38:35 -04:00
Scott LaVarnway	63a77cbed9	Merge "Remove usage of predict buffer for decode"	2011-10-19 10:24:48 -07:00
Scott LaVarnway	ed9c66f584	Remove usage of predict buffer for decode Instead of using the predict buffer, the decoder now writes the predictor into the recon buffer. For blocks with eob=0, unnecessary idcts can be eliminated. This gave a performance boost of ~1.8% for the HD clips used. Tero: Added needed changes to ARM side and scheduled some assembly code to prevent interlocks. Patch Set 6: Merged (I1bcdca7a95aacc3a181b9faa6b10e3a71ee24df3) into this commit because of similarities in the idct functions. Patch Set 7: EC bug fix. Change-Id: Ie31d90b5d3522e1108163f2ac491e455e3f955e6	2011-10-18 12:06:50 -04:00
Yaowu Xu	152ce6b2b9	fixed the wrong rounding in inverse haar transform Given the current forward haar transform: f0 = I0 + I1 + I2 + I3 f1 = I0 + I1 - I2 - I3 f2 = I0 - I1 + I2 - I3 f3 = I0 - I1 - I2 + I3 the output of the inverse haar prior rounding: i0 = f0 + f1 + f2 + f3 = I0 * 4; i1 = f0 + f1 - f2 - f3 = I1 * 4; i2 = f0 - f1 + f2 - f3 = I2 * 4; i3 = f0 - f1 - f2 + f3 = I3 * 4; As all the numbers are 4 multiples, simply >>2 always produces prefect results in term of forward-inverse transform round trip error. Change-Id: Id6658b00ea819ee61cfeef8c5985d4cd3e77f44e	2011-10-14 09:33:54 -07:00
Attila Nagy	a5cd42feb9	Fix: vp8cx_pack_tokens_into_partitions_armv5 crash It was crashing when number of partitions was bigger than the number of MB rows (ex. 128x96 with 8 partitions). Start point was not checked against mb_rows, plus extra "empty" partitions were not written out. Change-Id: I9c2f013b9ec022354b658fab4ef799ff8b1de93d	2011-10-14 10:53:04 +03:00
Adrian Grange	04182a121a	Merge "Added rate-targeted temporal scalability"	2011-10-11 12:54:52 -07:00
Adrian Grange	217591fde5	Added rate-targeted temporal scalability Added the ability to create rate-targeted, temporally scalable, VP8 compatible bitstreams. The application vp8_scalable_patterns.c demonstrates how to use this capability. Users can create output bitstreams containing upto 5 temporally separable streams encoded as a single VP8 bitstream. (previously abandoned as: I92d1483e887adb274d07ce9e567e4d0314881b0a) Change-Id: I156250a3fe930be57c069d508c41b6a7a4ea8d6a	2011-10-11 12:49:12 -07:00
John Koleszar	07ba411914	Reset FPU state after calc_plane_error() Fixes a MMX/SSE2 mismatch when building with --enable-internal-stats. Change-Id: I0c50a1f246f6916b7a5fc6f36864ceb362f25520	2011-10-11 08:43:30 -07:00
James Berry	05bde9d4a4	bug fix - starting/optimal/max and buffer_level changed from int to int64_t buffer_level in VP8_COMP and starting_buffer_level, optimal_buffer_level and maximum_buffer_size in VP8_CONFIG changed from int to int64_t to avoid potential crash issues for larger target bit rates. Change-Id: I0d5ab6c8a44c2fef51f30cd8df4bb4b739c5df26	2011-10-10 12:16:55 -04:00
Attila Nagy	c0de35b413	enc: save entropy probs only when needed for refresh Previous entropy probs need to be saved (and restored) only when current updates are not propagated. Change-Id: Ie6ee0543066e30874e56258be0a6b7d2dd2fdb2b	2011-10-10 13:44:54 +03:00
Yaowu Xu	3ca849691c	fixed a decoder bug When 8x8 transform is enabled, the decoder does an extra reconstruct on MBs that are coded using 8x8. This commit fixed the logic around the decoding of mb encoded with 8x8 transform. Change-Id: I6926557c9ef00eecb375f62946f7e140c660bf6f	2011-10-08 15:48:53 -07:00
Scott LaVarnway	af12c23e8e	Merge "Improved tokenize"	2011-10-04 09:57:42 -07:00
John Koleszar	8f8b526b54	Merge "Fix uninitialized new_mv_count in first pass file"	2011-10-04 07:40:49 -07:00
Yunqing Wang	538865dfa5	Merge "Multithreaded encoder, late sync loopfilter"	2011-10-04 07:04:30 -07:00
John Koleszar	86712c50f2	Fix uninitialized new_mv_count in first pass file Uninitialized data could be written to the first pass file when no motion vectors are present in the frame. Also fix a number of compiler warnings. Change-Id: Icc9f53b6d33da9de4563d86d9fd591910473ea90	2011-10-04 09:50:52 -04:00
Johann	2aa408524c	Merge "Reduce computational complexity of generic C loop filter."	2011-09-30 16:17:56 -07:00
Johann	48b1917112	Merge "combine loopfilter data access"	2011-09-30 15:47:56 -07:00
Scott LaVarnway	ab00d209bc	Improved tokenize For a realtime HD encodings, up to 1.6% gains seen. Change-Id: If45028e23db95124da63f9d38ffe06e05596cc6e	2011-09-30 12:49:46 -04:00
Paul Wilkins	156b221a7f	Segment coding of mode and reference frame. Proof of concept test code that encodes mode and reference frame data at the segment level. Decode-able bit stream but some issues not yet resolved. As it this helps a little on a couple of clips but hurts on most as the basis for segmentation is unsound. To build and test, configure with --enable-experimental --enable-segfeatures Change-Id: I22a60774f69273523fb152db8c31f4b10b07c7f4	2011-09-30 16:45:16 +01:00
Paul Wilkins	45e49e6e19	Experimental: segfeature added. New setting added to configure script	2011-09-30 16:08:37 +01:00
Johann	3556deaca3	combine loopfilter data access The data processed by the loopfilter overlaps. At the block level, this results in some redundant transforms. Grouping the filtering allows for a single 16x16 transpose (and inversion) instead of three 16x8 transposes (and three more inversions). This implementation is x86_64 only. We retain the previous implementation for x86. Improvements are obviously material dependant, but it seems to be ~%1 in tests here. Change-Id: I467b7ec3655be98fb5f1a94b5d145e5e5a660007	2011-09-30 07:38:35 -07:00
Alpha Lam	7bce513afe	Call vp8_find_near_mvs lazily vp8_find_near_mvs() is being called on all possible reference frames but the data computed may be used if the loop exits early, which can be due to x->skip beign set to 1. Optimize this by call vp8_find_near_mvs() laziy only if it is going to be used and not computed yet. Change-Id: Iccdbd4c962a670c9f2c99b8aca8096042ca5dc98	2011-09-30 14:48:18 +01:00
Paul Wilkins	a572ac8327	Merge "CQ and two pass rate control."	2011-09-30 02:57:54 -07:00
Paul Wilkins	b6e27d5f0b	CQ and two pass rate control. Changes to the selection of Q limits for two pass and two pass CQ mode. Allowance made for Mode and motion vector costs. Some refactoring of common code. For Derf and YT sets CQ mode average improvement circa 1% (SSIM and Global PSNR). Some increased tendency to undershoot even when user CQ not reached. Patch2: Removed some test code accidentally merged. Change-Id: Icf74d13af77437c08602571dc7a97e747cce5066	2011-09-30 10:55:52 +01:00
Aaron Watry	69aa303d96	Reduce computational complexity of generic C loop filter. Change-Id: I1e7f9ed3cd907844a495b9e0073bc140b87e5c06	2011-09-29 17:25:48 -05:00
Attila Nagy	380d64ecb1	Multithreaded encoder, late sync loopfilter Sync with loopfilter thread just at the beginning of next frame encoding. This returns control to application faster and allows a better multicore scaling. When PSNR packets are generated the final filtered frame is needed imediatly so we cannot delay the sync. Change-Id: I288d97b5e331d41d6f5bb49d97986fa12ac6f066	2011-09-29 10:06:24 +03:00
John Koleszar	6f9457ec12	Merge "clamp_mvs() using the wrong motion vector information"	2011-09-22 11:54:15 -07:00
John Koleszar	3c85c532bb	Merge changes Ie650e9b8,I2427e494 * changes: vpxenc: get version string programatically Install missing default_coef_probs.h	2011-09-22 11:18:00 -07:00
Johann	9f41a8b0aa	Merge "Replace vpx_ports/config.h with vpx_config.h"	2011-09-22 09:30:18 -07:00
John Koleszar	4a6ac727fe	Install missing default_coef_probs.h Make sure that this header is listed as one of the sources, so that it will be installed if necessary. Change-Id: I2427e494488126b179151dc21043c1e2c8ba5991	2011-09-22 11:08:24 -04:00
Attila Nagy	1a7d25a484	Replace vpx_ports/config.h with vpx_config.h Just a clean-up. Change-Id: Iea5b6dc925dcfa7db548bc1ab1a13d26ed5a2c9a	2011-09-22 13:33:54 +03:00
John Koleszar	305084d5fa	Merge remote branch 'internal/upstream' into HEAD	2011-09-21 00:05:04 -04:00
Fritz Koenig	bd0c3409a8	Move neon only arm functions under arm/neon. These files don't contain generic arm code, so should only be compiled by neon. Change-Id: Ie712823aa04d4235e7cfe7a3b725e73ee4c3e564	2011-09-20 10:51:06 -07:00
Johann	6829e62718	Merge "NEON FDCT updated to match current C code"	2011-09-20 09:51:05 -07:00
Johann	86e07525d5	Merge "NEON walsh transform updated to match C"	2011-09-20 09:50:42 -07:00
Johann	3a16276cf7	Merge "Updated ARMv6 forward transforms to match C"	2011-09-20 09:50:36 -07:00
Johann	fdd51829b1	Merge "Fixed armv5te multiplications"	2011-09-20 09:50:19 -07:00
Tero Rintaluoma	0c2529a812	NEON FDCT updated to match current C code - Removed fast_fdct4x4_neon and fast_fdct8x4_neon - Uses now short_fdct4x4 and short_fdct8x4 - Gives ~1-2% speed-up on Cortex-A8/A9 Change-Id: Ib62f2cb2080ae719f8fa1d518a3a5e71278a41ec	2011-09-20 10:20:55 +03:00
Tero Rintaluoma	3c19bc3fb3	Fixed armv5te multiplications Rd and Rm registers should be different in 'mul'. This register combination results in unpredictable behaviour. GCC will give a warning and RVCT an error in this case. Restriction applies only to armv5 targets and not for armv6 and above. Change-Id: I378d17c51e1f16a6820814fbed43e115aaabb03e	2011-09-20 09:59:27 +03:00
John Koleszar	feea724296	Merge remote branch 'internal/upstream' into HEAD	2011-09-20 00:05:04 -04:00
Stefan Holmer	e529a825f7	Fix necessary for input partitions iface to match the RTP profile These changes fixes a glitch between the RTP profile and the input partitions interface. Since there's no way for the user to know the actual number of partitions, the decoder have to read the multi_token_paritition bits also when input partitions mode is enabled. Included are also a couple of fixes for issues with independent partitions and uninitialized memory reads. Change-Id: I6f93b15287d291169ed681898ed3fbcc5dc81837	2011-09-19 15:00:21 +02:00
Tero Rintaluoma	4c3ad66b7f	Updated ARMv6 forward transforms to match C - Updated walsh transform to match C (based on Change Id24f3392) - Changed fast_fdct4x4 and 8x4 to short_fdct4x4 and 8x4 correspondingly Change-Id: I704e862f40e315b0a79997633c7bd9c347166a8e	2011-09-19 10:26:59 +03:00
Tero Rintaluoma	2a4b2a000c	NEON walsh transform updated to match C Modified original patch If2f07220885c4c3a0cae0dace34ea0e36124f001 according to comments. Scheduled code a little bit to prevent some interlocks. Change-Id: I338f02b881098782f82af63d97f042b85e63e902	2011-09-19 10:15:33 +03:00
John Koleszar	f3fce80954	Merge remote branch 'internal/upstream' into HEAD	2011-09-17 00:05:04 -04:00
Yaowu Xu	1d44e7ce1f	enable selecting&transmitting to for intra mode entropy This commit added a 3 bit index to the bitstream, the index is used to look into the intra mode coding entropy context table. The commit uses the mode stats to calculate the cost of transmitting modes using 8 possible entropy distributions, and selects the distribution that provides the lowest cost to do the actual mode coding. Initial test show this provides additional .2%~.3% gain over quantizer adaptive intra mode coding. So the adaptive intra mode coding provides a total of .5%(psnr) to .6% gain(ssim) combined for all-key-encoding To build and test, configure with --enable-experimental --enable-qimode Change-Id: I7c41cd8bfb352bc1fe7c5da1848a58faea5ed74a	2011-09-16 16:33:19 -07:00
Yaowu Xu	aac2c12663	add quantizer adaptive intra mb mode encoding make intra mode coding entropy distribution adaptive to baseQindex, an encoding test on hd clips with all key frame shows universal gain on all clips in both .2%(psnr) and (ssim).3%. To build and test, configure with --enable-experimental --enable-qimode Change-Id: Iaa69241b984d4fdd8baa6d77ee78c0140f5ac00a	2011-09-16 16:26:35 -07:00

... 11 12 13 14 15 ...

2586 Commits