generic-library/vpx

Author	SHA1	Message	Date
Ronald S. Bultje	55cafb6156	Reindent segmentation code. Indentation was off by 2 spaces for this particular block. Change-Id: I1e587b7ad3eff77ade5521252d20c7bb2daa0f6d	2013-02-06 09:18:25 -08:00
John Koleszar	31cbe2ed9a	Eliminate tautology Unreachable code that does nothing anyway removed forever. Change-Id: I14105d2dd9dbc9d558f36464055e350dbeb45488	2013-02-06 08:22:59 -08:00
Paul Wilkins	8b4e9c5925	Merge "Change definition of NearestMV." into experimental	2013-02-06 04:06:31 -08:00
Ronald S. Bultje	278df745d2	Fix mismatch after merge of the tiling patch. Change-Id: I8ecc178b4d4069e721c7fec6d7631c00e4a3e5d5	2013-02-05 17:15:04 -08:00
Ronald S. Bultje	1407bdc243	[WIP] Add column-based tiling. This patch adds column-based tiling. The idea is to make each tile independently decodable (after reading the common frame header) and also independendly encodable (minus within-frame cost adjustments in the RD loop) to speed-up hardware & software en/decoders if they used multi-threading. Column-based tiling has the added advantage (over other tiling methods) that it minimizes realtime use-case latency, since all threads can start encoding data as soon as the first SB-row worth of data is available to the encoder. There is some test code that does random tile ordering in the decoder, to confirm that each tile is indeed independently decodable from other tiles in the same frame. At tile edges, all contexts assume default values (i.e. 0, 0 motion vector, no coefficients, DC intra4x4 mode), and motion vector search and ordering do not cross tiles in the same frame. t log Tile independence is not maintained between frames ATM, i.e. tile 0 of frame 1 is free to use motion vectors that point into any tile of frame 0. We support 1 (i.e. no tiling), 2 or 4 column-tiles. The loopfilter crosses tile boundaries. I discussed this briefly with Aki and he says that's OK. An in-loop loopfilter would need to do some sync between tile threads, but that shouldn't be a big issue. Resuls: with tiling disabled, we go up slightly because of improved edge use in the intra4x4 prediction. With 2 tiles, we lose about ~1% on derf, ~0.35% on HD and ~0.55% on STD/HD. With 4 tiles, we lose another ~1.5% on derf ~0.77% on HD and ~0.85% on STD/HD. Most of this loss is concentrated in the low-bitrate end of clips, and most of it is because of the loss of edges at tile boundaries and the resulting loss of intra predictors. TODO: - more tiles (perhaps allow row-based tiling also, and max. 8 tiles)? - maybe optionally (for EC purposes), motion vectors themselves should not cross tile edges, or we should emulate such borders as if they were off-frame, to limit error propagation to within one tile only. This doesn't have to be the default behaviour but could be an optional bitstream flag. Change-Id: I5951c3a0742a767b20bc9fb5af685d9892c2c96f	2013-02-05 15:43:03 -08:00
Ronald S. Bultje	822864131b	Merge "Add SSE3 versions for sad{32x32,64x64}x4d functions." into experimental	2013-02-05 15:40:46 -08:00
Yaowu Xu	c9ae73b251	Merge "rewrite 4x4 idct and fdct" into experimental	2013-02-05 15:26:36 -08:00
Ronald S. Bultje	58c983d109	Add SSE3 versions for sad{32x32,64x64}x4d functions. Overall encoding about 15% faster. Change-Id: I176a775c704317509e32eee83739721804120ff2	2013-02-05 15:21:47 -08:00
John Koleszar	7a07eea13f	Convert subpixel filters to use convolve framework Update the code to call the new convolution functions to do subpixel prediction rather than the existing functions. Remove the old C and assembly code, since it is unused. This causes a 50% performance reduction on the decoder, but that will be resolved when the asm for the new functions is available. There is no consensus for whether 6-tap or 2-tap predictors will be supported in the final codec, so these filters are implemented in terms of the 8-tap code, so that quality testing of these modes can continue. Implementing the lower complexity algorithms is a simple exercise, should it be necessary. This code produces slightly better results in the EIGHTTAP_SMOOTH case, since the filter is now applied in only one direction when the subpel motion is only in one direction. Like the previous code, the filtering is skipped entirely on full-pel MVs. This combination seems to give the best quality gains, but this may be indicative of a bug in the encoder's filter selection, since the encoder could achieve the result of skipping the filtering on full-pel by selecting one of the other filters. This should be revisited. Quality gains on derf positive on almost all clips. The only clip that seemed to be hurt at all datarates was football (-0.115% PSNR average, -0.587% min). Overall averages 0.375% PSNR, 0.347% SSIM. Change-Id: I7d469716091b1d89b4b08adde5863999319d69ff	2013-02-05 14:23:17 -08:00
Yaowu Xu	fa36981ec8	rewrite 4x4 idct and fdct This commit changes the 4x4 iDCT to use same algorithm & constants as other iDCTs. The 4x4 fDCT is also changed to be based on the new iDCT. Change-Id: Ib1a902693228af903862e1f5a08078c36f2089b0	2013-02-05 11:42:49 -08:00
Paul Wilkins	81043e8d62	Change definition of NearestMV. This commit makes the NearestMV match the chosen best reference MV. It can be a 0,0 or non zero vector which means the the compound nearest mv mode can combine a 0,0 and a non zero vector. Change-Id: I2213d09996ae2916e53e6458d7d110350dcffd7a	2013-02-05 17:03:25 +00:00
Paul Wilkins	3ab538767c	Re-factor code for rd thresholds. Separate out code to set the main encode speed related rd thresholds. Some values changed from the initial defaults for various new modes. Quality test results pending but even the addition of some further non-zero defaults helps encode speed somewhat in limited testing on derf clips. Adjustment of thresholds for quality / speed tradeoff to follow. Change-Id: I117ee473157e151a1b93193d5f393449328de20d	2013-02-04 18:48:41 +00:00
Yaowu Xu	c1f611be74	Merge "fix a small bug in 16 point forward dct" into experimental	2013-02-01 05:57:41 -08:00
Frank Galligan	f67d740b34	Add support for x64 and win64 yasm flags. Some projects must define only win64 for Windows 64bit builds using yasm. Change-Id: I1d09590d66a7bfc8b4412e1cc8685978ac60b748	2013-01-31 16:25:37 -08:00
Yaowu Xu	ab1cad9bdd	fix a small bug in 16 point forward dct The commit fixes a minor error in 16 point fdct where in a rotation can produce result of -1 instead of 0. Change-Id: I45aac4a52bcd06225c6d04e643547a13e1c1aade	2013-01-31 15:39:41 -08:00
Deb Mukherjee	a53be60904	Merge "Adding a frame parallel decoding mode" into experimental	2013-01-30 12:03:45 -08:00
Ronald S. Bultje	b499c24c2f	Merge "don't code the branch for the predicted seg_id if that flag is false." into experimental	2013-01-30 10:02:51 -08:00
Ronald S. Bultje	3a4b18bc67	don't code the branch for the predicted seg_id if that flag is false. Change-Id: Icb6e21dc0c2d9918faa33c8bf70943660df7ad88	2013-01-30 09:30:46 -08:00
Ronald S. Bultje	3febf9707d	Default superblock skip flag to 32x32 for skip-blocks. This is identical to the later decisions made in encode_superblock(). This commit doesn't actually change anything, but makes the mbmi state more consistent between the RD loop and the final encode result. Change-Id: I9e735afb7c5a52e5b61728cb88c67ef9b9bf59be	2013-01-29 21:46:31 -08:00
Ronald S. Bultje	b90996c51b	Reset skip flag in superblock RD loop. This is the superblock equivalent of commit 290b83a. Change-Id: Ib3945dd9e992fa9ec1fdea5a11e17a3cc0e37637	2013-01-29 21:42:56 -08:00
Ronald S. Bultje	5a9da2d906	Merge "Fix block pointer corruption in intra8x8 prediction with 4x4 transform." into experimental	2013-01-29 12:49:42 -08:00
Paul Wilkins	d8e86af263	Merge "Remove eob_max_offset markers." into experimental	2013-01-29 09:29:45 -08:00
Paul Wilkins	5d1c62c639	Merge "Segment Skip Flag" into experimental	2013-01-29 09:29:26 -08:00
Ronald S. Bultje	ffc2e4f4af	Fix block pointer corruption in intra8x8 prediction with 4x4 transform. The RD loop would change the pointer after the first mode (DC) was tested, leading to corrupt block objects being provided for the others. This would essentially render the i8x8 predictor useless. Change-Id: I16c5906ca64fb34878ac32ce59af8974e4582bb8	2013-01-29 09:18:47 -08:00
Paul Wilkins	93762ca9b2	Remove eob_max_offset markers. Remove eob_max_offset markers and replace with the generic skip_block flag to indicate to the quantizer that all coeffs to be set to 0 and eob position set to 0; Change-Id: Id477e8f8d4ec1a5562758904071013c24b76bfd7	2013-01-29 13:39:34 +00:00
Paul Wilkins	0ff9b033b0	Segment Skip Flag First step in simplifying the segment mode and segment EOB flags into a simpler segment skip flag that implies 0,0 mv and EOB at position 0. Change-Id: Ib750cac31a7a02dc21082580498efd9f7d8d72a5	2013-01-28 17:28:04 +00:00
Paul Wilkins	5f2429259f	Merge "Simplify Zero bin and zero bin run code." into experimental	2013-01-28 08:35:36 -08:00
Paul Wilkins	8e2c03fbfd	Simplify Zero bin and zero bin run code. Simplification to eliminate a number of very large data data structures. All zero run, zbin boosts for different transform sizes are now limited to a maximum run length of 15 before they max out the boost. Some further work still needs be done to refactor, rationalize and optimize the multiple quantizer functions. The simplification coupled with tweaks to the 16 element array now used for all transform sizes, has minimal effect on quality. Change-Id: I6f3948b8ca0418b60d4db9030ff19026a34ed423	2013-01-28 13:21:10 +00:00
Deb Mukherjee	dfd89f2eab	Adding a frame parallel decoding mode Adds a flag to disable features that would inhibit frame parallel decoding. This includes backward adaptation and MV sorting based on search in ref frame buffer. Also includes some minor clean-ups. Change-Id: I434846717a47b7bcb244b37ea670c5cdf776f14d	2013-01-25 17:16:19 -08:00
Ronald S. Bultje	3ca5b35ce5	Merge "Remove "update_context" variable from VP9_COMP context." into experimental	2013-01-25 09:43:42 -08:00
Ronald S. Bultje	0a7b3953f0	Remove "update_context" variable from VP9_COMP context. The variable is always zero. Change-Id: Id5cdbecad543bca465a5b1d471badaec7e112c8d	2013-01-24 16:28:53 -08:00
Deb Mukherjee	01cafaab1d	Adds an error-resilient mode with test Adds an error-resilient mode where frames can be continued to be decoded even when there are errors (due to network losses) on a prior frame. Specifically, backward updates are turned off and probabilities of various symbols are reset to defaults at the beginning of each frame. Further, the last frame's mvs are not used for the mv reference list, and the sorting of the initial list based on search on previous frames is turned off as well. Also adds a test where an arbitrary set of frames are skipped from decoding to simulate errors. The test verifies (1) that if the error frames are droppable - i.e. frame buffer updates have been turned off - there are no mismatch errors for the remaining frames after the error frames; and (2) if the error-frames are non droppable, there are not only no decoding errors but the mismatch PSNR between the decoder's version of the post-error frames and the encoder's version is at least 20 dB. Change-Id: Ie6e2bcd436b1e8643270356d3a930e8989ff52a5	2013-01-23 21:56:15 -08:00
John Koleszar	2f24ad9e85	Use alt-ref frame context for keyframes This matches the behavior prior to generalizing the frame context selection, and intuitively makes sense in that the first forward ref is immediately after the keyframe, so it's quality is improved a bit by using the keyframe's entropy context rather than the default. Change-Id: Ia82cef79382b9d8cfafdc44ba0533d4dc3e44053	2013-01-18 14:40:39 -08:00
Frank Galligan	9ca907b53e	libvpx: Fix some warnings. Change-Id: If8be8b9d28a29631f29c46daea8a226ab3580610	2013-01-18 09:51:57 -08:00
John Koleszar	26bd81b955	Preserve the previous golden frame on golden updates This commit restores the quality lost when the buffer-to-buffer copy logic was removed. Note that this is specific to the current use of golden frames and will need rework when RTC functionality is added. Change-Id: I7324a75acd96eafd9e0f9b8633d782e390d5dc21	2013-01-16 15:57:02 -08:00
John Koleszar	4b65837bc6	Generalize and increase frame coding contexts Previously there were two frame coding contexts tracked, one for normal frames and one for alt-ref frames. Generalize this by signalling the context to use in the bitstream, rather than tieing it to the alt ref refresh bit. Also increase the number of contexts available to 4, which may be useful for temporal scalability. Change-Id: I7b66daaddd55c535c20cd16713541fab182b1662	2013-01-16 14:07:27 -08:00
John Koleszar	da832a80e4	Start to anonymize reference frames Remove lst_fb_idx, gld_fb_idx, alt_fb_idx, refresh_last_frame, refresh_golden_frame, refresh_alt_ref_frame from common. Gold/Alt are encode side conventions. From the decoder's perspective, we want to be dealing with numbered references. Updates to active_ref 2 signal mode context switches, vestigial from refresh_alt_ref_frame. This needs some clean up to make sense with increased numbers of reference frames, as well as reimplementing the swapping of alt/golden which was previously done using the buffer-to-buffer copy mechanism removed in an earlier commit. Change-Id: I7334445158b7666f9295d2a2dd22aa03f4485f58	2013-01-16 14:06:23 -08:00
John Koleszar	394b0a6a30	Update encoder to use fb_idx_ref_cnt Do reference counting the same way on the encoder as the decoder does, rather than maintaining the 'flags' member of YV12_BUFFER_CONFIG. Change-Id: I91dc210ffca081acaf9d5c09a06e7461b3c3139c	2013-01-15 17:36:39 -08:00
John Koleszar	b8e027989f	Remove buffer-to-buffer copy logic This is the first in a series of commits to add additional reference frames to the codec. Each frame will be able to update any of the available references, but copying between references is not supported. Change-Id: I5945b5ce6cc3582c495102b4e7eed4f08c44d5a1	2013-01-15 17:36:39 -08:00
Yaowu Xu	9bf73f46f9	fix a number issues that cause failures During master jenkins verification proces Change-Id: I3722b8753eaf39f99b45979ce407a8ea0bea0b89	2013-01-14 18:32:32 -08:00
John Koleszar	24bc1a7189	Use INT64_MAX instead of LLONG_MAX These variables have the type int64_t, not long long. long long could be a larger type than 64 bits. Emulate INT64_MAX for older versions of MSVC, and remove the unreferenced vpx_ports/vpxtypes.h Change-Id: Ideaca71838fcd3849d816d5ab17aa347c97d03b0	2013-01-14 15:57:21 -08:00
Ronald S. Bultje	c9071601a2	Remove compound intra-intra experiment. This experiment gives little gains and adds relatively much code complexity (and it hinders other experiments), so let's get rid of it. Change-Id: Id25e79a137a1b8a01138aa27a1fa0ba4a2df274a	2013-01-14 15:47:25 -08:00
Paul Wilkins	e2c696a7aa	Merge "Fix compiler warnings" into experimental	2013-01-14 14:20:57 -08:00
Adrian Grange	c7576f97ff	Merge "Merge prediction filter" into experimental	2013-01-14 14:18:21 -08:00
Yaowu Xu	113005b11d	Fix compiler warnings The warnings caused verify failure with gerrit for several commits Change-Id: I030df8638bd69b8783a3ac58e720ff9f0bfd546c	2013-01-14 13:56:52 -08:00
Adrian Grange	7bcaac3e64	Merge prediction filter Removed the experimental flag from around the prediction filter. Change-Id: Ic1dd2db8fe8ac17ed5129f83094d4c5cdd5527d2	2013-01-14 12:57:07 -08:00
Ronald S. Bultje	290b83ab62	Reset x->skip for each iteration in the RD loop. This prevents ill-defined behaviour, such as setting x->skip for a mode that is excluded because of frame-level flags (e.g. filter selection, compound prediction selection), then not breaking out of the RD loop because the mode is not allowed, but keeping the flag on. Whatever mode is iterated through next in the RD loop will then carry this flag, and all sort of bad stuff happens, such as x->skip being set on intra pred modes. Change-Id: I5bec46b36e38292174acb1c564b3caf00a9b4b9a	2013-01-14 12:44:32 -08:00
John Koleszar	76ac5b3937	Fix unused variable warnings Previous commit does not build cleanly on Jenkins with the DWT/DCT hybrid experiment enabled (--enable-dwtdcthybrid). Change-Id: Ia67e8f59d17ef2d5200ec6b90dfe6711ed6835a5	2013-01-14 12:12:43 -08:00
Deb Mukherjee	516db21c2c	Further enhancements/fixes on dct/dwt hybrid txfm Fixes some scaling issues. Adds an option to only compute the dct on the low-low subband for 32x32 and 64x64 blocks using only a single 16x16 dct after 1 and 2 wavelet decomposition levels respectively. Also adds an option to use a 8x8 dct as building block. Currenlty with the 2/6 filter and with a single 16x16 dct on the low low band, the reuslts compared to full 32x32 dct is as follows: derf: -0.15% yt: -0.29% std-hd: -0.18% hd: -0.6% These are my current recommended settings, since the 2/6 filter is very simple. Results with 8x8 dct are about 0.3% worse. Change-Id: I00100cdc96e32deced591985785ef0d06f325e44	2013-01-12 16:00:53 -08:00
Paul Wilkins	d27ae620bc	Remove INT64_MAX references. Replace INT64_MAX references with LLONG_MAX for windows build. Change-Id: Ib8b45c1e9c15c043b2f54c27ed83b8682b2be34f	2013-01-11 19:45:26 +00:00

... 27 28 29 30 31 ...

1610 Commits