generic-library/vpx

Author	SHA1	Message	Date
John Koleszar	7a07eea13f	Convert subpixel filters to use convolve framework Update the code to call the new convolution functions to do subpixel prediction rather than the existing functions. Remove the old C and assembly code, since it is unused. This causes a 50% performance reduction on the decoder, but that will be resolved when the asm for the new functions is available. There is no consensus for whether 6-tap or 2-tap predictors will be supported in the final codec, so these filters are implemented in terms of the 8-tap code, so that quality testing of these modes can continue. Implementing the lower complexity algorithms is a simple exercise, should it be necessary. This code produces slightly better results in the EIGHTTAP_SMOOTH case, since the filter is now applied in only one direction when the subpel motion is only in one direction. Like the previous code, the filtering is skipped entirely on full-pel MVs. This combination seems to give the best quality gains, but this may be indicative of a bug in the encoder's filter selection, since the encoder could achieve the result of skipping the filtering on full-pel by selecting one of the other filters. This should be revisited. Quality gains on derf positive on almost all clips. The only clip that seemed to be hurt at all datarates was football (-0.115% PSNR average, -0.587% min). Overall averages 0.375% PSNR, 0.347% SSIM. Change-Id: I7d469716091b1d89b4b08adde5863999319d69ff	2013-02-05 14:23:17 -08:00
John Koleszar	5ca6a3667f	Add 8-tap generic convolver This commit introduces a new convolution function which will be used to replace the existing subpixel interpolation functions. It is much the same as the existing functions, but allows for changing the filter kernel on a per-pixel basis, and doesn't bake in knowledge of the filter to be applied or the size of the resulting block into the function name. Replacing the existing subpel filters will come in a later commit. Change-Id: Ic9a5615f2f456cb77f96741856fc650d6d78bb91	2013-02-05 14:19:28 -08:00
Yaowu Xu	77f889b2e3	fix a build issue with MSVC on windows for idct 16x16 unit test Change-Id: I51da9405c3a4d7bb3f4cdf062aaccaa90b33dca4	2013-02-05 12:12:05 -08:00
Yaowu Xu	fa36981ec8	rewrite 4x4 idct and fdct This commit changes the 4x4 iDCT to use same algorithm & constants as other iDCTs. The 4x4 fDCT is also changed to be based on the new iDCT. Change-Id: Ib1a902693228af903862e1f5a08078c36f2089b0	2013-02-05 11:42:49 -08:00
Paul Wilkins	81043e8d62	Change definition of NearestMV. This commit makes the NearestMV match the chosen best reference MV. It can be a 0,0 or non zero vector which means the the compound nearest mv mode can combine a 0,0 and a non zero vector. Change-Id: I2213d09996ae2916e53e6458d7d110350dcffd7a	2013-02-05 17:03:25 +00:00
Scott LaVarnway	77440d508b	Merge "Added vp9_short_idct1_32x32_c" into experimental	2013-02-05 08:56:05 -08:00
Paul Wilkins	fb4b533da9	Merge "Re-factor code for rd thresholds." into experimental	2013-02-05 02:12:45 -08:00
Scott LaVarnway	5780c4cbd5	Added vp9_short_idct1_32x32_c and called this function in vp9_dequant_idct_add_32x32_c when eob == 1. For the test clip used, the decoder performance improved by 21+%. Based on Yaowu's 16 point idct work. Change-Id: Ib579a90fed531d45777980e04bf0c9b23c093c43	2013-02-04 16:49:17 -08:00
Paul Wilkins	3ab538767c	Re-factor code for rd thresholds. Separate out code to set the main encode speed related rd thresholds. Some values changed from the initial defaults for various new modes. Quality test results pending but even the addition of some further non-zero defaults helps encode speed somewhat in limited testing on derf clips. Adjustment of thresholds for quality / speed tradeoff to follow. Change-Id: I117ee473157e151a1b93193d5f393449328de20d	2013-02-04 18:48:41 +00:00
Yaowu Xu	dea143327e	Added INT16_MIN and INT16_MAX for MSVC builds These macros were not defined in earlier version of MSVC Change-Id: I8270a3abb7c6e9ead1931a653d7e41f877a1017b	2013-02-04 10:21:32 -08:00
Yaowu Xu	ebd5808970	enable 16x16 iDCT unit test test for forward transform will be enabled later after re-do forward transform Change-Id: Ie7c7cf88baf7ecbebbe52fe027e1c3b33d3b9d49	2013-02-04 09:03:32 -08:00
Yaowu Xu	1eb79dc1dc	re-write 8 point idct to be consistent with idct16 and idct32. Change-Id: Ie89dbd32b65c33274b7fecb4b41160fcf1962204	2013-02-04 07:31:25 -08:00
Yaowu Xu	ccaaeb4b5a	a couple of minor fixes fixed a function prototypes to prevent compiler warnings; removed a function not in use; un-capitialize "Refstride" to ref_stride Change-Id: Ib4472b6084f357d96328c6a06e795b6813a9edba	2013-02-04 07:19:32 -08:00
KO Myung-Hun	7f5e4fd7bd	Use smartalign for long nops with NASM 'CPU amdnop' is supported by YASM only. Change-Id: Ia3f7c2ba6d3bdf2889b62f5c6127fd515d7c7394	2013-02-03 21:51:05 +09:00
KO Myung-Hun	dd8d0134e0	Disable USE_POSIX_MAP on OS/2 Change-Id: Ib88ab619fa4e1593e85ca325555f2c4648ac9bc7	2013-02-03 21:50:58 +09:00
Yaowu Xu	af4c9d2f88	Merge "Changes 16 point idct" into experimental	2013-02-01 08:22:20 -08:00
Yaowu Xu	c1f611be74	Merge "fix a small bug in 16 point forward dct" into experimental	2013-02-01 05:57:41 -08:00
Yaowu Xu	91e0e80142	Changes 16 point idct This commit changes the inverse 16 point dct to use the same algorithm as the one for 32 point idct. In fact, now 16 point dct uses the exact version of the souce code for even portion of the 32 point idct. Tests showed current implementation has significant better accuracy than the previous version. With this implementation and the minor bug fix on forward 16 point dct, encoding tests showed about 0.2% better compression of CIF set, test results on std-hd setting pending. Change-Id: I68224b60c816ba03434e9f08bee147c7e344fb63	2013-01-31 19:52:18 -08:00
John Koleszar	226c57e4fa	Merge "Add support for x64 and win64 yasm flags."	2013-01-31 17:05:33 -08:00
Frank Galligan	f67d740b34	Add support for x64 and win64 yasm flags. Some projects must define only win64 for Windows 64bit builds using yasm. Change-Id: I1d09590d66a7bfc8b4412e1cc8685978ac60b748	2013-01-31 16:25:37 -08:00
Yaowu Xu	ab1cad9bdd	fix a small bug in 16 point forward dct The commit fixes a minor error in 16 point fdct where in a rotation can produce result of -1 instead of 0. Change-Id: I45aac4a52bcd06225c6d04e643547a13e1c1aade	2013-01-31 15:39:41 -08:00
Marco Paniconi	ec6cf493ff	Fix for divide by zero in vp8_adjust_key_frame. Change-Id: I3bf9bdd95abfd287fbcb644f4fb85fb9204be95a	2013-01-31 10:53:06 -08:00
Yaowu Xu	c94e55add0	Merge "A fix point implementation of 32x32 idct" into experimental	2013-01-31 10:48:01 -08:00
Yaowu Xu	5149d7f7bd	A fix point implementation of 32x32 idct This commit changes the 32x32 idct to use integer only. The algorithm was taken directly from "A Fast Computational Algorithm for the Discrete Cosine Tranform" by W. Chen, et al., which was published in IEEE Transaction on Communication Vol. Com.-25 No. 9, 1977. The signal flow graph in the original paper is for a 32 point forward dct, the current implementation of inverse DCT was done by follow the graph in reversed direction. With this implementation, the 32 point inverse dct contains a 16 point inverse dct in its even portion, similarly the 16 point idct further contains 8 point and 4 point inverse dcts. As of patch 4, encoding tests showed there is no compression loss when compared against the floating point baseline. Numbers even showed very small postives. (cif: .01%, std-hd: .05%). Change-Id: I2d2d17a424b0b04b42422ef33ec53f5802b0f378	2013-01-31 09:45:49 -08:00
Jim Bankoski	14301116e2	Merge "WIP: Multiple decoder instances support"	2013-01-30 18:59:55 -08:00
Deb Mukherjee	a53be60904	Merge "Adding a frame parallel decoding mode" into experimental	2013-01-30 12:03:45 -08:00
Scott LaVarnway	75f647fe8a	WIP: Multiple decoder instances support Started adding support for multiple internal decoder instances. Also added code to limit the vp8 config options available when using frame-based multithreading. Change-Id: I0f1ee7abcfcff59204f50162e28254b8dd6972eb	2013-01-30 10:27:26 -08:00
Ronald S. Bultje	b499c24c2f	Merge "don't code the branch for the predicted seg_id if that flag is false." into experimental	2013-01-30 10:02:51 -08:00
Ronald S. Bultje	3a4b18bc67	don't code the branch for the predicted seg_id if that flag is false. Change-Id: Icb6e21dc0c2d9918faa33c8bf70943660df7ad88	2013-01-30 09:30:46 -08:00
Ronald S. Bultje	4d53a95a34	Merge "Default superblock skip flag to 32x32 for skip-blocks." into experimental	2013-01-30 09:12:17 -08:00
Ronald S. Bultje	de6718a3b9	Merge "Reset skip flag in superblock RD loop." into experimental	2013-01-30 09:12:02 -08:00
Deb Mukherjee	d28750537e	Merge "Further improvement on compound inter-intra expt" into experimental	2013-01-30 08:38:17 -08:00
Ronald S. Bultje	3febf9707d	Default superblock skip flag to 32x32 for skip-blocks. This is identical to the later decisions made in encode_superblock(). This commit doesn't actually change anything, but makes the mbmi state more consistent between the RD loop and the final encode result. Change-Id: I9e735afb7c5a52e5b61728cb88c67ef9b9bf59be	2013-01-29 21:46:31 -08:00
Ronald S. Bultje	b90996c51b	Reset skip flag in superblock RD loop. This is the superblock equivalent of commit `290b83a`. Change-Id: Ib3945dd9e992fa9ec1fdea5a11e17a3cc0e37637	2013-01-29 21:42:56 -08:00
Ronald S. Bultje	2f6fce3e5a	Write only visible area (for better comparison with rec.yuv). Change-Id: I32bf4ee532a15af78619cbcd8a193224029fab50	2013-01-29 16:58:52 -08:00
Frank Galligan	0524f33108	libvpx: Fix warnings on windows. Warnings found when tyring to build libvpx in Chromium. Change-Id: I5824d9e2c06351e0cf46e9f5fa102cc8b04cf963	2013-01-29 13:57:09 -08:00
Scott LaVarnway	8b22a9d377	Merge "Use FRAGMENT_DATA struct in pbi"	2013-01-29 13:42:54 -08:00
Ronald S. Bultje	5a9da2d906	Merge "Fix block pointer corruption in intra8x8 prediction with 4x4 transform." into experimental	2013-01-29 12:49:42 -08:00
Ronald S. Bultje	64401f838f	Merge "Fix overread/write reported by valgrind if (mb_cols) & 3 != 0." into experimental	2013-01-29 12:49:22 -08:00
Scott LaVarnway	2146c68dfd	Use FRAGMENT_DATA struct in pbi for fragment information. Change-Id: Idc83625591a1e4ca6f551dcfb7fc0428f6f37351	2013-01-29 10:34:35 -08:00
Paul Wilkins	d8e86af263	Merge "Remove eob_max_offset markers." into experimental	2013-01-29 09:29:45 -08:00
Paul Wilkins	5d1c62c639	Merge "Segment Skip Flag" into experimental	2013-01-29 09:29:26 -08:00
Scott LaVarnway	8b7eced6fe	Merge "Added eob == 0 check to vp9_dequant_idct_add_32x32_c" into experimental	2013-01-29 09:19:58 -08:00
Ronald S. Bultje	ffc2e4f4af	Fix block pointer corruption in intra8x8 prediction with 4x4 transform. The RD loop would change the pointer after the first mode (DC) was tested, leading to corrupt block objects being provided for the others. This would essentially render the i8x8 predictor useless. Change-Id: I16c5906ca64fb34878ac32ce59af8974e4582bb8	2013-01-29 09:18:47 -08:00
Paul Wilkins	93762ca9b2	Remove eob_max_offset markers. Remove eob_max_offset markers and replace with the generic skip_block flag to indicate to the quantizer that all coeffs to be set to 0 and eob position set to 0; Change-Id: Id477e8f8d4ec1a5562758904071013c24b76bfd7	2013-01-29 13:39:34 +00:00
Deb Mukherjee	3b04d467ac	Further improvement on compound inter-intra expt Adds a special combination mode specific to intra prediciton mode D45. Current results with the compound inter/intra experiment: derf: 0.2% yt: 0.55% std-hd: 0.75% hd: 0.74% Change-Id: I8976bdf3b9b0b66ab8c5c628bbc62c14fc72ca86	2013-01-29 00:21:29 -08:00
Johann	cdc18067a4	obj_int_extract.bat is not a generated file Trying to create Visual Studio project files would fail with: make[1]: *** No rule to make target `obj_int_extract.bat', needed by `.projects'. Stop. Change-Id: Ie55458427ddea199a3de9973eaf2a37f711f839e	2013-01-28 18:19:17 -08:00
Paul Wilkins	0ff9b033b0	Segment Skip Flag First step in simplifying the segment mode and segment EOB flags into a simpler segment skip flag that implies 0,0 mv and EOB at position 0. Change-Id: Ib750cac31a7a02dc21082580498efd9f7d8d72a5	2013-01-28 17:28:04 +00:00
Paul Wilkins	5f2429259f	Merge "Simplify Zero bin and zero bin run code." into experimental	2013-01-28 08:35:36 -08:00
Paul Wilkins	8e2c03fbfd	Simplify Zero bin and zero bin run code. Simplification to eliminate a number of very large data data structures. All zero run, zbin boosts for different transform sizes are now limited to a maximum run length of 15 before they max out the boost. Some further work still needs be done to refactor, rationalize and optimize the multiple quantizer functions. The simplification coupled with tweaks to the 16 element array now used for all transform sizes, has minimal effect on quality. Change-Id: I6f3948b8ca0418b60d4db9030ff19026a34ed423	2013-01-28 13:21:10 +00:00

... 7 8 9 10 11 ...

4127 Commits