generic-library/vpx

Author	SHA1	Message	Date
Yue Chen	a49d80bfc8	Squash commits from master to playground Moving RD-opt related code from vp9_encoder.h to vp9_rdopt.h. Squashed-Change-Id: I8fab776c8801e19d3f5027ed55a6aa69eee951de gen_msvs_proj: fix in tree configure under cygwin strip trailing '/' from paths, this is later converted to '\' which causes execution errors for obj_int_extract/yasm. vs10+ wasn't affected by this issue, but make the same change for consistency. gen_msvs_proj: + add missing '"' to obj_int_extract call unlike gen_msvs_vcproj, the block is duplicated missed in: `1e3d9b9` build/msvs: fix builds in source dirs with spaces Squashed-Change-Id: I76208e6cdc66dc5a0a7ffa8aa1edbefe31e4b130 Improve vp9_rb_bytes_read Squashed-Change-Id: I69eba120eb3d8ec43b5552451c8a9bd009390795 Removing decode_one_iter() function. When superframe index is available we completely rely on it and use frame size values from the index. Squashed-Change-Id: I0011d08b223303a8b912c2bcc8a02b74d0426ee0 iosbuild.sh: Add vpx_config.h and vpx_version.h to VPX.framework. - Rename build_targets to build_framework - Add functions for creating the vpx_config shim and obtaining preproc symbols. Squashed-Change-Id: Ieca6938b9779077eefa26bf4cfee64286d1840b0 Implemented vp9_denoiser_{alloc,free}() Squashed-Change-Id: I79eba79f7c52eec19ef2356278597e06620d5e27 Update running avg for VP9 denoiser Squashed-Change-Id: I9577d648542064052795bf5770428fbd5c276b7b Changed buf_2ds in vp9 denoiser to YV12 buffers Changed alloc, free, and running average code as necessary. Squashed-Change-Id: Ifc4d9ccca462164214019963b3768a457791b9c1 sse4 regular quantize Squashed-Change-Id: Ibd95df0adf9cc9143006ee9032b4cb2ebfd5dd1b Modify non-rd intra mode checking Speed 6 uses small tx size, namely 8x8. max_intra_bsize needs to be modified accordingly to ensure valid intra mode checking. Borg test on RTC set showed an overall PSNR gain of 0.335% in speed -6. This also changes speed -5 encoding by allowing DC_PRED checking for block32x32. Borg test on RTC set showed a slight PSNR gain of 0.145%, and no noticeable speed change. Squashed-Change-Id: I1502978d8fbe265b3bb235db0f9c35ba0703cd45 Implemented COPY_BLOCK case for vp9 denoiser Squashed-Change-Id: Ie89ad1e3aebbd474e1a0db69c1961b4d1ddcd33e Improved vp9 denoiser running avg update. Squashed-Change-Id: Ie0aa41fb7957755544321897b3bb2dd92f392027 Separate rate-distortion modeling for DC and AC coefficients This is the first step to rework the rate-distortion modeling used in rtc coding mode. The overall goal is to make the modeling customized for the statistics encountered in the rtc coding. This commit makes encoder to perform rate-distortion modeling for DC and AC coefficients separately. No speed changes observed. The coding performance for pedestrian_area_1080p is largely improved: speed -5, from 79558 b/f, 37.871 dB -> 79598 b/f, 38.600 dB speed -6, from 79515 b/f, 37.822 dB -> 79544 b/f, 38.130 dB Overall performance for rtc set at speed -6 is improved by 0.67%. Squashed-Change-Id: I9153444567e5f75ccdcaac043c2365992c005c0c Add superframe support for frame parallel decoding. A superframe is a bunch of frames that bundled as one frame. It is mostly used to combine one or more non-displayable frames and one displayable frame. For frame parallel decoding, libvpx decoder will only support decoding one normal frame or a super frame with superframe index. If an application pass a superframe without superframe index or a chunk of displayable frames without superframe index to libvpx decoder, libvpx will not decode it in frame parallel mode. But libvpx decoder still could decode it in serial mode. Squashed-Change-Id: I04c9f2c828373d64e880a8c7bcade5307015ce35 Fixes in VP9 alloc, free, and COPY_FRAME case Squashed-Change-Id: I1216f17e2206ef521fe219b6d72d8e41d1ba1147 Remove labels from quantize Use break instead of goto for early exit. Unbreaks Visual Studio builds. Squashed-Change-Id: I96dee43a3c82145d4abe0d6a99af6e6e1a3991b5 Added CFLAG for outputting vp9 denoised signal Squashed-Change-Id: Iab9b4e11cad927f3282e486c203564e1a658f377 Allow key frame more flexibility in mode search This commit allows the key frame to search through more prediction modes and more flexible block sizes. No speed change observed. The coding performance for rtc set is improved by 1.7% for speed -5 and 3.0% for speed -6. Squashed-Change-Id: Ifd1bc28558017851b210b4004f2d80838938bcc5 VP9 denoiser bugfixes s/stdint.h/vpx\/vpx_int.h Added missing 'break;'s Also included other minor changes, mostly cosmetic. Squashed-Change-Id: I852bba3e85e794f1d4af854c45c16a23a787e6a3 Don't return value for void functions Clears "warning: 'return' with a value, in function returning void" Squashed-Change-Id: I93972610d67e243ec772a1021d2fdfcfc689c8c2 Include type defines Clears error: unknown type name 'uint8_t' Squashed-Change-Id: I9b6eff66a5c69bc24aeaeb5ade29255a164ef0e2 Validate error checking code in decoder. This patch adds a mechanism for insuring error checking on invalid files by creating a unit test that runs the decoder and tests that the error code matches what's expected on each frame in the decoder. Disabled for now as this unit test will segfault with existing code. Squashed-Change-Id: I896f9686d9ebcbf027426933adfbea7b8c5d956e Introduce FrameWorker for decoding. When decoding in serial mode, there will be only one FrameWorker doing decoding. When decoding in parallel mode, there will be several FrameWorkers doing decoding in parallel. Squashed-Change-Id: If53fc5c49c7a0bf5e773f1ce7008b8a62fdae257 Add back libmkv ebml writer files. Another project in ChromeOS is using these files. To make libvpx rolls simpler, add these files back unitl the other project removes the dependency. crbug.com/387246 tracking bug to remove dependency. Squashed-Change-Id: If9c197081c845c4a4e5c5488d4e0190380bcb1e4 Added Test vector that tests more show existing frames. Squashed-Change-Id: I0ddd7dd55313ee62d231ed4b9040e08c3761b3fe fix peek_si to enable 1 byte show existing frames. The test for this is in test vector code ( show existing frames will fail ). I can't check it in disabled as I'm changing the generic test code to do this: https://gerrit.chromium.org/gerrit/#/c/70569/ Squashed-Change-Id: I5ab324f0cb7df06316a949af0f7fc089f4a3d466 Fix bug in error handling that causes segfault See: https://code.google.com/p/chromium/issues/detail?id=362697 The code properly catches an invalid stream but seg faults instead of returning an error due to a buffer not having been initialized. This code fixes that. Squashed-Change-Id: I695595e742cb08807e1dfb2f00bc097b3eae3a9b Revert 3 patches from Hangyu to get Chrome to build: Avoids failures: MSE_ClearKey/EncryptedMediaTest.Playback_VP9Video_WebM/0 MSE_ClearKey_Prefixed/EncryptedMediaTest.Playback_VP9Video_WebM/0 MSE_ExternalClearKey_Prefixed/EncryptedMediaTest.Playback_VP9Video_WebM/0 MSE_ExternalClearKey/EncryptedMediaTest.Playback_VP9Video_WebM/0 MSE_ExternalClearKeyDecryptOnly/EncryptedMediaTest.Playback_VP9Video_WebM/0 MSE_ExternalClearKeyDecryptOnly_Prefixed/EncryptedMediaTest.Playback_VP9Video_WebM/0 SRC_ExternalClearKey/EncryptedMediaTest.Playback_VP9Video_WebM/0 SRC_ExternalClearKey_Prefixed/EncryptedMediaTest.Playback_VP9Video_WebM/0 SRC_ClearKey_Prefixed/EncryptedMediaTest.Playback_VP9Video_WebM/0 Patches are This reverts commit `9bc040859b` This reverts commit `6f5aba069a` This reverts commit `9bc040859b` I1f250441 Revert "Refactor the vp9_get_frame code for frame parallel." Ibfdddce5 Revert "Delay decreasing reference count in frame-parallel decoding." I00ce6771 Revert "Introduce FrameWorker for decoding." Need better testing in libvpx for these commits Squashed-Change-Id: Ifa1f279b0cabf4b47c051ec26018f9301c1e130e error check vp9 superframe parsing This patch insures that the last byte of a chunk that contains a valid superframe marker byte, actually has a proper superframe index. If not it returns an error. As part of doing that the file : vp90-2-15-fuzz-flicker.webm now fails to decode properly and moves to the invalid file test from the test vector suite. Squashed-Change-Id: I5f1da7eb37282ec0c6394df5c73251a2df9c1744 Remove unused vp9_init_quant_tables function This function is not effectively used, hence removed. Squashed-Change-Id: I2e8e48fa07c7518931690f3b04bae920cb360e49 Actually skip blocks in skip segments in non-rd encoder. Copy split from macroblock to pick mode context so it doesn't get lost. Squashed-Change-Id: Ie37aa12558dbe65c4f8076cf808250fffb7f27a8 Add Check for Peek Stream validity to decoder test. Squashed-Change-Id: I9b745670a9f842582c47e6001dc77480b31fb6a1 Allocate buffers based on correct chroma format The encoder currently allocates frame buffers before it establishes what the chroma sub-sampling factor is, always allocating based on the 4:4:4 format. This patch detects the chroma format as early as possible allowing the encoder to allocate buffers of the correct size. Future patches will change the encoder to allocate frame buffers on demand to further reduce the memory profile of the encoder and rationalize the buffer management in the encoder and decoder. Squashed-Change-Id: Ifd41dd96e67d0011719ba40fada0bae74f3a0d57 Fork vp9_rd_pick_inter_mode_sb_seg_skip Squashed-Change-Id: I549868725b789f0f4f89828005a65972c20df888 Switch active map implementation to segment based. Squashed-Change-Id: Ibb841a1fa4d08d164cf5461246ec290f582b1f80 Experiment for mid group second arf. This patch implements a mechanism for inserting a second arf at the mid position of arf groups. It is currently disabled by default using the flag multi_arf_enabled. Results are currently down somewhat in initial testing if multi-arf is enabled. Most of the loss is attributable to the fact that code to preserve the previous golden frame (in the arf buffer) in cases where we are coding an overlay frame, is currently disabled in the multi-arf case. Squashed-Change-Id: I1d777318ca09f147db2e8c86d7315fe86168c865 Clean out old CONFIG_MULTIPLE_ARF code. Remove the old experimental multi arf code that was under the flag CONFIG_MULTIPLE_ARF. Squashed-Change-Id: Ib24865abc11691d6ac8cb0434ada1da674368a61 Fix some bugs in multi-arf Fix some bugs relating to the use of buffers in the overlay frames. Fix bug where a mid sequence overlay was propagating large partition and transform sizes into the subsequent frame because of :- sf->last_partitioning_redo_frequency > 1 and sf->tx_size_search_method == USE_LARGESTALL Squashed-Change-Id: Ibf9ef39a5a5150f8cbdd2c9275abb0316c67873a Further dual arf changes: multi_arf_allowed. Add multi_arf_allowed flag. Re-initialize buffer indices every kf. Add some const indicators. Squashed-Change-Id: If86c39153517c427182691d2d4d4b7e90594be71 Fixed VP9 denoiser COPY_BLOCK case Now copies the src to the correct location in the running average buffer. Squashed-Change-Id: I9c83c96dc7a97f42c8df16ab4a9f18b733181f34 Fix test on maximum downscaling limits There is a normative scaling range of (x1/2, x16) for VP9. This patch fixes the maximum downscaling tests that are applied in the convolve function. The code used a maximum downscaling limit of x1/5 for historic reasons related to the scalable coding work. Since the downsampling in this application is non-normative it will revert to using a separate non-normative scaler. Squashed-Change-Id: Ide80ed712cee82fe5cb3c55076ac428295a6019f Add unit test to test user_priv parameter. Squashed-Change-Id: I6ba6171e43e0a43331ee0a7b698590b143979c44 vp9: check tile column count the max is 6. there are assumptions throughout the decode regarding this; fixes a crash with a fuzzed bitstream $ zzuf -s 5861 -r 0.01:0.05 -b 6- \ < vp90-2-00-quantizer-00.webm.ivf \ \| dd of=invalid-vp90-2-00-quantizer-00.webm.ivf.s5861_r01-05_b6-.ivf \ bs=1 count=81883 Squashed-Change-Id: I6af41bb34252e88bc156a4c27c80d505d45f5642 Adjust arf Q limits with multi-arf. Adjust enforced minimum arf Q deltas for non primary arfs in the middle of an arf/gf group. Squashed-Change-Id: Ie8034ffb3ac00f887d74ae1586d4cac91d6cace2 Dual ARF changes: Buffer index selection. Add indirection to the section of buffer indices. This is to help simplify things in the future if we have other codec features that switch indices. Limit the max GF interval for static sections to fit the gf_group structures. Squashed-Change-Id: I38310daaf23fd906004c0e8ee3e99e15570f84cb Reuse inter prediction result in real-time speed 6 In real-time speed 6, no partition search is done. The inter prediction results got from picking mode can be reused in the following encoding process. A speed feature reuse_inter_pred_sby is added to only enable the resue in speed 6. This patch doesn't change encoding result. RTC set tests showed that the encoding speed gain is 2% - 5%. Squashed-Change-Id: I3884780f64ef95dd8be10562926542528713b92c Add vp9_ prefix to mv_pred and setup_pred_block functions Make these two functions accessible by both RD and non-RD coding modes. Squashed-Change-Id: Iecb39dbf3d65436286ea3c7ffaa9920d0b3aff85 Replace cpi->common with preset variable cm This commit replaces a few use cases of cpi->common with preset variable cm, to avoid unnecessary pointer fetch in the non-RD coding mode. Squashed-Change-Id: I4038f1c1a47373b8fd7bc5d69af61346103702f6 [spatial svc]Implement lag in frames for spatial svc Squashed-Change-Id: I930dced169c9d53f8044d2754a04332138347409 [spatial svc]Don't skip motion search in first pass encoding Squashed-Change-Id: Ia6bcdaf5a5b80e68176f60d8d00e9b5cf3f9bfe3 decode_test_driver: fix type size warning like vpx_codec_decode(), vpx_codec_peek_stream_info() takes an unsigned int, not size_t, parameter for buffer size Squashed-Change-Id: I4ce0e1fbbde461c2e1b8fcbaac3cd203ed707460 decode_test_driver: check HasFailure() in RunLoop avoids unnecessary errors due to e.g., read (Next()) failures Squashed-Change-Id: I70b1d09766456f1c55367d98299b5abd7afff842 Allow lossless breakout in non-rd mode decision. This is very helpful for large moving windows in screencasts. Squashed-Change-Id: I91b5f9acb133281ee85ccd8f843e6bae5cadefca Revert "Revert 3 patches from Hangyu to get Chrome to build:" This patch reverts the previous revert from Jim and also add a variable user_priv in the FrameWorker to save the user_priv passed from the application. In the decoder_get_frame function, the user_priv will be binded with the img. This change is needed or it will fail the unit test added here: https://gerrit.chromium.org/gerrit/#/c/70610/ This reverts commit `9be46e4565`. Squashed-Change-Id: I376d9a12ee196faffdf3c792b59e6137c56132c1 test.mk: remove renamed file vp90-2-15-fuzz-flicker.webm was renamed in: `c3db2d8` error check vp9 superframe parsing Squashed-Change-Id: I229dd6ca4c662802c457beea0f7b4128153a65dc vp9cx.mk: move avx c files outside of x86inc block same reasoning as: `9f3a0db` vp9_rtcd: correct avx2 references these are all intrinsics, so don't depend on x86inc.asm Squashed-Change-Id: I915beaef318a28f64bfa5469e5efe90e4af5b827 Dual arf: Name changes. Cosmetic patch only in response to comments on previous patches suggesting a couple of name changes for consistency and clarity. Squashed-Change-Id: Ida3a359b0d5755345660d304a7697a3a3686b2a3 Make non-RD intra mode search txfm size dependent This commit fixes the potential issue in the non-RD mode decision flow that only checks part of the block to estimate the cost. It was due to the use of fixed transform size, in replacing the largest transform block size. This commit enables per transform block cost estimation of the intra prediction mode in the non-RD mode decision. Squashed-Change-Id: I14ff92065e193e3e731c2bbf7ec89db676f1e132 Fix quality regression for multi arf off case. Bug introduced during multiple iterations on: I3831* gf_group->arf_update_idx[] cannot currently be used to select the arf buffer index if buffer flipping on overlays is enabled (still currently the case when multi arf OFF). Squashed-Change-Id: I4ce9ea08f1dd03ac3ad8b3e27375a91ee1d964dc Enable real-time version reference motion vector search This commit enables a fast reference motion vector search scheme. It checks the nearest top and left neighboring blocks to decide the most probable predicted motion vector. If it finds the two have the same motion vectors, it then skip finding exterior range for the second most probable motion vector, and correspondingly skips the check for NEARMV. The runtime of speed -5 goes down pedestrian at 1080p 29377 ms -> 27783 ms vidyo at 720p 11830 ms -> 10990 ms i.e., 6%-8% speed-up. For rtc set, the compression performance goes down by about -1.3% for both speed -5 and -6. Squashed-Change-Id: I2a7794fa99734f739f8b30519ad4dfd511ab91a5 Add const mark to const values in non-RD coding mode Squashed-Change-Id: I65209fd1e06fc06833f6647cb028b414391a7017 Change-Id: Ic0be67ac9ef48f64a8878a0b8f1b336f136bceac	2014-06-26 14:22:05 -07:00
Adrian Grange	fd6bf31b8a	vp9_convolve.c: cleanup -wextra warnings Change-Id: I04930aca2293ebbaeb96dfedd2f9c5a55762fd2e	2014-05-13 09:57:24 -07:00
Tom Finegan	bf79a4da77	vp9/common: Silence MSVC warning in vp9_convolve.c. Added cast to int to silence MSVC warning. Change-Id: I9ef4709d2e4cf0db070d9e52385c1b3f138b00a5	2014-02-07 10:13:57 -08:00
James Zern	cca4276dac	vp9_filter.h: rename interp_kernel type -> InterpKernel avoids conflicts in variable names, fixing the build with various toolchains. broken since: `8691565` Removing subpix_fn_table struct. Change-Id: Ib5f6fdbcb494a97b62c75b99d4d826ff25d4c981	2014-02-03 16:48:38 -08:00
Dmitry Kovalev	4264c93844	Renaming INTERPOLATION_TYPE to INTERP_FILTER. Corresponding renames: subpel_kernel => interp_kernel vp9_get_filter_kernel() => vp9_get_interp_kernel() pred_filter_type => pred_interp_filter adaptive_pred_filter_type => adaptive_pred_interp_filter mcomp_filter_type => interp_filter read_interp_filter_type() => read_interp_filter() write_interp_filter_type() => write_interp_filter() fix_mcomp_filter_type() => fix_interp_filter() Change-Id: I1fa61fa1dc81ebbf043457c3ee2d8d4515bee6d3	2014-01-24 15:57:28 -08:00
Dmitry Kovalev	629fb85f17	vp9_convole.c cleanup. Making overall logic more clear, moving "hacked" calculation of base filter array pointer to get_filter_base() function. Change-Id: Ibbd38a9f937e48d35bbbfef3ad933ab36664cccb	2013-12-12 11:14:06 -08:00
Jim Bankoski	56af13a1b1	cpplint issue with convolve resolved Change-Id: I38b2100f1a64cb067c63f4e1662c36914b3569df	2013-10-07 15:55:42 -07:00
Yaowu Xu	9be0bb19df	Replace memcpy with vpx_memcpy Also removed obselete comment Change-Id: Iae1664777d76383639c637ee786e0d50fc45819a	2013-09-24 10:56:06 -07:00
Yaowu Xu	a783da80e7	Silence a bunch of MSVC warnings Change-Id: I16633269582a640809dca27572bbe99efa6369fc	2013-09-17 12:08:51 -07:00
Ivan Maltz	01b35c3c16	API extensions and sample app for spacial scalable encoder Sample app: vp9_spatial_scalable_encoder vpx_codec_control extensions: VP9E_SET_SVC VP9E_SET_WIDTH, VP9E_SET_HEIGHT, VP9E_SET_LAYER VP9E_SET_MIN_Q, VP9E_SET_MAX_Q expanded buffer size for vp9_convolve modified setting of initial width in vp9_onyx_if.c so that layer size can be set prior to initial encode Default number of layers set to 3 (VPX_SS_DEFAULT_LAYERS) Number of layers set explicitly in vpx_codec_enc_cfg.ss_number_layers Change-Id: I2c7a6fe6d665113671337032f7ad032430ac4197	2013-09-09 15:57:56 -07:00
Tero Rintaluoma	e326cecf18	Fix intermediate height in convolve_c - Intermediate height was not correct i.e. when block size is 4 and y_step_q4 is 6. In this case intermediate height was (4*6) >> 4 = 1 and vertical interpolation needs two source pixels plus 7 extra pixels for taps. - Also if the current output block is 16x16 and we are using 4x upscaling we need only 12 rows after horizontal filtering instead of 16. Patch Set 2: Intermediate_height updated after CL 66723 "Fix bug in convolution functions (filter selection)" Change-Id: I5a1a1bc2ac9d5edb3a6e0818de618bf318fdd589	2013-08-30 10:31:21 +03:00
James Zern	c8ba8c513c	cosmetics: strip 'VP9_' from defines in vp9 only code Change-Id: I481d9bb2fa3ec72b6a83d5f04d545ad8013f295c	2013-08-23 19:16:49 -07:00
Adrian Grange	3f10831308	Fix bug in convolution functions (filter selection) (In response to Issue 604: https://code.google.com/p/webm/issues/detail?id=604) There were bugs in the convolution code for two cases: 1. Where the filter table was assumed to be aligned to a 256 byte boundary. The offset of the pixel in the source buffer was computed incorrectly. 2. Where no such alignment assumption was made. An incorrect address for the filter table base was used. To fix both problems, I now assume that the filter table is 256-byte aligned and modify the pixel offset calculation to match. A later patch should remove the restriction that the filter table is aligned to a 256-byte boundary. There was also a bug in the ConvolveTest unit test (convolve_test.cc). (Bug & initial fix suggestion submitted by Tero Rintaluoma and Sami Pietilä). Change-Id: I71985551e62846e55e40de9e7e3959d4805baa82	2013-08-23 11:16:08 -07:00
Dmitry Kovalev	2612b99cc7	Adding VP9_FILTER_BITS constant. Removing VP9_FILTER_WEIGHT, VP9_FILTER_SHIFT, BLOCK_WIDTH_HEIGHT constants. Using ROUND_POWER_OF_TWO for rounding. Change-Id: I2e8d6858dcd600a87096138209731137d7decc24	2013-08-20 00:42:25 -07:00
Dmitry Kovalev	9a31d05e24	Removing unused convolve_avg_c function + cleanup. Change-Id: Id2b126c6456627c25e4041a82e304d0151d951ba	2013-08-12 14:28:00 -07:00
Ronald S. Bultje	decead7336	Replace copy_memNxM functions with a generic copy/avg function. Change-Id: I3ce849452ed4f08527de9565a9914d5ee36170aa	2013-07-10 18:27:24 -07:00
Tero Rintaluoma	18303b1263	Fix intermediate height in convolve intermediate_height for horizontal filtering must be at least 8 pixels to be able to do vertical filtering correctly. Currently it can be less for small block and y_step_q4 sizes. Change-Id: I2ee28b0591b2041c2fa9844d0ae2ff8a1a59cc21	2013-07-05 14:58:25 +03:00
Deb Mukherjee	735febf1ce	Removing the implicit compound inter experiment Removing this experiment for now, since it has been broken with the latest code changes. Change-Id: I1be2181b56de490fcb577f5905b5e147a8ed82d8	2013-04-22 16:46:54 -07:00
John Koleszar	a9ebbcc338	convolve: support larger blocks, fix asm saturation bug Updates the common convoloution code to support blocks larger than 16x16, and rectangular blocks. This uncovered a bug in the SSSE3 filtering routines due to the order of application of saturation. This commit fixes that bug, adjusts the unit test to bias its random values towards the extremes, and adds a test to ensure that all filters conform to the expected pairwise addition structure. Change-Id: I81f69668b1de0de5a8ed43f0643845641525c8f0	2013-04-18 13:57:59 -07:00
Deb Mukherjee	23144d2345	Implicit weighted prediction experiment Adds an experiment to use a weighted prediction of two INTER predictors, where the weight is one of (1/4, 3/4), (3/8, 5/8), (1/2, 1/2), (5/8, 3/8) or (3/4, 1/4), and is chosen implicitly based on consistency of the predictors to the already reconstructed pixels to the top and left of the current macroblock or superblock. Currently the weighting is not applied to SPLITMV modes, which default to the usual (1/2, 1/2) weighting. However the code is in place controlled by a macro. The same weighting is used for Y and UV components, where the weight is derived from analyzing the Y component only. Results (over compound inter-intra experiment) derf: +0.18% yt: +0.34% hd: +0.49% stdhd: +0.23% The experiment suggests bigger benefit for explicitly signaled weights. Change-Id: I5438539ff4485c5752874cd1eb078ff14bf5235a	2013-03-26 16:58:56 -07:00
John Koleszar	6fd7dd1a70	Use 256-byte aligned filter tables This avoids duplicating all the filters twice. Includes fixups to the convolve routines and associated tests to make this work. Change-Id: I922f86021594e55072ddb63b42b2313605db6e00	2013-02-27 08:22:39 -08:00
John Koleszar	eb939f45b8	Spatial resamping of ZEROMV predictors This patch allows coding frames using references of different resolution, in ZEROMV mode. For compound prediction, either reference may be scaled. To test, I use the resize_test and enable WRITE_RECON_BUFFER in vp9_onyxd_if.c. It's also useful to apply this patch to test/i420_video_source.h: --- a/test/i420_video_source.h +++ b/test/i420_video_source.h @@ -93,6 +93,7 @@ class I420VideoSource : public VideoSource { virtual void FillFrame() { // Read a frame from input_file. + if (frame_ != 3) if (fread(img_->img_data, raw_sz_, 1, input_file_) == 0) { limit_ = frame_; } This forces the frame that the resolution changes on to be coded with no motion, only scaling, and improves the quality of the result. Change-Id: I1ee75d19a437ff801192f767fd02a36bcbd1d496	2013-02-26 23:54:23 -08:00
John Koleszar	6a4f708c25	Refactor inter recon functions to support scaling Ensure that all inter prediction goes through a common code path that takes scaling into account. Removes a bunch of duplicate 1st/2nd predictor code. Also introduces a 16x8 mode for 8x8 MVs, similar to the 8x4 trick we were doing before. This has an unexpected effect with EIGHTTAP_SMOOTH, so it's disabled in that case for now. Change-Id: Ia053e823a8bc616a988a0af30452e1e75a739cba	2013-02-26 10:03:29 -08:00
Christian Duvivier	094e2572df	Faster convolve8_avg. Implement convolve8_avg using common functions which are already optimized instead of using more obscure ones which have only C versions. Encoder overall speed-up of about 12%. Change-Id: I8c57aa76936c8a48f22b115f19f61d9f2ae1e4b6	2013-02-11 16:53:11 -08:00
John Koleszar	7a07eea13f	Convert subpixel filters to use convolve framework Update the code to call the new convolution functions to do subpixel prediction rather than the existing functions. Remove the old C and assembly code, since it is unused. This causes a 50% performance reduction on the decoder, but that will be resolved when the asm for the new functions is available. There is no consensus for whether 6-tap or 2-tap predictors will be supported in the final codec, so these filters are implemented in terms of the 8-tap code, so that quality testing of these modes can continue. Implementing the lower complexity algorithms is a simple exercise, should it be necessary. This code produces slightly better results in the EIGHTTAP_SMOOTH case, since the filter is now applied in only one direction when the subpel motion is only in one direction. Like the previous code, the filtering is skipped entirely on full-pel MVs. This combination seems to give the best quality gains, but this may be indicative of a bug in the encoder's filter selection, since the encoder could achieve the result of skipping the filtering on full-pel by selecting one of the other filters. This should be revisited. Quality gains on derf positive on almost all clips. The only clip that seemed to be hurt at all datarates was football (-0.115% PSNR average, -0.587% min). Overall averages 0.375% PSNR, 0.347% SSIM. Change-Id: I7d469716091b1d89b4b08adde5863999319d69ff	2013-02-05 14:23:17 -08:00
John Koleszar	5ca6a3667f	Add 8-tap generic convolver This commit introduces a new convolution function which will be used to replace the existing subpixel interpolation functions. It is much the same as the existing functions, but allows for changing the filter kernel on a per-pixel basis, and doesn't bake in knowledge of the filter to be applied or the size of the resulting block into the function name. Replacing the existing subpel filters will come in a later commit. Change-Id: Ic9a5615f2f456cb77f96741856fc650d6d78bb91	2013-02-05 14:19:28 -08:00

26 Commits