generic-library/vpx

Author	SHA1	Message	Date
Jingning Han	ac50b75e50	Use balanced model for intra prediction mode coding This commit replaces the previous table based intra mode model coding with a more balanced entropy coding system. It reduces the decoder lookup table size by 1K bytes. The key frame compression performance is about even on average. There are a few points where the compression performance is improved by over 5%. Most test points are fairly close to the lookup table approach. Change-Id: I47154276c0a6a22ae87de8845bc2d494681b95f6	2015-06-23 16:42:56 -07:00
Jingning Han	81c389e790	Make tx partition entropy coder account for block size This commit allows the entropy coder for transform block partition to account for its relative position with respect to the block size. Change-Id: I2b5019c378bfb58c11b926fa50c0db1933f35852	2015-06-18 21:56:30 +00:00
Jingning Han	0a42a1efd4	Add max_tx_size to MB_MODE_INFO Refactor the recursive transform block partition to reduce repeated computation maximum transform block size per block. Change-Id: Ib408c78dc6923fe7d337dc937e74f2701ac63859	2015-06-18 14:54:49 -07:00
Jingning Han	7cbea06386	Update transform block partition information for intra blocks If a block is coded in the intra modes, update the transform block partition information as maximum block size. Change-Id: I5ea440c700fc887ff2fe84fabde77a9d896d16f4	2015-06-15 15:53:19 -07:00
Jingning Han	63c0d8df9f	Assign largest transform block size to skip block If a block has all coefficients quantized to zero, the codec will assume that it uses largest transform block size. Change-Id: I1a32527e50026e8e4759ad8de474189cd20e89c8	2015-06-11 11:01:44 -07:00
Jingning Han	9ce132ac37	Refactor transform block partition entropy coding This commit refactors the transform block partition entropy coding process to improve the encoding speed. There is no change in the compression statistics. Change-Id: I237466fd95c1b888df432babfa36e01f74240eef	2015-06-11 09:41:20 -07:00
Jingning Han	87a0d5436b	Account for context information for partition rate estimate This commit allows the encoder to account for the boundary block information to estimate the transform block partitiion rate cost in the rate-distortion optimization scheme. Change-Id: Idb79cf936d96cdd15bcba27e47318295413a5f5d	2015-06-09 15:53:55 -07:00
Jingning Han	948c6d882e	Enable transform block partition entropy coding Select the probability model for transform block partition coding conditioned on the neighbor transform block sizes. Change-Id: Ib701296e59009bad97dbd21d8dcd58bc5e552f39	2015-06-09 12:30:52 -07:00
Jingning Han	cd4aca5959	Add decoder support to recursive transform block partition It allows the decoder to recursively parse and use the transform block size for inter coded blocks. Change-Id: I12ceea48ab35501ac1a3447142deb2a334eff3b8	2015-05-22 16:45:34 -07:00
Jingning Han	64f3820f80	Refactor bit-stream syntax support to transform partition Make the bit-stream syntax elelment coding ready to support variable transform coding block sizes. Change-Id: I07ae4ab62d1ecd46c4a5ae45702fc14bd1d4b07d	2015-05-22 12:13:29 -07:00
Jingning Han	5f6fe83ac5	Syntax coding support for transform block coding This commit re-designs the bitstream syntax to support recursive transform block partition. It disables the decoder vector unit tests. Change-Id: I6cac24c4f1e44f29ffcc9b87ba1167eeb32d1b69	2015-05-18 15:43:02 -07:00
Frank Galligan	6eaca27df2	Refactor read_intra_frame_mode_info Change-Id: I56b0614154408e8ec613784b2007374df00fbf17	2015-03-09 16:25:01 -07:00
hkuang	9d44fd6bc5	Remove some unnecessary code in thread context copy. Change-Id: Iddf098e1bae9c10fc2f325f84156f50a0bd0055a	2015-03-06 10:29:15 -08:00
Yunqing Wang	85a9bc04d4	vp9_dthread: pass frame counts to decoder functions The current multi-threaded tile decoder requires that the videoes are encoded with frame_parallel_decoding_mode = 1. This requirement is not necessary, and is better to be removed. This patch includes the first part of the work. Change-Id: Ic7695fb3cfe13f9022582c9f0edd2aa6e2e36d28	2015-02-03 09:39:15 -08:00
hkuang	be6aeadaf4	Try again to merge branch 'frame-parallel' into master branch. In frame parallel decode, libvpx decoder decodes several frames on all cpus in parallel fashion. If not being flushed, it will only return frame when all the cpus are busy. If getting flushed, it will return all the frames in the decoder. Compare with current serial decode mode in which libvpx decoder is idle between decode calls, libvpx decoder is busy between decode calls. Current frame parallel decode will only speed up the decoding for frame parallel encoded videos. For non frame parallel encoded videos, frame parallel decode is slower than serial decode due to lack of loopfilter worker thread. There are still some known issues that need to be addressed. For example: decode frame parallel videos with segmentation enabled is not right sometimes. * frame-parallel: Add error handling for frame parallel decode and unit test for that. Fix a bug in frame parallel decode and add a unit test for that. Add two test vectors to test frame parallel decode. Add key frame seeking to webmdec and webm_video_source. Implement frame parallel decode for VP9. Increase the thread test range to cover 5, 6, 7, 8 threads. Fix a bug in adding frame parallel unit test. Add VP9 frame-parallel unit test. Manually pick "Make the api behavior conform to api spec." from master branch. Move vp9_dec_build_inter_predictors_* to decoder folder. Add segmentation map array for current and last frame segmentation. Include the right header for VP9 worker thread. Move vp9_thread.* to common. ctrl_get_reference does not need user_priv. Seperate the frame buffers from VP9 encoder/decoder structure. Revert "Revert "Revert "Revert 3 patches from Hangyu to get Chrome to build:""" Conflicts: test/codec_factory.h test/decode_test_driver.cc test/decode_test_driver.h test/invalid_file_test.cc test/test-data.sha1 test/test.mk test/test_vectors.cc vp8/vp8_dx_iface.c vp9/common/vp9_alloccommon.c vp9/common/vp9_entropymode.c vp9/common/vp9_loopfilter_thread.c vp9/common/vp9_loopfilter_thread.h vp9/common/vp9_mvref_common.c vp9/common/vp9_onyxc_int.h vp9/common/vp9_reconinter.c vp9/decoder/vp9_decodeframe.c vp9/decoder/vp9_decodeframe.h vp9/decoder/vp9_decodemv.c vp9/decoder/vp9_decoder.c vp9/decoder/vp9_decoder.h vp9/encoder/vp9_encoder.c vp9/encoder/vp9_pickmode.c vp9/encoder/vp9_rdopt.c vp9/vp9_cx_iface.c vp9/vp9_dx_iface.c This reverts commit a18da9760a74d9ce6fb9f875706dc639c95402f5. Change-Id: I361442ffec1586d036ea2e0ee97ce4f077585f02	2015-01-30 21:00:13 -08:00
Johann	a18da9760a	Revert "Merge branch 'frame-parallel' to enable frame parallel decode in master branch." This reverts commit bde04ce5039cbcf86c8b34bdb4127e18d7e1d0c7 Change-Id: I053dae04c761b04a36dc239558503905a14d2470	2015-01-23 08:42:02 -08:00
hkuang	bde04ce503	Merge branch 'frame-parallel' to enable frame parallel decode in master branch. In frame parallel decode, libvpx decoder decodes several frames on all cpus in parallel fashion. If not being flushed, it will only return frame when all the cpus are busy. If getting flushed, it will return all the frames in the decoder. Compare with current serial decode mode in which libvpx decoder is idle between decode calls, libvpx decoder is busy between decode calls. VP9 frame parallel decode is >30% faster than serial decode with tile parallel threading which will makes devices play 1080P VP9 videos more easily. * frame-parallel: Add error handling for frame parallel decode and unit test for that. Fix a bug in frame parallel decode and add a unit test for that. Add two test vectors to test frame parallel decode. Add key frame seeking to webmdec and webm_video_source. Implement frame parallel decode for VP9. Increase the thread test range to cover 5, 6, 7, 8 threads. Fix a bug in adding frame parallel unit test. Add VP9 frame-parallel unit test. Manually pick "Make the api behavior conform to api spec." from master branch. Move vp9_dec_build_inter_predictors_* to decoder folder. Add segmentation map array for current and last frame segmentation. Include the right header for VP9 worker thread. Move vp9_thread.* to common. ctrl_get_reference does not need user_priv. Seperate the frame buffers from VP9 encoder/decoder structure. Revert "Revert "Revert "Revert 3 patches from Hangyu to get Chrome to build:""" Conflicts: test/codec_factory.h test/decode_test_driver.cc test/decode_test_driver.h test/invalid_file_test.cc test/test-data.sha1 test/test.mk test/test_vectors.cc vp8/vp8_dx_iface.c vp9/common/vp9_alloccommon.c vp9/common/vp9_entropymode.c vp9/common/vp9_loopfilter_thread.c vp9/common/vp9_loopfilter_thread.h vp9/common/vp9_mvref_common.c vp9/common/vp9_onyxc_int.h vp9/common/vp9_reconinter.c vp9/decoder/vp9_decodeframe.c vp9/decoder/vp9_decodeframe.h vp9/decoder/vp9_decodemv.c vp9/decoder/vp9_decoder.c vp9/decoder/vp9_decoder.h vp9/encoder/vp9_encoder.c vp9/encoder/vp9_pickmode.c vp9/encoder/vp9_rdopt.c vp9/vp9_cx_iface.c vp9/vp9_dx_iface.c Change-Id: Ib92eb35851c172d0624970e312ed515054e5ca64	2015-01-22 18:18:53 -08:00
James Zern	953dd1894d	vp9: add per-tile longjmp error handling this avoids longjmp'ing from another thread on error which will cause undesired behavior Change-Id: Ic9074ed8cc4243944bf2539d6e482f213f4e8c86	2014-12-19 11:50:04 -08:00
hkuang	dde819599b	Clean up the logic of handling corrupted frame. No more checking of corrupted reference frame as we skip decoding any non-intra frame in case of frame corrupted. Change-Id: I77d41bbb02fc5f61972740e2d411441eb6a17073	2014-12-04 15:07:59 -08:00
Hui Su	2c95a3f374	Merge "Simplify interface of write_selected_tx_size and read_tx_size"	2014-11-05 13:33:09 -08:00
Hui Su	709c634b84	Simplify interface of write_selected_tx_size and read_tx_size Change-Id: Ia2b2a895deefaaf7b34bf26df86add56dbab082c	2014-11-04 16:11:50 -08:00
hkuang	55577431ae	Bind motion vectors with frame buffer structure. This will save a lot of memory for decoder due to removing of prev_mi, but prev_mi is still needed in encoder. So this will increase a little bit memory for encoder. Change-Id: I24b2f1a423ebffa55a9bd2fcee1077dac995b2ed	2014-10-31 17:01:08 -07:00
Yunqing Wang	7c7e4d4eb8	vp9_ethread: allocate frame contexts outside VP9_COMMON struct This patch allocated frame contexts outside VP9_COMMON. This allows multiple threads to share the same copy of frame contexts, and reduces the overhead. It also guarantees the correct update of these contexts during bitstream packing. This patch doesn't change encoding result. Change-Id: Ic181a2460b891d1d587278a6d02d8057b9dbd353	2014-10-22 15:03:12 -07:00
Hangyu Kuang	9ce3a7d76c	Implement frame parallel decode for VP9. Using 4 threads, frame parallel decode is ~3x faster than single thread decode and around 30% faster than tile parallel decode for frame parallel encoded video on both Android and desktop with 4 threads. Decode speed is scalable to threads too which means decode could be even faster with more threads. Change-Id: Ia0a549aaa3e83b5a17b31d8299aa496ea4f21e3e	2014-10-22 10:50:58 -07:00
hkuang	c38a8edf16	Merge "Remove extra line."	2014-10-14 11:05:01 -07:00
hkuang	dbe91de6d4	Remove extra line. Change-Id: I5e79c276d8953ae17cd35b2846e6e40660c037c3	2014-10-10 14:59:04 -07:00
hkuang	3304d4e6ca	Optimize the code to set the refernce frame right after reading the header. Change-Id: I495cf4a366e06e3220ed132500b1ba1c8448f708	2014-10-09 16:32:36 -07:00
hkuang	15a3e5f742	Remove unnecessary scale check in set_ref. Scale check has been done in read_inter_block_mode_info. Change-Id: I6c86f93bd579109ed30ff13a04a30e35f5ae6fc5	2014-10-09 12:19:55 -07:00
hkuang	c70cea97ac	Remove mi_grid_* structures. mi_grid_* are arrays of pointer to pointer. They save the pointers that point to the MIs in cm->mi. But they are unnecessary and complicated. The original goal was to remove MODE_INFO_t copy. But with an extra MODE_INFO_t pointer inside MODE_INFO_t, same goal could be achieved. This commit totally removes the mi_grid_* structures. But there are still many dummy MODE_INFO_t inside cm->mi which are a waste of memory. Next commit will do on-demand MODE_INFO_t allocation in order to save these memories. Change-Id: I3a05cf1610679fed26e0b2eadd315a9ae91afdd6	2014-09-19 21:27:11 -07:00
hkuang	7eca086707	Add segmentation map array for current and last frame segmentation. The original implementation only allocates one segmentation map and this works fine for serial decode. But for frame parallel decode, each thread need to have its own segmentation map and the last frame segmentation map should be provided from last frame decoding thread. After finishing decoding a frame, thread need to serve the old segmentation map that associate with the previous decoded frame. The thread also need to use another segmentation map for decoding the current frame. Change-Id: I442ddff36b5de9cb8a7eb59e225744c78f4492d8	2014-07-28 10:44:02 -07:00
Yaowu Xu	9261e1aa6e	Changed validation of reference frame size A previous change, https://gerrit.chromium.org/gerrit/#/c/70632, introduced a size validation for reference frames to insuare the input stream is a valid VP9 stream. However, the logic requiring all reference frames have valid size turned out to be too strict. In this commit, we modify the validation to require one of the reference frame has valid dimension. In addition, the decoder reports error whenever it detects the use of reference frame with invalid scalig ratio. Change-Id: If8efc312244087556cfe00f1fcbdff811268ebad	2014-07-24 14:58:01 -07:00
Dmitry Kovalev	e608418899	Renaming MB_PREDICTION_MODE to PREDICTION_MODE. Actually, it would be great to have two separate enums INTRA_MODES and INTER_MODES in future. Change-Id: I6c4147cf0002853da9c1e03fe9514eab876f01c8	2014-04-22 17:48:31 -07:00
Dmitry Kovalev	86f44a91f4	Renaming two members in MACROBLOCKD struct. Renames: mi_8x8 -> mi mode_info_stride -> mi_stride Change-Id: I66f3e5fd1e7b7f46f108af5bb711c5fd9493c1be	2014-04-01 17:46:40 -07:00
Dmitry Kovalev	9347e55f12	Making c++ compiler happier. Change-Id: Ie224e968589bdb0774dc112e6f6df56cc0447465	2014-03-21 14:37:01 -07:00
Dmitry Kovalev	b8bc2d337a	Fixing warnings/errors from c++ compiler. Change-Id: Ia561dda53f2dd10e3a10a2df2adb8027ab19397a	2014-03-18 10:47:51 -07:00
Dmitry Kovalev	aa7ec14c9a	Merge "Speeding up reading of intra block modes."	2014-03-13 13:45:32 -07:00
Dmitry Kovalev	ba54a886c3	Speeding up reading of intra block modes. Reimplementing sub8x8-reading of intra block modes in read_intra_frame_mode_info() and read_intra_block_mode_info(). Code looks more readable as well. Change-Id: Ia42fc7d0dad708bc0c7a8bff1f8b37809b843f40	2014-03-12 12:32:09 -07:00
Dmitry Kovalev	ff935ff781	Removing last_mi from MACROBLOCKD struct. Change-Id: Ied12b39c55667b26fd3bf90eb331e601c53a10f6	2014-03-10 16:02:03 -07:00
Dmitry Kovalev	f8f8c6d44c	Adding reusable get_y_mode_prob() function. Change-Id: Iebd182d7aeebc0f8964b6fd35057449bb25b00c1	2014-03-10 10:50:16 -07:00
Dmitry Kovalev	ea88da7492	Removing vp9_onyxd_int.h file. Moving VP9Decompressor struct from vp9_onyxd_int.h to vp9_onyxd.h. Change-Id: Ic86c15e44130541a7f692db43ef9109293f99ae8	2014-03-05 10:39:29 -08:00
Dmitry Kovalev	8fc8583a4c	Merge "Consistent names for reference_mode functions."	2014-02-25 11:04:37 -08:00
Dmitry Kovalev	69fd030dc8	Consistent names for reference_mode functions. Change-Id: I48c9e5e4ca21e11740c750ca2eabf7e8a51c52d2	2014-02-19 15:33:59 +01:00
Dmitry Kovalev	9b75f381cf	Adding is_mv_valid() function. Change-Id: I9d036244b558765b252d8c6681b22721cb2e51bb	2014-02-19 13:57:18 +01:00
Dmitry Kovalev	004c8c636e	Renaming skip_coeff to skip for consistency. Change-Id: I036e815ca63d00cba71202ae09ba0f6ef745dcb8	2014-02-12 17:44:12 -08:00
Jim Bankoski	9dec7712ab	static function convert to inline or global vp9_blockd.h Change-Id: Ifdd951f24932839f06d1c700371662511dde6ebe	2014-01-31 19:50:40 -08:00
Dmitry Kovalev	b107f2c470	Renaming "mbskip" to "skip". Change-Id: I27a30b43eae026a77f92958e2238d02d9cdf7832	2014-01-29 14:48:42 -08:00
Dmitry Kovalev	4264c93844	Renaming INTERPOLATION_TYPE to INTERP_FILTER. Corresponding renames: subpel_kernel => interp_kernel vp9_get_filter_kernel() => vp9_get_interp_kernel() pred_filter_type => pred_interp_filter adaptive_pred_filter_type => adaptive_pred_interp_filter mcomp_filter_type => interp_filter read_interp_filter_type() => read_interp_filter() write_interp_filter_type() => write_interp_filter() fix_mcomp_filter_type() => fix_interp_filter() Change-Id: I1fa61fa1dc81ebbf043457c3ee2d8d4515bee6d3	2014-01-24 15:57:28 -08:00
Jingning Han	318e177f4a	Deprecate the use of best_mv in decoding process This commit removes the use of best_mv in the decoding process. This variable can be replaced with nearest_mv. It saves a few cycles on assigning the values for best_mv. Change-Id: Ic183f9c1fb615c54efd7e6ccfedcf09d493435e4	2014-01-16 18:04:58 -08:00
Dmitry Kovalev	1e8b5bf4ac	Merge "Removing vp9_findnearmv.{h, c} files."	2013-12-26 13:38:38 -08:00
Dmitry Kovalev	f69b5609ff	Renaming vp9_dboolhuff.{h, c} to vp9_reader.{h, c}. Change-Id: I50c009ff8108bda1c57427f23d63a79c04f7e776	2013-12-20 12:53:03 -08:00

1 2 3 4 5 ...

319 Commits