generic-library/vpx

Author	SHA1	Message	Date
Dmitry Kovalev	50f97cf7fb	Removing function pointers from inter prediction. Removing foreach_predicted_block_visitor and calling build_inter_predictors directly. Change-Id: I11bb3c872b99b47c2680b01b0dbcc01c558c4a2b	2013-11-11 18:37:00 -08:00
Jingning Han	34b6abefa2	Enable dual buffer rd search and encoding scheme This commit enables the dual buffer rate-distortion optimization and encoding scheme. It stacks the original transform coefficients, quantized levels, and reconstructed coefficients, in the rate- distortion optimization search process, hence eliminates the need to re-run residual generation, forward transform, and quantization in the encoding stage. Change-Id: I011bfad3a59a380a869ee552e91dae0394ec492e	2013-11-11 18:32:55 -08:00
Jingning Han	e5741c56d1	Merge "Allocate dual buffer sets for encoding"	2013-11-11 18:00:57 -08:00
Dmitry Kovalev	42b1f62085	Removing redundant assignment. xd->mi_8x8 is assigned inside set_offsets() for each prediction block. Change-Id: I20e5974a9eaf105e5a04fc7f99b7a93bd50e3d0a	2013-11-11 17:39:43 -08:00
Dmitry Kovalev	3740d67d76	Syncing write_modes_{b, sb} implementation with decode_modes_{b, sb}. Change-Id: Iaee740ec3bfb2b5328c24f4641c285e5a4a046dc	2013-11-11 17:29:31 -08:00
Yunqing Wang	b45438181c	Rewrite filter_selectively_horiz for parallel loopfiltering Added loop filter mask checking, and made the caller function ready for implementation of parallel loopfiltering in horizontal direction. Next, we need to go through the loopfilter functions (both c and optimized versions), and provide 16-byte wide loopfiltering for each filter type. Change-Id: Ifef47e7ef9086ebc2fd6ca7ede8f27c9bbf79e66	2013-11-11 17:06:01 -08:00
Dmitry Kovalev	4e39d530f0	Merge "Cleaning up joint_motion_search function."	2013-11-11 16:34:39 -08:00
Jingning Han	3b3aea6834	Allocate dual buffer sets for encoding Allocate memory space of dual buffer sets that store the coeff, qcoeff, dqcoeff, and eobs. Connect the pointers of macroblock_plane and macroblockd_plane to the actual buffer in use accordingly. Change-Id: I2f0b5f482ca879fae39095013eaf8901db20a5a4	2013-11-11 16:24:39 -08:00
Dmitry Kovalev	14f2cf1757	Cleaning up joint_motion_search function. Change-Id: I70a0878b23bda0ac3ff8733b4c96d5c636bc551c	2013-11-11 16:04:02 -08:00
Dmitry Kovalev	3551e25099	Moving {sb, mb, b, ab}_index from MACROBLOCKD to MACROBLOCK. We use {sb, mb, b, ab}_index only inside encoder, so moving them into appropriate data structure. Change-Id: Ib5c1036716354d9d321e11a60c1634c1cb8f9716	2013-11-11 15:58:57 -08:00
Jingning Han	d8b4c79270	Decouple macroblockd_plane buffer usage Make the macroblockd_plane contain dynamic buffer pointers instead static pointers to the memory space allocated therein. The decoder uses the buffer allocated in pbi, while encoder will use a dual buffer approach for rate-distortion optimization search. Change-Id: Ie6f24be2dcda35df7c15b4014e5ccf236fb3f76c	2013-11-11 15:26:10 -08:00
Dmitry Kovalev	94d4add1f7	Replacing raster_block with block in the encoder. We only used "ib" to call get_scan() function, which in turn calls get_tx_type_4x4() function. The latter one only needs block index if bsize < BLOCK_8X8 -- under that condition raster_block == block. Change-Id: I697306a0c3cf937acdd4f5e623d4367c5acc0b2f	2013-11-11 15:18:48 -08:00
hkuang	c689a126ed	Fix a bug in the assembly code. Change-Id: Ic416e3f8a11e82ee298e6f709b2119a9ddf1e2f8	2013-11-11 12:49:12 -08:00
Dmitry Kovalev	ec8128e27f	Merge "Replacing (raster_block >> tx_size) with (block >> (tx_size << 1))."	2013-11-11 12:27:42 -08:00
Dmitry Kovalev	c53a9c70fb	Merge "Localizing NEARESTMV special cases in the code."	2013-11-11 11:12:06 -08:00
Dmitry Kovalev	f6baa62cd8	Merge "Cleaning up vp9_quantize_b_c() function."	2013-11-11 11:02:32 -08:00
Dmitry Kovalev	3aa4b42a35	Merge "Cleaning up read_mv_probs() function."	2013-11-11 11:01:35 -08:00
Dmitry Kovalev	974a27131e	Merge "Adding read_reference_mode() function."	2013-11-11 11:00:51 -08:00
Yaowu Xu	af2559c0d6	Merge "[BITSTREAM]Fix row tile mode_info pointer setup"	2013-11-09 13:59:19 -08:00
Yaowu Xu	cae7e0741a	[BITSTREAM]Fix row tile mode_info pointer setup This commit fixes the assignment of mode_info pointer per tile. It makes recognition of tiles in both row and column formats and properly arrange the use of mode_info. The bug was first introduced in I6226456dd11f275fa991e4a7a930549da6675915 https://gerrit.chromium.org/gerrit/#/c/67492/ Change-Id: Ie12cd209f53241513728c461ee3d7b9599ddb860	2013-11-08 22:06:53 -08:00
Yaowu Xu	ee1e4e645a	Merge "Correct a couple of typos"	2013-11-08 16:17:38 -08:00
Dmitry Kovalev	22a001988b	Optimizing set_contexts() function. Inlining set_contexts_on_border() into set_contexts(). The only difference is the additional check that "has_eob != 0" in addition to "xd->mb_to_right_edge < 0" and "xd->mb_to_right_edge < 0". If has_eob == 0 then memset does the right thing and works faster. Change-Id: I5206f767d729f758b14c667592b7034df4837d0e	2013-11-08 12:44:56 -08:00
Yaowu Xu	a596975eb6	Correct a couple of typos Change-Id: Ic470c6c9ce27b615c9645b9cb0d67526417bc374	2013-11-08 12:43:51 -08:00
Yunqing Wang	e731b2ba2c	Merge "Improve vp9_idct4x4_1_add_sse2"	2013-11-08 12:00:36 -08:00
Yunqing Wang	49cf335e7f	Improve loopfilter function This patch continued the work done in "Rewrite loop_filter_info_n struct"(commit:00dbd369c70270428d56da6d15ea5486fc821c52) to further improve loopfilter function. 1. Instead of storing pointers to thresholds, store loopfilter levels within 64x64 SB; 2. Since loopfilter levels are already calculated in setup_mask, we don't need call build_lfi to look up them again. Just save loopfilter levels in setup_mask. 3. Reorganized and simplified filter_block_plane(). Tests showed a ~0.8% decoder speedup. Change-Id: I723c7779738bbc2afcb9afa2c6f78580ee6c3af7	2013-11-08 11:48:31 -08:00
Yaowu Xu	814112d0f6	Merge "Disable zeroblock forcing for lossless coding mode"	2013-11-08 11:11:39 -08:00
Dmitry Kovalev	614effc0f6	Merge "Unifying tile decoding for both direct and inverse tile order."	2013-11-08 10:59:02 -08:00
Yaowu Xu	a4a5a210cb	Disable zeroblock forcing for lossless coding mode This to make sure that prediction residue always get coded in lossless mode. This commit also fixed lossless unit test Change-Id: I537726ee55328d4e4cf0a0196393a67e12bfcde1	2013-11-08 10:32:44 -08:00
Yunqing Wang	283427c053	Merge "Remove TEXTREL from 32bit encoder"	2013-11-08 08:26:30 -08:00
Paul Wilkins	0ed606fd40	Merge "Removed unused rate parameter."	2013-11-08 04:24:51 -08:00
Dmitry Kovalev	d28f30ef4e	Replacing (raster_block >> tx_size) with (block >> (tx_size << 1)). The new expression is much more logical than previous one. Surprisingly both expressions give exactly the same set of dependent values -- have_top, have_left, have_right -- in vp9_predict_intra_block. Change-Id: I63eb1b592b8c37883b3a0dbb1f3daa271e446109	2013-11-07 15:26:57 -08:00
hkuang	a6462990e6	Merge "Add back vp9_short_idct32x32_1_add_neon which is deleted in cleanup I63df79a13cf62aa2c9360a7a26933c100f9ebda3."	2013-11-07 14:42:29 -08:00
Yunqing Wang	d7289658fb	Remove TEXTREL from 32bit encoder This patch fixed the issue reported in "Issue 655: remove textrel's from 32-bit vp9 encoder". The set of vp9_subpel_variance functions that used x86inc.asm ABI didn't build correctly for 32bit PIC. The fix was carefully done under the situation that there was not enough registers. After the change, we got $ eu-findtextrel libvpx.so eu-findtextrel: no text relocations reported in 'libvpx.so' Change-Id: I1b176311dedaf48eaee0a1e777588043c97cea82	2013-11-07 13:39:40 -08:00
Jingning Han	abdefeaa89	Merge "Fix the variable naming in encode_block"	2013-11-07 11:39:04 -08:00
Deb Mukherjee	6cf3b98ac2	Merge "Miscelleneous changes in detokenize for speed"	2013-11-07 09:40:07 -08:00
Jingning Han	e91d770511	Fix the variable naming in encode_block The term x represents macroblock pointer across encode_block. Change the two local variable names to avoid confusion. Change-Id: Ic732e73023525d673c0a678ed2708ac1edf5a3f9	2013-11-07 08:59:14 -08:00
Paul Wilkins	84b3b03705	Removed unused rate parameter. Change-Id: I6e4a266fdbad1d222eb45d45b67bbb82d091821a	2013-11-07 09:59:45 +00:00
Dmitry Kovalev	672ba3ddf5	Unifying tile decoding for both direct and inverse tile order. Now tile decoding consists of two stages: 1. Find tile buffer start and its size, put this info into tile_buffers. 2. Decode each tile based on information from tile_buffers. It seems that stage 1 can also be reused by multithreaded tile decoder. Change-Id: If0cdaefdd6d10bb41c63561346c9ae4cfac081dd	2013-11-06 18:15:33 -08:00
Ivan Maltz	741c14fcf0	Merge "Move SVC per-frame loop from sample app into libvpx proper"	2013-11-06 17:24:05 -08:00
Dmitry Kovalev	af36c1f2e7	Merge "Using pd->dqcoeff instead of pd->qcoeff in the decoder."	2013-11-06 16:59:57 -08:00
Dmitry Kovalev	a1dc97beb1	Using pd->dqcoeff instead of pd->qcoeff in the decoder. It is more logical to use dqcoeff buffer to put there dequantized transform coefficients (inside inverse_transform_block and decode_coefs functions). Dequantization happens inside WRITE_COEF_CONTINUE macro. qcoeff buffer should be only used in the encoder for quantized transform coefficients. Change-Id: Ifd54bef272bbf5311ced6669c4f1079f998af5d7	2013-11-06 16:14:45 -08:00
Ivan Maltz	1ed0e1beb5	Move SVC per-frame loop from sample app into libvpx proper SVC multiple layer per frame encoding is invoked with vpx_svc_init and vpx_svc_encode. These interfaces are designed to be invoked from ffmpeg. Additional improvements: - make dummy frame handling a bit more explicit - fixed bug with single layer encodes - track individual frame sizes and psnrs instead of averages - parameterized quantizer, 16th scalefactors, more logging, - enabled single layer encodes to generate baseline - include new mode for 3 layer I frame with 5 total layers Change-Id: I46cfa600d102e208c6af8acd6132e0cc25cda8d4	2013-11-06 14:49:27 -08:00
Dmitry Kovalev	bfdfeb62a7	Merge "Replacing mi_{width,height}_log2 with num_8x8_blocks_{wide,high}_lookup."	2013-11-06 14:18:33 -08:00
Dmitry Kovalev	7b011c5467	Replacing mi_{width,height}_log2 with num_8x8_blocks_{wide,high}_lookup. Change-Id: I04c55daef89bca2b85cb7db0850f9b052abc5a7c	2013-11-06 13:34:23 -08:00
Deb Mukherjee	e52d3efe8a	Merge "Removes stack allocation of token_cache"	2013-11-06 13:12:47 -08:00
Yaowu Xu	2f4bade348	Merge "Missing _ means no sse3 for vp9_h_predictor_32x32."	2013-11-06 13:04:28 -08:00
Deb Mukherjee	8810db303c	Removes stack allocation of token_cache Removes stack-allocation of token_cache in the tokenize function in the encoder. Change-Id: Ifdbab76dc2b23384da0080d2e9390533e489db8c	2013-11-06 11:50:27 -08:00
Yaowu Xu	103c4603d2	Merge "optimize decode_bool operation"	2013-11-06 10:25:52 -08:00
Paul Wilkins	0c39318a8b	Missing _ means no sse3 for vp9_h_predictor_32x32. Error in script means vp9_h_predictor_32x32 sse3 version is not enabled. Change-Id: Ia43672740da1ecdfb7fcd420490ef424b04accc4	2013-11-06 13:57:55 +00:00
Dmitry Kovalev	d54ff9dc3d	Cleaning up vp9_quantize_b_c() function. Change-Id: I42c75530a8c9cff68480657f074131e6b60d9fca	2013-11-05 17:41:56 -08:00
Dmitry Kovalev	4a96e64dc2	Using max_tx_size instead of bsize when possible. Change-Id: I246364bc4270ca13aefb4bc3445bcf102b3170dc	2013-11-05 17:36:43 -08:00
Yaowu Xu	278856319b	Simply a macro Change-Id: I1c0c1b92aab33b1e4321b8e584a934fe58e0b324	2013-11-05 17:13:00 -08:00
hkuang	6b16f63332	Add back vp9_short_idct32x32_1_add_neon which is deleted in cleanup I63df79a13cf62aa2c9360a7a26933c100f9ebda3. Change-Id: I034848cf05031618818f7df2e7f9c35102686948	2013-11-05 14:57:32 -08:00
Dmitry Kovalev	6c217d97df	Adding read_reference_mode() function. Change-Id: I517ad7e1a48910f0d2e95feacd23f143d28b42d6	2013-11-05 14:57:05 -08:00
Dmitry Kovalev	d172201403	Cleaning up read_mv_probs() function. Change-Id: I7a1e88b5977076e22736c0d6af1d4dcc19d39a6a	2013-11-05 14:43:35 -08:00
Dmitry Kovalev	648e30bca6	Merge "Removing old code."	2013-11-05 14:28:43 -08:00
Dmitry Kovalev	815189613b	Localizing NEARESTMV special cases in the code. Removing special case handling from vp9_tree_probs_from_distribution(), tree_merge_probs(), and vp9_tokens_from_tree_offset() functions. Replacing inter_mode_offset() function with macro INTER_OFFSET which is used now for vp9_inter_mode_tree definition. Change-Id: Iff75a1499d460beb949ece543389c8754deaf178	2013-11-05 11:58:57 -08:00
Dmitry Kovalev	69cb7dc839	Merge "Unified approach for backward probability update."	2013-11-05 10:59:04 -08:00
Deb Mukherjee	551829d8f3	Miscelleneous changes in detokenize for speed Changes to reduce branching in detokenization in order to improve decode speed. Change-Id: I23a74d072cc14d99f6fc054c1e6a72021e777903	2013-11-05 10:27:36 -08:00
Deb Mukherjee	3a833ea38f	token_cache changes in decoder Removes stack-alocation of token_cache in decode_coefs function Seems to achieve about 1% decode speed improvement as tested on 25 480p videos. Change-Id: I8e7eb3361fa09d9654dfad0677a6d606701fdc6e	2013-11-05 09:32:58 -08:00
Deb Mukherjee	be8a4cbbdd	Merge "Remove one shot q experiment"	2013-11-05 09:29:31 -08:00
Jingning Han	f3efb9903c	Merge "Fix the use case of plane_block_idx in sub8x8 RD"	2013-11-05 09:27:05 -08:00
Jingning Han	4fc8320f4c	Merge "Avoid mv cost check for invalid mv values"	2013-11-05 09:01:05 -08:00
Adrian Grange	a8260a38e0	Merge "Remove unused members from VP9_COMP"	2013-11-05 07:45:03 -08:00
Jingning Han	c2217f2238	Avoid mv cost check for invalid mv values The compound inter prediction could potentially run with initial motion vectors of invalid value and check the mv_cost, which triggers overheap read. This commit resolves this issue by forcing a motion vector value check for compound inter modes of both superblock and sub8x8 block sizes. Change-Id: I4f4fc19ce83c8272782bc382f12c82a3f03212fc	2013-11-04 19:01:42 -08:00
Yaowu Xu	9f37d89037	optimize decode_bool operation Change-Id: I7fe44633d29046ddd38c88c00e8520d27ac748bf	2013-11-04 17:10:18 -08:00
Dmitry Kovalev	ac115a1f9a	Removing old code. Change-Id: Ie05cc5e2d8ce12eacdf482a8b75e5a6ce6f59f57	2013-11-04 16:38:15 -08:00
Dmitry Kovalev	c622e1d18f	Unified approach for backward probability update. Replacing update_mode_probs() and adapt_probs() with tree_merge_probs(). Change-Id: I50b2c968d67c9265f5216c700cbeba25fb014654	2013-11-04 16:12:29 -08:00
Dmitry Kovalev	dde8069e57	Splitting partition_probs array into two arrays. We only update partition_probs for inter frames but they are constant for key frames. It is not necessary to have constants inside frame context and copy them every time. This change reduces FRAME_CONTEXT size by at least 48 bytes. Change-Id: If70a53be51043f37fe7d113853217937710932a7	2013-11-04 14:26:16 -08:00
Adrian Grange	44e25155f7	Remove unused members from VP9_COMP Removed: goldfreq, avg_encode_time, avg_pick_mode_time, cpu_freq, interquantizer member variables from VP9_COMP since they are no longer used in the code. Change-Id: I010a82c217d0da03c3f53d1858d3462190c12dcf	2013-11-04 12:32:17 -08:00
Adrian Grange	dd45530ee6	Merge "Remove unused member variables from VP9_COMP"	2013-11-04 11:39:23 -08:00
Dmitry Kovalev	dd209fae3a	Merge "Removing 'new' probability calculation from convert_distribution()."	2013-11-04 11:14:58 -08:00
Adrian Grange	a0a6590e0f	Remove unused member variables from VP9_COMP Removed three members from the VP9_COMP data structure: inter_zz_count, gf_bad_count, gf_update_recommended. These were part of the VP8 real-time mode implementation that was removed from the initial VP9 codecbase. Change-Id: I866b083b88ef02c74837277d50ce532ca88492f3	2013-11-04 11:01:43 -08:00
Jingning Han	c96f94c63a	Merge "Add second ref frame check back in rdcost hist"	2013-11-03 15:17:08 -08:00
Jingning Han	4efa6a0176	Fix the use case of plane_block_idx in sub8x8 RD This commit fixes the use case of plane_block_idx, which determines the plane (Y/U/V) index based on block index. When block idx >= 4 in sub8x8 block loop, it should be of chroma components. Change-Id: I072705aa7b35445524ac607089ca8ce54b7ba478	2013-11-03 11:19:33 -08:00
Jingning Han	2de7cbe980	Add second ref frame check back in rdcost hist Update best_inter_rd and best_inter_ref_frame only in single ref frame case. Change-Id: Id56825b231a62d6852bd83811410c05a7569f715	2013-11-03 09:15:41 -08:00
James Zern	152181b25c	Merge "vp9 ssse3 d207_predictor_32x32: add missing GLOBAL()"	2013-11-02 12:25:47 -07:00
James Zern	2d980b803a	vp9 ssse3 d207_predictor_32x32: add missing GLOBAL() removes a textrel for sh_b23456789abcdefff Change-Id: I80cb9dfd8e49a0fe884c8ff76472275b3a00cb57	2013-11-01 20:33:22 -07:00
Dmitry Kovalev	df19c6b64c	Removing 'new' probability calculation from convert_distribution(). We don't have to calculate 'new' probability in convert_distribution() because it is enough to calculate only 'new' counters which could be used to calculate probability if necessary. That's why removing a lot of unused temporary probability arrays and reducing number of get_binary_prob() calls. Change-Id: I4e14eb7203d1ace61bbddefd6b9b6326be83ba63	2013-11-01 15:09:43 -07:00
Yaowu Xu	333345cd26	Merge "Convert filter kernel choice to lookup"	2013-11-01 13:43:09 -07:00
Yaowu Xu	0f76ba5523	Convert filter kernel choice to lookup Also removed unused declaration related 6 tap filter Change-Id: Ic17f516141d885157918505f4204081e4c951fad	2013-11-01 13:03:18 -07:00
Dmitry Kovalev	340b2b076e	Merge "Cleanup. Adding const to function pointer arguments."	2013-11-01 10:57:03 -07:00
Dmitry Kovalev	5113d9c3c9	Merge "Cleaning up read_skip_coeff() function."	2013-11-01 10:56:55 -07:00
Yaowu Xu	a272530bf0	Two optimizations: 1. Reduced the size memset based on eob for 32x32 transform. The reset of non-zero coefficient should probably go into where they are read in inverse transform functions. (TODO) 2. Removed a redundant level of indirection. vp9_iht4x4_add() checks transform type and call vp9_iht4x4_16_add() for tranforms other than DCT_DCT. In this case, the DCT_DCT case has been already handled here. Change-Id: Iacbc77da761f0b308df5acea0f20c9add9f33d20	2013-11-01 07:24:07 -07:00
Yaowu Xu	f0446164fb	Merge "simplify read_coef_prob()"	2013-11-01 06:59:17 -07:00
Dmitry Kovalev	0e1756330b	Merge "Removing is_intra_mode() function."	2013-10-31 18:06:53 -07:00
Yaowu Xu	a49e77af50	simplify read_coef_prob() Change-Id: I529c634db4f81ba5386092c126f53312b1e51b2b	2013-10-31 16:39:08 -07:00
Dmitry Kovalev	970eb39bdc	Cleaning up read_skip_coeff() function. Making code easier to read and avoiding calculation of skip context twice. Change-Id: I42c376b1a1811bc842bf6420bf81d2de7a1bf980	2013-10-31 14:53:18 -07:00
Dmitry Kovalev	25a2740abb	Merge "Reducing the number of foreach_transformed_block() calls."	2013-10-31 14:34:47 -07:00
Dmitry Kovalev	7c524bbef4	Cleanup. Adding const to function pointer arguments. Change-Id: I12c67c8c0fa1aa7fb3f7d6cc2ef65be29c4ea292	2013-10-31 14:34:21 -07:00
Dmitry Kovalev	47b6030dda	Reducing the number of foreach_transformed_block() calls. The change doesn't affect the bitstream. It changes the order or function calls and affects how we reconstruct intra- and inter-blocks. Speed up is about 1...1.5%. For intra-blocks: Before: for each transform block read tokens for each transform block do prediction for each transform block do inverse transform Now: for each transform block read tokens do prediction do inverse transform For inter-blocks: Before: for each transform block read tokens for each transform block do inverse transform Now: for each transform block read tokens do inverse transform Change-Id: I12a79bf1aa5a18c351b8010369bd3ff1deae1570	2013-10-31 13:52:08 -07:00
Yaowu Xu	d515716140	Merge "mb_lpf_horizontal_edge AVX2 optimization"	2013-10-31 10:43:57 -07:00
Yunqing Wang	d03b3cbdd7	Merge "Fix x_offset_q4/y_offset_q4 calculation"	2013-10-31 09:47:54 -07:00
Tamar Levy	54f9205653	mb_lpf_horizontal_edge AVX2 optimization This CL contains two AVX2 optimized loop filter functions, mb_lpf_horizontal_edge_w_avx2_8 and mb_lpf_horizontal_edge_w_avx2_16. Change-Id: I604e4fe6e99752b7800c2ea98721d97f7e0b931b	2013-10-31 10:26:15 -06:00
Deb Mukherjee	1df7ef2974	Remove one shot q experiment The experiment is no longer used and can be removed. Change-Id: I9feab378fc895c120aa375353c68f93cad090609	2013-10-31 00:20:55 -07:00
Parag Salasakar	d5a52edc11	Added optimized vp9_idct32x32_34_add_dspr2 Change-Id: I2ba9467525b87a8e4a58f0c546e63031b4e38a4e	2013-10-31 12:12:34 +05:30
Marco Paniconi	b26ce8b1be	Updates to 1-pass: -Don't reduce maxQ for gold/alt in CBR mode. -Fix to min/maxQ for first/initial key frame. -Add more speeds to datarate test and reduce the starting bitrate for test. Change-Id: Id2a333d76dd3f6a51b322ca984588e2a22159c58	2013-10-30 16:52:46 -07:00
Jingning Han	a2a5c6f821	Merge "Enable all-zero coeff block index for sub8x8 blk"	2013-10-30 16:00:57 -07:00
Dmitry Kovalev	ca39a00822	Merge "Reducing the number of recursive calls."	2013-10-30 15:14:18 -07:00
Dmitry Kovalev	6761872e49	Replacing (SWITCHABLE_FILTERS + 1) with SWITCHABLE_FILTER_CONTEXTS. Change-Id: I9781a62bc1a4cd9176554d1271d87dbcafda9cb0	2013-10-30 14:40:34 -07:00
Jingning Han	8c8381d543	Enable all-zero coeff block index for sub8x8 blk This commit makes zcoeff_blk cache the case where the entire block is quantized to be zero (without applying zero-forcing) in the rate- distortion optimization loop, and skip the forward DCT, quantization, inverse DCT, and reconstruction process in the encode_block stage. It now works for all the block sizes, including sub8x8 blocks. Change-Id: I5ae60a9c436ba3637d11666733554bec4580ef98	2013-10-30 14:01:06 -07:00
Dmitry Kovalev	2901bf2d00	Reducing the number of recursive calls. Both decode_modes_sb and decode_modes_b had conditions to immediately return at the beginning. Eliminating these conditions here and calling these functions only to do a real work. Also unrolling loop for PARTITION_SPLIT. Change-Id: I2fc41cb74ac491f045a2f04fe68d30ff4aaa555d	2013-10-30 12:17:05 -07:00
James Zern	54c2854fe2	vp9/decode: align tile worker data allocation fixes a crash in assembly on 32-bit linux/windows Change-Id: I0c27e6c0ece9732b5eb2ee5b59ff42c3c8016c50	2013-10-30 08:33:09 +01:00
Johann	2a67a34f4a	Merge "vp9_decodframe.c: use vpx_memset instead of cast"	2013-10-29 18:40:18 -07:00
Yunqing Wang	9ed2d0a577	Fix x_offset_q4/y_offset_q4 calculation "<< SUBPEL_BITS" needs to be added in the calculation. Call set_scaled_offsets() to calculate x_offset_q4 and y_offset_q4. Change-Id: Ied130ea771510e918f51cd1dc3abe57f4c0962b5	2013-10-29 17:46:55 -07:00
James Zern	ce053e7006	Merge "vp9: add multi-threaded tile decoder"	2013-10-29 17:44:22 -07:00
James Zern	3b47e05908	Merge "vp9/decode: add get_tile()"	2013-10-29 17:34:56 -07:00
James Zern	fb484524bd	vp9: add multi-threaded tile decoder tiles are decoded in parallel within a single frame Change-Id: I7aca87cb1c239b74eceef72bdc9f672faebac373	2013-10-30 01:00:20 +01:00
James Zern	6b00202f1b	vp9/decode: add get_tile() factorizes the code in decode_tiles(). reading the offsets backwards wasn't doing anything to prove tile independence Change-Id: I0395d3c77205852ebdc55efedc68291e93cef85c	2013-10-30 01:00:07 +01:00
Dmitry Kovalev	1bea58e4a8	Merge "Adding const to vp9_quantize_b_{32x32,} parameters."	2013-10-29 16:57:52 -07:00
Erik Niemeyer	27b8040c76	Merge "CL for adding AVX-AVX2 support in libvpx."	2013-10-29 15:55:54 -07:00
Dmitry Kovalev	065972f959	Adding const to vp9_quantize_b_{32x32,} parameters. Change-Id: I56f8c50ac382202f66040cd9cfaa05d889572fc7	2013-10-29 15:25:19 -07:00
Erik Niemeyer	e6863ef318	CL for adding AVX-AVX2 support in libvpx. Change-Id: Idc03f3fca4bf2d0afd33631ea1d3caf8fc34ec29	2013-10-29 15:11:16 -07:00
Dmitry Kovalev	cd94eee45f	Fixing clang warning. Warning was: "implicit conversion from enumeration type 'VPX_SCALING_MODE' (aka 'enum vpx_scaling_mode_1d') to different enumeration type 'VPX_SCALING'". Change-Id: I45689e439a8775bc1e7534d0ea1ff7c729f2c7f5	2013-10-29 14:50:27 -07:00
Dmitry Kovalev	b4139d582d	Merge "Fixing wrongly initialized tx_type variable."	2013-10-29 14:07:16 -07:00
Johann	dc799a875b	vp9_decodframe.c: use vpx_memset instead of cast Fix warning with -Wstrict-aliasing=1 Change-Id: Idfac09be1ab328923883e63436577f1018c895b8	2013-10-29 13:52:48 -07:00
Dmitry Kovalev	e6dcf2aeb6	Fixing wrongly initialized tx_type variable. Wrong value was used in get_tx_type_4x4() function, so making initialization before that call. Change-Id: Ief30bb1e0c03b2f23d993bbf9ae18d7150ba9a83	2013-10-29 12:18:53 -07:00
Dmitry Kovalev	156de9c3ef	Correct handling of show_bit in uncompressed header. "keyframe" variable in the current code actually means that previous frame is a keyframe because cm->frame_type has not been initialized in read_uncompressed_header. Change-Id: I5645b0816c70abdef5dfc70113018d06276dac77	2013-10-29 11:24:08 -07:00
Dmitry Kovalev	a8e31d624a	Merge "Cleaning up vp9_regular_quantize_b_4x4."	2013-10-29 10:49:52 -07:00
Dmitry Kovalev	e5956258dd	Merge "Making get_tx_counts() similar to get_tx_probs()."	2013-10-29 10:48:50 -07:00
Yunqing Wang	c634ec6a56	Merge "Rewrite loop_filter_info_n struct"	2013-10-29 09:49:36 -07:00
James Zern	7795c1911e	Merge "vp9_decode_frame: group assignments/setup calls"	2013-10-29 03:34:10 -07:00
James Zern	d39f279daa	vp9_decode_frame: group assignments/setup calls group error checking at the top followed by allocations, setup then decode. Change-Id: I877d21326bb767885520511ecea70e5fd1e28054	2013-10-29 11:03:50 +01:00
Dmitry Kovalev	aa76cd1e49	Removing is_intra_mode() function. It is enough to check just block type: intra or inter. Intra block implies intra prediction mode, and inter block implies inter mode. Change-Id: I3cf98731a3935f670a3cd8e2b2443483eb944be4	2013-10-28 20:00:55 -07:00
Dmitry Kovalev	fa1ac00aee	Making get_tx_counts() similar to get_tx_probs(). Change-Id: I5b17f40e515c4bcf9ebef5380270a214af4e0115	2013-10-28 19:52:38 -07:00
Dmitry Kovalev	19cf72eddc	Adding {read, write}_partition() instead of check_bsize_coverage(). Making partition read/write logic more clear. Change-Id: I1981e90327257d37095567c62d72a103cda1da33	2013-10-28 15:14:45 -07:00
Dmitry Kovalev	8253532c2d	Cleaning up vp9_regular_quantize_b_4x4. Passing scan & iscan as parameters, adding useful local variables. Change-Id: Ia2a87906941db9557350d273669ce5c3cdb7235d	2013-10-28 14:28:28 -07:00
James Zern	58a0f6dbdd	vp9: add TileInfo replaces use of cur_tile_mi_(row\|col)_(start\|end) by VP9_COMMON, making it less stateful and more reusable for parallel tile decoding Change-Id: I1df09382b4567a0e5f4434825d47c79afe2399be	2013-10-28 20:54:43 +01:00
James Zern	f0eabfd432	vp9_decodframe: limit scope of private function params (2) replace VP9D_COMP usage with the (slightly) more targeted VP9_COMMON/MACROBLCKD structures. Change-Id: Ifdd9034f44d69eb94e232dd03c922de763b96a30	2013-10-28 20:48:59 +01:00
James Zern	3ffa41aae3	Merge changes If9b16f7d,I75aab21c,I9cbb768c,If5cea3d3,I96940657,I025595d8,Ie0bc3935,I3ebb172d * changes: vp9: remove partition+entropy contexts from common vp9: add above/left_context to MACROBLOCKD vp9: add above/left_seg_context to MACROBLOCKD vp9: add above/left_context to encoder vp9: add above/left_seg_context to encoder vp9: pass entropy context directly to set_skip_context vp9: pass context directly to partition functions vp9/decode: add alloc_tile_storage()	2013-10-28 12:45:11 -07:00
Dmitry Kovalev	ded951793c	Merge "Replacing is_inter_mode with is_inter_block."	2013-10-28 10:07:06 -07:00
James Zern	50c4026e93	Merge "vp9 decode: defer loop filter allocation"	2013-10-28 08:00:59 -07:00
James Zern	7b9ca3caa7	vp9: remove partition+entropy contexts from common these are now handled separately by the encoder and decoder Change-Id: If9b16f7d734e992fb94a510a6d88f2690d7fb7cb	2013-10-28 11:34:20 +01:00
James Zern	e571d3badc	vp9: add above/left_context to MACROBLOCKD Change-Id: I75aab21c1692cbad717564cbb436578fddbc348d	2013-10-28 11:34:18 +01:00
James Zern	d9a317c8b2	vp9: add above/left_seg_context to MACROBLOCKD Change-Id: I9cbb768c5f857a096cf6c29d6755d0e5e6728435	2013-10-28 11:32:16 +01:00
Dmitry Kovalev	b133cbf5f3	Merge "Adding fht{4x4, 8x8, 16x16} functions."	2013-10-26 03:03:27 -07:00
James Zern	8f177bb0b6	vp9 decode: defer loop filter allocation wait until do_loopfilter_inline is true before committing the resources Change-Id: I01661bd40599b47362bb3fb534668471f2a9d8d7	2013-10-26 11:57:44 +02:00
Dmitry Kovalev	ae2f732e8c	Adding fht{4x4, 8x8, 16x16} functions. Adding these functions to encapsulate tx_type check. Changing TX_TYPE to int to match the declaration in vo9_rtch.h. Change-Id: I6f3a2df6e35595ca73b6aaa9e3909ee7bc3fd16f	2013-10-25 17:55:07 -07:00
Dmitry Kovalev	07502f1963	Merge "Adding get_frame_new_buffer() function to replace duplicated code."	2013-10-25 15:25:13 -07:00
Dmitry Kovalev	ddfc87c6f3	Merge "Making input pointer constant for all fdct/fht functions."	2013-10-25 15:14:49 -07:00
Yunqing Wang	00dbd369c7	Rewrite loop_filter_info_n struct Restructured the storing of loopfilter information. Deleted loop_filter_info struct and reduced copying happened in every superblock. Tests showed a 0.5% ~ 0.8% decoder speed gain. Change-Id: Ie6a8e46bae71dc3a3cd8c6054f5de540b8e0ef5e	2013-10-25 14:56:28 -07:00
James Zern	ce2c337261	vp9: add above/left_context to encoder Change-Id: If5cea3d389bb1135ee490d273e57cc2c43325d01	2013-10-25 22:01:14 +02:00
James Zern	d72dfab296	vp9: add above/left_seg_context to encoder Change-Id: I969406574c6658936e9f6db5752f1b295025aab5	2013-10-25 22:01:14 +02:00
James Zern	d2bf696ee0	vp9: pass entropy context directly to set_skip_context this will allow for separate storage to be used in tile decoding Change-Id: I025595d83118bdc82a545dae69bc6602e8d2a6e3	2013-10-25 22:01:13 +02:00
James Zern	88d79eabdc	vp9: pass context directly to partition functions update_partition_context / partition_plane_context: this will allow for separate storage to be used in tile decoding Change-Id: Ie0bc393531ab7e9d2ce35c95111849b294aad4ed	2013-10-25 22:01:13 +02:00
James Zern	71097d9cf2	vp9/decode: add alloc_tile_storage() Change-Id: I3ebb172d4f2ae7db73b72fb42eb93833a295fb55	2013-10-25 22:01:13 +02:00
Dmitry Kovalev	d5ac877f7f	Adding COLOR_SPACE enum. Change-Id: If5711eb166609cce0a88b3cb5b56b3afeebc4fb0	2013-10-25 12:35:20 -07:00
Yunqing Wang	47665452f0	Merge "Add 32x32 idct function for eob<=34 case"	2013-10-25 09:34:46 -07:00
Jingning Han	39b6ab6a29	Merge "Separate encode_block for pass 1 and 2"	2013-10-24 21:39:27 -07:00
Yunqing Wang	f88315cb29	Add 32x32 idct function for eob<=34 case When only upper-left 8x8 area has non-zero dct coefficients, we could skip 1D IDCT for 9th to 32th rows to save operations. This function is called when eob <= 34. Change-Id: I9684b75947bdde346cfe3720f08a953aa7a13fb5	2013-10-24 16:13:21 -07:00

1 2 3 4 5 ...

3432 Commits