generic-library/vpx

Author	SHA1	Message	Date
Dmitry Kovalev	6f1bb2246c	Reading diff update flag inside vp9_diff_update_prob. Change-Id: I5ae659c1bfb132428a7272d094b5287d144ec7c8	2013-10-03 10:55:36 -07:00
Dmitry Kovalev	ad6ed536d5	Merge "Removing vpx_codec_impl_{top, bottom}.h files."	2013-10-03 10:44:16 -07:00
Paul Wilkins	b03d3da9c1	Merge "Speed setting review."	2013-10-03 09:49:00 -07:00
Paul Wilkins	fa71882e63	Merge "make use last partition consider motion"	2013-10-03 09:48:49 -07:00
Johann	fd6c4c71d6	Merge "mips dsp-ase r2 vp9 decoder convolve module optimizations"	2013-10-03 09:41:16 -07:00
Dmitry Kovalev	6cb6987d4d	Merge "BITSTREAM - RESTORING BILINEAR INTERPOLATION FILTER SUPPORT"	2013-10-03 09:34:26 -07:00
Yunqing Wang	ed22179a82	Rewrite HORIZx4 and HORIZx8 in subpixel filter functions In subpixel filters, prefetched source data, unrolled loops, and interleaved instructions. In HORIZx4, integrated the idea in Scott's CL (commit: `d22a504d11`), which was suggested by Erik/Tamar from Intel. Further tweaking was done to combine row 0, 2, and row 1, 3 in registers to do more 2-row-in-1 operations until the last add. Test showed a ~2% decoder speedup. Change-Id: Ib53d04ede8166c38c3dc744da8c6f737ce26a0e3	2013-10-03 09:04:02 -07:00
Paul Wilkins	6253cc9279	Speed setting review. Substantial reworking of the speed vs quality trade offs for speed 1 and 2. In this patch I am attempting to freeze the "quality" meaning of speeds 1 and 2 relative to speed 0 so that in future we can better evaluate progress. I am targeting : Speed 1 quality ~-5% vs speed 0. Speed 2 quality ~-10% vs speed 0 It is inevitable that quality will still fluctuate a little as we adjust settings and add new features, but we will attempt to keep as close as possible to these values. Above speed 2 things will remain a bit more fluid for now. In this patch speed 1 is approximately 4-5x as fast as speed 0. This is similar to before but the quality hit is a lot less. Likewise speed 2 is approximately 2x as fast as speed 1 but is similar in quality to the previous speed 1 configuration. Also slight change to behavior of FLAG_EARLY_TERMINATE to insure all reference frames get at least one rd test. Important for very low variance regions. WIP :- Added a new speed level with old speed 4 becoming speed 5. Speed 3 and 4 tradeoffs still WIP Change-Id: Ic7a38dd7b5b63ab1501f9352411972f480ac6264	2013-10-03 10:23:28 +01:00
Jim Bankoski	f1d3e5e4d6	make use last partition consider motion This commit causes use last partition to consider whether a 64x64 has motion that might make a new partitioning worth while. Change-Id: I3a57bedef4f3cd961fadbfa96651c206fa36da4a	2013-10-03 10:22:39 +01:00
Paul Wilkins	ece99b3da0	Merge "Improved auto_partition_range."	2013-10-03 02:06:13 -07:00
Dmitry Kovalev	68a3e4a888	BITSTREAM - RESTORING BILINEAR INTERPOLATION FILTER SUPPORT Adding appropriate test vector vp90-2-06-bilinear.webm. Change-Id: Ia3bbf57318e0cc61a1b724fe751e3f9c7e11b337	2013-10-02 18:04:12 -07:00
A.Mahfoodh	5215b83aea	Simplifying and inlining k_cvtlo_epi16 and k_cvthi_epi16 Simplify the k_cvtlo_epi16 and k_cvthi_epi16 to only two instructions. Then inlined them. quoting from intel MMX_App_Compute_16bit_Vector.pdf‎ "The PMADDWD instruction multiplies four pairs of 16-bit numbers and produces partial sums of the results and can do so once per clock (with a three-clock latency)." so I am assuming that there will be three clock overhead after the last _mm_madd_pi16 command. Even with the overhead the number of clocks in general should be smaller. I am not sure though becasue I could not find information about number of clocks required for instructions in k_cvtlo_epi16 and k_cvthi_epi16. I will run a test and compare the execution time. Change-Id: Ieda4aa338f69ad3dd196ac6e7892da3cf1b47ea7	2013-10-02 20:02:03 -04:00
Parag Salasakar	40edab5e39	mips dsp-ase r2 vp9 decoder convolve module optimizations Change-Id: I401536778e3c68ba2b3ae3955c689d005e1f1d59	2013-10-02 16:58:37 -07:00
Dmitry Kovalev	43e979db3b	Merge "Adding const to function arguments."	2013-10-02 16:26:20 -07:00
Dmitry Kovalev	7fa14f42c1	Merge "Removing unused vp9_coeff_stats_model typedef."	2013-10-02 16:26:09 -07:00
Dmitry Kovalev	a88a0e88a4	Merge "Moving get_token_alloc function from common to the encoder."	2013-10-02 16:26:00 -07:00
Jim Bankoski	f5bcc372c9	unused typedef in vp9_variance.h Change-Id: I15f79c9de34c723c1dd419b8da96c3ff948c5e03	2013-10-02 15:59:31 -07:00
Dmitry Kovalev	be7eec79be	Moving all idct/iht functions in one place. Moving functions from vp9_idct_blk to vp9_idct because these functions are used from both encoder and decoder. Removing duplicated code from vp9_encodemb.c and reusing existing functions. Change-Id: Ia0a6782f8c4c409efb891651b871dd4bf22d5fe8	2013-10-02 14:13:33 -07:00
Scott LaVarnway	20a09d928a	d153 intra prediction (16x16) ssse3 using bytes Change-Id: I8a106dd61b0a2520fae792d87d6348e662649b2d	2013-10-02 16:34:05 -04:00
Dmitry Kovalev	d958c0486a	Merge "Removing memset calls inside idct/iht functions."	2013-10-02 12:45:27 -07:00
Dmitry Kovalev	c4d1ab573a	Removing memset calls inside idct/iht functions. Making appropriate memset inside decode_block now. Change-Id: I8e944194668c830de08271c8fb6e413251c201d8	2013-10-02 11:48:08 -07:00
Jingning Han	54bc73151b	Deprecate unused mode count variables Remove mode_check_freq and mode_test_hit_counts from VP9_COMP. Change-Id: Iabfd9f841444cd9bf19ac761a9795f140082ce0b	2013-10-02 11:07:14 -07:00
Jingning Han	6d3bd96607	BITSTREAM - CLARIFICATION OF MV SIZE RANGE The codec should effectively run with motion vector of range (-2048, 2047) in full pixels, for sequences of 1080p and below. Add assertions to clarify this behavior. Change-Id: Ia0cac28249f587d8f8882205228fa480263ab313	2013-10-02 10:29:45 -07:00
Dmitry Kovalev	6c2082db71	Merge "Adding read_intra_mode_{y, uv} functions for clarity."	2013-10-02 09:17:10 -07:00
Dmitry Kovalev	3c4e9e341f	Adding SSE2 optimized vp9_short_idct32x32_1_add function. Change-Id: I4b1c6bb9ff615f5872b96ed07dbf0f5e18e63643	2013-10-01 18:34:36 -07:00
Dmitry Kovalev	771f3ef5ad	Adding read_intra_mode_{y, uv} functions for clarity. Change-Id: I92fd32476c472e54f52b8d7602a98262b25e6eaf	2013-10-01 17:55:48 -07:00
Jim Bankoski	e83ebc8992	Merge "vp9_thread nolintify lint issue I can't fix easily"	2013-10-01 16:15:03 -07:00
Jim Bankoski	825b7c301d	Merge "vp9_block.h cpplint issues resolved"	2013-10-01 16:14:58 -07:00
Jim Bankoski	691177842c	Merge "cpplint issue in vp9_rdopt.h"	2013-10-01 15:45:35 -07:00
Jim Bankoski	d0308b7daa	Merge "cpplint issues in vp9_onyx_int.h"	2013-10-01 15:45:02 -07:00
Dmitry Kovalev	aeb603f2af	Making decode_modes_b function more straightforward. Moving out decode_tokens function calls and adding decode_blocks boolean variable. We only have to decode if eobtotal > 0, i.e. we have at least one non-zero coefficient. Also inlining and remove vp9_set_pred_flag_mbskip function. Change-Id: I7be38b12ee8206faf0beea2bbf4d52be42575b03	2013-10-01 15:41:30 -07:00
Jim Bankoski	c52d85442c	vp9_thread nolintify lint issue I can't fix easily Change-Id: Ib19dabe697656e4d7e8403d91bedca7cd31d36bf	2013-10-01 15:19:39 -07:00
Jim Bankoski	5491a1f33e	vp9_block.h cpplint issues resolved Change-Id: Icc6a76a5be77f3e19918155bab3998e0aa32ccf5	2013-10-01 15:17:39 -07:00
Jim Bankoski	c4627a9ff1	cpplint issues in vp9_onyx_int.h Change-Id: I6c4058aebe834e1a12b7a3fb10484b9ebe60b349	2013-10-01 15:14:39 -07:00
Jim Bankoski	b6e2f9b752	cpplint issue in vp9_rdopt.h Change-Id: I84209d382ca5dfc537ee533cd792d8caa0e25cee	2013-10-01 15:09:32 -07:00
Matthew Heaney	6b78f11a03	Merge "Fix linker warnings for bilinear filters"	2013-10-01 14:42:38 -07:00
Matthew Heaney	dcab9896e8	Fix linker warnings for bilinear filters The declaration of the bilinear filters specified an alignment clause in the implementation file but not in the header. This turned out to be harmless, but it did cause linker warnings to be emitted when building on Windows. The (extern) declaration in the header was changed, to match the declaration in the implementation. Change-Id: I44be89b1572fe9a50fa47a42e4db9128c4897b04	2013-10-01 14:40:05 -07:00
Yunqing Wang	03698aa6d8	Merge "Modify HORIZx16 macro in subpixel filter functions"	2013-10-01 14:18:10 -07:00
Yunqing Wang	df8e156432	Modify HORIZx16 macro in subpixel filter functions Interleaved the instructions, reduced register dependency, and prefetched the source data. This improved the decoder speed by 0.6% - 2%. Change-Id: I568067aa0c629b2e58219326899c82aedf7eccca	2013-10-01 12:49:25 -07:00
Dmitry Kovalev	0a5e9ee054	Moving get_token_alloc function from common to the encoder. Also renaming mb_row -> mi_row, mb_col -> mi_col arguments and calculate mb_rows/mb_cols values from mi_rows/mi_cols. Change-Id: I6919a279f560648e23bc9a12f507d17c21ffd5d7	2013-10-01 11:54:10 -07:00
Yaowu Xu	5c66f6f5eb	fix build with MSVC near is a key word, changed to use nearmv instead. Change-Id: Ib54438c431b2b2521a62fc7b61a9c127dd7bc01e	2013-10-01 09:51:59 -07:00
Scott LaVarnway	27b390e1a1	d153 intra prediction ssse3 using bytes byte version of ronalds d153 ssse3 optimizations for 4x4 and 8x8 (commit: fc91a2a112238a1aee568f3b840585de4e928fca) Change-Id: Iec4426032311483f615fd9e0dceba3ee85ddebd7	2013-10-01 09:05:20 -04:00
Dmitry Kovalev	c982a73b9f	Removing unused vp9_coeff_stats_model typedef. Change-Id: I6973e7121b6393379b5759f288632e8eab763d3e	2013-09-30 15:10:00 -07:00
Dmitry Kovalev	c64e23832f	Adding const to function arguments. Function list: tx_counts_to_branch_counts_32x32 tx_counts_to_branch_counts_8x8 tx_counts_to_branch_counts_8x8 update_ct update_ct2 update_mode_probs Change-Id: I120d8945a34378cf285d6bd415e23de1d522cf2f	2013-09-30 14:50:15 -07:00
Dmitry Kovalev	40047bef5d	Merge "Using array of motion vectors instead of separate variables."	2013-09-30 13:16:45 -07:00
Dmitry Kovalev	cd945c7bd9	Merge "Removing vp9_add_constant_residual_{8x8, 16x16, 32x32} functions."	2013-09-30 13:16:34 -07:00
Jingning Han	195061feda	Fix rectangular partition check in speed 1 Make encoder skip rectangular partition check in speed 1 and above, when early termination was triggered in partition split. Thanks Guillaume (gmartres@) for catching this issue. This change makes bus_cif at 2000kbps speed 1 runtime goes down from 25612ms to 23438ms (about 9% speed-up), at the expense of -0.235% performance down. Change-Id: I98613fad081a261d30d5fa206f934ca70601c180	2013-09-30 12:14:36 -07:00
Dmitry Kovalev	c151bdd412	Using array of motion vectors instead of separate variables. Change-Id: I7380a089105f658257bbb3e30a525da168e76952	2013-09-30 12:11:46 -07:00
Dmitry Kovalev	e288c6015e	Removing vpx_codec_impl_{top, bottom}.h files. It doesn't seem reasonable to have these files as our API part. Just inlining them in the source. Change-Id: Iff970bb25e72e49e7ac21990824dbf4ef8bfd2e2	2013-09-30 11:10:54 -07:00
Dmitry Kovalev	1a9d4fedf3	Merge "Using size_t for memory buffer size."	2013-09-30 11:10:08 -07:00

... 4 5 6 7 8 ...

6864 Commits