generic-library/vpx

Author	SHA1	Message	Date
Johann	2104454607	block error avx2: use tran_low_t Change-Id: Ic5f3a1f569d6f82afeaf4fcd7235374bb460db3c	2017-02-16 12:39:02 -08:00
Johann Koenig	cc43012674	Merge changes I267050a5,Iebade0ef,Id96a8df3 * changes: quantize_fp_32x32 highbd ssse3: enable existing function quantize_fp highbd ssse3: use tran_low_t for coeff quantize_fp highbd sse2: use tran_low_t for coeff	2017-02-16 20:34:48 +00:00
Yi Luo	72a43e2378	Add idct32x32_135_add SSSE3 intrinsics - Replace the corresponding assembly code. - No user level speed performance degrade. - Unit tests passed. Change-Id: Idd0c5a4bad4976f1617c34100cb46e75e3b961e5	2017-02-16 11:29:34 -08:00
Yunqing Wang	0bf6b51572	Merge "Structured the mode ordering code to avoid redundant memcpy"	2017-02-16 16:22:54 +00:00
Johann	ff37a911ce	quantize_fp_32x32 highbd ssse3: enable existing function This was created as part of the quantize_fp_ssse3 change. Both functions use the same source file with different macro parameters. Change-Id: I267050a559426a85955d215aa0aaca270439c5ab	2017-02-16 07:40:56 -08:00
Johann	4682130b60	quantize_fp highbd ssse3: use tran_low_t for coeff Change-Id: Iebade0efc0efbb0a80a0f3adbef4962e3a2f25e8	2017-02-16 07:40:56 -08:00
Johann	ac3996a6d1	quantize_fp highbd sse2: use tran_low_t for coeff Change-Id: Id96a8df33354a7987ce890a3d6798c7375ffa4aa	2017-02-16 07:40:55 -08:00
Johann	44600442dc	bitdepth conversion: really use num elements The previous implementation confused bit/bytes/elements. It was using '32' as the multiplier but that was mistakenly adopted because a 32x32 transform embedded the stride. Change-Id: Ieeb867a332416b9a40580b5e7c9b20088e9e691a	2017-02-16 15:02:48 +00:00
Ranjit Kumar Tulabandu	5127e58dab	Structured the mode ordering code to avoid redundant memcpy Change-Id: I4f5d6b54018bd1928cd9e5e42619e6f55b334803	2017-02-16 14:12:33 +00:00
Paul Wilkins	60a10116d1	Merge "Disconnect ARF breakout from frame boost."	2017-02-16 10:02:09 +00:00
Paul Wilkins	543ebc900f	Merge "Remove unnecessary factor."	2017-02-16 10:01:58 +00:00
Paul Wilkins	9216ba58d8	Merge "Bug in scale_sse_threshold()"	2017-02-16 10:01:46 +00:00
Paul Wilkins	e6c1993f1b	Merge "Additional first pass stats."	2017-02-16 09:39:29 +00:00
Kaustubh Raste	fddf66b741	Fix mips vpx_post_proc_down_and_across_mb_row_msa function Added fix to handle non-multiple of 16 cols case for size 16 Change-Id: If3a6d772d112077c5e0a9be9e612e1148f04338c	2017-02-16 13:17:00 +05:30
Johann Koenig	b63e88e506	Merge "Use 'packssdw' for loading tran_low_t values"	2017-02-16 02:41:00 +00:00
Johann Koenig	61d05c1e67	Merge "vp8_dx_iface: remove unused 'else' condition"	2017-02-16 01:00:45 +00:00
James Zern	cc04ae1565	Merge "vpx_temporal_svc_encoder.sh: remove FUNCNAME bashism"	2017-02-16 00:21:19 +00:00
Marco Paniconi	e6cf741ae6	Merge "vp9: Some code cleanup for aq-mode = 3."	2017-02-15 23:03:27 +00:00
Marco	158b300952	vp9: Some code cleanup for aq-mode = 3. The weight segment needs to only be computed once per frame, so remove it from the funciton vp9_cyclic_refresh_rc_bits_per_mb(), which is called within a loop inside vp9_rc_regulate_q. Change-Id: Ia0e18b89abb97e42c466d4dbc47700d7f76555db	2017-02-15 14:07:04 -08:00
Jerome Jiang	2865de86ec	vpx_temporal_svc_encoder: Expose error resilient control to cmd line. Change-Id: Ic74a8690b136ffbc370080f70b2d5a6b1572bf63	2017-02-15 21:45:52 +00:00
Linfeng Zhang	d12f25f216	Merge "cosmetics,dsp/inv_txfm.c: reorder functions"	2017-02-15 20:18:23 +00:00
Marco Paniconi	725606a678	Merge "vp9. Use same source_sad threshold for all speeds."	2017-02-15 20:07:19 +00:00
Linfeng Zhang	106c342659	cosmetics,dsp/inv_txfm.c: reorder functions Change-Id: Ie0f7689ebe230c68eadb22a32b14838c1a7543a6	2017-02-15 11:40:35 -08:00
Linfeng Zhang	d5edf56bb5	Merge "Add vpx_highbd_idct16x16_38_add_neon()"	2017-02-15 19:34:18 +00:00
Marco	f82280820a	vp9. Use same source_sad threshold for all speeds. Only affects real-time mode. Change-Id: Iba836f110c4da936f5173cc0f54424d5b6121bff	2017-02-15 11:28:26 -08:00
Marco	716c1d5ff5	Vp9: Speed 8 aq-mode=3: Reduce computation in estimating bits per mb. vp9_compute_qdelta_by_rate has almost 2% overhead in profiling on Nexus 6. Reduce the calling of that function in speed 8 by estimating the delta-q. Both rtc and rtc_derf show little/no change in avg psnr/ssim. Encoding speed is 2~3% faster on Nexus 6. Change-Id: If25933715783f31104a18a5092ea347b1221b5f5	2017-02-15 09:28:16 -08:00
Linfeng Zhang	81914ce68a	Add vpx_highbd_idct16x16_38_add_neon() BUG=webm:1301 Change-Id: Ic6cd8c1e63e1b7a997cbed221e20fff4c599e0fe	2017-02-15 09:12:02 -08:00
Linfeng Zhang	ccada0636b	Merge "Add vpx_highbd_idct16x16_38_add_c()"	2017-02-15 17:06:17 +00:00
paulwilkins	cfc79a357a	Disconnect ARF breakout from frame boost. This small change replaces the frame boost check in the arf group length break out clause with a test against a prediction decay value. The boost value is in fact partly dependent on the decay value but this change means that the per frame boost calculation can be adjusted without influencing the group length calculation. The value chosen gives a close match on all the test sets with the previous code (on average) but it was noted that a lower threshold was slightly better for 1080P and up and a slightly higher value for small image sizes. Change-Id: I4d5b9f67d5b17b0d99ea3f796d3d6202fd61ee0c	2017-02-15 10:46:14 +00:00
paulwilkins	b89ba05ab4	Remove unnecessary factor. Removed unnecessary scaling factor to simplify. Change-Id: I3fc9c5975a2597e72f1324e09dd586dea1facfa7	2017-02-15 10:45:43 +00:00
paulwilkins	76550dfdc0	Bug in scale_sse_threshold() The function scale_sse_threshold() returns a threshold scaled if necessary for use with 10 and 12 bit from an 8 bit baseline. SSE error values would be expected to rise for the 10 and 12 bit cases where there are more bits of precision. Hence the threshold used for the test should also be scaled up. Change-Id: I4009c98b6eecd1bf64c3c38aaa56598e0136b03d	2017-02-15 10:45:03 +00:00
paulwilkins	945ccfee59	Additional first pass stats. Added counts that split the intra coded blocks into low and high variance. Change-Id: Ic540144b34d5141659081bb22f7ee16fd6861f14	2017-02-15 10:44:37 +00:00
Paul Wilkins	7635ee0f37	Merge "Aggressive VBR method."	2017-02-15 10:37:02 +00:00
James Zern	1cd926d665	vpx_temporal_svc_encoder.sh: remove FUNCNAME bashism replace with an explicit output file prefix that matches the function name Change-Id: I7f6a4105adb34327b1099a5fbf132aa8d1ad5b90	2017-02-14 23:44:00 -08:00
Johann Koenig	61927ba4ac	Merge "vp9 fdct higbd neon: connect existing highbd calls"	2017-02-15 01:33:00 +00:00
Linfeng Zhang	e07e74fb0f	Add vpx_highbd_idct16x16_38_add_c() When eob is less than or equal to 38 for high-bitdepth 16x16 idct, call this function. BUG=webm:1301 Change-Id: I09167f89d29c401f9c36710b0fd2d02644052060	2017-02-14 17:25:52 -08:00
Yunqing Wang	f2c1aea118	Merge "Row based multi-threading of encoding stage"	2017-02-15 00:54:10 +00:00
Ranjit Kumar Tulabandu	71061e9332	Row based multi-threading of encoding stage (Yunqing Wang) This patch implements the row-based multi-threading within tiles in the encoding pass, and substantially speeds up the multi-threaded encoder in VP9. Speed tests at speed 1 on STDHD(using 4 tiles) set show that the average speedups of the encoding pass(second pass in the 2-pass encoding) is 7% while using 2 threads, 16% while using 4 threads, 85% while using 8 threads, and 116% while using 16 threads. Change-Id: I12e41dbc171951958af9e6d098efd6e2c82827de	2017-02-15 00:49:34 +00:00
Linfeng Zhang	615566aa81	Merge "Replace 14 with DCT_CONST_BITS in idct NEON functions' shifts"	2017-02-15 00:46:29 +00:00
Johann	86fed469ec	vp8_dx_iface: remove unused 'else' condition Clears up static clang analysis warning regarding a dead store. Change-Id: If4fe7a9a7f94c6e2001d46136944f90712e543b4	2017-02-15 00:05:41 +00:00
Johann	327a02d77e	Use 'packssdw' for loading tran_low_t values This matches bitdepth_conversion_sse2.asm and produces substantially better assembly. The old way had lots of 'movzwl' and 'shl' and storing back to memory before loading into an xmm register. Change-Id: Ib33e35354dfd691a4f8b1e39f4dbcbb14cd5302b	2017-02-14 22:39:49 +00:00
Johann	3e7aa8fda9	vp9 fdct higbd neon: connect existing highbd calls Change-Id: Ia8f822bd6e70b3911bc433a5a750bfb6f9a3a75c	2017-02-14 22:11:49 +00:00
Johann Koenig	9c2bb7f342	Merge "quantize_fp highbd neon: use tran_low_t for coeff"	2017-02-14 21:28:23 +00:00
Linfeng Zhang	429e652809	Replace 14 with DCT_CONST_BITS in idct NEON functions' shifts Change-Id: I2a39a3bb87516b04d273bc1c0f4a634e3fb6f0f6	2017-02-14 13:08:41 -08:00
clang-format	4b402746ca	apply clang-format Change-Id: I75e4a9e0b37bd4586f26c8d6c1fa27f3f6ff1bce	2017-02-14 12:45:52 -08:00
James Zern	f670628ca5	.clang-format: update to 3.9.1 Change-Id: Ia51f2201df897651067d09122075953382b59139	2017-02-14 12:39:54 -08:00
Yi Luo	c1a90dc160	Merge "Replace idct32x32_34_add_ssse3 assembly with intrinsics"	2017-02-14 20:13:27 +00:00
Yi Luo	bd86de1ac8	Replace idct32x32_34_add_ssse3 assembly with intrinsics - No user-level speed performance change. - Pass unit tests. Change-Id: Idfc598e00f354265e41f6b3219f4734216c115c6	2017-02-14 10:38:36 -08:00
Johann	2b24aa87d9	quantize_fp highbd neon: use tran_low_t for coeff Change-Id: I90fd815f15884490ad138f35df575a00d31e8c95	2017-02-14 10:26:10 -08:00
Johann	25301a84a8	vp8 onyx_if: assert divide by zero Clears up static clang analysis warning regarding divide by zero. Trying to explain to the compiler how it's impossible to avoid incrementing num_blocks at least once is difficult. Change-Id: Ibaae43be572e5cd7a689b440dcd341c17d33443b	2017-02-14 04:27:31 +00:00

... 3 4 5 6 7 ...

17028 Commits