vpx/vpx_dsp/x86
Yi Luo a3452996a1 High bit depth inter prediction horizontal/vertical filters AVX2
User level speed improvement on i7-6700, cpu-used=1,
  x86_64 Linux, bitrate, 1080p, 8Mbps, 4K, 16Mbps:
- Decoder:
  1080p: ~4%
  4K: ~5%
- Encoder:
  1080p: ~1%
  4K: ~3%

Change-Id: I51b48f9c5de0d62487d5a11aa579c97bd03dd640
2017-05-03 12:18:01 -07:00
..
add_noise_sse2.asm postproc : fix function parameters for noise functions. 2016-07-15 08:27:34 -07:00
avg_intrin_sse2.c highbd x86: consolidate tran_low_t conversions 2017-02-06 10:43:26 -08:00
avg_pred_sse2.c vpx_comp_avg_pred: sse2 optimization 2017-04-13 08:44:52 -07:00
avg_ssse3_x86_64.asm bitdepth conversion: really use num elements 2017-02-16 15:02:48 +00:00
bitdepth_conversion_avx2.h block error avx2: use tran_low_t 2017-02-16 12:39:02 -08:00
bitdepth_conversion_sse2.asm quantize_fp highbd ssse3: use tran_low_t for coeff 2017-02-16 07:40:56 -08:00
bitdepth_conversion_sse2.h correct bitdepth_conversion_sse2.h header guard 2017-02-16 12:43:33 -08:00
convolve.h Update highbd convolve functions arguments to use uint16_t src/dst 2017-04-25 14:22:19 -07:00
deblock_sse2.asm Fix segmentation fault caused by denoiser working with spatial SVC. 2017-02-21 09:38:28 -08:00
fwd_dct32x32_impl_avx2.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
fwd_dct32x32_impl_sse2.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
fwd_txfm_avx2.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
fwd_txfm_impl_sse2.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
fwd_txfm_sse2.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
fwd_txfm_sse2.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
fwd_txfm_ssse3_x86_64.asm Rework forward 8x8 2D-DCT ssse3 implementation 2017-01-10 12:50:55 -08:00
highbd_convolve_avx2.c High bit depth inter prediction horizontal/vertical filters AVX2 2017-05-03 12:18:01 -07:00
highbd_intrapred_sse2.asm Code clean of highbd_tm_predictor_32x32 2015-12-22 16:51:57 -08:00
highbd_loopfilter_sse2.c Unify loopfilter function names 2016-09-29 16:25:42 -07:00
highbd_quantize_intrin_sse2.c Fix warnings reported by -Wshadow: Part1: vpx_dsp directory 2016-10-17 19:25:19 -07:00
highbd_sad4d_sse2.asm Use newer x86inc.asm 2015-08-07 16:44:44 -07:00
highbd_sad_sse2.asm Use newer x86inc.asm 2015-08-07 16:44:44 -07:00
highbd_subpel_variance_impl_sse2.asm Fix for issue 1114 compile error 2015-12-18 09:43:22 +00:00
highbd_variance_impl_sse2.asm Move variance functions to vpx_dsp 2015-05-26 12:01:52 -07:00
highbd_variance_sse2.c Resolve -Wshorten-64-to-32 in highbd variance. 2017-04-05 17:34:02 -07:00
intrapred_sse2.asm *.asm: normalize label format 2016-06-27 19:46:57 -07:00
intrapred_ssse3.asm Slow pshufb removal in 3 intra prediction functions. 2016-06-02 10:55:58 -07:00
inv_txfm_sse2.c inv_txfm_sse2: clear conversion warning in hbd build 2017-03-17 01:16:38 -07:00
inv_txfm_sse2.h Replace idct32x32_34_add_ssse3 assembly with intrinsics 2017-02-14 10:38:36 -08:00
inv_txfm_ssse3.c Make butterfly_self() signature consistent with butterfly() 2017-03-21 09:36:35 -07:00
inv_wht_sse2.asm bitdepth conversion: really use num elements 2017-02-16 15:02:48 +00:00
loopfilter_avx2.c Unify loopfilter function names 2016-09-29 16:25:42 -07:00
loopfilter_sse2.c Unify loopfilter function names 2016-09-29 16:25:42 -07:00
quantize_avx_x86_64.asm Optimize vpx_quantize_{b,b_32x32} assembler. 2015-10-20 10:11:19 +01:00
quantize_sse2.c highbd x86: consolidate tran_low_t conversions 2017-02-06 10:43:26 -08:00
quantize_ssse3_x86_64.asm quantize ssse3: remove unused pxor 2017-01-30 17:02:57 -08:00
sad4d_avx2.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
sad4d_sse2.asm Code clean of sad4xNx4D_sse 2015-12-17 17:43:46 -08:00
sad_avx2.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
sad_sse2.asm sad_sse2: fix sad4xN(_avg) on windows 2015-12-18 19:19:32 -08:00
sad_sse3.asm Move shared SAD code to vpx_dsp 2015-05-06 16:58:20 -07:00
sad_sse4.asm Move shared SAD code to vpx_dsp 2015-05-06 16:58:20 -07:00
sad_ssse3.asm Move shared SAD code to vpx_dsp 2015-05-06 16:58:20 -07:00
ssim_opt_x86_64.asm ssim: Replace unsigned long with uint32_t. 2015-08-07 11:48:31 -07:00
subpel_variance_sse2.asm Code clean of sub_pixel_variance4xh -- 2 2016-05-24 04:44:05 -07:00
subtract_sse2.asm Use newer x86inc.asm 2015-08-07 16:44:44 -07:00
sum_squares_sse2.c vp9_rdopt: correct size to vpx_sum_squares_2d_i16 2017-03-22 12:04:33 -07:00
txfm_common_sse2.h vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
variance_avx2.c variance_avx2: sync variance functions with c-code 2016-09-19 16:19:29 -07:00
variance_impl_avx2.c variance_impl_avx2: restore table layout 2016-08-12 11:52:53 -07:00
variance_sse2.c Resolve -Wshorten-64-to-32 in variance. 2016-07-28 10:16:31 -07:00
vpx_asm_stubs.c vpx_dsp: apply clang-format 2016-07-25 14:14:19 -07:00
vpx_convolve_copy_sse2.asm Clean CONVERT_TO_BYTEPTR/SHORTPTR in convolve 2017-04-19 12:13:49 -07:00
vpx_high_subpixel_8t_sse2.asm Code refactor on InterpKernel 2015-07-31 10:27:33 -07:00
vpx_high_subpixel_bilinear_sse2.asm Code refactor on InterpKernel 2015-07-31 10:27:33 -07:00
vpx_subpixel_8t_intrin_avx2.c vpx_subpixel_8t_intrin_avx2: tolerate unversioned clang 2016-09-16 07:14:17 +00:00
vpx_subpixel_8t_intrin_ssse3.c add vpx high bitdepth convolve8 NEON intrinsics optimization 2016-10-17 15:23:54 -07:00
vpx_subpixel_8t_sse2.asm Code refactor on InterpKernel 2015-07-31 10:27:33 -07:00
vpx_subpixel_8t_ssse3.asm Update vpx subpixel 1d filter ssse3 asm 2016-06-29 13:48:41 -07:00
vpx_subpixel_bilinear_sse2.asm Code refactor on InterpKernel 2015-07-31 10:27:33 -07:00
vpx_subpixel_bilinear_ssse3.asm improve vpx_filter_block1d* based on replace paddsw+psrlw to pmulhrsw 2016-06-27 17:50:45 +00:00