generic-library/vpx

Author	SHA1	Message	Date
Jingning Han	08a453b9de	Replace vp9_ prefix with vpx_ prefix in vpx_dsp function names This commit clears the function naming convention in vpx_dsp. It replaces vp9_ prefix of global functions with vpx_ prefix. It also removes the vp9_ prefix from static functions. Change-Id: I6394359a63b71a51dda01342eec6a3cc08dfeedf	2015-08-04 13:46:11 -07:00
Jingning Han	bfad9d2fe6	Move inverse transfrom dspr2 functions from vp9 to vpx_dsp Change-Id: Ia9cf7c31cab4ba3dd6b9bb668c4b3e84bd55cf69	2015-08-03 11:59:50 -07:00
Jingning Han	7723b8df6a	Merge "Add common_dspr2.c file to vpx_dsp/mips"	2015-08-03 18:58:37 +00:00
Yaowu Xu	9d9273726b	Merge "Correct the allocation size for ssim_vars"	2015-08-03 18:43:55 +00:00
Jingning Han	92b08f516a	Add common_dspr2.c file to vpx_dsp/mips Move the declaration of commonly referenced variable to vpx_dsp/mips/common_dspr2.c. Change-Id: Ia51287b02e2ac5cfae0fba98c721f0810618f28e	2015-08-03 10:53:47 -07:00
Yaowu Xu	8f089cbd2e	Correct the allocation size for ssim_vars Ssim_vars is used to accumulate stats based 4x4 pixel blocks, this commit changes the allocations size to be based on mi_rows and mi_cols to avoid out-of-bound memory access for larger size videos. The hard coded 720x480 can only work for image size up to 2880x1920. Change-Id: Id9d07f3f777385b448ac88a6034b7472e4cf3c79	2015-08-03 10:46:12 -07:00
Jingning Han	a68356202d	Remove vpx_ prefix from the dspr2 file name in vpx_dsp/mips Make it consistent with other formats. Change-Id: I28f0d05ff7c5bf2b815989b3f1bd6c6b25608677	2015-08-03 09:59:14 -07:00
Jingning Han	d10fc5af8f	Merge "Add vpx_dsp_rtcd.h to inv_txfm_sse2.c"	2015-08-03 16:03:09 +00:00
Jingning Han	b096db5ad4	Merge "Remove vp9_common.h from idct16x16_neon.c"	2015-08-03 16:03:02 +00:00
Jingning Han	0b0eba728d	Add _dspr2 to local function names It avoids symbol conflicts between function names of various implementation versions. Change-Id: Iad79ebcb8e289457801812a7745c8380b5b06a46	2015-08-02 20:21:59 -07:00
Jingning Han	da7dc59837	Merge "Factor out mips/msa inverse transform implementations"	2015-08-03 03:18:39 +00:00
Jingning Han	0fcfc613c6	Merge "Add x86inc flag guard to inv_txfm_sse2.asm"	2015-08-02 21:56:09 +00:00
Jingning Han	6eabf229e2	Remove vp9_common.h from idct16x16_neon.c Change-Id: I3df35a99900ef8ce549d315866849a10db1a4c7b	2015-08-02 09:57:25 -07:00
Jingning Han	4f7a7d29fa	Add x86inc flag guard to inv_txfm_sse2.asm Fix the VS build failure. Change-Id: I4fb9d1c83980c4b52d5a848a9cb02ec72493dccb	2015-08-02 08:43:51 -07:00
Jingning Han	80ae856c8b	Add vpx_dsp_rtcd.h to inv_txfm_sse2.c Change-Id: Ibab434fb4bd6da02dba087582ed74811f555c3ed	2015-08-02 08:25:13 -07:00
James Zern	22a8474fe7	vpx_convolve_copy_sse2: fix win64 xmm6-7 need to be stored Change-Id: I6c51559598d335946ec91be6246b49589c63b724	2015-08-01 11:45:49 -07:00
Jingning Han	44849516d4	Factor out mips/msa inverse transform implementations Move mips/msa inverse transform implementations from vp9 folder to vpx_dsp. Change-Id: Ic4cf3f05247c3c63db7b532a0e5000017a962391	2015-08-01 09:25:12 -07:00
Jingning Han	b37494cfb5	Merge "Use precise header files in inverse transform msa implementations"	2015-08-01 16:20:43 +00:00
Jingning Han	b4c7d0523a	Merge "Factor inverse transform functions into vpx_dsp"	2015-08-01 16:20:24 +00:00
Parag Salasakar	c1b233dd43	Merge "mips msa vp8 temporal filter optimization"	2015-08-01 02:12:20 +00:00
Jingning Han	4dc390b15d	Merge "Add dynamic range notes to vp9_vector_var_c"	2015-08-01 01:01:37 +00:00
Aℓex Converse	fd22c492f7	Merge "Turn off simple_model_rd_from_var at speed 4."	2015-07-31 23:51:01 +00:00
Jingning Han	36a9a33b90	Add dynamic range notes to vp9_vector_var_c Change-Id: If536ad31046ecd9e2ecd9c21f52f8192c8153ad7	2015-07-31 16:42:09 -07:00
Jingning Han	56c2cb7553	Use precise header files in inverse transform msa implementations Change-Id: Ie8a79d9e2837842c3f60776b661cd42782b108d5	2015-07-31 23:24:54 +00:00
James Zern	d8642d831f	Merge "VP9_COPY_CONVOLVE_SSE2 optimization"	2015-07-31 23:22:34 +00:00
Jingning Han	e8b133c79c	Factor inverse transform functions into vpx_dsp This commit moves the module inverse transform functions from vp9 to vpx_dsp folder. The hybrid transform wrapper functions stay in the vp9 folder, since it involves codec-specific data structures. Change-Id: Ib066367c953d3d024c73ba65157bbd70a95c9ef8	2015-07-31 16:21:00 -07:00
Alex Converse	af6d2c7d42	Turn off simple_model_rd_from_var at speed 4. This got erroneously changed during the refactor. This fixes SvcTest.TwoPassEncode2TemporalLayersWithMultipleFrameContextsAndTiles. Change-Id: Ifa5ab0e098396c5e2d10478db87df256eadfa4c7	2015-07-31 15:50:17 -07:00
James Zern	e184b613b9	Merge changes Iecdbbc34,I8b4db93f * changes: Android.mk: fix *_rtcd.h deps for armeabi-v7a Android.mk: add a dep on vpx_config.asm for x86_64	2015-07-31 22:22:48 +00:00
Scott LaVarnway	a5e97d874b	VP9_COPY_CONVOLVE_SSE2 optimization This function suffers from a couple problems in small core(tablets): -The load of the next iteration is blocked by the store of previous iteration -4k aliasing (between future store and older loads) -current small core machine are in-order machine and because of it the store will spin the rehabQ until the load is finished fixed by: - prefetching 2 lines ahead - unroll copy of 2 rows of block - pre-load all xmm regiters before the loop, final stores after the loop The function is optimized by: copy_convolve_sse2 64x64 - 16% copy_convolve_sse2 32x32 - 52% copy_convolve_sse2 16x16 - 6% copy_convolve_sse2 8x8 - 2.5% copy_convolve_sse2 4x4 - 2.7% credit goes to Tom Craver(tom.r.craver@intel.com) and Ilya Albrekht(ilya.albrekht@intel.com) Change-Id: I63d3428799c50b2bf7b5677c8268bacb9fc29671	2015-07-31 14:51:51 -07:00
Jingning Han	6025c6d65b	Merge "Fix compiler warning in mips/dspr2"	2015-07-31 21:29:50 +00:00
Aℓex Converse	dd4b416412	Merge "Compute skippable inside the block_rd_txfm loop."	2015-07-31 21:19:11 +00:00
Jingning Han	135b43ccf3	Fix compiler warning in mips/dspr2 This commit fixes the mix declaration and definition warning when mips/dspr2 is turned on. Change-Id: I633d6fe42368b9ac35b106786ebac6969ad53552	2015-07-31 12:34:34 -07:00
Aℓex Converse	90e563d91f	Merge changes Ic1ce346a,Ic0b4e92c * changes: Simplify model_rd_for_sb HBD ifdefs Simplify dist_block HBD ifdefs	2015-07-31 19:05:54 +00:00
Alex Converse	ab20c98e84	Compute skippable inside the block_rd_txfm loop. Change-Id: Iaa43aeeb7a2074495e00cdb83bb551c3f13d3ed2	2015-07-31 11:45:59 -07:00
Zoe Liu	7f8dd35329	Merge "Refactor mips/dspr2 on convolution."	2015-07-31 18:23:19 +00:00
Zoe Liu	873a158f14	Merge "Code refactor on InterpKernel"	2015-07-31 18:20:14 +00:00
Alex Converse	c62228f273	Simplify model_rd_for_sb HBD ifdefs Change-Id: Ic1ce346a053800ae3b2d77178f46e6a388357f6d	2015-07-31 11:16:59 -07:00
Alex Converse	da9c73c293	Simplify dist_block HBD ifdefs Change-Id: Ic0b4e92cbaf813bcca8a8e9052c936c2e025e114	2015-07-31 11:04:01 -07:00
Aℓex Converse	8abd0c2a12	Merge "Short circuit rate_block in block_rd_txfm."	2015-07-31 17:59:22 +00:00
Zoe Liu	7cfdc00337	Refactor mips/dspr2 on convolution. Change-Id: If59a39d5a92c261537342726f94bb7f7f26dfff3	2015-07-31 10:27:42 -07:00
Zoe Liu	7186a2dd86	Code refactor on InterpKernel It in essence refactors the code for both the interpolation filtering and the convolution. This change includes the moving of all the files as well as the changing of the code from vp9_ prefix to vpx_ prefix accordingly, for underneath architectures: (1) x86; (2) arm/neon; and (3) mips/msa. The work on mips/drsp2 will be done in a separate change list. Change-Id: Ic3ce7fb7f81210db7628b373c73553db68793c46	2015-07-31 10:27:33 -07:00
Alex Converse	4ac5058afc	Give skip_txfm constants names. This is using a define instead of an enum to keep byte packing. Change-Id: I3abb07c8bfe377e19be4531b624af7b7b4207792	2015-07-31 10:08:08 -07:00
Alex Converse	73422d3b2d	Short circuit rate_block in block_rd_txfm. Don't run rate_block (cost_coeffs) if distortion alone is enough to surpass best_rd. This decreases 2nd pass runtime on HD at speed 2 by about 2%. There is zero effect on output if tx_cache is removed. Change-Id: Ia3b1cc77bfbe6ee988c395fde06c0eb92940b784	2015-07-31 10:05:51 -07:00
Parag Salasakar	8fbc641540	mips msa vp8 temporal filter optimization average improvement ~2x-3x Change-Id: I05593bed583234dc7809aaec6cab82773a29505d	2015-07-31 12:03:19 +05:30
Parag Salasakar	0e3f494b21	mips msa vp8 block subtract optimization average improvement ~2x-3x Change-Id: I30abf4c92cddcc9e87b7a40d4106076e1ec701c2	2015-07-31 09:29:10 +05:30
Parag Salasakar	e3ee8c292b	Merge "mips msa vp8 quantize optimization"	2015-07-31 03:44:03 +00:00
Yunqing Wang	3b2e73b9a4	Remove tx cache and speed up tx size selection 1. The RD scores obtained during the tx size selection were stored in the tx cache, and used to help make the tx decision for the following frames. This wasn't used anymore in VP9 encoder. Recovered the related decision making code from 1.5+ years ago, and borg tests didn't show any quality gain. This patch removed it to lower the complexity. 2. An optimization was done after the above refactoring. If the tx_mode is not TX_MODE_SELECT, we only need to test the chosen tx size instead of all posible tx sizes. This gave a 1.5% average speed gain at speed 2, and a 1% average speed gain at speed 3. Change-Id: Id8cd650e066a8cef33829d8c15388a8138adc78c	2015-07-30 18:53:40 -07:00
Aℓex Converse	eb6b443bd2	Merge "Convert simple_model_rd_from_var from a speed check to a speed feature."	2015-07-30 23:04:28 +00:00
Hui Su	a71c5c0ee9	Merge "Exclude vpx intra prediction functions in vp8-only build"	2015-07-30 22:29:35 +00:00
Alex Converse	c827c59eaf	Convert simple_model_rd_from_var from a speed check to a speed feature. Change-Id: I8877025e172fff29bc4e270790211463b676b4d7	2015-07-30 13:53:26 -07:00

1 2 3 4 5 ...

13943 Commits