generic-library/vpx

Author	SHA1	Message	Date
Yushin Cho	77bba8d30a	New experiment: Perceptual Vector Quantization from Daala PVQ replaces the scalar quantizer and coefficient coding with a new design originally developed in Daala. It currently depends on the Daala entropy coder although it could be adapted to work with another entropy coder if needed: ./configure --enable-experimental --enable-daala_ec --enable-pvq The version of PVQ in this commit is adapted from the following revision of Daala: `fb51c1ade6` More information about PVQ: - https://people.xiph.org/~jm/daala/pvq_demo/ - https://jmvalin.ca/papers/spie_pvq.pdf The following files are copied as-is from Daala with minimal adaptations, therefore we disable clang-format on those files to make it easier to synchronize the AV1 and Daala codebases in the future: av1/common/generic_code.c av1/common/generic_code.h av1/common/laplace_tables.c av1/common/partition.c av1/common/partition.h av1/common/pvq.c av1/common/pvq.h av1/common/state.c av1/common/state.h av1/common/zigzag.h av1/common/zigzag16.c av1/common/zigzag32.c av1/common/zigzag4.c av1/common/zigzag64.c av1/common/zigzag8.c av1/decoder/decint.h av1/decoder/generic_decoder.c av1/decoder/laplace_decoder.c av1/decoder/pvq_decoder.c av1/decoder/pvq_decoder.h av1/encoder/daala_compat_enc.c av1/encoder/encint.h av1/encoder/generic_encoder.c av1/encoder/laplace_encoder.c av1/encoder/pvq_encoder.c av1/encoder/pvq_encoder.h Known issues: - Lossless mode is not supported, '--lossless=1' will give the same result as '--end-usage=q --cq-level=1'. - High bit depth is not supported by PVQ. Change-Id: I1ae0d6517b87f4c1ccea944b2e12dc906979f25e	2016-11-06 22:18:01 -08:00
Debargha Mukherjee	6a47cff882	Further work on 64x64 fwd/inv transform support For higher level fwd and inv transform functions. Change-Id: I91518250a0be7d94aada7519f6c9e7ed024574fb	2016-11-03 14:32:54 -07:00
Jingning Han	9fe31390ca	Support rectangular tx_size in the common lib Change-Id: I4128ab932a967a3d657bb1f95f0fa2af20a06469	2016-11-02 11:48:31 -07:00
Debargha Mukherjee	67d134772c	Adding 64x64 forward and inverse transforms Change-Id: I213f3111fc0656aecd1303a8b871ecded2b92bc2	2016-11-02 09:48:46 -07:00
Jingning Han	6a503e4110	Merge "Make rectangular transform block available in the common lib" into nextgenv2	2016-11-02 16:17:00 +00:00
Jingning Han	ec419e0771	Make rectangular transform block available in the common lib This prepares the integration of rectangular transform block size with recursive transform block partition system. Change-Id: Id96aa3790dace15619c665f438241938992d1730	2016-11-01 22:25:54 -07:00
Yi Luo	fb77385fd0	Merge "Remove unused copies of transform related source code" into nextgenv2	2016-11-02 01:43:19 +00:00
Yi Luo	ea1167c33f	Remove unused copies of transform related source code - Library size reduces: 165 kB, 292 kB (HBD). Change-Id: I50cb630dde326bd2a28c0db4b7e2d53c2fd94a2a	2016-11-01 15:07:46 -07:00
Yi Luo	7317200002	Hybrid inverse transforms 16x16 AVX2 optimization - Add unit tests to verify the bit-exact result. - User level time reduction (EXT_TX): encoder: 3.63% decoder: 2.36% - Also add tx_type=V_DCT...H_FLIPADST SSE2 for 16x16 inv txfm. Change-Id: Idc6d9e8254aa536e5f18a87fa0d37c6bd551c083	2016-11-01 13:38:20 -07:00
Yi Luo	e4abb97ba3	Merge "Fix the overflow of av1_fht32x32() in 2D DCT_DCT" into nextgenv2	2016-10-21 16:13:18 +00:00
Yi Luo	157e45a44b	Fix the overflow of av1_fht32x32() in 2D DCT_DCT - Use range check function to avoid DCT_DCT overflow. We need to re-develop the column txfm side scaling/rounding. Now, we prefer to maintain the current BDRate level. - Encoder user level time reduction <1% owing to av1_fht32x32_avx2. - Add MemCheck unit test and fdct32() unit test. Change-Id: I1e67030f67bc637859798ebe2f6698afffb8531c	2016-10-20 09:22:24 -07:00
hui su	5db9743fbb	Seperate FILTER_INTRA from EXT_INTRA experiment Prepare for the av1/nextgenv2 merge. Coding gain (%): lowres midres ext-intra 0.69 0.97 filter-intra 0.67 0.83 both 1.05 1.48 Change-Id: Ia24d6fafb3e484c4f92192e0b7eee5e39f4f4ee6	2016-10-19 21:40:49 -07:00
Yaowu Xu	0dd046371f	Fix build issues when --enable-aom-qm Change-Id: I1a462675c06c4b2a5f8b4b347f23fec67feccdd0	2016-10-19 12:26:53 -07:00
Debargha Mukherjee	a720f4b3b5	Merge "Add sse2 forward and inverse 16x32 and 32x16 transforms" into nextgenv2	2016-10-14 02:49:20 +00:00
Yaowu Xu	98e9ce923b	Merge "Add SSE4.1 code for deringing functions." into nextgenv2	2016-10-13 18:02:59 +00:00
Michael Bebenita	7227b65c4c	Add SSE4.1 code for deringing functions. Change-Id: I363f7fb610a5c86ea9f417e34b57c6373af877e5	2016-10-13 18:02:19 +00:00
David Barker	33231d4801	Add sse2 forward and inverse 16x32 and 32x16 transforms Change-Id: I1241257430f1e08ead1ce0f31db8272b50783102	2016-10-13 14:01:22 +01:00
Yi Luo	fed8e1c06d	Hybrid forward transform 32x32 AVX2 optimization - av1_fht32x32 AVX2 function level time reduction ~89% compared to C. - av1_fht32x32_avx2() on DCT_DCT improves 42.62% over aom_fdct32x32_avx2() But function replacement must go with the corresponding inverse txfm. - No obvious user level time reduction due to 32x32 TX_TYPE selection. - Zero high 128b YMM to avoid AVX-SSE transition penalties (fix 16x16 case). - Added 32x32 AVX2 unit tests to verify bitexact. - AVX2 optimization summary: On CPU i7-6700, based on 16x16/32x32 fwd txfm optimization results: C to AVX2: function level time reduction, ~86-89%. SSE2 to AVX2: function level time reduction, ~51%. Change-Id: Idd0cd8bf066a61c7117140ef15ab6c1f8eb4b036	2016-10-12 14:19:53 -07:00
David Barker	4d03d6fc6f	Add sse2 forward / inverse 4x8 and 8x4 transforms Change-Id: I89ed93fb20cf975c2b463cff58879521ceaa4163	2016-10-10 09:02:45 -07:00
Yi Luo	3a8217f21b	Merge "Hybrid forward transforms 16x16 AVX2 optimization" into nextgenv2	2016-10-07 01:52:11 +00:00
Yi Luo	e8e8cd8f1b	Hybrid forward transforms 16x16 AVX2 optimization - Unit tests are added for AVX2 SIMD. - Encoder speed improvement: AV1 baseline and EXT_TX, three 1080p sequences at bitrate: 800 Kbps, 2 Mbps, 6 Mbps, on i7-6700 CPU, average user level time reduction: 3.86%. Change-Id: Ibbd7837ee3a831c6b1e4e471bf6c8d3fa3a19ff4	2016-10-06 15:33:15 -07:00
Peter de Rivaz	1baecfeb03	Added sse2 inverse 8x16 and 16x8 transforms Change-Id: I43628407b11e5c8e6af4df69f2acdc67ac827834	2016-10-06 11:23:14 -07:00
Geza Lore	1a800f6539	Add SSE2 versions of av1_fht8x16 and av1_fht16x8 Encoder speedup ~2% with ext-tx + rect-tx Change-Id: Id56ddf102a887de31d181bde6d8ef8c4f03da945	2016-09-09 11:29:41 -07:00
James Zern	9fa47587d9	fix 'dist' & other decode-only builds common/av1_fwd_txfm.[hc] are encode-only; add a TODO to relocate them Change-Id: I28cf8d0b22632b04066bcb72f3d2252ee7eb153e	2016-09-08 14:53:42 +00:00
Yaowu Xu	f883b42cab	Port renaming changes from AOMedia Cherry-Picked the following commits: 0defd8f Changed "WebM" to "AOMedia" & "webm" to "aomedia" 54e6676 Replace "VPx" by "AVx" 5082a36 Change "Vpx" to "Avx" 7df44f1 Replace "Vp9" w/ "Av1" 967f722 Remove kVp9CodecId 828f30c Change "Vp8" to "AOM" 030b5ff AUTHORS regenerated 2524cae Add ref-mv experimental flag 016762b Change copyright notice to AOMedia form 81e5526 Replace vp9 w/ av1 9b94565 Add missing files fa8ca9f Change "vp9" to "av1" ec838b7 Convert "vp8" to "aom" 80edfa0 Change "VP9" to "AV1" d1a11fb Change "vp8" to "aom" 7b58251 Point to WebM test data dd1a5c8 Replace "VP8" with "AOM" ff00fc0 Change "VPX" to "AOM" 01dee0b Change "vp10" to "av1" in source code cebe6f0 Convert "vpx" to "aom" 17b0567 rename vp10.mk to av1_.mk fe5f8a8 rename files vp10_* to av1_* Change-Id: I6fc3d18eb11fc171e46140c836ad5339cf6c9419	2016-08-31 18:19:03 -07:00

25 Commits