Debargha Mukherjee
0fc82ea1cf
Refactoring and cosmetic changes to ext-inter expt
...
Change-Id: Icd457480744b7734b3c412c9fed43be738373334
2016-04-05 15:16:18 -07:00
Yi Luo
e6b089be43
Merge "Optimized HBD 4x4 variance calculation" into nextgenv2
2016-04-05 18:41:09 +00:00
Geza Lore
8917146a12
Fix supertx with ext-tile.
...
Change-Id: Ic2135c3812be009085c7c8e8dc15ee2ba618a67e
2016-04-05 12:49:55 +01:00
Angie Chiang
ff8c490b9a
Branch dct to new implementation for bd12
...
Change-Id: I9281935653aacce22ac3100f79fb956c249e2bf3
2016-04-04 12:40:10 -07:00
Yi Luo
250935cab3
Optimized HBD 4x4 variance calculation
...
vpx_highbd_8/10/12_variance4x4_sse4_1 improves performance ~7%-11%.
Change-Id: Ida22bb2a2f7a58037cfd73e186d4f6267a960c02
2016-04-04 11:28:59 -07:00
Angie Chiang
f1060f5bc4
Change dct32x32's range
...
Bitdepth 10/12:
Fit coefficient range into 32 bits
Fit codfficient * const range into 32 bits
Bitdepth 8:
Fit coefficient range into 16 bits
Fit codfficient * constant range into 32 bits
Change-Id: I50b5a3132e8a9f5155c971ab0f6eb52876d2b5ca
2016-04-04 11:21:11 -07:00
Angie Chiang
39b3c025fa
Fit dct's stage range into 32-bit when bitdepth is 12
...
Change-Id: I807e60c6dcacc50c087adcbdb1df022f8541efc5
2016-04-04 11:13:44 -07:00
Geza Lore
f0290cd127
Refactor get_partition to be universal.
...
Change-Id: I3a2fe4073bb94c5afc24d9274e6edcdb3aed934f
2016-04-04 15:22:25 +01:00
Geza Lore
e0dbfdeedc
Minor refactoring of partition type processing.
...
Change-Id: Idcb1e94298d4b7d8832d285548ec2d2ced4b2988
2016-04-04 14:51:10 +01:00
Angie Chiang
75ae90f7a9
Pass tx_type into get_tx_scale
...
Change-Id: I8a8df9fdefa492f66cf2cd29b0b081ad69b5d85e
2016-04-01 12:53:10 -07:00
Alex Converse
d649065ea3
Merge "Remove duplicate ans parameter in bitstream functions." into nextgenv2
2016-04-01 18:35:11 +00:00
Alex Converse
c961bcc594
Merge "ANS experiment: Use ANS everywhere." into nextgenv2
2016-04-01 18:34:50 +00:00
Alex Converse
0f68d80420
Remove duplicate ans parameter in bitstream functions.
...
Change-Id: Icd459209dae328f90c9a875259fe5d201b2a4e45
2016-04-01 11:33:06 -07:00
Alex Converse
fb9186d68d
ANS experiment: Use ANS everywhere.
...
Use ANS for all entropy coded data in VP10 including the compressed header and
modes and motion vectors. ANS tokens continue to be used for DCT tokens.
Change-Id: Idf709a747150601e4d95d81ecfb3dc7253d349df
2016-04-01 11:32:31 -07:00
Debargha Mukherjee
2fba8189de
Merge "Loopfilter fix" into nextgenv2
2016-04-01 17:48:09 +00:00
Angie Chiang
9f879b3c5f
Merge "change vp10_fwd_txfm2d_#x#_sse2 to vp10_fwd_txfm2d_#x#_sse4_1" into nextgenv2
2016-04-01 17:25:23 +00:00
Angie Chiang
2c2b9bd455
Merge "Remove redundant code from vp10_fwd_txfm2d.c" into nextgenv2
2016-04-01 17:25:13 +00:00
Angie Chiang
1b755039c6
Merge "Simplify rounding in vp10_[fwd/inv]_txfm[1/2]d_#x#" into nextgenv2
2016-04-01 17:24:50 +00:00
Angie Chiang
0a9eedfbef
Merge "Add vp10_fwd_txfm2d_sse2" into nextgenv2
2016-04-01 17:24:34 +00:00
Debargha Mukherjee
f7457f5e89
Loopfilter fix
...
Fixes mismatch introduced in
https://chromium-review.googlesource.com/#/c/336645
Change-Id: I15cded221c18dbf87b5029bc464e975d5c7c40e3
2016-03-31 19:57:42 -07:00
Yaowu Xu
a416d5bd2d
Fix a build issue
...
Change-Id: Ifdb32c487632098496bf59fcc76c518f8f0426d2
2016-03-31 16:06:24 -07:00
Debargha Mukherjee
2a6389bb8b
Merge "Fix interpolation values and decouple interintra" into nextgenv2
2016-03-31 21:47:10 +00:00
Debargha Mukherjee
4aac819a18
Fix some static analysis issues with resize
...
Change-Id: I2e8ada1ae95bd5577344d6f898e6ad4723e38fbd
2016-03-31 14:39:34 -07:00
Debargha Mukherjee
2be211e971
Fix interpolation values and decouple interintra
...
Decouples interintra modes and probability models from regular
intra modes, to enable creating/optimizing new interintra modes.
Also, fixes interpolation values for 128x128 interintra and obmc.
Change-Id: I5c2016db49b8f029164e5fe84c6274d4e02ff90e
2016-03-31 12:12:51 -07:00
Debargha Mukherjee
6d3fc82b7f
Merge changes Id20526d0,Iee08d975 into nextgenv2
...
* changes:
Refactor loopfilter level arrays to 2D.
Rename MI_BLOCK_SIZE and MI_MASK macros.
2016-03-31 18:48:20 +00:00
Jingning Han
aae7e0f6a4
Merge "Refactor the sub8x8 block motion search control" into nextgenv2
2016-03-31 15:50:38 +00:00
Geza Lore
10232eda8e
Refactor loopfilter level arrays to 2D.
...
Change-Id: Id20526d0b6d1371dc9f45cb8b5f24b6974da7bc4
2016-03-31 15:52:12 +01:00
Geza Lore
511da8cbe5
Rename MI_BLOCK_SIZE and MI_MASK macros.
...
Rename MI_BLOCK_SIZE.* -> MAX_MIB_SIZE.* (MIB is for MI Block).
Rename MI_MASK.* -> MAX_MIB_MASK.*
There are no functional changes.
This is in preparation for coding the superblock size at the frame
level, which will require some of these constants to become variables.
The new names better reflect future semantics, and hence make the code
clearer.
Change-Id: Iee08d97554cf4cc16a5dc166a3ffd1ab91529992
2016-03-31 09:57:41 +01:00
Alex Converse
615482af92
Merge "Use write_modes_b_wrapper throughout." into nextgenv2
2016-03-31 06:04:03 +00:00
Hui Su
cce6688c31
Merge "Set block size upper bound for Palette mode" into nextgenv2
2016-03-31 00:23:11 +00:00
Geza Lore
9d288bf698
Use write_modes_b_wrapper throughout.
...
Change-Id: Ifbef3aa6e6b0dbc3701a9ef91b8b685a918d84f4
2016-03-30 22:26:54 +00:00
Angie Chiang
c7c40d2329
Generalize txfm scale in highbd quantizer
...
Change-Id: I359aa49c09b244e0d44ebd09442e365a3d22556c
2016-03-30 15:25:26 -07:00
Angie Chiang
25520d8dc3
change vp10_fwd_txfm2d_#x#_sse2 to vp10_fwd_txfm2d_#x#_sse4_1
...
The speed performance for running 20k times is as follows
Notice that the vp10_highbd_fdct#x#_sse2 version is
16-bit version plus range check
The rest are 32-bit version
vp10_fwd_txfm2d_4x4_c (2 ms)
vp10_fwd_txfm2d_8x8_c (9 ms)
vp10_fwd_txfm2d_16x16_c (45 ms)
vp10_fwd_txfm2d_32x32_c (233 ms)
vp10_fwd_txfm2d_4x4_sse4_1 (2 ms)
vp10_fwd_txfm2d_8x8_sse4_1 (3 ms)
vp10_fwd_txfm2d_16x16_sse4_1 (16 ms)
vp10_fwd_txfm2d_32x32_sse4_1 (80 ms)
vp10_highbd_fdct4x4_c (1 ms)
vp10_highbd_fdct8x8_c (3 ms)
vp10_highbd_fdct16x16_c (17 ms)
highbd_fdct32x32_c (160 ms)
vp10_highbd_fdct4x4_sse2 (0 ms)
vp10_highbd_fdct8x8_sse2 (2 ms)
vp10_highbd_fdct16x16_sse2 (8 ms)
highbd_fdct32x32_sse2 (105 ms)
Change-Id: I24daf1e0d4d66e91e4ce61ef71cefa7b70ee90ce
2016-03-30 15:25:26 -07:00
Angie Chiang
c75f64780b
Remove redundant code from vp10_fwd_txfm2d.c
...
Change-Id: I87ae5e93957616c0f5160a4f679e42f77092c33f
2016-03-30 15:25:26 -07:00
Angie Chiang
f2b311f580
Simplify rounding in vp10_[fwd/inv]_txfm[1/2]d_#x#
...
Change-Id: I24ce46e157dc5b9c0d75000a1a48e9c136ed4ee1
2016-03-30 15:25:26 -07:00
Angie Chiang
11d2bb5429
Add vp10_fwd_txfm2d_sse2
...
Change-Id: Idfbe3c7f5a7eb799c03968171006f21bf3d96091
2016-03-30 15:25:26 -07:00
Angie Chiang
64413a6ca7
Parameterize transform scale for quantizer
...
This is to facilitate changing transform scale later
Change-Id: Ic8ca5afba57d2489ebd191ccc40c1b31605a0d8c
2016-03-30 15:25:26 -07:00
hui su
cbb8be769d
Set block size upper bound for Palette mode
...
Avoid buffer overflow in case of such new experiments as
128 x 128 superblock size.
Change-Id: Ib775f3925a85fc87227c0ddd9b6a6110a12ef196
2016-03-30 14:39:44 -07:00
Debargha Mukherjee
8d3a4aa891
Some fixes/speed-ups on inter-intra part of ext-inter
...
Fixes an issue with rectangular inter-intra blocks.
Includes various other refactoring and cleanups to enable fast mixing
of inter and intra predictors.
Uses only the best single inter reference so far for the inter-intra
search.
About 30% speed-up with a 0.1% hit in performance.
This is part one of overhauling on the ext-inter experiment. To be
continued in subsequent patches.
Change-Id: Id10ee100c78c6e00009a3a4f930a4435ef403a95
2016-03-30 14:39:29 -07:00
Debargha Mukherjee
91707ac79e
Merge "Extend superblock size fo 128x128 pixels." into nextgenv2
2016-03-30 20:55:32 +00:00
Geza Lore
552d5cd715
Extend superblock size fo 128x128 pixels.
...
If --enable-ext-partition is used at build time, the superblock size
(sometimes also referred to as coding unit (CU) size) is extended to
128x128 pixels.
Change-Id: Ie09cec6b7e8d765b7555ff5d80974aab60803f3a
2016-03-30 18:23:06 +01:00
Debargha Mukherjee
e467627f33
Merge "Fix for ext_interp experiment" into nextgenv2
2016-03-30 14:44:39 +00:00
Jingning Han
b6238b413e
Refactor the sub8x8 block motion search control
...
Change-Id: Ia340e66e0a61403070adf8e4f18f00eab143f8f7
2016-03-29 09:53:55 -07:00
Hui Su
aa6f5724ec
Merge "Palette mode: record selected transform type" into nextgenv2
2016-03-29 16:23:07 +00:00
Alex Converse
21ce8b9671
Merge "Force the VPX boolcoder trees in the ANS test." into nextgenv2
2016-03-29 16:23:00 +00:00
Yaowu Xu
37241e6f95
Merge "Merge branch 'masterbase' into nextgenv2" into nextgenv2
2016-03-29 16:05:53 +00:00
Julia Robson
068e799459
Fix for ext_interp experiment
...
Amends previous commit to also handle subsampling correctly.
Change ID of prev commit: I6b07e6cf9b287ba4b5bd6599af4a7412e50b3bdc
Was causing occassional failures for 422 streams due to accessing
elements beyond the extent of the bmi array.
Change-Id: I37ebabf4c01ca84bcd1851428172bdf753805d98
2016-03-29 16:09:49 +01:00
hui su
4ab00912c4
Palette mode: record selected transform type
...
Change-Id: I4c3d3224571176ac924d79ddfaba56990fc4000e
2016-03-28 20:43:59 -07:00
Jingning Han
78ee83125b
Merge "Fix a rdcost computation issue in sub8x8 block mode search" into nextgenv2
2016-03-29 00:51:01 +00:00
Yaowu Xu
c810740c36
Merge branch 'masterbase' into nextgenv2
...
Conflicts:
vp9/encoder/vp9_encoder.c
vpx_dsp/x86/convolve.h
Change-Id: I60c3532936bedd796a75dfe78245a95ec21e2e55
2016-03-28 17:44:28 -07:00