Yaowu Xu
1309679e96
Merge "Rename quantize_sse2.c to quantize_sse2_intrinsics.c to avoid collision." into experimental
2013-03-04 13:13:29 -08:00
Ronald S. Bultje
f60f6db716
Rename quantize_sse2.c to quantize_sse2_intrinsics.c to avoid collision.
...
Change-Id: I5637d491eb6a9b7633f72e03fd9df72131eeb121
2013-03-04 12:25:01 -08:00
Yunqing Wang
e8bc9f4220
Optimize vp9_short_idct4x4llm function
...
Wrote a SSE2 vp9_short_idct4x4llm to improve the decoder
performance.
Change-Id: I90b9d48c4bf37aaf47995bffe7e584e6d4a2c000
2013-03-04 12:01:27 -08:00
James Zern
a5762191d7
Merge "vp8_init: fix NULL dereference on allocation error"
2013-03-04 11:10:06 -08:00
Jingning Han
5957b2b514
Support 16K sequence coding
...
Fixed a couple of variable/function definitions, as well as header
handling to support 16K sequence coding at high bit-rates.
The width and height are each specified by two bytes in the header.
Use an extra byte to explicitly indicate the scaling factors in
both directions, each ranging from 0 to 15.
Tested coding up to 16400x16400 dimension.
Change-Id: Ibc2225c6036620270f2c0cf5172d1760aaec10ec
2013-03-04 11:08:41 -08:00
James Zern
798c5b148d
vp8_init: fix NULL dereference on allocation error
...
Change-Id: I15a6cd014fce8090cdb0441723bd1a90b562579c
2013-03-02 14:46:06 -08:00
James Zern
08348d9cab
prefix vp8 asm_{com,dec,enc}_offsets files
...
make them symmetrical with the generated output and their vp9
counterparts
Change-Id: I72cc97c4d33d713dff620a6d7cc25955266216fc
2013-03-02 14:45:40 -08:00
John Koleszar
2d3e879fcc
Merge changes If5896507,I06b5ba5c,I2712f99e into experimental
...
* changes:
Add unit test for x4 multi-SAD functions
Add VP9 1 block SAD functions to unit test
Merge master branch into experimental
2013-03-01 20:50:33 -08:00
John Koleszar
1cfc86ebe0
Add unit test for x4 multi-SAD functions
...
Update the function prototypes to match between VP9 and VP8.
Change-Id: If58965073989e87df3b62b67a030ec6ce23ca04f
2013-03-01 18:14:02 -08:00
John Koleszar
6b653cba02
Add VP9 1 block SAD functions to unit test
...
Change-Id: I06b5ba5c457944cfa4cd9f53c3bd8cda132439c2
2013-03-01 18:04:19 -08:00
Dmitry Kovalev
b5a9795d25
Code cleanup and simplification of build_4x4uvmvs function.
...
Change-Id: Iab0176f058045181821ded95ff1cf423af1625f9
2013-03-01 17:50:55 -08:00
Dmitry Kovalev
135428e954
Code cleanup.
...
Removing redundant 'extern' keyword, lowercase variable names.
Change-Id: I608e8d8579aba8981f5fac3493f77b4481b13808
2013-03-01 17:39:31 -08:00
Johann
403145032d
Merge "Use intrinsics for sse2 regular quantize"
2013-03-01 17:20:26 -08:00
Ronald S. Bultje
e189edfeb1
Initialize pass variable in tile test.
...
Change-Id: I7977694223521404fc69f29ae2cff03e36e87299
2013-03-01 12:43:10 -08:00
John Koleszar
69c67c9531
Merge master branch into experimental
...
Picks up some build system changes, compiler warning fixes, etc.
Change-Id: I2712f99e653502818a101a72696ad54018152d4e
2013-03-01 11:06:05 -08:00
Yaowu Xu
db4dc6f0c0
Merge "Adjust the max_gf_interval initialization" into experimental
2013-03-01 11:02:23 -08:00
Yunqing Wang
67dbc8fe55
Merge "Add eob<=10 case in idct32x32" into experimental
2013-03-01 08:58:19 -08:00
Yaowu Xu
cea8cd08d3
Adjust the max_gf_interval initialization
...
to be a fixed value of 15.
Test results:
cif: .124%, .068%, .081%
std-hd: 2.809%, 3.174%, 2.705%
Change-Id: I380c8152c973506094da15eab59e3aa22b75a983
2013-03-01 06:38:35 -08:00
Johann
eca59cad0b
Use intrinsics for sse2 regular quantize
...
Remove dependency of this function on asm_offsets. ssse3/sse4 next.
Change quant_shift calculation so it be done using SIMD. Pre-calculate
as much as possible to simplify EOB selection.
Take advantage of qcoeff being zero'd by tying the if statements
together.
Speed parity with previous implementation with gcc x86_64 linux
Change-Id: Ife97556a1eca3a74b09def1a3d04084974dff1fb
2013-02-28 18:06:15 -08:00
Dmitry Kovalev
852ca19e4b
Merge "Code cleanup." into experimental
2013-02-28 17:22:51 -08:00
Yunqing Wang
c550bb3b09
Add eob<=10 case in idct32x32
...
Simplified idct32x32 calculation when there are only 10 or less
non-zero coefficients in 32x32 block. This helps the decoder
performance.
Change-Id: If7f8893d27b64a9892b4b2621a37fdf4ac0c2a6d
2013-02-28 16:40:29 -08:00
Dmitry Kovalev
253886413a
Merge changes I9be9c990,Ic3b97339 into experimental
...
* changes:
Ignoring test video sequences in the source tree.
Code cleanup.
2013-02-28 16:07:45 -08:00
James Zern
a07bed2b2b
firstpass.c: correct casting around gf_group_bits
...
gf_group_bits is int64_t remove casts to int.
Change-Id: I3b4225905041fac9af9fdfcbcb6f1c357ea4b593
2013-02-28 15:45:29 -08:00
John Koleszar
17c221687f
Merge "Fix use of uninitialized memory in CONFIG_ABOVESPREFMV" into experimental
2013-02-28 15:18:50 -08:00
Jim Bankoski
078f5bf439
Merge "mv dct_sse2.c dct_sse2_intrinsics.c to avoid collision" into experimental
2013-02-28 15:16:44 -08:00
Dmitry Kovalev
b8d19664fc
Ignoring test video sequences in the source tree.
...
Adding /*.ivf and /*.ivf.md5 filename patterns to .gitignore.
Change-Id: I9be9c99023e6b59bcc9cfbb6c9d6cdc8e500fc4f
2013-02-28 15:10:48 -08:00
Dmitry Kovalev
dcbdda8e15
Code cleanup.
...
Lower case variable names, converting while loops to for loops.
Change-Id: Ic3b973391eef7472a99d18d02fe79cfef5e04e62
2013-02-28 14:40:20 -08:00
Yunqing Wang
72b146690a
Merge "Refactor vp9_dequant_idct_add function" into experimental
2013-02-28 14:34:27 -08:00
Yunqing Wang
6193bc3ba8
Refactor vp9_dequant_idct_add function
...
Provided a wrapper and removed duplicate code.
Change-Id: Iaef842226ec348422e459202793b001d0983ea30
2013-02-28 14:18:46 -08:00
Scott LaVarnway
aa8fb070b8
Removed vp9_dequantize_b
...
Change-Id: Ie89bd00d58e30bf4094cb748a282f1dfa81a31d8
2013-02-28 14:08:12 -08:00
Jim Bankoski
8f270acfb2
mv dct_sse2.c dct_sse2_intrinsics.c to avoid collision
...
Change-Id: Id786be31da3c91d95d2955aa569ecdc6e66650df
2013-02-28 13:58:15 -08:00
John Koleszar
2eab4372fc
Fix use of uninitialized memory in CONFIG_ABOVESPREFMV
...
The ABOVESPREFMV experiment uses four pixels to the left of the
current block, which don't exist for the left-most column.
Change-Id: I4cf0b42ae8f54c0b3e7b1ed8755704b74fafc39c
2013-02-28 13:48:58 -08:00
Dmitry Kovalev
40fec9b588
Merge "Dequantization code cleanup." into experimental
2013-02-28 13:46:43 -08:00
Dmitry Kovalev
c43906e2e9
Dequantization code cleanup.
...
Removing redundant variables, using x *= y instead x = x * y, moving
variable declarations into inner blocks.
Change-Id: I884f95c755f55d51b7c1c6585f10296919063e41
2013-02-28 13:28:05 -08:00
Dmitry Kovalev
0d9cc0a9f0
Code cleanup.
...
Removing redundant 'extern' keyword, better formatting, code
simplification.
Change-Id: I132fea14f08c706ee9ea147d19464d03f833f25b
2013-02-28 13:18:02 -08:00
John Koleszar
b6a3062d81
Fix incorrect comparison of frame size
...
The width and height stored in the reference frames are padded out to
a multiple of 16. The Width and Height variables in common are the
displayed size, which may be smaller. The incorrect comparison was
causing scaling related code to be called when it shouldn't have
been. A notable case where this happens is 1080p, since 1088 != 1080.
Change-Id: I55f743eeeeaefbf2e777e193bc9a77ff726e16b5
2013-02-28 11:33:02 -08:00
Johann
67978d1380
Merge "vp8 fast quantizer with intrinsics"
2013-02-28 11:32:03 -08:00
Jim Bankoski
714aa9f3c0
this commit converts all sad ptrs to uint32
...
sse4_1 code used uint16_t for returning sad, but that
won't work for 32x32 or 64x64. This code fixes the
assembly for those and also reenables sse4_1 on linux
Change-Id: I5ce7288d581db870a148e5f7c5092826f59edd81
2013-02-28 08:46:35 -08:00
Jim Bankoski
b715e371c0
fix to parameters to match rtcd
...
Change-Id: I919e2dd72292fe44f2e53ada56bd42287d50cdeb
Signed-off-by: Jim Bankoski <jimbankoski@google.com>
2013-02-28 08:10:08 -08:00
Christian Duvivier
f02769b872
Merge "Faster vp9_short_fdct8x8." into experimental
2013-02-27 17:50:27 -08:00
John Koleszar
d962bd530d
Merge "Fix rollover and pass 1 time estimate" into experimental
2013-02-27 17:43:03 -08:00
Christian Duvivier
c129203f7e
Faster vp9_short_fdct8x8.
...
Scalar path is about 1.4x faster (4% overall encoder speedup).
SSE2 path is about 7x faster (13% overall encoder speedup).
Change-Id: I7e85d8225a914a74c61ea370210414696560094d
2013-02-27 17:23:08 -08:00
Dmitry Kovalev
c6421433c8
Merge "Code cleanup." into experimental
2013-02-27 16:43:04 -08:00
Dmitry Kovalev
347f3a0aa8
Code cleanup.
...
Fixing code style, using array lookup instead of switch statements for
forward hybrid transforms (in the same way as for their inverses).
Consistent usage of ROUND_POWER_OF_TWO macro in appropriate places.
Change-Id: I0d3822ae11f928905fdbfbe4158f91d97c71015f
2013-02-27 13:51:04 -08:00
John Koleszar
889ce83390
Merge changes Idc1c490f,I6b5fe1a4 into experimental
...
* changes:
convolve test: validate 1D filters are 1D
Run all filters through convolve test
2013-02-27 13:45:42 -08:00
Dmitry Kovalev
9d771f948f
Merge "Motion vectors code cleanup." into experimental
2013-02-27 13:34:56 -08:00
Yunqing Wang
bbc7b6a86a
Merge "Remove unused file" into experimental
2013-02-27 13:00:10 -08:00
John Koleszar
ebf8b9fc6d
Fix rollover and pass 1 time estimate
...
Fixes a rollover of the cx_time variable for encodes that take
over ~4200 seconds. Also enable the time estimate in first pass.
Change-Id: Ib5a98ee71bccd79a804d709cec7260651d0b7141
2013-02-27 12:29:25 -08:00
John Koleszar
5ac141187a
Merge "Remove unused vp9_copy32xn" into experimental
2013-02-27 12:23:45 -08:00
Yunqing Wang
d6ff6fe2ed
Merge "Remove unused file" into experimental
2013-02-27 11:58:29 -08:00