Jim Bankoski
68ef7a6b8a
configure with internal stats not working
...
Change-Id: I5dea4570cb05df27a522abf6e7b695998654284a
2013-07-10 15:07:53 -07:00
Ronald S. Bultje
3f210f10eb
Remove unused iwalsh4x4 MMX/SSE2 functions.
...
Change-Id: I2d22577911a37ed7d8c7e08cac20764842267652
2013-07-10 14:52:47 -07:00
Ronald S. Bultje
48c53233fd
Remove unused 16x3/3x16 sad SSE2 functions.
...
Change-Id: I30a597c0cc366e34c9a3e2afe32d70e044f95ca4
2013-07-10 14:52:47 -07:00
Ronald S. Bultje
e6f955251f
Merge "SSSE3 assembly for 4x4/8x8/16x16/32x32 H intra prediction."
2013-07-10 14:52:23 -07:00
Ronald S. Bultje
6a60249071
Merge "SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 TM intra prediction."
2013-07-10 14:52:19 -07:00
Jim Bankoski
865ca76604
Merge "remove warnings when NDEBUG is set"
2013-07-10 14:39:39 -07:00
Jim Bankoski
6591cf2f7e
remove warnings when NDEBUG is set
...
Change-Id: Ie0cb732fdcb98616a422c4463bff80642248d136
2013-07-10 14:27:20 -07:00
Deb Mukherjee
53ff43adc3
Prunes out full-rd computation based on modeled rd
...
Adds a speed feature to eliminate full-rd computation if the modeled
rd or rd based on a different parameter in the same mode is already
a lot larger than the best rd yet.
Specifically, only search the sharp and smooth filters if the modeled
rd cost based on the regular filter is within a certain factor of the
best rd cost so far. Also, skip full-rd computation of non splitmv
inter modes if the modeled rd cost based on pred error is within the
same factor of the best rd cost so far.
Also adds some enhancements in the rd search for splitmv mode to
speed things up by early breakouts. Negligible impact on performance.
Resuts on derfraw300:
psnr: -0.013% with the splitmv enhancements, -0.24% with the rd
breakout feature on.
speedup: 6% with splitmv enhancements, 20% with also residual breakout
(tested on football sequence at 600 Kbps)
Change-Id: I37abc308ea9f110c1679ce649b6a7e73ab1ad5fc
2013-07-10 13:49:49 -07:00
James Zern
82f5935111
Merge "msvc: set a more useful debug format"
2013-07-10 13:02:22 -07:00
James Zern
9a8524d5ba
Merge "test_libvpx: disable pthreads in gtest for win targets"
2013-07-10 13:01:52 -07:00
Jingning Han
114423538f
SSE2 16x16 ADST/DCT hybrid transform
...
This commit enables 16x16 ADST/DCT forward hybrid transform using SSE2
operations. It reduces the runtime from 5433 cycles to 1621 cycles, at
no compression performance loss.
Change-Id: I75fd7f1984e9e28846af459f810ff0d6ae125230
2013-07-10 12:14:53 -07:00
Dmitry Kovalev
417df1d42e
Merge "Adding encode_tiles function to vp9_bitstream.c."
2013-07-10 11:43:50 -07:00
Yaowu Xu
e52eec490c
Merge "Add a feature to reduce chrome intra mode search"
2013-07-10 11:35:47 -07:00
Jingning Han
82c415328c
Merge "Add unit test for 16x16 forward ADST/DCT"
2013-07-10 11:16:39 -07:00
Scott LaVarnway
25b4909076
Merge "Bug fix: set frame_parallel_decoding_mode"
2013-07-10 11:09:30 -07:00
John Koleszar
d1f8dd518c
Merge "Fix intermediate height in convolve"
2013-07-10 11:04:40 -07:00
Dmitry Kovalev
704afd0c7a
Adding read_compressed_header function.
...
Splitting setup_txfm_mode into read_tx_mode and read_tx_probs.
Change-Id: I5b4fe48698d56490857d32eafcaeb4291f208479
2013-07-10 10:27:50 -07:00
Ronald S. Bultje
44b29a769c
Merge "SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 V intra prediction."
2013-07-10 10:24:16 -07:00
Ronald S. Bultje
89810bfd71
Merge "SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 DC intra prediction."
2013-07-10 10:13:16 -07:00
Ronald S. Bultje
4823307720
Merge "Remove memcpy() in handle_inter_mode() filter selection."
2013-07-10 10:13:07 -07:00
Dmitry Kovalev
20986c81b3
Merge "Removing vp9_maskingmv.c and corresponding assembly file."
2013-07-10 10:05:06 -07:00
Jingning Han
cf768b2d80
Add unit test for 16x16 forward ADST/DCT
...
Unit tests on the functional accuracy of forward ADST/DCT.
Change-Id: I81afff866bdeacbd457b0af96993a035741657f6
2013-07-10 09:40:46 -07:00
Ronald S. Bultje
7fd643264a
SSSE3 assembly for 4x4/8x8/16x16/32x32 H intra prediction.
...
Change-Id: Iad70966b986f65259329070e258f76ef0af816b4
2013-07-10 09:28:03 -07:00
Ronald S. Bultje
8dade638a1
SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 TM intra prediction.
...
Change-Id: I3441c059214c2956e8261331bbf521525a617a86
2013-07-10 09:28:03 -07:00
Ronald S. Bultje
75b33c68c7
SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 V intra prediction.
...
Change-Id: I55a6cfa2daba738cbc0c4a02f806893f7e556997
2013-07-10 09:28:03 -07:00
Ronald S. Bultje
92c5d3665d
SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 DC intra prediction.
...
Change-Id: Ibe1690afc5459f3b3beca401e7734fcd03da6dd0
2013-07-10 09:28:03 -07:00
Ronald S. Bultje
b1df674a99
Remove memcpy() in handle_inter_mode() filter selection.
...
Encode time of first 50 frames of bus (speed 0) @ 1500kbps goes from
2min4.9 to 2min3.1, i.e. a 1.4% speedup overall.
Change-Id: I9b25e87974430cb942caa276410bb2eda815bd83
2013-07-10 09:27:56 -07:00
Yaowu Xu
bed27a960a
Add a feature to reduce chrome intra mode search
...
Change-Id: I721ebdeef2b53ce3e5c3eba3f7462ae2103c95a8
2013-07-10 08:59:18 -07:00
Jim Bankoski
863204e64d
mi_width_log2 & mi_height_log2
...
converted to lookup to avoid unnecessary code
Change-Id: I2ee6a01f06984cc2c4ba74b3fffd215318f749d2
2013-07-10 07:26:08 -07:00
Jim Bankoski
6c8170af52
b_width_log2 and b_height_log2 lookups
...
Replace case statement with lookup.
Small speed gain at low speed settings but at speed 2+ where the
number of motion searches etc. falls the impact rises to ~3-4%.
Change-Id: Idff639b7b302ee65e042b7bf836943ac0a06fad8
Change-Id: I5940719a4a161f8c26ac9a6753f1678494cec644
2013-07-10 07:19:09 -07:00
Jim Bankoski
fb027a7658
removing case statements around prediction entropy coding
...
Removes SEG_ID
Removes MBSKIP
Removes SWITCHABLE_INTERP
Removes INTRA_INTER
Removes COMP_INTER_INTER
Removes COMP_REF_P
Removes SINGLE_REF_P1
Removes SINGLE_REF_P2
Removes TX_SIZE
Change-Id: Ie4520ae1f65c8cac312432c0616cc80dea5bf34b
2013-07-09 20:10:16 -07:00
Yaowu Xu
059f2929e9
Merge "Revert "Remove memcpy() in handle_inter_mode() filter selection.""
2013-07-09 20:10:06 -07:00
James Zern
c2a4b2059e
Merge "remove unused VP8 com/dec asm offsets"
2013-07-09 19:13:49 -07:00
James Zern
dac57fece6
Merge "Remove all asm offset files from VP9"
2013-07-09 19:13:37 -07:00
Dmitry Kovalev
2824048a56
Merge "Loop filter code cleanup."
2013-07-09 18:56:19 -07:00
Yaowu Xu
205efbc153
Revert "Remove memcpy() in handle_inter_mode() filter selection."
...
This reverts commit fcf7998a47
.
Change-Id: Ic6532223faec9f1483b78adb2e37b79c7b1a0efb
2013-07-09 17:42:10 -07:00
James Zern
9718e708f3
msvc: set a more useful debug format
...
pdb vs. c7; works better with test_libvpx
Change-Id: I67d18e328dd8e7734d3710f3912e9b179d368a62
2013-07-09 17:28:22 -07:00
Yaowu Xu
d3d6ddcee5
Merge "Added a lossless test"
2013-07-09 17:15:09 -07:00
Dmitry Kovalev
d82f459d1a
Adding encode_tiles function to vp9_bitstream.c.
...
Change-Id: Ie44824ec25fd8fdb25d7c8124a9b28c26d802029
2013-07-09 15:59:19 -07:00
Frank Galligan
53971d86ea
Merge "Add Neon horizontal and vertical vp9_mbloop_filter"
2013-07-09 15:38:44 -07:00
Yaowu Xu
9ce6de195b
Added a lossless test
...
It does encodings with min and max q set at 0, and check to make sure
output PSNR at MAX_PSNR (100).
Change-Id: Ia2418353cccf6e487204ea4ff874a7e71e55cb3e
2013-07-09 14:40:20 -07:00
James Zern
f89335f7ca
remove unused VP8 com/dec asm offsets
...
Change-Id: Ib3b26ee27f04b2dcbbd32b3127afb45e9f50cfcf
2013-07-09 14:33:49 -07:00
John Koleszar
f0d9f10d24
Remove all asm offset files from VP9
...
The files are empty and unused.
Change-Id: Ieb4242d14273efdf24149bda33f9591540bba06a
2013-07-09 14:26:53 -07:00
Scott LaVarnway
5900d13183
Merge "Removed unnecessary xd->mode_info_context assignment"
2013-07-09 12:45:32 -07:00
Frank Galligan
198fa6d0a0
Add Neon horizontal and vertical vp9_mbloop_filter
...
- The vp9 mbfilter C code will branch on flat and mask. This CL
will perform both branches and combine the data. A later CL will
perform a check to see if all patch will take one branch.
- These functions are about 1.75 times faster than the C code on
Nexus 7.
PS #3
- Changed all functions to dub limit, blimit, and thresh from
vld {dx[]}, freeing up r4-r6.
- Changed code to use vbif to reduce one instruction and free
up a d register.
Change-Id: I028dae0e434dc9891c3677bdb182e201ffb04777
2013-07-09 12:40:05 -07:00
Dmitry Kovalev
ec68d25521
Merge "Adding update_tx_ct function, removing duplicated code."
2013-07-09 12:26:11 -07:00
Dmitry Kovalev
aeed28f143
Removing vp9_maskingmv.c and corresponding assembly file.
...
Change-Id: I9842d02d61d78d17dc3449bae8ffbe60f4b3ecb3
2013-07-09 11:22:56 -07:00
Dmitry Kovalev
92a9eaef50
Loop filter code cleanup.
...
Using MAX_LOOP_FILTER constant instead of number 63.
Change-Id: If91e0c198331b3041e7cd0707a5948479e9209d8
2013-07-09 11:18:09 -07:00
Scott LaVarnway
69d1d1d865
Removed unnecessary xd->mode_info_context assignment
...
mi is xd->mode_info_context
Change-Id: Ib101be922b695205ec57b5ce1828ba19bde5b41c
2013-07-09 13:41:34 -04:00
Ronald S. Bultje
204d1b7058
Merge "Unbreak lossless."
2013-07-09 09:54:48 -07:00