hkuang
3476404912
Merge "Speed up idct8x8 by rearrange instructions. Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf."
2013-09-05 17:37:13 -07:00
Jim Bankoski
736114f44b
fix loop filter setup_mask could reach out of bounds issue
...
Change-Id: Ic8446c4f26b6782a6dc482c19ea73c77646df418
2013-09-05 15:53:31 -07:00
Jingning Han
1c263d6918
Merge "Use saturated addition in SSSE3 of 32x32 quant"
2013-09-05 14:09:40 -07:00
Jim Bankoski
2156ccaa4a
Merge "resolve clang warnings : uninitialized vars in vp9_entropy.h"
2013-09-05 12:55:32 -07:00
Jingning Han
458c2833c0
Use saturated addition in SSSE3 of 32x32 quant
...
The 32x32 forward transform can potentially reach peak coefficient
value close to 32700, while the rounding factor can go upto 610.
This could cause overflow issue in the SSSE3 implementation of 32x32
quantization process.
This commit resolves this issue by replacing the addition operations
with saturated addition operations in 32x32 block quantization.
Change-Id: Id6b98996458e16c5b6241338ca113c332bef6e70
2013-09-05 12:49:12 -07:00
Jim Bankoski
9fc3d32a50
Merge "faster accounting of inc_mv"
2013-09-05 12:38:56 -07:00
Jim Bankoski
2e4ca9d1a5
resolve clang warnings : uninitialized vars in vp9_entropy.h
...
This helps clear out some of the warnings
Change-Id: Ie7ccaca8fd92542386a7f1b257398e1bdf2f55dc
2013-09-04 18:38:41 -07:00
Jim Bankoski
e8feb2932f
Merge "wrap non420 loop filter code in macro"
2013-09-04 17:20:53 -07:00
hkuang
01c4e04424
Speed up idct8x8 by rearrange instructions.
...
Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf.
Change-Id: I3e2cc818ec14b432204ff43732f39b6438db685d
2013-09-04 15:57:22 -07:00
hkuang
3c05bda058
Merge "Add neon optimize vp9_short_iht4x4_add."
2013-09-04 13:35:09 -07:00
hkuang
3b8614a8f6
Add neon optimize vp9_short_iht4x4_add.
...
Change-Id: I42c497b68ae1ee645b59c9968ad805db0a43e37e
2013-09-04 12:37:58 -07:00
Jim Bankoski
872c6d85c0
Merge "speed up inc_mv_component"
2013-09-04 10:35:51 -07:00
Jim Bankoski
bb2313db28
Merge "make vp9 postproc a config option"
2013-09-04 10:35:26 -07:00
Jim Bankoski
c3c21e3c14
wrap non420 loop filter code in macro
...
Change-Id: I62bca0e7a4bffc1a78b750dbb9df9d2378e92423
2013-09-04 10:24:42 -07:00
Jim Bankoski
79401542f7
make vp9 postproc a config option
...
Vp9 postproc is disabled for now as its not been shown to help and
may be merged with vp8.
Change-Id: I25620d6cd34c6e10331b18c7b5ef7482e39c6057
2013-09-04 10:02:08 -07:00
Jim Bankoski
532179e845
faster accounting of inc_mv
...
Moves counting of mv branches to where we have a new mv, instead of after
the whole frame is summed.
Change-Id: I945d9f6d9199ba2443fe816c92d5849340d17bbd
2013-09-04 09:47:57 -07:00
Jim Bankoski
5dda1d2394
speed up inc_mv_component
...
Convert mv_class if statements to look up. re order to avoid ifs...
Change-Id: I76966a21bf517bb1f9a7957c08c476c7bb3e9a63
2013-09-04 07:11:30 -07:00
James Zern
1cf2272347
Merge "Fix intermediate height in convolve_c"
2013-09-03 15:50:33 -07:00
Jingning Han
010c0ad0eb
Merge "Fix 32x32 forward transform SSE2 version"
2013-09-03 08:58:03 -07:00
Scott LaVarnway
948aaab4ca
Merge "Improved mb_lpf_horizontal_edge_w_sse2_8"
2013-09-03 05:44:01 -07:00
Jingning Han
3cf46fa591
Fix 32x32 forward transform SSE2 version
...
This commit fixed the potential overflow issue in the SSE2
implementation of 32x32 forward DCT. It resolved the corrupted
coded frames in the border of scenes.
Change-Id: If87eef2d46209269f74ef27e7295b6707fbf56f9
2013-08-31 18:47:08 -07:00
Tero Rintaluoma
e326cecf18
Fix intermediate height in convolve_c
...
- Intermediate height was not correct i.e. when block size is 4 and
y_step_q4 is 6. In this case intermediate height was
(4*6) >> 4 = 1 and vertical interpolation needs two source pixels
plus 7 extra pixels for taps.
- Also if the current output block is 16x16 and we are using 4x upscaling
we need only 12 rows after horizontal filtering instead of 16.
Patch Set 2: Intermediate_height updated after CL 66723
"Fix bug in convolution functions (filter selection)"
Change-Id: I5a1a1bc2ac9d5edb3a6e0818de618bf318fdd589
2013-08-30 10:31:21 +03:00
Jim Bankoski
1d44fc0c49
Merge "rework filter_block_plane"
2013-08-29 20:11:09 -07:00
Jim Bankoski
bc50961a74
rework filter_block_plane
...
Change-Id: I55c3b60c4c0f4910d3dfb70e3edaae00cfa8dc4d
2013-08-29 17:00:05 -07:00
Jingning Han
c86c5443eb
Merge "Fix overflow issue in SSSE3 32x32 quantization"
2013-08-29 16:49:04 -07:00
James Zern
d765df2796
consistently name VP9_COMMON variables #3
...
stragglers
Change-Id: Ib1e853f9a331b7b66639dc34d79568d84d1930f1
2013-08-29 13:27:41 -07:00
James Zern
aa05321262
consistently name VP9_COMMON variables #2
...
oci -> cm
Change-Id: Ifd75c809d9cc99034d3c2fccc4653a78b3aec21f
2013-08-29 13:25:58 -07:00
James Zern
924d74516a
consistently name VP9_COMMON variables #1
...
pc -> cm
Change-Id: If3e83404f574316fdd3b9aace2487b64efdb66f3
2013-08-29 13:25:57 -07:00
Jingning Han
abff678866
Fix overflow issue in SSSE3 32x32 quantization
...
The 32x32 quantization process can potentially have the intermediate
stacks over 16-bit range, thereby causing enc/dec mismatch. This commit
fixes this overflow issue in the SSSE3 implementation, as well as the
prototype, of 32x32 quantization.
This fixes issue 607 from webm@googlecode.
Change-Id: I85635e6ca236b90c3dcfc40d449215c7b9caa806
2013-08-29 11:00:54 -07:00
Scott LaVarnway
22dc946a7e
Improved mb_lpf_horizontal_edge_w_sse2_8
...
This patch is a reformatted version of optimizations done by
engineers at Intel (Erik/Tamar) who have been providing
performance feedback for VP9. For the test clips used (720p, 1080p),
up to 1.2% performance improvement was seen.
Change-Id: Ic1a7149098740079d5453b564da6fbfdd0b2f3d2
2013-08-29 08:30:17 -04:00
Dmitry Kovalev
851a2fd72c
Renaming txfm_size to tx_size.
...
Change-Id: I752e374867d459960995b24d197301d65ad535e3
2013-08-27 19:47:53 -07:00
Dmitry Kovalev
1d3f94efe2
Merge "Adding get_entropy_context function."
2013-08-27 17:02:36 -07:00
Frank Galligan
7d058ef86c
Merge "Fix winodws warning."
2013-08-27 15:39:58 -07:00
Frank Galligan
f1560ce035
Fix winodws warning.
...
Const is not needed on the function parameter.
Change-Id: I38c2a7317cb6f42f70bbddfde9a2cd18d65ceb1c
2013-08-27 15:19:55 -07:00
Dmitry Kovalev
a93992e725
Adding get_entropy_context function.
...
Moving common code from encoder and decoder to this function.
Change-Id: I60fa643fb1ddf7ebbff5e83b6c4710137b0195ef
2013-08-27 14:17:53 -07:00
hkuang
3a679e56b2
Add neon optimize vp9_short_idct16x16_1_add.
...
Change-Id: Ib9354c1d975d03e8081df20d50b6a77dfe2dc7e5
2013-08-27 14:00:27 -07:00
hkuang
ce04b1aa62
Merge "Add neon optimize vp9_short_idct8x8_1_add."
2013-08-27 12:10:07 -07:00
Dmitry Kovalev
7b95f9bf39
Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the encoder.
...
Change-Id: I62bb07c377f947cb72fac68add7a6b199e42c6b9
2013-08-27 11:05:08 -07:00
Dmitry Kovalev
12e5931a9a
Merge "Using existing functions instead of raw expressions."
2013-08-27 10:33:34 -07:00
Dmitry Kovalev
bfebe7e927
Merge "Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the common/decoder."
2013-08-27 10:15:21 -07:00
Dmitry Kovalev
78e670fcf8
Merge "Renaming D27 to D207."
2013-08-27 10:03:57 -07:00
hkuang
36e9b82080
Add neon optimize vp9_short_idct8x8_1_add.
...
Change-Id: I0b15d5e3b0eb97abb9ab5ec08e88b61f8723aaf4
2013-08-26 16:28:57 -07:00
hkuang
69384f4fad
Add neon optimize vp9_short_idct4x4_1_add.
...
Change-Id: I6ecb5c4a1a472feb8e84e9f3352b536d5e28a4a5
2013-08-26 15:55:16 -07:00
Dmitry Kovalev
45870619f3
Renaming BLOCK_SIZE_TYPE to BLOCK_SIZE in the common/decoder.
...
Adding temporary "typedef BLOCK_SIZE BLOCK_SIZE_TYPE" which will go away
after encoder's patch.
Change-Id: I06ec6a6f079401439843ec981d1496234fd7775c
2013-08-26 11:33:16 -07:00
Jingning Han
4681197a58
Merge "Temporarily disable SSSE3 quant_32x32"
2013-08-26 11:19:53 -07:00
Jingning Han
166dc85bed
Temporarily disable SSSE3 quant_32x32
...
Make the current head working properly, while working on fixing an
issue in the SSSE3 implementation of 32x32 quantization.
Change-Id: Ic029da3fd7f1f5e58bc641341cbd226ec49a16bc
2013-08-26 10:45:59 -07:00
James Zern
c8ba8c513c
cosmetics: strip 'VP9_' from defines in vp9 only code
...
Change-Id: I481d9bb2fa3ec72b6a83d5f04d545ad8013f295c
2013-08-23 19:16:49 -07:00
Dmitry Kovalev
50ee61db4c
Renaming D27 to D207.
...
I've already renamed d27_predictor to d207_predictor but forgot about the
corresponding constant.
Change-Id: Id312aa80fc5b5a1ab8a709a33418a029552a6857
2013-08-23 17:33:48 -07:00
Dmitry Kovalev
480dd8ffbe
Using existing functions instead of raw expressions.
...
Change-Id: Ifa50b04bac1a6ff2abef989073cbf1f37a89eb50
2013-08-23 17:26:53 -07:00
Dmitry Kovalev
e6c435b506
Merge "Cleanup in mvref_common.{h, c}."
2013-08-23 17:09:49 -07:00