Debargha Mukherjee
01a2b40e95
Merge "Spatial SVC crash fix"
2015-12-01 21:24:46 +00:00
Debargha Mukherjee
d3409bad9a
Fix a spatial svc bug related to scaling
...
Fixes bug introduced in
https://chromium-review.googlesource.com/#/c/299482/5
Change-Id: If542c1a917380465dd9bc4ce5e32b0adbb20e340
2015-12-01 10:40:59 -08:00
Marco
1abf575f32
vp9 denoiser: Re-evaluate mode selection for golden reference.
...
Under certain denoising conditons, check for re-evaluation of
zero_last mode if best mode was golden reference.
Change-Id: Ic6cdfd175eef2f7d68606300c7173ab6654b3f6e
2015-12-01 09:39:01 -08:00
Jian Zhou
c7fae5d893
Speed up tm_predictor_16x16
...
Reduce mem access to left. Speed up by 10% in ./test_intra_pred_speed
with the same instruction size.
Change-Id: Ia33689d62476972cc82ebb06b50415aeccc95d15
2015-11-30 17:46:40 -08:00
Marco
f78b7daec4
Condition use of minmax in variance partition on speed setting.
...
For non-rd variance partition: only allow minmax computation
(which currently has no arm-neon optimization) for speeds < 8.
Performance loss is small: On RTC set with speed 8, few clips lose ~2/3%,
average loss is < 1%.
Change-Id: Ia9414f4d0b77dc83c3e73ca8de5d903f64b425ce
2015-11-30 17:23:32 -08:00
Scott LaVarnway
2669e05949
Merge "VPX: x86 asm version of vpx_idct32x32_1024_add()"
2015-11-30 23:28:27 +00:00
Marco Paniconi
23831545a0
Merge "vp9 denoiser: Fix to re-evaluate mode selection."
2015-11-30 19:00:39 +00:00
Jian Zhou
9d29d76280
SSE2 speed up of h_predictor_4x4
...
Relocate h_predictor_4x4 from SSSE3 to SSE2 with XMM registers.
Speed up by ~25% in ./test_intra_pred_speed.
Change-Id: I64e14c13b482a471449be3559bfb0da45cf88d9d
2015-11-30 10:08:05 -08:00
Marco
f1f74a4e6c
vp9: Update to noise estimation for denoising.
...
Change initial state of noise level, and only update
denoiser with noise level when estimate is done.
Change-Id: If44090d29949d3e4927e855d88241634cdb395dc
2015-11-30 10:03:20 -08:00
Marco
ad7e765319
vp9 denoiser: Fix to re-evaluate mode selection.
...
This fix allows to enable reuse_inter_pred.
Change-Id: I53f2bf1163bb0036ffb6df92117a86debdca11d1
2015-11-30 08:59:10 -08:00
Scott LaVarnway
0148e20c3c
VPX: x86 asm version of vpx_idct32x32_1024_add()
...
Change-Id: I3ba4ede553e068bf116dce59d1317347988b3542
2015-11-25 10:11:29 -08:00
James Zern
1138b986c9
test.mk: enable test_intra_pred_speed unconditionally
...
vpx_dsp is currently included in all configurations
Change-Id: If3253a87d27f3e1abc94fbfe76f978c1172f3762
2015-11-24 22:29:12 -08:00
Marco Paniconi
610b413d7b
Merge "vp9 denoiser: Re-evaluate ZEROMV after denoiser filtering."
2015-11-25 04:24:00 +00:00
Jian Zhou
901d20369a
Merge "Speed up tm_predictor_8x8"
2015-11-25 02:34:07 +00:00
James Zern
adb033b57b
Merge "configure: simplify x86 asm dependencies"
2015-11-25 02:19:47 +00:00
James Zern
fd51d90159
Merge changes Iaf8cbe95,I6748183d,I2a49811d
...
* changes:
add vp9_satd_neon
fix vp9_satd_sse2
vp9_satd: return an int
2015-11-25 01:48:53 +00:00
Marco
5b0ddb931d
vp9 denoiser: Re-evaluate ZEROMV after denoiser filtering.
...
For denoising, and for noise level above threshold, re-evaluate
ZEROMV for mode selection after denoising.
Current change only does this check if selected best mode (before denoising)
was intra.
Change-Id: I4b1435b68d26c78f7597b995ee7bff0ddd5f9511
2015-11-24 17:30:32 -08:00
Debargha Mukherjee
e807517a93
Spatial SVC crash fix
...
Fixes a spatial_svc breakage introduced in
https://chromium-review.googlesource.com/#/c/305228/3 .
Change-Id: I7f2cecbdca980addb85d5e58b58b5454f4730ada
2015-11-24 16:40:27 -08:00
Alex Converse
022c848b4d
Change highbd variance rounding to prevent negative variance.
...
Always round sum error and sum square error toward zero in variance
calculations. This prevents variance from becoming negative.
Avoiding rounding variance at all might be better but would be far
more invasive.
Change-Id: Icf24e0e75ff94952fc026ba6a4d26adf8d373f1c
2015-11-24 16:32:01 -08:00
James Zern
eb1d0f8d60
add vp9_satd_neon
...
~60-65% faster at the function level across block sizes
Change-Id: Iaf8cbe95731c43fdcbf68256e44284ba51a93893
2015-11-24 16:09:10 -08:00
Jian Zhou
f4621c5c8d
Speed up tm_predictor_8x8
...
Left neighbor read from memory only once.
Speed up by ~20% in ./test_intra_pred_speed.
Change-Id: Ia1388630df6fed0dce9a6eeded6cb855bbc43505
2015-11-24 16:07:06 -08:00
Marco
fbd245c598
vp9-denoiser: Fix to reset frame_stats.
...
zeromv_lastref_sse was not reset.
Change-Id: I23c12e804d63dc7dc18514f6efe71de1d1acbd6a
2015-11-24 15:58:28 -08:00
Marco Paniconi
e99e4a64e0
Merge "vp9 non-rd pickmode: Fix logic in reference masking."
2015-11-24 19:14:35 +00:00
Alex Converse
b84fa548fb
Merge "bitreader/writer: Change shift to signed"
2015-11-24 18:33:45 +00:00
Alex Converse
4b038ad2ef
Merge "Deduplicate some high bit depth tables"
2015-11-24 18:24:32 +00:00
Marco
eb43c8ebfc
vp9 non-rd pickmode: Fix logic in reference masking.
...
This change makes sure last reference with zero mv
is always checked for mode selection.
No change in metrics.
Change-Id: Iaf01877bf34272b966c78bfe18daad882a0a419e
2015-11-24 10:10:03 -08:00
Scott LaVarnway
b16a164c97
Merge "VPX: Removed unnecessary pmulhrsw in IDCT32X32_34"
2015-11-23 23:37:13 +00:00
Scott LaVarnway
26eb806342
Merge "VP9: Only zero counts when !frame_parallel_decoding_mode (2)"
2015-11-23 23:36:46 +00:00
Scott LaVarnway
2c3b737af6
VP9: Only zero counts when !frame_parallel_decoding_mode (2)
...
The counts are never used when frame_parallel_decoding_mode
is set.
Change-Id: I293aa68abadcdd30973adacb9f5f5a3aecf8daa2
2015-11-23 14:42:15 -08:00
Marco
b0027b96ae
vp9-svc: Fix to allow setting qp-max/min per spatial and temporal layer.
...
Change-Id: Ic0ec32c1d7f7c08c9f956592dccbfd9060b1f624
2015-11-23 10:46:34 -08:00
Scott LaVarnway
97e6cc6198
VPX: Removed unnecessary pmulhrsw in IDCT32X32_34
...
and fixed macro name.
Change-Id: I306b98a2b4ec80b130ae80290b4cd9c7a5363311
2015-11-23 10:24:09 -08:00
James Zern
16eba81f69
Revert "Speed up h_predictor_4x4"
...
This reverts commit d76032ae87e535be5b924d9e88bbd67189380534.
breaks 32-bit builds
Change-Id: If6266ec2a405b5a21d615112f0f37e8a71193858
2015-11-20 22:25:29 -08:00
James Zern
073dc71cd0
Merge "Use Interlocked calls in win32 once() implementation."
2015-11-21 01:40:11 +00:00
James Zern
1b10753ad7
Merge "Speed up h_predictor_4x4"
2015-11-21 01:12:42 +00:00
Marco
131c1600a9
vp9 denoiser: Bias to last reference for temporal filter.
...
Change-Id: I6a360a12e8da8cdcb8a779647512591612d64f31
2015-11-20 15:38:32 -08:00
James Zern
60760f710f
fix vp9_satd_sse2
...
accumulate satd in 32-bits
+ add unit test
Change-Id: I6748183df3662ddb9d635f9641f9586f2fd38ad5
2015-11-20 14:35:46 -08:00
James Zern
3e0138edb7
vp9_satd: return an int
...
the final sum may use up to 26 bits
+ add a unit test
+ disable the sse2 as the result will rollover; this will be fixed in a
future commit
Change-Id: I2a49811dfaa06abfd9fa1e1e65ed7cd68e4c97ce
2015-11-20 14:35:38 -08:00
Marco Paniconi
64a60ce3ba
Merge "vp9-svc: Fix the setting of is_key_frame."
2015-11-20 18:29:15 +00:00
Alex Converse
612e3c8a0e
Merge "Fix a signed shift overflow in vpx_rb_read_inv_signed_literal."
2015-11-20 17:42:05 +00:00
Alex Converse
d37c78819a
Merge "Fix unsigned overflow in rd_variance_adjustment."
2015-11-20 17:41:58 +00:00
Marco
80a3e2615a
vp9-svc: Fix the setting of is_key_frame.
...
Change on affects 1 pass CBR.
On key frame, temporal layer_id is reset to 0 for 1 pass CBR,
but since "layer" is reset, the svc.layer_context[layer].is_key_frame
was not correspondingly set properly.
Change-Id: I08f6da0a55ac7429ccfbaddfb7be14479e43543b
2015-11-20 08:51:13 -08:00
Scott LaVarnway
e7fc39fdf5
Merge "VPX: x86 asm version of vpx_idct32x32_34_add()"
2015-11-20 15:11:00 +00:00
Alex Converse
6aa2163b69
bitreader/writer: Change shift to signed
...
Silences several legal but suspicious unsigned overflows found with
clang -fsanitize=integer.
Change-Id: I69399751492a183167932b0a10751c433c32ca7b
2015-11-19 15:13:39 -08:00
Alex Converse
42b7c44b2f
Fix a signed shift overflow in vpx_rb_read_inv_signed_literal.
...
Found with clang -fsanitize=integer
Change-Id: I17cb2166c06ff463abfaf9b0e6bc749d0d6fdf94
2015-11-19 15:04:20 -08:00
Alex Converse
b1fcd1751e
Fix unsigned overflow in rd_variance_adjustment.
...
Found with clang -fsanitize=integer
Change-Id: I2538e7483cb2d5f06bceecbd3326bdd88bfecfa1
2015-11-19 15:00:59 -08:00
Jian Zhou
d76032ae87
Speed up h_predictor_4x4
...
Modify h_predictor_4x4 with XMM registers.
Speed up by ~25% in ./test_intra_pred_speed.
Change-Id: Id01c34c48e75b9d56dfc2e93af12cf0c0326a279
2015-11-19 11:34:22 -08:00
Paul Wilkins
f3f6b6fe3e
Merge "Changes to best quality settings."
2015-11-19 16:13:43 +00:00
Jian Zhou
4993158ee5
Merge "Speed up tm_predictor_4x4"
2015-11-19 02:32:48 +00:00
Jian Zhou
79b68626ae
Speed up tm_predictor_4x4
...
tm_predictor_4x4 is implemented with SSE2 using XMM registers.
Speed up by ~25% in ./test_intra_pred_speed.
Change-Id: I25074b78d476a2cb17f81cf654bdfd80df2070e0
2015-11-18 16:44:25 -08:00
Marco
eed5494fc6
vp9-svc: Fix to key frame counter for spatial layers.
...
Existing condition only applied to temporal layers.
Change-Id: Icef20a59d0afc61d4e14dea01aff4786fa9e41ae
2015-11-18 14:31:37 -08:00