Johann Koenig
6c375b9cd0
Merge "fdct neon: 32x32_rd"
2017-07-07 14:05:51 +00:00
Johann
e4e08556db
sad neon: avg for 64x[32,64]
...
BUG=webm:1425
Change-Id: Id84d97807a6a0fbcc889c4dfe11929d54f85493d
2017-07-07 07:04:04 -07:00
Johann
6ae8f8dbe8
sad neon: macroize 64xN definitions
...
Change-Id: Iaa6ea75b10e75784f31b1e08637eecf0dcb5cff9
2017-07-07 07:04:04 -07:00
Johann
67cffc1ef6
sad neon: avg for 32x[16,32,64]
...
BUG=webm:1425
Change-Id: I3362e0dded3b46ca032caa7f44db42f324bc596d
2017-07-07 07:04:04 -07:00
Johann
b0d15713be
sad neon: macroize 32xN definitions
...
Change-Id: I0020a49e77d27514375a03095d5821dc0aa7d128
2017-07-07 07:04:04 -07:00
Johann
527e0c9b1c
sad neon: avg for 16x[8,16,32]
...
BUG=webm:1425
Change-Id: Ia42e4f36547c5fe12114fb58379e34bce82eb2f2
2017-07-07 07:04:04 -07:00
Johann
3c18acf452
sad neon: macroize 16xN definitions
...
Change-Id: I5aea6ffbfa48eb1970afe3be54f0bba275d7fa58
2017-07-07 07:04:04 -07:00
Johann Koenig
9b253f9f0a
Merge changes I7b36a57e,If2ab51e3,Ifc685a96
...
* changes:
sad neon: macroize 8xN definitions
sad neon: avg for 8x[4,8,16]
sad neon: avg for 4x4 and 4x8
2017-07-07 14:03:13 +00:00
Marco Paniconi
2075af4b16
Merge "vp9: Nonrd mode: use content_state_sb for high motion."
2017-07-07 03:00:59 +00:00
James Zern
80b83c73ba
cosmetics,vp9/: normalize inv/fwd_txfm naming
...
+ vpx_dsp/, test/
itxfm -> inv_txfm, ftxfm -> fwd_txfm
Change-Id: I3aacdb65143576d64cfe5c9b14dd358c17c1fe7e
2017-07-06 18:35:44 -07:00
James Zern
777ca80f0a
Merge changes from topic 'rm-dec-frame-parallel'
...
* changes:
vp9: remove FrameWorkerData & vp9_dthread.h
vp9: remove (un)lock_buffer_pool
2017-07-06 23:31:30 +00:00
James Zern
a15c6a7ebf
vp8cx,cosmetics: correct VP9_SET_TILE_COLUMNS docs
...
this has been set to max since:
f5c36a5ce
VP9: turn on tile-columns and frame-parallel-mode by default
~v1.4.0
Change-Id: Ic796fc05abe73a58700ec50e3f8e72d3462898ec
2017-07-06 16:24:35 -07:00
Marco
8c3f18efa1
vp9: Nonrd mode: use content_state_sb for high motion.
...
In the content_state for a superblock is set to HighSad,
use that to bias some decisions in variance partition and
nonrd pickmde: use int_pro_motion for sad computation in
choose_partitioning, and set large_block in pickmode based
on the content_state_sb.
Only affects speed >= 7.
Immprovement for high motion content.
Small gain (~1%) in RTC metrics.
Speedup of ~5 for high motion clip on android (speed 8, 1 thread).
Change-Id: I5774c4854f012b89c8e969f6129b60988c2ce11c
2017-07-06 15:05:19 -07:00
James Zern
26a9a4cd64
vp8cx,cosmetics: correct VP9_SET_FRAME_PARALLEL_DECODING docs
...
this has been on by default since:
f5c36a5ce
VP9: turn on tile-columns and frame-parallel-mode by default
~v1.4.0
Change-Id: I52017ab0157feaf429dce3d9e1af8a53bb5c1b65
2017-07-06 10:40:18 -07:00
Johann
d6423b3166
sad neon: macroize 8xN definitions
...
Change-Id: I7b36a57e893c1795a37ba7994995bec7ff021409
2017-07-06 07:51:59 -07:00
Johann
63bdc574e5
sad neon: avg for 8x[4,8,16]
...
BUG=webm:1425
Change-Id: If2ab51e3050e078b0011b174efe41fcb65a15f44
2017-07-06 07:43:09 -07:00
Johann
6bac3f80ee
sad neon: avg for 4x4 and 4x8
...
BUG=webm:1425
Change-Id: Ifc685a96cb34f7fd9243b4c674027480564b84fb
2017-07-06 07:12:47 -07:00
Johann
75b00592c7
fdct neon: 32x32_rd
...
About 40% faster than the non-rd version.
BUG=webm:1424
Change-Id: Ia99d14eb9532302eeaab8cd3e503395b0374b5a2
2017-07-06 06:30:50 -07:00
James Zern
5227b8200b
vp9: remove FrameWorkerData & vp9_dthread.h
...
the file was empty after the struct removal. the only remaining use was
within vp9_dx_iface, but the wrapper became unnecessary after the
removal of frame_parallel_decode.
BUG=webm:1395
Change-Id: I515ab585d701e77d388d12b2802d844c424f9bcd
2017-07-05 22:32:00 -07:00
James Zern
48c4a038eb
vp9: remove (un)lock_buffer_pool
...
there is no threaded access to this pool after the removal of
frame_parallel_decode
BUG=webm:1395
Change-Id: I710769b87102edc898c59eb9a2e7a91d8c49107f
2017-07-05 21:07:00 -07:00
James Zern
af3cab7b24
Merge changes from topic 'rm-dec-frame-parallel'
...
* changes:
vp9_onyxc_int,RefCntBuffer: rm unused members
remove vp9_dthread.c
vp9: reduce FRAME_BUFFERS by 3
2017-07-06 04:06:30 +00:00
James Zern
4ffd8350be
Merge changes from topic 'rm-dec-frame-parallel'
...
* changes:
VP9_COMMON: rm frame_parallel_decode
VP9Decoder: rm frame_parallel_decode
vp9_dx: rm worker thread creation
2017-07-05 23:53:22 +00:00
James Zern
0d245d42c4
Merge "test_vector_test,vp8: correct thread range"
2017-07-05 22:33:51 +00:00
Yaowu Xu
f2b1dc529f
Merge "Further refactoring of mod error calculation."
2017-07-05 21:43:50 +00:00
Yaowu Xu
e3cafbc8df
Merge "Fix incorrect index test in GF group rate assignment."
2017-07-05 21:43:43 +00:00
James Zern
24d0391efb
Merge "googletest: suppress unsigned overflow in the LCG"
2017-07-05 21:19:44 +00:00
Johann Koenig
9a05f9771a
Merge "test/buffer.h: move range checking to compiler"
2017-07-05 21:15:13 +00:00
James Zern
a22bb9809e
Merge "dct_partial_test: cover vpx_fdct8x8_1_msa in hbd"
2017-07-05 21:08:46 +00:00
Hui Su
3e08a88854
Merge "level tests: allow level undershoot"
2017-07-05 20:47:20 +00:00
James Zern
23d60be414
dct_partial_test: cover vpx_fdct8x8_1_msa in hbd
...
this was enabled in:
5ac88162b
partial fdct test
Change-Id: Ibae2031ec1308fe3a3b84a1ce6e7bacda3a7cb82
2017-07-05 13:01:41 -07:00
James Zern
a6531cbc54
Merge changes from topic 'missing-proto'
...
* changes:
fwd_txfm_msa.c: add missing vpx_dsp_rtcd.h
vpx_convolve_*_msa.c: add missing vpx_dsp_rtcd.h
loopfilter_*_msa.c: add missing vpx_dsp_rtcd.h
2017-07-05 20:00:25 +00:00
Johann Koenig
b6321025cd
Merge "partial fdct neon: maintain neon registers"
2017-07-05 19:12:38 +00:00
Johann
da2ad47d66
test/buffer.h: move range checking to compiler
...
Pass low/high values as type T. Out of range values should be caught by
static analysis instead.
Change-Id: I0a3ee8820af05f4c791ab097626174e2206fa6d5
2017-07-05 11:21:18 -07:00
paulwilkins
a1af335f44
Further refactoring of mod error calculation.
...
Further refactoring to support alternative error distributions.
Change-Id: I0f7fa3fd6f3baa4b0a1e53c6aa3be63966e97b82
2017-07-05 16:49:37 +01:00
paulwilkins
b0459ec8ea
Fix incorrect index test in GF group rate assignment.
...
Correct test for middle frame in the group.
Change-Id: I1ee49fa33968eb3c4a01d6a27a60bb1409e3e68c
2017-07-05 16:45:36 +01:00
James Zern
7d526c1654
Merge "buffer.h: incorrect RandRange results"
2017-07-02 03:48:53 +00:00
Johann
6cb3178192
buffer.h: incorrect RandRange results
...
'low' was promoted to unsigned, triggering a ubsan warning
Change-Id: Id49340079d39c105da93cf13e96cf852a93a94ba
2017-07-01 20:01:22 -07:00
James Zern
fb135ff050
Merge changes I4ed1312f,Id2673eec
...
* changes:
ppc: Add vpx_idct8x8_64_add_vsx
ppc: Add vpx_idct4x4_16_add_vsx
2017-07-02 02:38:39 +00:00
Alexandra Hájková
c757d6dde4
ppc: Add vpx_idct8x8_64_add_vsx
...
Change-Id: I4ed1312f365509e0595dcc09890ecb050f6f2069
2017-07-01 12:55:47 -07:00
Alexandra Hájková
d8c277030c
ppc: Add vpx_idct4x4_16_add_vsx
...
Change-Id: Id2673eece32027fb245919c7a5c81994a4a19fd8
2017-07-01 12:32:18 -07:00
Alex Converse
f7645138d4
googletest: suppress unsigned overflow in the LCG
...
Local application of:
https://github.com/google/googletest/pull/1066
Suppress unsigned overflow instrumentation in the LCG
The rest of the (covered) codebase is already integer overflow clean.
TESTED=gtest_shuffle_test goes from fail to pass with -fsanitize=integer
Change-Id: I8a6db02a7c274160adb08b7dfd528b87b5b53050
2017-07-01 12:24:32 -07:00
James Zern
3dd993e4be
highbd_idct8x8_add_sse4: make << of neg. val a multiply
...
left shifting a negative value is undefined; quiets a ubsan warning.
this is applied to a constant, no change in the generated code.
Change-Id: Ia17a7672d4832463decbc4afd6cd42974d02698e
2017-07-01 11:56:56 -07:00
Johann
3ae458f2f3
partial fdct neon: maintain neon registers
...
Finish the calulations in neon registers. This avoids a potentially
expensive move from neon to gp and allows at least clang to store
directly to memory.
BUG=webm:1424
Change-Id: Idef25eec95f7610947167818e9194bde8b00d282
2017-07-01 09:29:38 -07:00
James Zern
a876d04072
fwd_txfm_msa.c: add missing vpx_dsp_rtcd.h
...
+ only expose compatible functions in high-bitdepth build
quiets -Wmissing-prototypes warnings
Change-Id: I8ef7db08a34c5c54b5cde6e732c0d70f4287c89a
2017-06-30 18:53:30 -07:00
James Zern
8710c6d884
vpx_convolve_*_msa.c: add missing vpx_dsp_rtcd.h
...
quiets -Wmissing-prototypes warnings
Change-Id: I1ab5b8ae4a62f54e0f9eb3fc81371c9b99972c30
2017-06-30 18:50:56 -07:00
James Zern
329dabf57e
loopfilter_*_msa.c: add missing vpx_dsp_rtcd.h
...
+ make some functions static
quiets -Wmissing-prototypes warnings
Change-Id: I2130e06142e71a004a1eb30e173feba4f6fe68a0
2017-06-30 18:50:52 -07:00
James Zern
27e37e1a8a
fwd_txfm_msa.c: correct vpx_fdct8x8_1_msa prototype
...
this makes the function compatible with high-bitdepth and fixes test
failures since:
5ac88162b
partial fdct test
Change-Id: Ib630694608237f0c515948942e05dbea259ba338
2017-06-30 18:50:47 -07:00
James Zern
af3ab45867
test_vector_test,vp8: correct thread range
...
testing::Range does not include the end parameter in the set of values.
also adjust the start to 2 as the single threaded case is already
covered in another instantiation
Change-Id: Iae3bf3ed4363dd434eccfa5ad4e3c5e553fbee60
2017-06-30 16:21:06 -07:00
James Zern
5a8e4110c7
Merge "gen_msvs_sln: fix solution version for 2015/17"
2017-06-30 22:05:32 +00:00
James Zern
37e03b1d13
Merge "cosmetics,vp9/encoder: s/txm/txfm/"
2017-06-30 21:57:16 +00:00