Ronald S. Bultje
5ebe47747d
Merge "Don't re-allocate comp_pred buffers for each call to comp motion search."
2013-06-25 12:00:36 -07:00
Dmitry Kovalev
5ae096778e
Merge "Removing unused code."
2013-06-25 11:50:55 -07:00
Jingning Han
cd6932db77
Merge "Add 8x8 dct/adst unit tests"
2013-06-25 11:21:17 -07:00
Yaowu Xu
c2e3ee13e7
Merge "Changed size of mb_mode_context to 8 bits"
2013-06-25 10:44:47 -07:00
Scott LaVarnway
855e23ce8c
Merge "Small mode_info_context cleanup in filter_block_plane"
2013-06-25 10:34:19 -07:00
Dmitry Kovalev
87ee34aacb
Removing unused code.
...
Removing block index (ib) parameter from get_tx_type_{8x8, 16x16}
functions.
Change-Id: Ia213335aae7a7cb027f97b9cc9b04519840250f1
2013-06-25 10:17:19 -07:00
Dmitry Kovalev
70e9622185
Merge "Removing find_seg_id and using vp9_get_pred_mi_segid instead."
2013-06-25 10:16:06 -07:00
Dmitry Kovalev
529679bd52
Merge "Transforming scale_mv_component_q4 into scale_mv_q4 function."
2013-06-25 10:15:33 -07:00
Jingning Han
ab362621fe
Add 8x8 dct/adst unit tests
...
This commit enables 8x8 DCT and hybrid transform unit tests. It
also tunes the forward hybrid transform rounding opertions for
more precise round-trip performance.
Change-Id: If05c1ce59d75d641b9c6c91527d02d3a6ef498c3
2013-06-25 09:57:01 -07:00
Jingning Han
67365520e7
Merge "Use aligned buffer operations in 8x8/16x16 2D-DCT"
2013-06-25 09:49:03 -07:00
Scott LaVarnway
c787f40bc4
Small mode_info_context cleanup in filter_block_plane
...
Unnecessary updates to xd->mode_info_context.
Change-Id: I36d2d68ca48366f727548526726b1b5437f62968
2013-06-25 12:28:50 -04:00
Yaowu Xu
b9c934df8e
Merge "Enable sse2 implmentation of 8x8 ADST/DCT"
2013-06-25 09:13:22 -07:00
Yaowu Xu
ca976db44d
Merge "change to enable use_largest_txform feature"
2013-06-25 09:07:01 -07:00
Jingning Han
82d504b50f
Use aligned buffer operations in 8x8/16x16 2D-DCT
...
This reduces 16x16 2D-DCT runtime from 865 cycles to 837 cycles.
Change-Id: I137758b81cd127b936175284310e81378db64552
2013-06-24 19:56:23 -07:00
Jingning Han
a32a086d23
Enable sse2 implmentation of 8x8 ADST/DCT
...
This commit makes use of the butterfly structure to enable the sse2
version implementation of 8x8 ADST/DCT hybrid transform coding.
The runtime of hybrid transform module goes down from 1170 cycles
to 245 cycles. Overall speed-up around 1.5%.
Change-Id: Ic808ffd21ece8a9d0410d8c0243d7b6c28ac3b3f
2013-06-24 18:41:33 -07:00
Yaowu Xu
e371cd73a3
change to enable use_largest_txform feature
...
for all regular inter frames at speed 1
Change-Id: I0a8b301273ecf2b8730ab1f6b7a05f89f4d498e0
2013-06-24 16:43:26 -07:00
John Koleszar
ece724ae16
Merge "Remove unused vp9_build_intra_predictors_sb{y,uv}_s"
2013-06-24 15:08:58 -07:00
John Koleszar
ee4a7e4e46
Merge "Remove unused vp9_model_to_full_probs_sb()"
2013-06-24 15:08:54 -07:00
Scott LaVarnway
dfa2ecc3f1
Changed size of mb_mode_context to 8 bits
...
This reduced the size of the MODE_INFO array (mip and prev_mip)
by 425,568 bytes each for 1080p resolutions.
Change-Id: Ifa513ec2d0a49e8ec0867ec90620762fb7f1261d
2013-06-24 17:11:16 -04:00
Ronald S. Bultje
4dc70fa7f9
Don't re-allocate comp_pred buffers for each call to comp motion search.
...
Instead, just allocate a few bytes on the stack, this is 4k, which isn't
all that much.
Change-Id: I82af6ee89e6ed01faaa23ff891ee7ced76df8c16
2013-06-24 14:05:13 -07:00
Yaowu Xu
93f88ab55a
Merge "Fix loopfilter of leftmost 4x4 edges in SB"
2013-06-24 09:55:21 -07:00
John Koleszar
858475a03a
Fix loopfilter of leftmost 4x4 edges in SB
...
For cases where there's no transform set in bit 0 (the left edge of
the SB) but bit 0 of mask_4x4_int is set (the edge 4 pixels from the
left edge needs filtering), it was incorrectly being skipped before.
This situation only happens on the leftmost edge of the image, as
the edge at column 0 is intentionally skipped since there aren't
pixels to the left to read.
Change-Id: Ib2fbbcb40166e90af31b1a0e13b85b68c226cbd3
2013-06-24 08:26:00 -07:00
Ronald S. Bultje
4eb8c56587
Merge "Allocate memory using appropriate expected alignment in unit tests."
2013-06-21 21:22:55 -07:00
Ronald S. Bultje
ac6ea2ab91
Allocate memory using appropriate expected alignment in unit tests.
...
Fixes crashes of test_libvpx on 32-bit Linux.
Change-Id: If94e7628a86b788ca26c004861dee2f162e47ed6
2013-06-21 17:03:57 -07:00
John Koleszar
0c8e13d2f8
Merge "Add some unaligned test vectors"
2013-06-21 16:31:18 -07:00
John Koleszar
9e7019f7df
Remove unused vp9_build_intra_predictors_sb{y,uv}_s
...
The functions no longer referenced.
Change-Id: If2705dfbc607f79ec8ec2242d5e03bec27a35aaf
2013-06-21 16:10:05 -07:00
Ronald S. Bultje
98188e0e82
Merge "Remove emms - that shouldn't be there."
2013-06-21 15:53:25 -07:00
John Koleszar
5c32215e27
Remove unused vp9_model_to_full_probs_sb()
...
This function never referenced.
Change-Id: I1c42cd355bfa88e17d169f7335a44be682af58cc
2013-06-21 15:38:55 -07:00
Dmitry Kovalev
f27f76dfb3
Transforming scale_mv_component_q4 into scale_mv_q4 function.
...
Using MV instead of int_mv for function arguments.
Change-Id: Ic25e13dccbc98fac1fa1b3255127e00cca2a57f6
2013-06-21 15:34:29 -07:00
Ronald S. Bultje
fc033b38ee
Remove emms - that shouldn't be there.
...
Change-Id: I8fcab81e390f93dc17e9666bbf8f77883b5aa897
2013-06-21 14:45:04 -07:00
James Zern
cc774c8bb0
variance_test: use REGISTER_STATE_CHECK
...
Change-Id: Id54ad9a781634f075e990d5bade5be8490959975
2013-06-21 14:30:08 -07:00
Dmitry Kovalev
40141681c0
Removing find_seg_id and using vp9_get_pred_mi_segid instead.
...
Change-Id: Ia40229903c08f14020e90e94cfdf494aba1be827
2013-06-21 13:05:10 -07:00
Ronald S. Bultje
ba42c02654
Add missing SECTION .text marker in assembly file.
...
Fixes a crash on Windows when building with MSVC.
Change-Id: I124ac756a1be55d190fadda5fcc46d23b1445dbf
2013-06-21 12:55:46 -07:00
Ronald S. Bultje
54b2a59623
Implement SSE2 block_error.
...
Change vp9_block_error() to return a 64bit error variable, change all
callers to expect a 64bit return value (this will prevent overflows,
which we basically don't check for at all right now). Remove duplicate
block_error() function, which fixed that through truncation. Remove
old (incompatible) mmx/sse2 block_error SIMD versions and replace with
a new one that returns a 64bit value.
Encoding time of first 50 frames of bus @ 1500kbps goes from 3min29 to
3min23, i.e. a 3% overall speedup.
Change-Id: Ib71ac5508b5ee8a80f1753cd85d72df1629abe68
2013-06-21 12:54:52 -07:00
Ronald S. Bultje
7756e9892b
Merge "Add subtract_block SSE2 version and unit test."
2013-06-21 12:49:50 -07:00
Ronald S. Bultje
9a480482cb
Merge "SSE2/SSSE3 optimizations and unit test for sub_pixel_avg_variance()."
2013-06-21 12:49:43 -07:00
Ronald S. Bultje
25c588b1e4
Add subtract_block SSE2 version and unit test.
...
3% faster overall (3min35.0 to 3min28.5).
Change-Id: I5ff8a5c2c91586b6632ca5009ad1ea51ce94af5e
2013-06-21 09:35:37 -07:00
Yaowu Xu
869d770610
Merge "Get some speed back for cpuused 1"
2013-06-20 22:37:01 -07:00
Yaowu Xu
45e25a7814
Get some speed back for cpuused 1
...
and remove unused code.
Change-Id: If380440c4450294b5450b7a9eeb94a376846ec01
2013-06-20 19:05:18 -07:00
Yaowu Xu
61721181ec
Merge "rename variables to avoid build error in MSVC"
2013-06-20 19:04:30 -07:00
Yaowu Xu
ee07a261a0
rename variables to avoid build error in MSVC
...
Change-Id: I7960178c95c54d5c4497e44cfc8c493566294b34
2013-06-20 18:31:48 -07:00
Yaowu Xu
e6cd5ed307
Merge "Implement sse2 and ssse3 versions for all sub_pixel_variance sizes."
2013-06-20 17:42:50 -07:00
Ronald S. Bultje
1e6a32f1af
SSE2/SSSE3 optimizations and unit test for sub_pixel_avg_variance().
...
Encoding of bus @ 1500kbps (first 50 frames) goes from 3min57 to
3min35, i.e. approximately a 10.5% speedup. Note that the SIMD versions
which use a bilinear filter (x_offset & 7 || y_offset & 7) aren't
perfectly interleaved, and can probably be improved further in the
future. I've marked this with a few TODOs/FIXMEs in the code.
Change-Id: I5c9e900c0f0d32e431a50fecae213b510b2549f9
2013-06-20 15:59:48 -07:00
Jim Bankoski
84490a1f3d
Merge "clean out libvpx-srcs.txt if built"
2013-06-20 15:10:16 -07:00
Jim Bankoski
975df8c729
clean out libvpx-srcs.txt if built
...
Change-Id: Idfd69e66e8982275eb00d8007a55efd1a4f86a98
2013-06-20 15:05:42 -07:00
James Zern
43d04ef93b
Merge "Revert "test_libvpx: disable pthreads in gtest""
2013-06-20 15:02:27 -07:00
Frank Galligan
c259af4f73
Fix win64 warning.
...
- size_t vs int.
Change-Id: Ib47ebd932a4b69db9f52a43000bb69d0a96b9134
2013-06-20 14:07:11 -07:00
James Zern
f2dc38256d
Revert "test_libvpx: disable pthreads in gtest"
...
This reverts commit 90a9900abb
Seems to break the Mac build:
src/include/gtest/internal/gtest-port.h:1208:: pthread_mutex_lock(&mutex_)failed with error 22
Abort trap: 6
Change-Id: Icbe31161d7c27f1b0a28d33409e7712430bbf0ae
2013-06-20 12:49:15 -07:00
Jingning Han
4f4713b417
Merge "Add unit tests for 4x4 ADST"
2013-06-20 10:22:40 -07:00
Johann
0373e517f7
Merge "Cast value to avoid size_t/int warning on win64"
2013-06-20 10:19:39 -07:00