11 Commits

Author SHA1 Message Date
cduvivier@google.com
9d94116697 Optimization of 'rftbsub':
* scalar optimization, vectorization.
* 0.5% AEC overall speedup for the straight C path.
* 2.8% AEC overall speedup for the SSE2 path.
Review URL: http://webrtc-codereview.appspot.com/48008

git-svn-id: http://webrtc.googlecode.com/svn/trunk@137 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-30 19:19:37 +00:00
leozwang@google.com
8ec2231979 Add aec_rdft.c to android build
Review URL: http://webrtc-codereview.appspot.com/58001

git-svn-id: http://webrtc.googlecode.com/svn/trunk@136 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-30 18:34:09 +00:00
cduvivier@google.com
20cb6b684b Optimization of 'rftfsub':
* scalar optimization, vectorization (including new file for SSE2 code
  and path selection mechanism).
* 0.5% AEC overall speedup for the straight C path.
* 3.0% AEC overall speedup for the SSE2 path.
Review URL: http://webrtc-codereview.appspot.com/46005

git-svn-id: http://webrtc.googlecode.com/svn/trunk@134 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-30 01:22:19 +00:00
cduvivier@google.com
181f543de4 AEC specific version of " Real Discrete Fourier Transform".
Lots of AEC CPU usage is coming from calls to 'rdft'. To optimize this,
deep changes (modification of memory layout, ...) have to be done and it
is not practical to do them in an utility library. Most of these changes
will occur in subsequent CLs.

The new file 'aec_core_rdft.c' is a copy of 'modules/audio_processing/
utility/fft4g.c' whose size has been significantly reduced by removing
all code non-necessary to compute rdft. The main entry point and utility
functions have also been modified to take into account the fact that all
'rdft' calls performed by AEC have a length of 128. This yields:
* 1.8% AEC overall speedup for the straight C path.
* 2.3% AEC overall speedup for the SSE2 path.
Review URL: http://webrtc-codereview.appspot.com/44008

git-svn-id: http://webrtc.googlecode.com/svn/trunk@126 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-24 18:22:47 +00:00
cduvivier@google.com
5af7a804ea Optimization of "overdrive and suppress":
* float accuracy pow function, vectorized pow approximation, general
  vectorization.
* 10.2% AEC overall speedup for the straight C path.
* 16.1% AEC overall speedup for the SSE2 path.
Review URL: http://webrtc-codereview.appspot.com/24016

git-svn-id: http://webrtc.googlecode.com/svn/trunk@72 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-13 18:56:48 +00:00
leozwang@google.com
0b0c28c495 add android makefile, some modification in vpx makefile to build encoder from c source for now
Review URL: http://webrtc-codereview.appspot.com/29012

git-svn-id: http://webrtc.googlecode.com/svn/trunk@50 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-07 17:24:39 +00:00
cduvivier@google.com
a4f6303c5d Vectorization of "FilterAdaptation":
* 1.0% AEC overall speedup for straight C path.
* 6.2% AEC overall speedup for SSE2 path.
* fix warnings, make code compile with "-std=gnu89
-Wstrict-prototypes -Wold-style-definition -Wmissing-prototypes
-Wmissing-declarations -Wdeclaration-after-statement -Wextra -Wall
-Werror"
Review URL: http://webrtc-codereview.appspot.com/24012

git-svn-id: http://webrtc.googlecode.com/svn/trunk@38 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-02 23:50:06 +00:00
cduvivier@google.com
936b36dbf6 Partial vectorization of "ProcessBlock":
* new file for SSE2 code, code selection through function pointers.
* structure change for array of complex numbers.
* 3.8% AEC overall speedup for straight C path.
* 8.8% AEC overall speedup for SSE2 path.
Review URL: http://webrtc-codereview.appspot.com/34002

git-svn-id: http://webrtc.googlecode.com/svn/trunk@36 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-02 01:38:10 +00:00
niklase@google.com
9ed826feea Review URL: http://webrtc-codereview.appspot.com/29009
git-svn-id: http://webrtc.googlecode.com/svn/trunk@27 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-01 07:29:32 +00:00
cduvivier@google.com
d357f2ca3b Partial vectorization of "ProcessBlock":
* new file for SSE2 code, code selection through function pointers.
* structure change for array of complex numbers.
* 3.8% AEC overall speedup for straight C path.
* 8.8% AEC overall speedup for SSE2 path.
Review URL: http://webrtc-codereview.appspot.com/33003

git-svn-id: http://webrtc.googlecode.com/svn/trunk@26 4adac7df-926f-26a2-2b94-8c16560cd09d
2011-06-01 01:20:06 +00:00
niklase@google.com
77ae29bc81 git-svn-id: http://webrtc.googlecode.com/svn/trunk@4 4adac7df-926f-26a2-2b94-8c16560cd09d 2011-05-30 11:22:19 +00:00