openh264

History

Sindre Aamås 8a0af4a3f2 [Processing/x86] DyadicBilinearDownsample optimizations

Average vertically before horizontally; horizontal averaging is more
worksome. Doing the vertical averaging first reduces the number of
horizontal averages by half.

Use pmaddubsw and pavgw to do the horizontal averaging for a slight
performance improvement.

Minor tweaks.

Improve the SSSE3 dyadic downsample routines and drop the SSE4 routines.
The non-temporal loads used in the SSE4 routines do nothing for cache-
backed memory AFAIK.

Adjust tests because averaging vertically first gives slightly different
output.

~2.39x speedup for the widthx32 routine on Haswell when not memory-bound.
~2.20x speedup for the widthx16 routine on Haswell when not memory-bound.

Note that the widthx16 routine can be unrolled for further speedup.

2016-06-02 13:44:28 +02:00

BaseDecoderTest.cpp

Fix the decoder init failed case in UT

2016-03-14 17:06:58 +08:00

BaseEncoderTest.cpp

adjust encoder test case to cover multi-thread without loadbalancing

2015-12-09 09:58:03 -08:00

c_interface_test.c

add new API as DecodeFrameNoDelay for immediate decoding, which will be recommended decoding method for h.264 bitstream

2014-12-30 23:43:47 -08:00

cpp_interface_test.cpp

add new API as DecodeFrameNoDelay for immediate decoding, which will be recommended decoding method for h.264 bitstream