vpx/vp9
Ronald S. Bultje 26b6318de8 Make get_coef_context() branchless.
This should significantly speedup cost_coeffs(). Basically what the
patch does is to make the neighbour arrays padded by one item to
prevent an eob check in get_coef_context(), then it populates each
col/row scan and left/top edge coefficient with two times the same
neighbour - this prevents a single/double context branch in
get_coef_context(). Lastly, it populates neighbour arrays in pixel
order (rather than scan order), so we don't have to dereference the
scantable to get the correct neighbours.

Total encoding time of first 50 frames of bus (speed 0) at 1500kbps
goes from 2min10.1 to 2min5.3, i.e. a 2.6% overall speed increase.

Change-Id: I42bcd2210fd7bec03767ef0e2945a665b851df56
2013-07-01 16:34:10 -07:00
..
common Make get_coef_context() branchless. 2013-07-01 16:34:10 -07:00
decoder Make get_coef_context() branchless. 2013-07-01 16:34:10 -07:00
encoder Make get_coef_context() branchless. 2013-07-01 16:34:10 -07:00
exports_dec support building vp8 and vp9 into a single lib 2012-11-15 10:46:17 -08:00
exports_enc support building vp8 and vp9 into a single lib 2012-11-15 10:46:17 -08:00
vp9_common.mk Add Neon optimized loop filter functions. 2013-06-27 16:14:45 -07:00
vp9_cx_iface.c Using vp9 function prefix instead of vp8 inside vp9_cx_iface.c. 2013-05-22 18:04:08 -07:00
vp9_dx_iface.c Quick fix to stop vpxdec infinite loop 2013-06-12 13:46:47 -04:00
vp9_iface_common.h Align frame size to 8 instead of 16. 2013-06-07 17:20:50 -07:00
vp9cx.mk Quantize (64-bit only, for now) SSSE3 SIMD. 2013-07-01 11:36:07 -07:00
vp9dx.mk add Neon optimized add constant residual functions 2013-06-28 19:06:51 -07:00