Christophe Gisquet 996697e266 x86: float dsp: unroll SSE versions
vector_fmul and vector_fmac_scalar are guaranteed that they can process in
batch of 16 elements, but their SSE versions only does 8 at a time.

Therefore, unroll them a bit.
299 to 261c for 256 elements in vector_fmac_scalar on Arrandale/Win64.

Signed-off-by: Janne Grunau <janne-libav@jannau.net>
2014-02-20 14:18:05 +01:00
..
2013-04-10 11:04:05 +03:00
2013-06-29 13:23:57 +02:00
2013-06-29 13:23:57 +02:00
2013-10-14 12:41:54 +01:00