Christophe Gisquet 4369b9dc7b x86: simple_idct(_put): 10bits versions
Modeled from the prores version. Clips to [0;1023] and is bitexact.
Bitexactness requires to add offsets in different places compared to
prores or C, and makes the function approximately 2% slower.

For 16 frames of a DNxHD 4:2:2 10bits test sequence:

C:    60861 decicycles in idct, 1048205 runs,    371 skips
sse2: 27567 decicycles in idct, 1048216 runs,    360 skips
avx:  26272 decicycles in idct, 1048171 runs,    405 skips

The add version is not implemented, so the corresponding dsp
function is set to NULL to make it clear in a code executing it.

Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
2015-10-13 13:32:21 +02:00
..
2015-08-04 20:13:09 +02:00
2015-02-06 23:35:02 +01:00
2015-08-11 11:12:01 +02:00
2015-08-04 20:13:09 +02:00
2015-08-11 11:12:01 +02:00
2015-08-11 11:12:01 +02:00
2015-08-11 11:12:01 +02:00
2015-08-11 11:12:01 +02:00
2015-08-11 11:12:01 +02:00
2015-08-04 20:13:09 +02:00
2014-12-03 23:03:13 +01:00
2015-08-04 20:13:09 +02:00
2015-08-04 20:13:09 +02:00
2015-08-11 11:12:01 +02:00
2015-08-11 11:12:01 +02:00
2015-08-04 20:13:09 +02:00
2015-08-11 11:12:01 +02:00
2015-08-04 20:13:09 +02:00
2015-08-11 11:12:01 +02:00
2015-08-11 11:12:01 +02:00
2015-08-04 20:13:09 +02:00
2015-09-11 23:27:03 -03:00