* without border extrapolation * with aligned write * process 4 pixels per thread in 8u case
41 KiB
41 KiB
* without border extrapolation * with aligned write * process 4 pixels per thread in 8u case