8624d18ca5
* without border extrapolation * with aligned write * process 4 pixels per thread in 8u case
* without border extrapolation * with aligned write * process 4 pixels per thread in 8u case