fffac2f085
Inlining and customizing sampling functions to reduce memory traffic and compute Improve calcOrientation implementation. Using more efficient rounding routines. Removing unnecessary use of local memory