Fall back on the C version of the algorithm here, just as in McCopy_sse2. This fixes 2 of the McCopy tests on arm/aarch64.