Introduction to ARM64 NEON assembly
This article was written back in 2013, right after Apple released ARM64-based iPhones and iPads.
If you own a somewhat recent iPhone or iPad, you already own a shiny ARM64 CPU to play with.
Let’s start with a trivial operation: adding two vectors of 32-bits floats.
auto add_to(float *pDst, const float *pSrc, long…