Part 2 shows how to optimize DSP “kernels,” i.e., inner loops. It also shows how to write fast floating-point and fractional code. Part 3 explains how to access DSP features like circular addressing ...