This paper investigates the implementation of the Bruun FFT on the TMS320C30. It presents effective implementation methods and techniques in the aspects of butterfly computation
loop control and data transfer. The resulting program runs faster than radix-2