Resumen
Modern Single Instruction Multiple Data (SIMD) microprocessor architectures allow parallel floating point operations over four contiguous elements in memory. The radix-2 FFT algorithm is well suited for modern SIMD architectures after the second stage (decimation-in-time case). In this paper, a general radix-2 FFT algorithm is developed for the modern SIMD architectures. This algorithm (SIMD-FFT) is implemented on the Intel Pentium and Motorola PowerPC architecture for 1D and 2D. The results are compared against Intel's implementation of the split-radix FFT for the SIMD architecture [2] and the FFTW [3]. Overall, the SIMD-FFT was found to be faster than the other two implementations for complex 1D input data (ranging from 95.9% up to 372%), and for complex 2D input data (ranging from 68.8% up to 343%) as well.
Idioma original | Inglés |
---|---|
Páginas (desde-hasta) | III/3220-III/3223 |
Publicación | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
Volumen | 3 |
Estado | Publicada - 2002 |
Publicado de forma externa | Sí |
Evento | 2002 IEEE International Conference on Acoustic, Speech, and Signal Processing - Orlando, FL, Estados Unidos Duración: 13 may. 2002 → 17 may. 2002 |