Abstract
In this paper we discuss architecture-specific performance tuning for fast Fourier transforms (FFTs) implemented in the UHFFT library. The UHFFT library is an adaptive and portable software library for FFTs developed by the authors. We present the optimization methods used at different levels, starting with the algorithm selection used for the library code generation and ending with the actual implementation and specification of the appropriate compiler optimization options. We report on the performance results for several modern microprocessor architectures.
Original language | English (US) |
---|---|
Pages (from-to) | 47-64 |
Number of pages | 18 |
Journal | International Journal of High Performance Computing Applications |
Volume | 18 |
Issue number | 1 |
DOIs | |
State | Published - 2004 |
Externally published | Yes |
Keywords
- Automatic performance tuning
- Discrete Fourier transform (DFT)
- Fast Fourier transform (FFT)
- Software libraries
ASJC Scopus subject areas
- Software
- Theoretical Computer Science
- Hardware and Architecture