Automatically tuned linear algebra software (1998)

by R. Clint Whaley , Jack J. Dongarra
Venue:CONFERENCE ON HIGH PERFORMANCE NETWORKING AND COMPUTING
Citations:372 - 31 self

Documents Related by Co-Citation

227 Optimizing Matrix Multiply using PHiPAC: a Portable, High-Performance, ANSI C Coding Methodology – Jeff Bilmes, Krste Asanovic , Chee-Whye Chin , Jim Demmel - 1996
449 FFTW: An Adaptive Software Architecture For The FFT – Matteo Frigo, Steven G. Johnson - 1998
153 A Fast Fourier Transform Compiler – Matteo Frigo - 1999
202 Tile Size Selection Using Cache Organization and Data Layout – Stephanie Coleman, Kathryn S. M Kinley - 1995
742 A set of level 3 basic linear algebra subprograms – J DONGARRA, J DUCROZ, I S DUFF, S HAMMARLING - 1990
509 The Cache Performance and Optimizations of Blocked Algorithms – Monica S. Lam, Edward E. Rothberg, Michael E. Wolf - 1991
42 Iterative Compilation in Program Optimization – T. Kisuki, P.M.W. Knijnenburg, M.F.P. O'Boyle, H. A. G. Wijshoff - 2000
133 Combining Loop Transformations Considering Caches and Scheduling – M E Wolf, D E Maydan, D-K Chen - 1998
319 Computational Frameworks for the Fast Fourier Transform – Charles van Loan - 1992
532 Basic linear algebra subprograms for fortran usage – C L Lawson, R J Hanson, D R Kincaid, F T Krogh - 1979
293 Improving Data Locality with Loop Transformations – Kathryn S. McKinley, Steve Carr, Chau-Wen Tseng - 1996
1534 The Grid: Blueprint for a New Computing Infrastructure – I Foster, C Kesselman - 1999
82 SPL: A Language and Compiler for DSP Algorithms – Jianxin Xiong, Jeremy Johnson, Robert Johnson, David Padua - 2001
705 A data locality optimizing algorithm – Michael E. Wolf, Monica S. Lam - 1991
3973 Computer Architecture: A Quantitative Approach, 3 rd ed – J L Hennessy, D A Patterson, D Goldberg - 2002
91 Combined Selection of Tile Sizes and Unroll Factors Using Iterative Compilation – T. Kisuki, P.M.W. Knijnenburg - 2000
14 Automatic implementation of FFT algorithms – L AUSLANDER, J R JOHNSON, R W JOHNSON - 1996
83 Algorithms for Discrete Fourier Transform and Convolution – R Tolimieri, M An, C Lu - 1997
78 A methodology for designing, modifying, and implementing Fourier transform algorithms on various architectures – J. Johnson, R. W. Johnson, D. Rodriguez, R. Tolimieri - 1990