Automatically tuned linear algebra software (1998)

by R. Clint Whaley , Jack J. Dongarra
Venue:CONFERENCE ON HIGH PERFORMANCE NETWORKING AND COMPUTING
Citations:312 - 31 self

Documents Related by Co-Citation

199 Optimizing Matrix Multiply using PHiPAC: a Portable, High-Performance, ANSI C Coding Methodology – Jeff Bilmes, Krste Asanovic , Chee-Whye Chin , Jim Demmel - 1996
372 FFTW: An Adaptive Software Architecture For The FFT – Matteo Frigo, Steven G. Johnson - 1998
129 A Fast Fourier Transform Compiler – Matteo Frigo - 1999
681 A set of level 3 basic linear algebra subprograms – Jack J Dongarra, Jeremy Du Croz, Sven Hammarling, Iain Duff - 1990
486 The Cache Performance and Optimizations of Blocked Algorithms – Monica S. Lam, Edward E. Rothberg, Michael E. Wolf - 1991
193 Tile Size Selection Using Cache Organization and Data Layout – Stephanie Coleman, Kathryn S. M Kinley - 1995
677 A data locality optimizing algorithm – Michael E. Wolf, Monica S. Lam - 1991
37 Iterative Compilation in Program Optimization – T. Kisuki, P.M.W. Knijnenburg, M.F.P. O'Boyle, H. A. G. Wijshoff - 2000
116 Combining loop transformations considering caches and scheduling – Michael E Wolf, Dror E Maydan, Ding-Kai Chen - 1996
480 Basic linear algebra subprograms for FORTRAN usage – C Lawson, R Hanson, D Kincaid, F Krogh - 1979
285 Computational Frameworks for the Fast Fourier Transform – Van Loan - 1992
275 Improving Data Locality with Loop Transformations – Kathryn S. McKinley, Steve Carr, Chau-Wen Tseng - 1996
1324 C: The Grid: Blueprint for a New Computing Infrastructure – I Foster, Kesselman - 1999
67 SPL: A Language and Compiler for DSP Algorithms – Jianxin Xiong, Jeremy Johnson, Robert Johnson, David Padua - 2001
16 Load Balancing And Data Locality Via Fractiling: An Experimental Study – Susan Flynn Hummel, Ioana Banicescu, Chui-Tzu Wang, Joel Wein - 1995
168 More Iteration Space Tiling – M Wolfe - 1989
251 A.: Evaluating Associativity in CPU Caches – M Hill, Smith - 1989
304 Gaussian elimination is not optimal – V STRASSEN - 1969
3637 D.A.Patterson, “Computer Architecture: A quantitative Approach”, Fourth edition – J L Hennessy - 2007