|
681
|
A set of level 3 basic linear algebra subprograms
– Jack J Dongarra, Jeremy Du Croz, Sven Hammarling, Iain Duff
- 1990
|
|
60
|
Matrix algorithms on a hypercube i: Matrix multiplication. Parallel Computing 3
– G Fox, S Otto, A Hey
- 1987
|
|
19
|
The Multicomputer Toolbox: Scalable Parallel Libraries for Large-Scale Concurrent Applications
– Anthony Skjellum, Chuck Baldwin
- 1994
|
|
15
|
The Data-Distribution-Independent Approach to Scalable Parallel Libraries
– Purushotham V. Bangalore, Purushotham V. Bangalore, Anthony Skjellum, Edwin Ellis, Clayborne D. Taylor, Richard D. Koshel
- 1995
|
|
397
|
LAPACK’s user’s guide
– E ANDERSON, Z BAI, C BISCHOF, J DEMMEL, J DONGARRA, J D CROZ, A GREENBAUM, S HAMMARLING, A MCKENNEY, S OSTROUCHOV, D SORENSEN
- 1992
|
|
57
|
PUMMA: Parallel Universal Matrix Multiplication Algorithms on Distributed Memory Concurrent Computers
– Jaeyong Choi, Jack J. Dongarra, David W. Walker
- 1993
|
|
32
|
de Geijn. Parallel implementation of BLAS: General techniques for level 3 BLAS
– Almadena Chtchelkanova, John Gunnels, Greg Morrow, James Overfelt, Robert Van
- 1995
|
|
34
|
PB-BLAS: A Set of Parallel Block Basic Linear Algebra Subprograms
– J CHOI, J DONGARRA, D WALKER
- 1993
|
|
28
|
Block-Cyclic Dense Linear Algebra
– Woody Lichtenstein, S. Lennart Johnsson
- 1992
|
|
480
|
Basic linear algebra subprograms for FORTRAN usage
– C Lawson, R Hanson, D Kincaid, F Krogh
- 1979
|
|
73
|
A Proposal for a Set of Parallel Basic Linear Algebra Subprograms, , LAPACK Working Note #100
– J Choi, J Dongarra, S Ostrouchov, A Petitet, D Walker, R Whaley
- 1995
|
|
39
|
A high performance matrix multiplication algorithm on a distributedmemory parallel computer, using overlapped communication
– R Agarwal, F Gustavson, M Zubair
- 1994
|
|
409
|
An Extended Set of Fortran Basic Linear Algebra Subprograms
– Jack J. Dongarra, Jeremy Du Croz, Sven Hammarling, Richard J. Hanson
- 1986
|
|
30
|
A Three-Dimensional Approach to Parallel Matrix Multiplication
– R.C. Agarwal, S. M. Balle, F. G. Gustavson, M. Joshi, P. Palkar
- 1995
|
|
13
|
ªThe Parallelization of Level 2
– M Aboelaze, N Chrisochoides, E Houstis
- 1991
|
|
58
|
Summa: Scalable universal matrix multiplication algorithm
– Robert A. Van De Geijn, Jerrell Watts
- 1997
|
|
12
|
der Vorst. Parallel Triangular System Solving on a mesh network of Transputers
– R Bisseling, J van
- 1991
|
|
43
|
ªA New Method for Solving Triangular Systems on Distributed-Memory Message-Passing Multiprocessor,º
– G Li, T Coleman
- 1989
|
|
30
|
A parallel triangular solver for a distributed-memory multiprocessor
– G Li, T F Coleman
- 1986
|