Reducing Software Overheads in Parallel Linear Algebra Libraries

by Peter E. Strazdins
Citations:11 - 8 self

Active Bibliography

3 OPTIMAL LOAD BALANCING TECHNIQUES FOR BLOCK-CYCLIC DECOMPOSITIONS FOR MATRIX FACTORIZATION – Peter Strazdins
1 Load Balance and Communication Tradeoffs in Parallel Matrix Factorization. – Peter Strazdins
21 LOOKAHEAD AND ALGORITHMIC BLOCKING TECHNIQUES COMPARED FOR PARALLEL MATRIX FACTORIZATION – Peter E. Strazdins
IN – Antoine P. Petitet, Jack J. Dongarra
10 Execution Time of Symmetric Eigensolvers – Kendall Swenson Stanley - 1997
7 Transporting Distributed BLAS to the Fujitsu AP3000 and VPP-300 – Peter E. Strazdins - 1998
DISTRIBUTED BLAS User's Guide – Australian National
5 A High Performance Version of Parallel LAPACK: Preliminary Report – Peter Strazdins, Hari Koesmanro - 1996
22 Algorithmic redistribution methods for block cyclic decompositions – Antoine Petitet - 1996
13 A High Performance, Portable Distributed BLAS Implementation – Peter Strazdins - 1996
2 A Dense Complex Symmetric Indefinite Solver for the Fujitsu AP3000 – Peter E. Strazdins - 1999
Project Descrition: Achieving Optimum Performance for Dense Linear Algebra Computations on Parallel Computers: Extending ScaLAPACK for Distributed Panels – P.E. Strazdins - 1996
8 A Parallel Algorithm for Householder Tridiagonalization – Christopher Smith, Bruce Hendrickson, Elizabeth Jessup - 1994
1 A Study on Parallel Implementation of Large Scale Eigenproblem Solver for Distributed Memory Architecture Parallel Machines – Takahiro Katagiri - 1998
4 Load Balancing Strategies for Dense Linear Algebra Kernels on Heterogeneous Two-dimensional Grids – Olivier Beaumont, Vincent Boudet, Fabrice Rastello, Yves Robert - 2000
5 Algorithms and Tools for (Distributed) Heterogeneous Computing: A Prospective Report – J. F. Mehaut, Y. Robert, Unite Mixte, Recherche Cnrs-inria-ens Lyon - 1999
16 A Parallel Divide and Conquer Algorithm for the Symmetric Eigenvalue Problem on Distributed Memory Architectures – Françoise Tisseur, Jack Dongarra - 1999
Block Size Selection of Parallel LU and QR on PVP-based and RISC-based Supercomputers – Yunquan Zhang, Ying Chen - 2004
31 Scheduling block-cyclic array redistribution – Frederic Desprez, Jack Dongarra, Antoine Petitet, Cyril R, Yves Robert - 1998