Recursive Array Layouts and Fast Parallel Matrix Multiplication (1999)

Cached

Download Links

by Siddhartha Chatterjee , Alvin R. Lebeck , Praveen K. Patnala , Mithuna Thottethodi
Venue:In Proceedings of Eleventh Annual ACM Symposium on Parallel Algorithms and Architectures
Citations:48 - 4 self

Documents Related by Co-Citation

76 Auto-Blocking Matrix-Multiplication or Tracking BLAS3 Performance from Source Code – Jeremy Frens, David S. Wise - 1997
8557 R.: Introduction to Algorithms – T Cormen, C Leiserson, Rivest - 1990
72 Nonlinear Array Layouts for Hierarchical Memory Systems – Siddhartha Chatterjee, Vibhor V. Jain, Alvin R. Lebeck, Shyam Mundhra, Mithuna Thottethodi - 1999
512 The Cache Performance and Optimizations of Blocked Algorithms – Monica S. Lam, Edward E. Rothberg, Michael E. Wolf - 1991
26 Ahnentafel indexing into Morton-ordered arrays, or matrix locality for free – David S. Wise - 2000
168 I/O complexity: the red-blue pebble game – Jia-Wei Hong, H T Kung - 1981
546 The input/output complexity of sorting and related problems – A Aggarwal, J S Vitter - 1988
96 Locality Of Reference In Lu Decomposition With Partial Pivoting – Sivan Toledo - 1997
56 Dynamic Partitioning of Non-Uniform Structured Workloads with Spacefilling Curves – John R. Pilkington, Scott B. Baden - 1995
272 Space-Filling Curves – Hans Sagan - 1994
105 An analysis of dag-consistent distributed shared-memory algorithms – Robert D. Blumofe, Matteo Frigo, Christopher F. Joerg - 1996
2452 The Design and Analysis of Computer Algorithms – A V Aho, J E Hopcroft, J D Ullman - 1974
381 Gaussian Elimination is Not Optimal – V Strassen - 1969
185 Linear clustering of objects with multiple attributes – H V Jagadish - 1990
58 Space-Filling Curves: Their Generation and Their Application to Bandwidth Reduction – T Bially - 1969
372 Automatically tuned linear algebra software – R. Clint Whaley, Jack J. Dongarra - 1998
958 Advanced Compiler Design and Implementation – Steven S Muchnick - 1997
152 Unifying Data and Control Transformations for Distributed Shared-Memory Machines – Michal Cierniak, Wei Li - 1994
38 Tuning Strassen's Matrix Multiplication for Memory Efficiency – Mithuna Thottethodi, Siddhartha Chatterjee, Alvin R. Lebeck - 1998