Recursive Array Layouts and Fast Parallel Matrix Multiplication (1999)

Cached

Download Links

by Siddhartha Chatterjee , Alvin R. Lebeck , Praveen K. Patnala , Mithuna Thottethodi
Venue:In Proceedings of Eleventh Annual ACM Symposium on Parallel Algorithms and Architectures
Citations:44 - 3 self

Documents Related by Co-Citation

69 Auto-Blocking Matrix-Multiplication or Tracking BLAS3 Performance from Source Code – Jeremy Frens, David S. Wise - 1997
7324 Cache-oblivious algorithms – M FRIGO, C E LEISERSON, H PROKOP, S RAMACHANDRAN - 1999
67 Nonlinear Array Layouts for Hierarchical Memory Systems – Siddhartha Chatterjee, Vibhor V. Jain, Alvin R. Lebeck, Shyam Mundhra, Mithuna Thottethodi - 1999
486 The Cache Performance and Optimizations of Blocked Algorithms – Monica S. Lam, Edward E. Rothberg, Michael E. Wolf - 1991
25 Ahnentafel indexing into Morton-ordered arrays, or matrix locality for free – David S. Wise - 2000
142 I/O complexity: The red-blue pebble game – J W HONG, H T KUNG - 1981
494 The input/output complexity of sorting and related problems. Commun – A AGGARWAL, J S VITTER - 1988
51 Dynamic Partitioning of Non-Uniform Structured Workloads with Spacefilling Curves – John R. Pilkington, Scott B. Baden - 1995
222 Space-Filling Curves – Hans Sagan - 1994
84 Locality Of Reference In Lu Decomposition With Partial Pivoting – Sivan Toledo - 1997
304 Gaussian elimination is not optimal – V STRASSEN - 1969
174 Linear clustering of objects with multiple attributes. A – H V Jagadish - 1990
53 Space–filling curves: Their generation and their application to bandwidth reduction – T Bially - 1969
312 Automatically tuned linear algebra software – R. Clint Whaley, Jack J. Dongarra - 1998
150 Unifying Data and Control Transformations for Distributed Shared-Memory Machines – Michal Cierniak, Wei Li - 1994
34 Tuning Strassen's Matrix Multiplication for Memory Efficiency – Mithuna Thottethodi, Siddhartha Chatterjee, Alvin R. Lebeck - 1998
31 High Performance Fortran for Highly Irregular Problems – Yu Charlie Hu, S. Lennart Johnsson, Shang-Hua Teng, Y. Charlie, Hu S. Lennart, Johnsson Shang--hua Teng - 1996
43 Towards a Theory of Cache-Efficient Algorithms – Sandeep Sen, Siddhartha Chatterjee, Neeraj Dumir - 1999
98 An analysis of dag-consistent distributed shared-memory algorithms – Robert D. Blumofe, Matteo Frigo, Christopher F. Joerg - 1996