Recursive Array Layouts and Fast Parallel Matrix Multiplication (1999)

Cached

Download Links

by Siddhartha Chatterjee , Alvin R. Lebeck , Praveen K. Patnala , Mithuna Thottethodi
Venue:In Proceedings of Eleventh Annual ACM Symposium on Parallel Algorithms and Architectures
Citations:48 - 4 self

Active Bibliography

31 Recursive Array Layouts and Fast Matrix Multiplication – Siddhartha Chatterjee, Alvin R. Lebeck, Praveen K. Patnala, Mithuna Thottethodi - 1999
72 Nonlinear Array Layouts for Hierarchical Memory Systems – Siddhartha Chatterjee, Vibhor V. Jain, Alvin R. Lebeck, Shyam Mundhra, Mithuna Thottethodi - 1999
26 An Overview of Cache Optimization Techniques and Cache-Aware Numerical Algorithms – Markus Kowarschik, Christian Weiß - 2003
Analyzing the Behavior of Loop Nests in . . . – Erin Parker - 2004
4 Guiding Program Transformations with Modal Performance Models – Nicholas Matthew Mitchell, Professor Larry Carter - 2000
Alternative Array Storage Layouts for Regular Scientific Programs – Thiyagalingam Jeyarajan - 2005
8 Array Restructuring for Cache Locality – Shun-Tak A. Leung - 1996
38 Tuning Strassen's Matrix Multiplication for Memory Efficiency – Mithuna Thottethodi, Siddhartha Chatterjee, Alvin R. Lebeck - 1998
23 Cache-Efficient Matrix Transposition – Siddhartha Chatterjee, Sandeep Sen
Efficient Data . . . Multidimensional Array Operations Based on the EKMR Scheme for Distributed Memory Multicomputers – Chun-yuan Lin, Yeh-ching Chung, Jen-shiuh Liu - 2003
89 Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings – John Mellor-crummey, David Whalley, Ken Kennedy - 2001
6 Adaptive Winograd’s Matrix Multiplications – Paolo D'Alberto, ALEXANDRU NICOLAU - 2008
2 Exploiting Parallelism in Matrix-Computation Kernels for Symmetric Multiprocessor Systems -- Matrix-Multiplication and Matrix-Addition Algorithm Optimizations by Software Pipelining and Threads Allocation – Paolo D’Alberto , Marco Bodrato, Alexandru Nicolau - 2011
Exploiting Parallelism in Matrix-Computation Kernels for Symmetric Multiprocessor Systems Matrix-Multiplication and Matrix-Addition Algorithm Optimizations by Software Pipelining and Threads Allocation – Paolo D’alberto Yahoo Sunnivale, Marco Bodrato
23 Locality Optimizations for Multi-Level Caches – Gabriel Rivera, Chau-wen Tseng - 1999
Experiments with Data Layouts – M. Kandemir, A. Choudhary, A. Choudhary, N. Shenoy, N. Shenoy, P. Banerjee, P. Banerjee, J. Ramanujam, J. Ramanujam - 1997
Sequential decomposition of operations and compilers optimization – Mumtaz Ahmad, Serge Burckel, Thème Sym, Mumtaz Ahmad, Serge Burckel, Thème Sym Systèmes Symboliques - 2009
4 How to write fast numerical code: A Small Introduction – Srinivas Chellappa , Franz Franchetti, Markus Püschel - 2008
23 Self Adapting Software for Numerical Linear Algebra and LAPACK for Clusters – Zizhong Chen, Jack Dongarra, Piotr Luszczek, Kenneth Roche - 2003