The Cache Performance and Optimizations of Blocked Algorithms (1991)

by Monica S. Lam , Edward E. Rothberg , Michael E. Wolf
Venue:In Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems
Citations:508 - 5 self

Active Bibliography

1 TOLERATING LATENCY THROUGH SOFTWARE-CONTROLLED DATA PREFETCHING – Albert Macovksi - 1994
467 Design and Evaluation of a Compiler Algorithm for Prefetching – Todd C. Mowry, Monica S. Lam, Anoop Gupta - 1992
89 Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings – John Mellor-crummey, David Whalley, Ken Kennedy - 2001
8 Informing Loads: Enabling Software To Observe And React To Memory Behavior – Mark Horowitz, Margaret Martonosi, Todd C. Mowry, Michael D. Smith - 1995
4 Dynamic Access Ordering for Symmetric Shared-Memory Multiprocessors – Sally A. McKee - 1994
31 A Matrix-Based Approach to Global Locality Optimization – Mahmut Kandemir, Alok Choudhary, J. Ramanujam, Prith Banerjee - 1998
50 Memory-Hierarchy Management – Steve Carr - 1994
8 A Blocked All-Pairs Shortest-Paths Algorithm – Gayathri Venkataraman , Sartaj Sahni, Srabani Mukhopadhyaya - 2003
112 The Uniform Memory Hierarchy Model of Computation – Bowen Alpern, Larry Carter, Ephraim Feig, Ted Selker - 1992
56 Informing Memory Operations: Providing Memory Performance Feedback in Modern Processors – Mark Horowitz, Margaret Martonosi, Todd C. Mowry, Michael D. Smith - 1996
Memory Latency Rediction via Data Prefetching and Data Forwarding in Shared Memory Multiprocessors – David Kristian Poulsen, David Kristian Poulsen, Ph. D - 1994
15 Data Access Reorganizations in Compiling Out-of-core Data Parallel Programs on Distributed Memory Machines – Rajesh Bordawekar, Alok Choudhary, Rajeev Thakur - 1994
38 Automatic and Interactive Parallelization – Kathryn S. McKinley, Kathryn S. Mckinley, Kathryn S. Mckinley - 1994
94 Optimizing for Parallelism and Data Locality – Ken Kennedy, Kathryn S. M Kinley - 1992
Experiments with Data Layouts – M. Kandemir, A. Choudhary, A. Choudhary, N. Shenoy, N. Shenoy, P. Banerjee, P. Banerjee, J. Ramanujam, J. Ramanujam - 1997
40 Access Order and Memory-Conscious Cache Utilization – Sally A. Mckee, Wm. A. Wulf - 1995
Sequential decomposition of operations and compilers optimization – Mumtaz Ahmad, Serge Burckel, Thème Sym, Mumtaz Ahmad, Serge Burckel, Thème Sym Systèmes Symboliques - 2009
14 Informing Memory Operations: Memory Performance Feedback Mechanisms and Their Applications – Mark Horowitz, Margaret Martonosi, Todd C. Mowry, Michael D. Smith - 1998
138 Practical Dependence Testing – Gina Goff, Ken Kennedy, Chau-Wen Tseng - 1991