Towards optimal multi-level tiling for stencil computations (2007)

Cached

Download Links

by Lakshminarayanan Renganarayana , Manjukumar Harthikote-matha , Rinku Dewri , Sanjay Rajopadhye
Venue:21st IEEE International Parallel and Distributed Processing Symposium (IPDPS
Citations:10 - 0 self

Active Bibliography

1 On the Scalability of Loop Tiling Techniques – David G. Wonnacott, Michelle Mills Strout
14 Sparse Tiling for Stationary Iterative Methods – Michelle Mills Strout, Larry Carter, Jeanne Ferrante, Barbara Kreaseck - 2004
MINES ParisTech – Spécialité Informatique, Ramakrishna Upadrasta, Président Prof, Florent Hivert, Université Paris-sud, Prof François Irigoin, Prof Christian Lengauer, Universität Passau, Prof Cédric Bastoul, Prof Sanjay Rajopadhye, Université Paris-sud - 2013
23 Locality Optimizations for Multi-Level Caches – Gabriel Rivera, Chau-wen Tseng - 1999
4 A Performance Study for Iterative Stencil Loops on GPUs with Ghost Zone Optimizations – Jiayuan Meng, Kevin Skadron, Jiayuan Meng, Kevin Skadron
94 Loop Parallelization in the Polytope Model – Christian Lengauer - 1993
3 A Step Towards Unifying Schedule and Storage Optimization – William Thies, Frédéric Vivien
3 Time-Minimal Tiling When Rise is Larger Than Zero – Jingling Xue, Wentong Cai - 2002
Smashing: Folding Space to Tile through Time – Nissa Osheim, Michelle Mills Strout, Dave Rostron, Sanjay Rajopadhye
(Or, Can Adding Scalable Locality to Distributed Shared Memory Yield SuperComputer Power?) – Tim Douglas, Sharon Warner, David G. Wonnacott - 2009
6 Parameterized Tiled Loops for Free – Lakshminarayanan Renganarayanan, DaeGon Kim, Sanjay Rajopadhye, Michelle Mills Strout - 2007
5 Pipelined Scheduling of Tiled Nested Loops onto Clusters of SMPs using Memory Mapped Network Interfaces – Maria Athanasaki, Evangelos Koukis, Nectarios Koziris - 2002
25 Selecting Tile Shape for Minimal Execution Time – Karin Hogstedt, Larry Carter, Jeanne Ferrante - 1999
7 Code tiling for improving the cache performance of pde solvers – Qingguang Huang, Jingling Xue - 2003
10 Generating Efficient Tiled Code for Distributed Memory Machines – Peiyi Tang - 2000
Adobe Systems Incorporated – Bin Bao, Chen Ding
3 Reuse-Driven Tiling for Improving Data Locality – Jingling Xue, Chua-Huang Huang - 1998
Optimal task scheduling at run time to exploit intra-tile parallelism – Fabric Rastello , Amit Rao , Santosh Pande - 2003
37 Communication-Minimal Tiling of Uniform Dependence Loops – Jingling Xue - 1996