Loop Fusion in High Performance Fortran (1998)
| Venue: | IN PROCEEDINGS OF THE 1998 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING |
| Citations: | 8 - 2 self |
BibTeX
@INPROCEEDINGS{Roth98loopfusion,
author = {Gerald Roth and Ken Kennedy},
title = {Loop Fusion in High Performance Fortran},
booktitle = {IN PROCEEDINGS OF THE 1998 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING},
year = {1998},
pages = {125--132},
publisher = {ACM Press}
}
OpenURL
Abstract
In this paper we investigate a unique problem associated with fusing loops within a High Performance Fortran (HPF) program. In particular, we discuss the issue of performing loop fusion in an HPF compiler when compiling Fortran90 array assignment statements for execution on a distributedmemory machine. During compilation of an HPF program, Fortran90 array assignment statements must be scalarized into loop nests. We show how a certain class of these loop nests, when fused, can cause problems for the compiler's distributed-memory code generator. We then present an algorithm which not only prevents the fusion of these loops, but also increases the amount of useful fusion that can be performed.







