Reduction of cache coherence overhead by compiler data layout and loop transformation (1992)

by Y Ju, H Dietz
Venue:In Proc. Languages and Compilers for Parallel Computing (LCPC’92), U. Banerjee et al. (Eds.), Lecture Notes in Computer Science