Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology (1997)

by J Bilmes, K Asanović, C W Chin, J Demmel
Venue:In Proceedings of the International Conference on Supercomputing