Optimizing matrix multiply using phipac: a portable, high-performance, ansi c coding methodology (1997)

by J Bilmes, K Asanovic, C-W Chin, J Demmel
Venue:in: ICS ’97: Proceedings of the 11th international conference on Supercomputing, ACM