Auto-Blocking Matrix-Multiplication or Tracking BLAS3 Performance from Source Code (1997)

by Jeremy Frens, David S. Wise
Venue:In Proceedings of the Sixth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming