Autotuning and specialization: Speeding up matrix multiply for small matrices with compiler technology (2009)

by Jaewook Shin, Mary W Hall, Jacqueline Chame, Chun Chen, Paul D Hovland
Venue:In The Fourth International Workshop on Automatic Performance Tuning