Autotuning and specialization: Speeding up matrix multiply for small matrices with compiler technology (2009)

by Jaewook Shin, Mary W. Hall, Jacqueline Chame, Chun Chen, Paul D. Hovland
Venue:IN THE FOURTH INTERNATIONAL WORKSHOP ON AUTOMATIC PERFORMANCE TUNING