Enabling and scaling matrix computations on heterogeneous multi-core and multi-GPU systems”. (2012)

by Fengguang Song, Stanimire Tomov, Jack Dongarra
Venue:In Proceedings of the 26th ACM international conference on Supercomputing, ICS ’12,