Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA (2008)

by S Ryoo, C I Rodrigues, S S Baghsorkhi, S S Stone, D B Kirk, W M W Hwu
Venue:In Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP ’08