Streamlining GPU Applications on the Fly: Thread Divergence Elimination through Runtime Thread-Data Mapping (2010)

by E Z Zhang, Y Jiang, Z Guo, X Shen
Venue:In: Proceedings of the 24 th ACM International Conference on Supercomputing (ICS 2010