Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on gpu. (2013)

by B Wu, Z Zhao, E Z Zhang, Y Jiang, X Shen
Venue:In Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming, PPoPP ’13,