A network-failure-tolerant messagepassing system for terascale clusters (2003)

by R L Graham, S-E Choi, D J Daniel, N N Desai, R G Minnich, C E Rasmussen, L D Risinger, M W Sukalksi
Venue:International Journal of Parallel Programming