A network-failure-tolerant message-passing system for terascale clusters (2003)

by R L Graham, S-E Choi, D J Daniel, N N Desai, R G Minnich, C E Rasmussen, L Risinger, M W Sukalski
Venue:In International Journal of Parallel Programming