The design and implementation of checkpoint/restart process tolerance for Open MPI (2007)

by J Hursey
Venue:In Intl. Symp. on Parallel and Distributed Processing