Active policy learning for robot planning and exploration under uncertainty (2007)

by Ruben Martinez-Cantin, Nando de Freitas, Arnaud Doucet, José A Castellanos
Venue:In Proceedings of Robotics: Science and Systems