Automatic Programming of Behavior-based Robots using Reinforcement Learning (1991)

by S. Mahadevan, J. Connell, C. Sammut, R. Sutton, Temporal Phd