Temporal Credit Assignment in Reinforcement Learning (1984)

by R Sutton