Temporal credit assignment in reinforcement learning (1984)

by R S Sutton