Temporal credit assignment in reinforcement learning. Doctoral dissertation (1984)

by R S Sutton