Truncating temporal differences: On the efficient implementation of TD(λ) for reinforcement learning (1995)

by Paweł Cichosz
Venue:Journal of Artificial Intelligence Research
Citations:24 - 8 self