Technical update: Least-squares temporal difference learning (2002)

by Justin A. Boyan
Venue:Machine Learning
Citations:94 - 2 self

Active Bibliography

95 Least-Squares Temporal Difference Learning – Justin A. Boyan - 1999
long-term sum of future rewards obtained when the – Justin A. Boyan
2 A Study on Architecture, Algorithms, and Applications of Approximate Dynamic Programming Based Approach to Optimal Control – Jong Min Lee - 2004
1324 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
18 A unifying framework for computational reinforcement learning theory – Lihong Li - 2009
440 Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning – Richard Sutton, Doina Precup, Satinder Singh - 1999
426 Decision-Theoretic Planning: Structural Assumptions and Computational Leverage – Craig Boutilier, Thomas Dean, Steve Hanks - 1999
540 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
671 Being There -- Putting Brain, Body, and World Together Again – Andy Clark - 1997