Planning by Incremental Dynamic Programming (1991)

Cached

Download Links

by Richard S. Sutton
Venue:In Proceedings of the Eighth International Workshop on Machine Learning
Citations:61 - 2 self

Documents Related by Co-Citation

473 Integrated architectures for learning, planning, and reacting based on approximating dynamic programming – Richard S. Sutton - 1990
1227 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
1321 Learning from Delayed Rewards – C Watkins - 1989
526 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
373 Dynamic Programming: Deterministic and Stochastic Models – D P Bertsekas - 1987
2611 Dynamic Programming – R Bellman - 1957
613 Some studies in machine learning using the game of Checkers – Arthur L. Samuel - 1959
619 Tsitsiklis, Parallel and Distributed Computation: Numerical Methods – D P Bertsekas, J N - 1989
246 Temporal credit assignment in reinforcement learning – R S Sutton - 1984
368 Practical Issues in Temporal Difference Learning – Gerald Tesauro - 1992
92 Efficient Learning and Planning Within the Dyna Framework – Jing Peng, Ronald J. Williams - 1993
516 Dynamic Programming and and Markov Processes – R Howard - 1960
134 Input generalization in delayed reinforcement learning: An algorithm and performance comparisons – David Chapman, Leslie Pack Kaelbling - 1991
314 Prioritized sweeping: Reinforcement learning with less data and less time – Andrew W. Moore, Christopher G. Atkeson - 1993
73 Metrics for evaluating dialogue strategies in a spoken language system – M Danieli, E Gerbino - 1995
17 The Composition of Messages in Speech-Graphics Interactive Systems – Alan Biermann, Philip M. Long - 1996
247 Introduction to Stochastic Dynamic Programming – S Ross - 1983
47 Reinforcement learning is direct adaptive optimal control – Richard S. Sutton, Andrew G. Barto, Ronald J. Williams - 1991
373 Temporal difference learning and TD-gammon – G Tesauro - 1995