Learning to act using real-time dynamic programming (1993)

by Andrew G. Barto , Steven J. Bradtke , Satinder P. Singh
Venue:
Citations:526 - 18 self

Documents Related by Co-Citation

2611 Dynamic Programming – R Bellman - 1957
1227 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
1321 Learning from Delayed Rewards – C Watkins - 1989
516 Dynamic Programming and and Markov Processes – R Howard - 1960
207 Convergence of Stochastic Iterative Dynamic Programming Algorithms – Tommi Jaakkola, Michael I. Jordan, Satinder P. Singh - 1994
153 Learning to predict by the methods of temporal di erences – R S Sutton - 1988
224 Exploiting structure in policy construction – Craig Boutilier, Richard Dearden, Mois├ęs Goldszmidt - 1995
1740 STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving – R Fikes, N Nilsson - 1971
473 Integrated architectures for learning, planning, and reacting based on approximating dynamic programming – Richard S. Sutton - 1990
224 The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces – Andrew W. Moore, Christopher G. Atkeson - 1995
396 Real-time heuristic search – R Korf - 1990
303 Learning in Embedded Systems – L P Kaelbling - 1993
334 Automatic Programming of Behavior-based Robots using Reinforcement Learning – S. Mahadevan, J. Connell, C. Sammut, R. Sutton, Temporal Phd - 1991
206 Learning to coordinate behaviors – Pattie Maes, Rodney A. Brooks - 1990
373 Dynamic Programming: Deterministic and Stochastic Models – D P Bertsekas - 1987
161 Planning Under Time Constraints in Stochastic Domains – Thomas Dean, Leslie Pack Kaelbling, Jak Kirman, Ann Nicholson - 1993
457 Dynamic Programming and Optimal Control, Athena Scientific, 3rd edition – D P Bertsekas - 2007
423 Learning and executing generalized robot plans – R Fikes, P Hart, N Nilsson - 1972
92 Efficient Learning and Planning Within the Dyna Framework – Jing Peng, Ronald J. Williams - 1993