Improving Elevator Performance Using Reinforcement Learning (1996)

Cached

Download Links

by Robert Crites , Andrew Barto
Venue:Advances in Neural Information Processing Systems 8
Citations:278 - 3 self

Documents Related by Co-Citation

1226 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
363 Practical Issues in Temporal Difference Learning – Gerald Tesauro - 1992
373 Temporal Difference Learning and TD-Gammon – G TESAURO - 1995
1309 Learning from Delayed Rewards – C Watkins - 1989
1298 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
114 Reinforcement Learning Methods for Continuous-Time Markov Decision Problems – Steven J. Bradtke, Michael O. Duff - 1994
208 Stable Function Approximation in Dynamic Programming – Geoffrey J. Gordon - 1995
527 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
3760 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
62 Practical Issues in Temporal Di erence Learning – G Tesauro - 1992
252 Motivated Reinforcement Learning – Peter Dayan - 2001
160 Transfer of Learning by Composing Solutions of Elemental Sequential Tasks – Satinder Pal Singh - 1992
237 Residual Algorithms: Reinforcement Learning with Function Approximation – Leemon Baird - 1995
847 Reinforcement Learning – Richard S. Sutton, Presented Pirooz Chubak, Dyna Architecture, Dyna Architecture - 1998
224 TD-gammon, a self-teaching backgammon program, achieves master-level play – G J Tesauro - 1994
285 On-Line Q-Learning Using Connectionist Systems – G. A. Rummery, M. Niranjan - 1994
355 Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding – Richard S. Sutton - 1996
612 Some studies in machine learning using the game of Checkers – Arthur L. Samuel - 1959
153 Learning to predict by the methods of temporal diā†µerences – R S Sutton - 1988