Improving Elevator Performance Using Reinforcement Learning (1996)

Cached

Download Links

by Robert Crites , Andrew Barto
Venue:Advances in Neural Information Processing Systems 8
Citations:294 - 3 self

Documents Related by Co-Citation

1246 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
372 Practical Issues in Temporal Difference Learning – Gerald Tesauro - 1992
540 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
1342 Learning from delayed rewards – C Watkins - 1989
1324 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
3892 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
386 Temporal difference learning and TD-gammon – Gerald Tesauro - 1995
873 Reinforcement Learning – Richard S. Sutton, Presented Pirooz Chubak, Dyna Architecture, Dyna Architecture - 1998
626 Some studies in machine learning using the game of Checkers – Arthur L. Samuel - 1959