Improving Elevator Performance Using Reinforcement Learning (1996)

Cached

Download Links

by Robert Crites , Andrew Barto
Venue:Advances in Neural Information Processing Systems 8
Citations:296 - 3 self

Documents Related by Co-Citation

1328 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
384 Practical Issues in Temporal Difference Learning – Gerald Tesauro - 1992
563 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
1412 Learning from Delayed Rewards – Christopher J C H Watkins - 1989
1405 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
4165 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
414 Temporal difference learning and TD-Gammon – Gerald J Tesauro - 1995
948 Reinforcement Learning – Richard S. Sutton, Presented Pirooz Chubak, Dyna Architecture, Dyna Architecture - 1998
663 Some studies in machine learning using the game of Checkers – Arthur L. Samuel - 1959