Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems

by Satinder Singh , Dimitri Bertsekas
Citations:136 - 6 self

Documents Related by Co-Citation

4165 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
1328 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
249 TD-Gammon, a self-teaching backgammon program, achieves master-level play – Gerald J Tesauro
663 Some studies in machine learning using the game of Checkers – Arthur L. Samuel - 1959
1405 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
1412 Learning from Delayed Rewards – Christopher J C H Watkins - 1989
294 Improving Elevator Performance Using Reinforcement Learning – Robert Crites, Andrew Barto - 1996
512 Neuronlike adaptive elements that can solve difficult learning control problems – Andrew G Barto, Richard S Sutton, Satinder P Singh
829 Neuro-dynamic programming – D P Bertsekas, J N Tsitsiklis - 1996