Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems

by Satinder Singh , Dimitri Bertsekas
Citations:130 - 6 self

Documents Related by Co-Citation

3892 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
1246 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
234 TD-Gammon, a self-teaching backgammon program, achieves master-level play – G Tesauro - 1994
626 Some studies in machine learning using the game of Checkers – Arthur L. Samuel - 1959
1324 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
1342 Learning from delayed rewards – C Watkins - 1989
283 Improving Elevator Performance Using Reinforcement Learning – Robert Crites, Andrew Barto - 1996
486 Neuronlike adaptive elements that can solve difficult learning control problems – A G Barto, R S Sutton, C W Anderson - 1983
774 Nonlinear programming – D P Bertsekas - 1999