Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems

by Satinder Singh , Dimitri Bertsekas
Citations:124 - 5 self

Active Bibliography

1298 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
114 Machine-Learning Research -- Four Current Directions – Thomas G. Dietterich
4 What Makes a Good Co-Evolutionary Learning Environment? – Alan D. Blair, Jordan B. Pollack - 1997
109 An introduction to collective intelligence – David H. Wolpert, Kagan Tumer - 1999
94 Robust Non-linear Control through Neuroevolution – Faustino John Gomez - 2003
46 Explanationbased learning and reinforcement learning: A unified view – Thomas G. Dietterich, Nicholas S. Flann, Andrew Barto - 1995
15 An Approach to Learning Mobile Robot Navigation – Sebastian Thrun - 1995
7 Missile Defense and Interceptor Allocation by Neuro-Dynamic Programming – Dimitri Bertsekas, Mark L. Homer, David A. Logan, Stephen D. Patek, Nils R. Sandell - 1999
45 Problem Solving With Reinforcement Learning – Gavin Adrian Rummery - 1995
22 A Counterexample to Temporal Differences Learning – Dimitri P. Bertsekas - 1995
18 Large-Scale Dynamic Optimization Using Teams of Reinforcement Learning Agents – Robert Harry Crites - 1996
2 How to Make Software Agents Do the Right Thing: An Introduction to Reinforcement Learning – Santinder Singh, Peter Norvig, David Cohn, Harlequin Inc - 1996
7 A Dynamic Channel Assignment Policy through Q-Learning – Junhong Nie, Simon Haykin - 1999
24 Truncating temporal differences: On the efficient implementation of TD(λ) for reinforcement learning – Paweł Cichosz - 1995
17 Value Function Based Production Scheduling – Jeff G. Schneider, Justin A. Boyan, Andrew W. Moore - 1998
10 A Tutorial Survey of Reinforcement Learning – S Sathiya Keerthi, B Ravindran
32 Modular Neural Networks for Learning Context-Dependent Game Strategies – Justin A. Boyan - 1992
55 TD(λ) Converges with Probability 1 – Peter Dayan, Terrence J. Sejnowski - 1994
316 Prioritized sweeping: Reinforcement learning with less data and less time – Andrew W. Moore, Christopher G. Atkeson - 1993