Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results (1996)

Cached

Download Links

by Sridhar Mahadevan
Citations:97 - 12 self

Documents Related by Co-Citation

1309 Learning from Delayed Rewards – C Watkins - 1989
94 A Reinforcement Learning Method for Maximizing Undiscounted Rewards – A Schwartz - 1993
207 Convergence of Stochastic Iterative Dynamic Programming Algorithms – Tommi Jaakkola, Michael I. Jordan, Satinder P. Singh - 1994
278 Improving Elevator Performance Using Reinforcement Learning – Robert Crites, Andrew Barto - 1996
1226 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
527 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
316 Prioritized sweeping: Reinforcement learning with less data and less time – Andrew W. Moore, Christopher G. Atkeson - 1993
473 Integrated architectures for learning, planning, and reacting based on approximating dynamic programming – Richard S. Sutton - 1990
151 Asynchronous Stochastic Approximation and Q-Learning – John N. Tsitsiklis, Richard Sutton - 1994
500 Markov games as a framework for multi-agent reinforcement learning – Michael L. Littman - 1994
455 Dynamic programming and optimal control. Athena Scientific – D Bertsekas - 2001
1298 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
2593 On the theory of dynamic programming – Richard E Bellman - 1952
208 Stable Function Approximation in Dynamic Programming – Geoffrey J. Gordon - 1995
124 Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems – Satinder Singh, Dimitri Bertsekas
237 Residual Algorithms: Reinforcement Learning with Function Approximation – Leemon Baird - 1995
513 Dynamic Programming and Markov Processes – R A Howard - 1960
3760 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
111 Reinforcement Learning with Soft State Aggregation – Satinder P. Singh, Tommi Jaakkola, Michael I. Jordan - 1995