Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results (1996)

Cached

Download Links

by Sridhar Mahadevan
Citations:97 - 12 self

Active Bibliography

1298 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
49 Learning to Solve Markovian Decision Processes – Satinder P. Singh - 1994
17 To Discount or not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning – Sridhar Mahadevan - 1994
10 A Tutorial Survey of Reinforcement Learning – S Sathiya Keerthi, B Ravindran
1 C3 Reinforcement Learning – S. Sathiya Keerthi, B. Ravindran
527 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
3 Hierarchical Reinforcement Learning: A Hybrid Approach – Malcolm Ross Kinsella Ryan - 2002
22 Exploration and Inference in Learning from Reinforcement – Jeremy Wyatt - 1997
29 Auto-exploratory Average Reward Reinforcement Learning – Dokyeong Ok, Prasad Tadepalli - 1996
45 Problem Solving With Reinforcement Learning – Gavin Adrian Rummery - 1995
175 Algorithms for Sequential Decision Making – Michael Lederman Littman - 1996
2 Optimality Criteria in Reinforcement Learning – Sridhar Mahadevan - 1996
1 A Study on Architecture, Algorithms, and Applications of Approximate Dynamic Programming Based Approach to Optimal Control – Jong Min Lee - 2004
20 Incremental Dynamic Programming for On-Line Adaptive Optimal Control – Steven J. Bradtke - 1994
Solution of Delayed Reinforcement Learning Problems Having Continuous Action Spaces – B. Ravindran - 1996
8 The Sensorimotor Foundations of Phonology: A Computational Model of Early Childhood Articulatory and Phonetic Development – Kevin Lee Markey - 1994
42 Exploration of Multi-State Environments: Local Measures and Back-Propagation of Uncertainty – Nicolas Meuleau, Sridhar Mahadevan - 1998
18 A unifying framework for computational reinforcement learning theory – Lihong Li - 2009
4 Reinforcement Learning in Non-Markov Environments – Steven D. Whitehead, Long Ji Lin - 1992