Hierarchical Learning in Stochastic Domains: Preliminary Results (1993)

Cached

Download Links

by Leslie Pack Kaelbling
Venue:In Proceedings of the Tenth International Conference on Machine Learning
Citations:99 - 8 self

Active Bibliography

28 Learning to Achieve Goals – Leslie Pack Kaelbling - 1993
22 Exploration and Inference in Learning from Reinforcement – Jeremy Wyatt - 1997
49 Learning to Solve Markovian Decision Processes – Satinder P. Singh - 1994
99 Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results – Sridhar Mahadevan - 1996
1298 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
56 Lifelong Robot Learning – Sebastian Thrun, Tom M. Mitchell - 1993
8 The Sensorimotor Foundations of Phonology: A Computational Model of Early Childhood Articulatory and Phonetic Development – Kevin Lee Markey - 1994
2 Learning Evaluation Functions – Justin A. Boyan, Scott E. Fahlman, Tom Mitchell - 1996
24 Learning to Solve Multiple Goals – Jonas Karlsson, Dana H. Ballard - 1997
17 Automated Learning of Load-Balancing Strategies For A Distributed Computer System – P. Mehra, Load Balancing As, A Strategy-learning Task - 1992
Load Balancing as a Strategiy-Learning Task – n.n. - 1992
275 Self-improving reactive agents based on reinforcement learning, planning and teaching – Long-ji Lin - 1992
3 Hierarchical Reinforcement Learning: A Hybrid Approach – Malcolm Ross Kinsella Ryan - 2002
Feudal Q-Learning – Peter Dayan - 1995
4 The interaction of representations and planning objectives for decision-theoretic planning tasks – Sven Koenig, Yaxin Liu - 2002
17 To Discount or not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning – Sridhar Mahadevan - 1994
527 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
24 Truncating temporal differences: On the efficient implementation of TD(λ) for reinforcement learning – Paweł Cichosz - 1995
6 An Actor/Critic Algorithm that is Equivalent to Q-Learning – Robert H. Crites, Andrew G. Barto - 1995