Active Bibliography

175 Algorithms for Sequential Decision Making – Michael Lederman Littman - 1996
18 Large-Scale Dynamic Optimization Using Teams of Reinforcement Learning Agents – Robert Harry Crites - 1996
1 A Study on Architecture, Algorithms, and Applications of Approximate Dynamic Programming Based Approach to Optimal Control – Jong Min Lee - 2004
18 A unifying framework for computational reinforcement learning theory – Lihong Li - 2009
20 Incremental Dynamic Programming for On-Line Adaptive Optimal Control – Steven J. Bradtke - 1994
24 Generalized markov decision processes: dynamicprogramming and reinforcement-learning algorithms – Csaba Szepesvari, Michael L. Littman - 1996
49 Learning to Solve Markovian Decision Processes – Satinder P. Singh - 1994
5 Focus of attention in reinforcement learning – Lihong Li, Vadim Bulitko, Russell Greiner - 2004
102 Multiagent Reinforcement Learning in the Iterated Prisoner's Dilemma – Tuomas W. Sandholm, Robert H. Crites - 1995
www.cs.uu.nl Learning to Play Board Games using Temporal Difference Methods – Marco A. Wiering, Jan Peter Patist, Henk Mannen, Marco A. Wiering, Henk Mannen, Jan Peter Patist - 2007
2 EFFICIENT APPROXIMATE POLICY ITERATION METHODS for . . . – Michail G. Lagoudakis - 2003
7 Reinforcement learning for factored markov decision processes – Brian Sallans - 2002
526 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
43 A Generalized Reinforcement-Learning Model: Convergence and Applications – Michael L. Littman, Csaba Szepesv├íri - 1996
10 Abstraction in Control Learning – Richard Yee - 1992
3 Co-Learning in Differential Games – John W. Sheppard
161 Recent advances in hierarchical reinforcement learning – Andrew G. Barto - 2003
17 To Discount or not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning – Sridhar Mahadevan - 1994
3 Learning and Planning in Structured Worlds – Richard W. Dearden - 2000