Active Bibliography

177 Algorithms for Sequential Decision Making – - 1996
18 Large-Scale Dynamic Optimization Using Teams of Reinforcement Learning Agents – - 1996
1 A Study on Architecture, Algorithms, and Applications of Approximate Dynamic Programming Based Approach to Optimal Control – - 2004
18 A unifying framework for computational reinforcement learning theory – - 2009
20 Incremental Dynamic Programming for On-Line Adaptive Optimal Control – - 1994
24 Generalized markov decision processes: dynamicprogramming and reinforcement-learning algorithms – - 1996
48 Learning to Solve Markovian Decision Processes – - 1994
5 Focus of attention in reinforcement learning – - 2004
104 Multiagent Reinforcement Learning in the Iterated Prisoner's Dilemma – - 1995
www.cs.uu.nl Learning to Play Board Games using Temporal Difference Methods – - 2007
2 EFFICIENT APPROXIMATE POLICY ITERATION METHODS for . . . – - 2003
7 Reinforcement learning for factored markov decision processes – - 2002
532 Learning to act using real-time dynamic programming – - 1993
43 A Generalized Reinforcement-Learning Model: Convergence and Applications – - 1996
10 Abstraction in Control Learning – - 1992
3 Co-Learning in Differential Games
164 Recent advances in hierarchical reinforcement learning – - 2003
3 Learning and Planning in Structured Worlds – - 2000
17 To Discount or not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning – - 1994