|
158
|
Algorithms for Sequential Decision Making
– Michael Lederman Littman
- 1996
|
|
1
|
A Study on Architecture, Algorithms, and Applications of Approximate Dynamic Programming Based Approach to Optimal Control
– Jong Min Lee
- 2004
|
|
18
|
Large-Scale Dynamic Optimization Using Teams of Reinforcement Learning Agents
– Robert Harry Crites
- 1996
|
|
13
|
A unifying framework for computational reinforcement learning theory
– Lihong Li
- 2009
|
|
43
|
Learning to Solve Markovian Decision Processes
– Satinder P. Singh
- 1994
|
|
20
|
Incremental Dynamic Programming for On-Line Adaptive Optimal Control
– Steven J. Bradtke
- 1994
|
|
23
|
Generalized Markov Decision Processes: Dynamic-programming and Reinforcement-learning Algorithms
– Csaba Szepesvári, Michael L. Littman
- 1996
|
|
472
|
Learning to act using real-time dynamic programming
– Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh
- 1993
|
|
4
|
Focus of attention in reinforcement learning
– Lihong Li, Vadim Bulitko, Russell Greiner
- 2004
|
|
17
|
To Discount or not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning
– Sridhar Mahadevan
- 1994
|
|
80
|
Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results
– Sridhar Mahadevan
- 1996
|
|
91
|
Multiagent Reinforcement Learning in the Iterated Prisoner's Dilemma
– Tuomas W. Sandholm, Robert H. Crites
- 1995
|
|
|
www.cs.uu.nl Learning to Play Board Games using Temporal Difference Methods
– Marco A. Wiering, Jan Peter Patist, Henk Mannen, Marco A. Wiering, Henk Mannen, Jan Peter Patist
- 2007
|
|
1
|
A Tutorial on Reinforcement Learning Techniques
– Carlos Henrique, Costa Ribeiro
|
|
|
Aprendizado por Reforço
– Carlos Henrique Costa Ribeiro
- 1999
|
|
4
|
Reinforcement Learning in Non-Markov Environments
– Steven D. Whitehead, Long Ji Lin
- 1992
|
|
5
|
Reinforcement learning for factored markov decision processes
– Brian Sallans
- 2002
|
|
35
|
A Generalized Reinforcement-Learning Model: Convergence and Applications
– Michael L. Littman, Csaba Szepesvári
- 1996
|
|
10
|
Abstraction in Control Learning
– Richard Yee
- 1992
|