|
220
|
Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents
– Ming Tan
- 1993
|
|
249
|
The dynamics of reinforcement learning in cooperative multiagent systems
– Caroline Claus, Craig Boutilier
- 1998
|
|
237
|
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm
– Junling Hu, Michael P. Wellman
- 1998
|
|
1134
|
Reinforcement learning: a survey
– Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore
- 1996
|
|
135
|
Learning to Coordinate Without Sharing Information
– Sandip Sen, Ip Sen, Mahendra Sekaran, John Hale
- 1994
|
|
1137
|
Learning from delayed rewards
– C J C H Watkins
- 1989
|
|
1060
|
Learning to predict by the methods of temporal differences
– Richard S. Sutton
- 1988
|
|
545
|
Some Studies in Machine Learning using the Game of Checkers
– A Samuel
- 2000
|
|
159
|
Stochastic games
– L S Shapley
- 1953
|
|
561
|
The Theory of Learning in Games
– D Fudenberg, D Levine
- 1998
|
|
2829
|
Reinforcement Learning I: Introduction
– Richard S. Sutton, Andrew G. Barto
- 1998
|
|
585
|
The evolution of cooperation. Basic
– R Axelrod
- 1984
|
|
472
|
Learning to act using real-time dynamic programming
– Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh
- 1993
|
|
878
|
Markov Decision Processes: Discrete Stochastic Dynamic Programming
– M L Puterman
- 1994
|
|
19
|
Learning to coordinate actions in multi-agent systems
– G Wei
- 1993
|
|
91
|
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms
– Satinder Singh, Tommi Jaakkola, Michael L. Littman, Csaba Szepesv Ari
- 1998
|
|
166
|
Variance-penalized Markov decision processes
– J A Filar, L C M Kallenberg, H-M Lee
- 1989
|
|
158
|
On the synthesis of useful social laws for artificial agent societies
– Yoav Shoham, Moshe Tennenholtz
- 1992
|
|
1197
|
An introduction to game theory
– M Osborne
- 2004
|