|
117
|
Multiagent Planning with Factored MDPs
– Carlos Guestrin, Daphne Koller, Ronald Parr
- 2001
|
|
60
|
Coordinated Reinforcement Learning
– Carlos Guestrin, Michail Lagoudakis, Ronald Parr
- 2002
|
|
2829
|
Reinforcement Learning I: Introduction
– Richard S. Sutton, Andrew G. Barto
- 1998
|
|
198
|
The Complexity of Decentralized Control of Markov Decision Processes
– Daniel S. Bernstein, Robert Givan, Neil Immerman, Shlomo Zilberstein
- 2000
|
|
37
|
A Concise Introduction to Multiagent Systems and Distributed
– Nikos Vlassis
- 2007
|
|
244
|
Multiagent Systems: A Survey from a Machine Learning Perspective
– Peter Stone, Manuela Veloso
- 1997
|
|
53
|
Solving transition independent decentralized markov decision processes
– Raphen Becker, Shlomo Zilberstein, Claudia V. Goldman
- 2004
|
|
121
|
Reinforcement Learning in the Multi-Robot Domain
– Maja J. Mataric
- 1997
|
|
5667
|
Probabilistic reasoning in intelligent systems
– Judea Pearl
- 1988
|
|
13
|
All Learning is Local: Multi-agent learning in global reward games
– Yu-han Chang, Tracey Ho, Leslie Pack Kaelbling
|
|
38
|
Reinforcement Learning of Coordination in Cooperative Multi-Agent Systems
– Spiros Kapetanakis, Daniel Kudenko
- 2002
|
|
89
|
Dynamic Programming for Partially Observable Stochastic Games
– Eric A. Hansen
- 2004
|
|
122
|
Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings
– Ranjit Nair, Milind Tambe, Makoto Yokoo, David Pynadath, Stacy Marsella, R. Nair, M. Tambe
- 2003
|
|
249
|
The dynamics of reinforcement learning in cooperative multiagent systems
– Caroline Claus, Craig Boutilier
- 1998
|
|
88
|
A Scalable Method for Multiagent Constraint Optimization
– Adrian Petcu, Boi Faltings
|
|
8
|
DCOPs Meet the Real World: Exploring Unknown Reward Matrices with Applications to Mobile Sensor Networks
– Manish Jain, Matthew Taylor, Milind Tambe, Makoto Yokoo
|
|
162
|
Adopt: asynchronous distributed constraint optimization with quality guarantees
– Pragnesh Jay Modi , Wei-Min Shen , Milind Tambe , Makoto Yokoo
- 2005
|
|
126
|
Solving distributed constraint optimization problems using cooperative mediation
– R Mailler, V Lesser
- 2004
|
|
106
|
Inverted autonomous helicopter flight via reinforcement learning
– Andrew Y. Ng, H. Jin Kim, Michael I. Jordan, Shankar Sastry
- 2004
|