Collaborative Multiagent Reinforcement Learning by Payoff Propagation (2006)

by Jelle R. Kok , Nikos Vlassis
Venue:JOURNAL OF MACHINE LEARNING RESEARCH
Citations:32 - 2 self

Documents Related by Co-Citation

142 Multiagent Planning with Factored MDPs – Carlos Guestrin, Daphne Koller, Ronald Parr - 2001
84 Coordinated Reinforcement Learning – Carlos Guestrin, Michail Lagoudakis, Ronald Parr - 2002
53 A Concise Introduction to Multiagent Systems and Distributed AI – Nikos Vlassis - 2003
3760 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
287 The Complexity of Decentralized Control of Markov Decision Processes – Daniel S. Bernstein, Robert Givan, Neil Immerman, Shlomo Zilberstein - 2000
292 Multiagent Systems: A Survey from a Machine Learning Perspective – Peter Stone, Manuela Veloso - 1997
1163 Factor Graphs and the Sum-Product Algorithm – Frank R. Kschischang, Brendan J. Frey, Hans-Andrea Loeliger - 1998
64 Networked Distributed POMDPs: A Synthesis of Distributed Constraint Optimization and POMDPs – Ranjit Nair, Pradeep Varakantham, Milind Tambe, Makoto Yokoo - 2005
53 Decentralised Coordination of Low-Power Embedded Devices Using the Max-Sum Algorithm – A. Farinelli, A. Rogers, A. Petcu, N. R. Jennings - 2008
27 Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies – Pradeep Varakantham, Janusz Marecki, Yuichi Yabu, Milind Tambe, Makoto Yokoo - 2007
417 Decision-Theoretic Planning: Structural Assumptions and Computational Leverage – Craig Boutilier, Thomas Dean, Steve Hanks - 1999
119 Dynamic Programming for Partially Observable Stochastic Games – Eric A. Hansen - 2004
822 Planning and acting in partially observable stochastic domains – Leslie Pack Kaelbling, Michael L. Littman, Anthony R. Cassandra - 1998
18 Reinforcement learning for true adaptive traffic signal control – B Abdulhai, R Pringle, G J Karakoulas - 2003
13 Traffic Light Control Using SARSA with Three State Representations – Thomas L. Thorpe, Charles W. Anderson - 1996
43 A.: Loopy belief propagation as a basis for communication in sensor networks – C Crick, Pfeffer
300 Understanding belief propagation and its generalizations – J Yedidia, W T Freeman, Y Weiss - 2001
157 Exploiting Causal Independence in Bayesian Network Inference – Nevin Lianwen Zhang, David Poole - 1996
113 Cooperative Multi-Agent Learning: The State of the Art – Liviu Panait, Sean Luke - 2005