AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response against Stationary Opponents (2006)

by Vincent Conitzer , Tuomas Sandholm
Venue:IN PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING
Citations:81 - 5 self

Documents Related by Co-Citation

304 The dynamics of reinforcement learning in cooperative multiagent systems – Caroline Claus, Craig Boutilier - 1998
180 Multiagent Learning Using a Variable Learning Rate – Michael Bowling, Manuela Veloso - 2002
498 Markov games as a framework for multi-agent reinforcement learning – Michael L. Littman - 1994
65 Convergence and no-regret in multiagent learning – Michael Bowling - 2005
91 Nash Convergence of Gradient Dynamics in General-Sum Games – Satinder Singh, Michael Kearns, Yishay Mansour - 2000
779 The theory of Learning in Games – D Fudenburg, D K Levine - 1998
236 R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning – Ronen I. Brafman, Moshe Tennenholtz, Pack Kaelbling - 2001
45 Efficient learning equilibrium – Ronen I. Brafman, Moshe Tennenholtz - 2002
283 Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm – Junling Hu, Michael P. Wellman - 1998
40 Learning against opponents with bounded memory – Rob Powers - 2005
215 Rational Learning Leads to Nash Equilibrium – Author(s) Ehud Kalai, Ehud Lehrer - 1993
219 A Simple Adaptive Procedure Leading to Correlated Equilibrium – Hart, Andreu Mas-colell
40 Implicit Negotiation in Repeated Games – Michael L. Littman, Peter Stone - 2001
49 New criteria and a new algorithm for learning in multi-agent systems – R Powers, Y Shoham - 2005
56 Correlated Q-learning – Amy Greenwald, Martin Zinkevich, Pack Kaelbling - 2003
82 Rational and Convergent Learning in Stochastic Games – Michael Bowling , Manuela Veloso - 2001
78 Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games – Xiaofeng Wang, Tuomas Sandholm - 2002
64 Run the GAMUT: A comprehensive approach to evaluating game-theoretic algorithms – Eugene Nudelman, Jennifer Wortman, Yoav Shoham, Kevin Leyton-brown - 2004
119 Friend-or-foe q-learning in generalsum games – M L Littman