Active Bibliography

9 Regret minimization in repeated matrix games with variable stage duration – Shie Mannor, Nahum Shimkin - 2006
33 Regret minimization under partial monitoring – Nicolò Cesa-Bianchi, Gábor Lugosi, Gilles Stoltz - 2004
32 Potential-based Algorithms in On-line Prediction and Game Theory – Nicolo Cesa-Bianchi, Gabor Lugosi
23 Stochastic approximations and differential inclusions – Michel Benaïm, Josef Hofbauer, Sylvain Sorin - 2005
17 The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes – Shie Mannor, Nahum Shimkin - 2002
Decision Making in Uncertain and Changing Environments – Karl H. Schlag, et al. - 2009
Online learning of graphical models – Frédéric Koriche - 2010
3 EXPONENTIAL WEIGHT ALGORITHM IN CONTINUOUS TIME – Sylvain Sorin - 2006
No-Regret Learning and . . . – Casey Alvin Marks - 2008
26 On No-Regret Learning, Fictitious Play, and Nash Equilibrium – Amir Jafari, Amy Greenwald, David Gondek, Gunes Ercal - 2001
81 AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents – Vincent Conitzer, Tuomas Sandholm - 2003
8 Learning to Search: Structured Prediction Techniques for Imitation Learning – Nathan D. Ratliff, James Kuffner, Andrew Ng - 2009
24 Global Nash convergence of Foster and Young’s regret testing – Fabrizio Germano, Gábor Lugosi - 2007
2 Chapter 4: Learning, Regret-Minimization, and Equilibria – A. Blum, Y. Mansour - 2007
5 Learning, regret minimization, and equilibria – Avrim Blum, Yishay Monsour, A. Blum, Y. Mansour - 2007
13 No-regret algorithms for structured prediction problems – Geoffrey J. Gordon - 2005
3 Reinforcement Learning Without Rewards – Umar Ali Syed - 2010
10 Efficient Algorithms Using The Multiplicative Weights Update Method – Satyen Kale - 2006
3 Regret Minimization in Signal Space for Repeated Matrix Games with Partial Observations – Shie Mannor, Nahum Shimkin - 2000