Bandit based Monte-Carlo Planning (2006)

by Levente Kocsis , Csaba Szepesvári
Venue:In: ECML-06. Number 4212 in LNCS
Citations:217 - 6 self

Documents Related by Co-Citation

398 Finite-time analysis of the multiarmed bandit problem – Peter Auer, Paul Fischer, Jyrki Kivinen - 2002
3760 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
114 Efficient selectivity and backup operators in Monte-Carlo tree search – Rémi Coulom - 2006
53 Modification of UCT with patterns in Monte-Carlo go – Sylvain Gelly, Yizao Wang, Rémi Munos, Olivier Teytaud - 2006
1498 PROBABILITY INEQUALITIES FOR SUMS OF BOUNDED RANDOM VARIABLES – Wassily Hoeffding - 1962
45 Bandit Algorithm for Tree Search – Thème Cog, Pierre-arnaud Coquelin, Pierre-arnaud Coquelin, Rémi Munos, Rémi Munos - 2007
272 Asymptotically efficient adaptive allocation rules – T H Lai, H Robbins - 1985
171 A sparse sampling algorithm for near-optimal planning in large Markov decision processes – Michael Kearns - 1999
109 Using Confidence Bounds for Exploitation-Exploration Trade-offs – Peter Auer, M. Long - 2002
11 Improved rates for the stochastic continuum-armed bandit problem. Learning Theory – Szepesvári - 2007
27 Tsitsiklis. The complexity of dynamic programming – Chee-Seng Chow, J N - 1989
43 Pac model-free reinforcement learning – Alexander L. Strehl, Lihong Li, Eric Wiewiora, John Langford, Michael L. Littman - 2006
136 Stochastic Optimal Control (The Discrete Time Case – D P Bertsekas, S E Shreve - 1978
16 Model-based reinforcement learning with nearly tight exploration complexity bounds – István Szita, Csaba Szepesvári
17 Bayesian generation and integration of K-nearest-neighbor patterns for 19x19 go – Bruno Bouzy - 2005
19 Associating Domain-Dependent Knowledge and Monte Carlo Approaches within a Go Program – Bruno Bouzy - 2003
78 Computer Go: an AI Oriented Survey – Bruno Bouzy, Tristan Cazenave - 2001
78 Combining Online and Offline Knowledge in UCT – Sylvain Gelly, David Silver - 2007
39 Y.: Exploration exploitation in go: UCT for Monte-Carlo go. In: NIPS-2006: On-line trading of Exploration and Exploitation Workshop – Sylvain Gelly, Yizao Wang - 2006