Searching for "R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning." – sorted by Relevance.
-
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
- R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal
- Cited by 97 (7 self) – Add To MetaCart
-
Journal of Machine Learning Research 3 (2002) 213-231 Submitted 11/01; Published 10/02 R-max -- A
- General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning Ronen I. Brafman brafman
- Add To MetaCart

