R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning (2001)

by Ronen I. Brafman , Moshe Tennenholtz , Pack Kaelbling
Citations:176 - 9 self