Near-optimal reinforcement learning in polynomial time (1998)

by Michael Kearns
Venue:Machine Learning
Citations:235 - 3 self