Near-optimal reinforcement learning in polynomial time (1998)

by Michael Kearns
Venue:Machine Learning
Citations:236 - 3 self