Near-polynomial reinforcement learning in polynomial time (2002)

by M Kearns, S Singh
Venue:Machine Learning