Near-optimal reinforcement learning in polynomial time (1998)

by M Kearns, S Singh
Venue:Proceedings of the 15th International Conference on Machine Learning