Analysis of temporal-diffference learning with function approximation (1997)

by J N Tsitsiklis, B Van Roy
Venue:In Advances in Neural Information Processing Systems