Basis function adaptation in temporal difference reinforcement learning (2005)

by I Menache, S Mannor, N Shimkin
Venue:Annals of Operations Research