Basis function adaptation in temporal difference reinforcement learning (2005)

by Ishai Menache , Shie Mannor , Nahum Shimkin
Venue:Annals of Operations Research
Citations:54 - 3 self