Learning and value function approximation in complex decision processes (1998)

by B Van Roy
Venue:Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology