Kernel-based least squares policy iteration for reinforcement learning. (2006)

by X Xu, D Hu, X Lu
Venue:Mathematics of Operations Research,