Least-Squares λ Policy Iteration: Bias-Variance Tradeoff (2010)

by C Thiéry, B Scherrer
Venue:in Control Problems. In ICML