Off-policy reinforcement learning with gaussian processes. Acta Automatica Sinica (2014)

by Girish Chowdhary, Miao Liu, Robert C Grande, Thomas J Walsh, Jonathan P How, Lawrence Carin