Stochastic kernel temporal difference for reinforcement learning (2011)

by J Bae, L S Giraldo, P Chhatbar, J T Francis, J C Sanchez, J C Principe
Venue:in Proceedings of the 21st IEEE International Workshop on Machine Learning for Signal Processing (MLSP ’11