Reinforcement learning by reward-weighted regression for operational space control (2007)

by Jan Peters, Stefan Schaal
Venue:In In: Proceedings of the International Conference on Machine Learning (ICML