Reinforcement learning by reward-weighted regression for operational space control (2007)

by J Peters, S Schaal
Venue:in Proc. Int. Conf. Machine Learning