Stochastic policy gradient reinforcement learning on a simple 3D biped (2004)

by Russ Tedrake
Venue:Proc. of the 10th Int. Conf. on Intelligent Robots and Systems
Citations:52 - 5 self