Policy Gradient Methods for Reinforcement Learning with Function Approximation (1999)

by Richard S. Sutton , David Mcallester , Satinder Singh , Yishay Mansour
Citations:262 - 13 self

Document Versions

Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour — 1999
Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour — 1999 — In Advances in Neural Information Processing Systems 12
Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David McAllester, Satinder Singh, Yishay Mansour — 2000 — IN ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12
Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour — 2000 — In Advances in Neural Information Processing Systems 12
Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David McAllester, Satinder Singh, Yishay Mansour — 2000 — IN ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12
Policy gradient methods for reinforcement learning with function approximation –Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour — 2000 — In Advances in Neural Information Processing Systems 12