Document Versions

Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour — 1999
Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour — 1999 — In Advances in Neural Information Processing Systems 12
Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David McAllester, Satinder Singh, Yishay Mansour — 2000 — IN ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12
Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour — 2000 — In Advances in Neural Information Processing Systems 12
Policy Gradient Methods for Reinforcement Learning with Function Approximation –Richard S. Sutton, David McAllester, Satinder Singh, Yishay Mansour — 2000 — IN ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12
Policy gradient methods for reinforcement learning with function approximation –Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour — 2000 — In Advances in Neural Information Processing Systems 12