Adaptive Choice of Grid and Time in Reinforcement Learning (1997)

Cached

Download Links

by Stephan Pareigis
Venue:IN NIPS ’97: PROCEEDINGS OF THE 1997 CONFERENCE ON ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10
Citations:15 - 1 self

Documents Related by Co-Citation

224 The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces – Andrew W. Moore, Christopher G. Atkeson - 1995
355 Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding – Richard S. Sutton - 1996
3760 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
1298 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
471 Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems – A G Barto, R S Sutton, C W Anderson - 1983
237 Residual Algorithms: Reinforcement Learning with Function Approximation – Leemon Baird - 1995
1226 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
426 Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning – Richard Sutton, Doina Precup, Satinder Singh - 1999
278 Improving Elevator Performance Using Reinforcement Learning – Robert Crites, Andrew Barto - 1996
309 Learning from demonstration – Stefan Schaal - 1997
27 Temporal Difference Learning in Continuous Time and Space – Kenji Doya - 1996
92 Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces – Juan Carlos Santamaría, Richard S. Sutton, Ashwin Ram - 1996
208 Stable Function Approximation in Dynamic Programming – Geoffrey J. Gordon - 1995
1309 Learning from Delayed Rewards – C Watkins - 1989
218 An analysis of temporal-difference learning with function approximation – John N. Tsitsiklis, Benjamin Van Roy - 1997
367 Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition – Thomas G. Dietterich - 2000
108 Hierarchical Control and Learning for Markov Decision Processes – Ronald Edward Parr - 1998
38 A numerical approach to the infinite horizon problem of deterministic control theory – M Falcone - 1987
114 Reinforcement Learning Methods for Continuous-Time Markov Decision Problems – Steven J. Bradtke, Michael O. Duff - 1994