Tight Performance Bounds on Greedy Policies Based on Imperfect Value Functions (1993)

by Ronald Williams , Leemon C. Baird
Citations:84 - 1 self

Documents Related by Co-Citation

1222 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
1308 Learning from delayed rewards – Christopher J C H Watkins - 1989
470 Integrated architectures for learning, planning, and reacting based on approximating dynamic programming – Richard S. Sutton - 1990
2587 Dynamic Programming – R E Bellman - 1957
526 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
237 Residual Algorithms: Reinforcement Learning with Function Approximation – Leemon Baird - 1995
205 Convergence of Stochastic Iterative Dynamic Programming Algorithms – Tommi Jaakkola, Michael I. Jordan, Satinder P. Singh - 1994
513 Dynamic Programming and Markov Processes – R A Howard - 1960
314 Prioritized sweeping: Reinforcement learning with less data and less time – Andrew W. Moore, Christopher G. Atkeson - 1993
372 Dynamic Programming: Deterministic and Stochastic Models – D Bertsekas
274 Acting Optimally in Partially Observable Stochastic Domains – Anthony R. Cassandra, Leslie Pack Kaelbling, Michael L. Littman - 1994
206 Stable Function Approximation in Dynamic Programming – Geoffrey J. Gordon - 1995
95 Markov Decision Processes—Discrete Stochastic Dynamic Programming – M L Puterman - 1994
152 The Complexity of Stochastic Games – Anne Condon - 1992
193 Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach – Lonnie Chrisman - 1992
1192 Markov Decision Processes. Discrete Stochastic Dynamic Programming – M L Puterman - 1994
223 Exploiting structure in policy construction – Craig Boutilier, Richard Dearden, MoisĂ©s Goldszmidt - 1995
334 The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs – Edward J Sondik - 1978
121 Optimal control of Markov decision processes with incomplete state estimation – K J Astrom - 1965