Algorithms for Sequential Decision Making (1996)

by Michael Lederman Littman
Citations:175 - 8 self

Documents Related by Co-Citation

2593 On the theory of dynamic programming – Richard E Bellman - 1952
334 The Optimal Control of Partially Observable Markov Processes – E J Sondik - 1971
258 An Algorithm for Probabilistic Planning – Nicholas Kushmerick, Steve Hanks, Daniel Weld - 1995
234 Learning policies for partially observable environments: Scaling up – Michael L. Littman, Anthony R. Cassandra, Leslie Pack Kaelbling - 1995
191 A survey of partially observable Markov decision processes: Theory, models, and algorithms – G E Monahan - 1982
175 A survey of algorithmic methods for partially observable Markov decision processes – W S Lovejoy - 1991
274 Acting Optimally in Partially Observable Stochastic Domains – Anthony R. Cassandra, Leslie Pack Kaelbling, Michael L. Littman - 1994
580 Markov decision processes – M L Puterman - 2005
306 The complexity of Markov decision processes – C Papadimitriou, J Tsisiklis - 1987
121 Optimal control of markov decision processes with incomplete state estimation – K J Ă„strom - 1965
120 Approximating Optimal Policies for Partially Observable Stochastic Domains – Ronald Parr, Stuart Russell - 1995
292 The optimal control of partially observable markov processes over a finite horizon – R Smallwood, E Sondik - 1971
1737 STRIPS: A new approach to the application of theorem proving to problem solving. Arti cial Intelligence 2:189{208 – R Fikes, N J Nilsson - 1971
157 Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes – Anthony Cassandra, Michael L. Littman, Nevin L. Zhang - 1997
75 Algorithms for Partially Observable Markov Decision Processes – H T Cheng - 1988
94 Computationally feasible bounds for partially observed Markov decision processes – W S Lovejoy - 1991
93 Solving POMDPs by Searching in Policy Space – Eric A. Hansen - 1998
251 Generalization in Reinforcement Learning: Safely Approximating the Value Function – Justin A. Boyan, Andrew W. Moore - 1995
1202 Markov Decision Processes: Discrete Stochastic Dynamic Programming – M L Puterman - 1994