Planning With Deadlines in Stochastic Domains (1993)

Cached

Download Links

by Thomas Dean , Leslie Pack Kaelbling , Jak Kirman , Ann Nicholson
Venue:In Proceedings of the Eleventh National Conference on Artificial Intelligence
Citations:137 - 10 self

Documents Related by Co-Citation

2593 On the theory of dynamic programming – Richard E Bellman - 1952
274 Acting Optimally in Partially Observable Stochastic Domains – Anthony R. Cassandra, Leslie Pack Kaelbling, Michael L. Littman - 1994
308 Planning and Control – T Dean, M Wellman - 1991
69 Using abstractions for decision-theoretic planning with time constraints – Craig Boutilier, Richard Dearden - 1994
461 A model for reasoning about persistence and causation – T Dean, K Kanazawa - 1989
36 Control Strategies for a Stochastic Planner – Jonathan Tash, Stuart Russell - 1994
513 Dynamic Programming and Markov Processes – R A Howard - 1960
7069 Probabilistic Reasoning in Intelligent Systems – J Pearl - 1988
226 Exploiting structure in policy construction – Craig Boutilier, Richard Dearden, Mois├ęs Goldszmidt - 1995
51 Modified policy iteration algorithms for discounted Markov decision problems – M Puterman, M Shin - 1978
95 Utility Models for Goal-Directed Decision-Theoretic Planners – Peter Haddawy, Peter Haddawy, Steve Hanks, Steve Hanks - 1993
120 Approximating Optimal Policies for Partially Observable Stochastic Domains – Ronald Parr, Stuart Russell - 1995
334 The Optimal Control of Partially Observable Markov Processes – E J Sondik - 1971
1202 Markov Decision Processes: Discrete Stochastic Dynamic Programming – M L Puterman - 1994
292 The optimal control of partially observable markov processes over a finite horizon – R Smallwood, E Sondik - 1971
175 A survey of algorithmic methods for partially observable Markov decision processes – W S Lovejoy - 1991
527 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
1737 STRIPS: A new approach to the application of theorem proving to problem solving. Arti cial Intelligence 2:189{208 – R Fikes, N J Nilsson - 1971
133 Input generalization in delayed reinforcement learning: An algorithm and performance comparisons – David Chapman, Leslie Pack Kaelbling - 1991