Incremental Dynamic Programming for On-Line Adaptive Optimal Control (1994)
| Citations: | 20 - 2 self |
BibTeX
@MISC{Bradtke94incrementaldynamic,
author = {Steven J. Bradtke},
title = {Incremental Dynamic Programming for On-Line Adaptive Optimal Control},
year = {1994}
}
Years of Citing Articles
OpenURL
Abstract
Reinforcement learning algorithms based on the principles of Dynamic Programming (DP) have enjoyed a great deal of recent attention both empirically and theoretically. These algorithms have been referred to generically as Incremental Dynamic Programming (IDP) algorithms. IDP algorithms are intended for use in situations where the information or computational resources needed by traditional dynamic programming algorithms are not available. IDP algorithms attempt to find a global solution to a DP problem by incrementally improving local constraint satisfaction properties as experience is gained through interaction with the environment. This class of algorithms is not new, going back at least as far as Samuel's adaptive checkers-playing programs,...







