Efficient Exploration In Reinforcement Learning (1992)
| Citations: | 113 - 4 self |
BibTeX
@TECHREPORT{Thrun92efficientexploration,
author = {Sebastian B. Thrun},
title = {Efficient Exploration In Reinforcement Learning},
institution = {},
year = {1992}
}
Years of Citing Articles
OpenURL
Abstract
Exploration plays a fundamental role in any active learning system. This study evaluates the role of exploration in active learning and describes several local techniques for exploration in finite, discrete domains, embedded in a reinforcement learning framework (delayed reinforcement). This paper distinguishes between two families of exploration schemes: undirected and directed exploration. While the former family is closely related to random walk exploration, directed exploration techniques memorize exploration-specific knowledge which is used for guiding the exploration search. In many finite deterministic domains, any learning technique based on undirected exploration is inefficient in terms of learning time, i.e. learning time is expected to scale exponentially with the size of the state space (Whitehead, 1991b) . We prove that for all these domains, reinforcement learning using a directed technique can always be performed in polynomial time, demonstrating the important role of e...







