Searching for "On the Convergence of Optimistic Policy Iteration." – sorted by Relevance.
-
On the Convergence of Optimistic Policy Iteration
- /02 On the Convergence of Optimistic Policy Iteration John N. Tsitsiklis jnt@(email omitted); LIDS, Room 35-209 Massachusetts
- Cited by 5 (0 self) – Add To MetaCart
-
Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning
- on the policy iteration method, in which the value function must be estimated for a sequence of xed policies
- Cited by 20 (6 self) – Add To MetaCart

