Searching for "An Analysis of Actor/Critic Algorithms Using Eligibility Traces Reinforcement Learning with Imperfect Value Function." – sorted by Relevance.
-
Fast and stable learning of quasi-passive dynamic walking by an unstable biped robot based on off
- ., and Kobayashi, S., “An Analysis of Actor/Critic Algorithms using Eligibility Traces: Reinforcement Learning
- Cited by 3 (1 self) – Add To MetaCart

