Searching for "Neural Q-learning." – sorted by Relevance.
-
Neural Q-Learning
- Neural Q-Learning Stephan ten Hagen Ben Krose Real World Computing Partnership, Novel Functions
- Cited by 1 (0 self) – Add To MetaCart
-
Neural reinforcement learning for behaviour synthesis
- of a research aimed at improving the Q-learning method through the use of artificial neural networks
- Cited by 13 (2 self) – Add To MetaCart
-
Neural Reinforcement Learning For An Obstacle Avoidance Behavior
- that increases the updating speed of the memory by the re-use of executed experiments. Neural Q-learning
- Add To MetaCart
-
Q-Learning for systems with continuous state and action spaces.
- function approximators. In this paper we will propose Neural Q-learning as a continuous state-action space
- Cited by 2 (0 self) – Add To MetaCart
-
An Hybrid Methodology for RL-based Behavior Coordination In a Target Following Mission with an AUV
- and parameter selection, the algorithm demonstrated to converge. The neural Q_learning algorithm structure
- Cited by 1 (0 self) – Add To MetaCart
-
High-Level Control Of Autonomous Robots Using A Behavior based Scheme And Reinforcement Learning
- Learning (RL). A new continuous approach of the Q_learning algorithm, implemented with a multi-layer neural
- Cited by 1 (0 self) – Add To MetaCart
-
Towards a Continuous Reinforcement Learning Module for Navigation in Video Games
- actions Neural Q-learning One way to deal with continuous states and actions consists in using a Neural
- Add To MetaCart
-
Self-Learning Neural Control of a Mobile Robot
- of neural Q-Learning [Wat89]. In a first attempt, the controller learns to move an ideal model of the mobile
- Cited by 1 (0 self) – Add To MetaCart
-
Continuous State Space Q-Learning for Control of Nonlinear Systems
- .8 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74 5 Neural Q-Learning
- Cited by 6 (0 self) – Add To MetaCart
-
A human-robot collaborative learning system using a virtual reality telerobotic interface
- .g., Q-learning) will be used to find the best action (e.g., determining the optimal grasping point
- Cited by 2 (2 self) – Add To MetaCart

