Learning to Control Dynamic Systems Via Associative Reinforcement Learning
by
Vijaykumar Gullapalli
BibTeX
@MISC{Gullapalli_learningto,
author = {Vijaykumar Gullapalli},
title = {Learning to Control Dynamic Systems Via Associative Reinforcement Learning},
year = {}
}
OpenURL
Abstract
this paper. The internal critic network has 8 input units, a hidden layer of 10 back-propagation units, and a single temporal difference output unit. The controller has 4 input units and a single action unit. In simulations of the supervised learning method, a "noisy" linear unit was used as the action unit, while in simulations of the reinforcement learning method, a stochastic real-valued (SRV) unit [25] was used.







