Temporal sequence learning, prediction and control - a review of different models and their relation to biological mechanisms (2004)
| Venue: | Neural Computation |
| Citations: | 17 - 3 self |
BibTeX
@ARTICLE{Wörgötter04temporalsequence,
author = {Florentin Wörgötter and Bernd Porr},
title = {Temporal sequence learning, prediction and control - a review of different models and their relation to biological mechanisms},
journal = {Neural Computation},
year = {2004},
volume = {17},
pages = {2005}
}
OpenURL
Abstract
In this article we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spiketiming dependent plasticity. This review will briefly introduce the most influential models and focus on two questions: 1) To what degree are reward-based (e.g. TD-learning) and correlation based (hebbian) learning related? and 2) How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We will first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe, that reward-based and correlation based learning are indeed very similar. Machine-control is then used to introduce the problem of closed-loop control (e.g. “actor-critic architectures”). Here the problem of evaluative (“rewards”) versus nonevaluative (“correlations”) feedback from the environment will be discussed showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question we will compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus and







