Results 1 - 10
of
26
Biologically Plausible Error-driven Learning using Local Activation Differences: The Generalized Recirculation Algorithm
- NEURAL COMPUTATION
, 1996
"... The error backpropagation learning algorithm (BP) is generally considered biologically implausible because it does not use locally available, activation-based variables. A version of BP that can be computed locally using bi-directional activation recirculation (Hinton & McClelland, 1988) instead of ..."
Abstract
-
Cited by 70 (10 self)
- Add to MetaCart
The error backpropagation learning algorithm (BP) is generally considered biologically implausible because it does not use locally available, activation-based variables. A version of BP that can be computed locally using bi-directional activation recirculation (Hinton & McClelland, 1988) instead of backpropagated error derivatives is more biologically plausible. This paper presents a generalized version of the recirculation algorithm (GeneRec), which overcomes several limitations of the earlier algorithm by using a generic recurrent network with sigmoidal units that can learn arbitrary input/output mappings. However, the contrastiveHebbian learning algorithm (CHL, a.k.a. DBM or mean field learning) also uses local variables to perform error-driven learning in a sigmoidal recurrent network. CHL was derived in a stochastic framework (the Boltzmann machine), but has been extended to the deterministic case in various ways, all of which rely on problematic approximationsand assumptions, le...
Conjunctive Representations in Learning and Memory: Principles of Cortical and Hippocampal Function
- PSYCHOLOGICAL REVIEW
, 2001
"... We present a theoretical framework for understanding the roles of the hippocampus and neocortex in learning and memory. This framework incorporates a theme found in many theories of hippocampal function, that the hippocampus is responsible for developing conjunctive representations binding together ..."
Abstract
-
Cited by 59 (11 self)
- Add to MetaCart
We present a theoretical framework for understanding the roles of the hippocampus and neocortex in learning and memory. This framework incorporates a theme found in many theories of hippocampal function, that the hippocampus is responsible for developing conjunctive representations binding together stimulus elements into a unitary rep- resentation that can later be recalled from partial input cues. This idea appears problematic, however, because it is contradicted by the fact that hippocampally lesioned rats can learn nonlinear discrimination problems that require conjunctive representations. Our framework accommodates this finding by establishing a principled division of labor between the cortex and hippocampus, where the cortex is responsible for slow learning that integrates over multiple experiences to extract generalities, while the hippocampus performs rapid learning of the arbitrary contents of individual experiences. This framework shows that nonlinear discrimination problems are not good tests of hippocampal function, and suggests that tasks involving rapid, incidental conjunctive learning are better. We implement this framework in a computational neural network model, and show that it can account for a wide range of data in animal learning, thus validating our theoretical ideas, and providing a number of insights and predictions about these learning phenomena.
Six Principles for Biologically-Based Computational Models of Cortical Cognition
- TRENDS IN COGNITIVE SCIENCES
, 1998
"... This paper describes and motivates six principles for computational cognitive neuroscience models: biological realism, distributed representations, inhibitory competition, bidirectional activation propagation, errordriven task learning, and Hebbian model learning. Although these principles are suppo ..."
Abstract
-
Cited by 43 (14 self)
- Add to MetaCart
This paper describes and motivates six principles for computational cognitive neuroscience models: biological realism, distributed representations, inhibitory competition, bidirectional activation propagation, errordriven task learning, and Hebbian model learning. Although these principles are supported by a number of cognitive, computational, and biological motivations, the prototypical neural network model (a feedforward backpropagation network) incorporates only two of them, and no widely used model incorporates all of them. This paper argues that these principles should be integrated into a coherent overall framework, and discusses some potential synergies and conflicts in doing so.
From the lexicon to expectations about kinds: a role for associative learning
- Psychological Review
, 2005
"... In the novel noun generalization task, 2 1/2-year-old children display generalized expectations about how solid and nonsolid things are named, extending names for never-before-encountered solids by shape and for never-before-encountered nonsolids by material.This distinction between solids and nonso ..."
Abstract
-
Cited by 34 (13 self)
- Add to MetaCart
In the novel noun generalization task, 2 1/2-year-old children display generalized expectations about how solid and nonsolid things are named, extending names for never-before-encountered solids by shape and for never-before-encountered nonsolids by material.This distinction between solids and nonsolids has been interpreted in terms of an ontological distinction between objects and substances.Nine simulations and behavioral experiments tested the hypothesis that these expectations arise from the correlations characterizing early learned noun categories.In the simulation studies, connectionist networks were trained on noun vocabularies modeled after those of children.These networks formed generalized expectations about solids and nonsolids that match children’s performances in the novel noun generalization task in the very different languages of English and Japanese.The simulations also generate new predictions supported by new experiments with children.Implications are discussed in terms of children’s development of distinctions between kinds of categories and in terms of the nature of this knowledge. Concepts are hypothetical constructs, theoretical devices hypothesized to explain data, what people do, and what people say. The question of whether a particular theory can explain children’s concepts is therefore semantically strange because strictly speaking this question asks about an explanation of an explanation.We begin with this reminder because the goal of the research reported here is to understand the role of associative processes in children’s systematic attention to the shape of solid things and to the material of nonsolid things in the task of forming new lexical categories. These attentional biases have been interpreted in terms of children’s concepts about the ontological kinds of object and substance
Generalization in Interactive Networks: The Benefits of Inhibitory Competition and Hebbian Learning
- Neural Computation
, 2001
"... Computational models in cognitive neuroscience should ideally use biological properties and powerful computational principles to produce behavior consistent with psychological findings. Error-driven backpropagation is computationally powerful, and has proven useful for modeling a range of psycholo ..."
Abstract
-
Cited by 28 (5 self)
- Add to MetaCart
Computational models in cognitive neuroscience should ideally use biological properties and powerful computational principles to produce behavior consistent with psychological findings. Error-driven backpropagation is computationally powerful, and has proven useful for modeling a range of psychological data, but is not biologically plausible. Several approaches to implementing backpropagation in a biologically plausible fashion converge on the idea of using bidirectional activation propagation in interactive networks to convey error signals. This paper demonstrates two main points about these error-driven interactive networks: (a) they generalize poorly due to attractor dynamics that interfere with the network's ability to systematically produce novel combinatorial representations in response to novel inputs; and (b) this generalization problem can be remedied by adding two widely used mechanistic principles, inhibitory competition and Hebbian learning, that can be independent...
Learning continuous probability distributions with symmetric diffusion networks
- Cognitive Science
, 1993
"... in this article we present symmetric diffusion networks, a family of networks that instantiate the principles of continuous, stochastic, adaptive and interactive pro-pagation of information. Using methods of Markovlon diffusion theory, we for-malize the activation dynamics of these networks and then ..."
Abstract
-
Cited by 24 (4 self)
- Add to MetaCart
in this article we present symmetric diffusion networks, a family of networks that instantiate the principles of continuous, stochastic, adaptive and interactive pro-pagation of information. Using methods of Markovlon diffusion theory, we for-malize the activation dynamics of these networks and then show that they can be trained to reproduce entire muitivariote probability distributions an their outputs using the contrastive Hebbian learning rule (CHL).,We show that CHL performs gradient descent on an error function that captures differences between desired and obtolned continuous multivoriate probability distributions. This allows the learning algorithm to go beyond expected values of output units and to approxi-mate complete probability distributions on continuous muitivariote activation spaces. We argue that learning continuous distributions is an important task underlying a variety of real-life situations that were beyond the scope of previous connectionist networks. Deterministic networks, like back propagation, cannot ieorn this task because they ore limited to learning average values of indepen-dent output units. Previous stochastic connectionist networks could learn pro-bobility distributions but they were limited to discrete variables. Simulations show that symmetric diffusion networks can be trained with the CHL rule to op-proximate discrete and continuous probability distributions of various types. 1.
A New Learning Algorithm for Mean Field Boltzmann Machines
, 2002
"... We present a new learning algorithm for Mean Field Boltzmann Machines based on the contrastive divergence optimization criterion. In addition to minimizing the divergence between the data distribution and the equilibrium distribution that the network believes in, we maximize the divergence betwe ..."
Abstract
-
Cited by 23 (4 self)
- Add to MetaCart
We present a new learning algorithm for Mean Field Boltzmann Machines based on the contrastive divergence optimization criterion. In addition to minimizing the divergence between the data distribution and the equilibrium distribution that the network believes in, we maximize the divergence between one-step reconstructions of the data and the equilibrium distribution. This eliminates the need to estimate equilibrium statistics, so we do not need to approximate the multimodal probablility distribution of the free network with the unimodal mean field distribution. We test the learning algorithm on the classification of digits. A New Learning Algorithm for Mean Field Boltzmann Machines Max Welling G.E. Hinton Gatsby Unit 1 Boltzmann Machines The stochastic Boltzmann machine (BM) is a probabilistic neural network of symmetrically connected binary units taking values f0; 1g (Ackley, Hinton & Sejnowski, 1985). The variant used for unsupervised learning consists of a set of visi...
The dynamics of perceptual learning: an incremental reweighting model
- PSYCHOLOGICAL REVIEW
, 2005
"... The mechanisms of perceptual learning are analyzed theoretically, probed in an orientationdiscrimination experiment involving a novel nonstationary context manipulation, and instantiated in a detailed computational model. Two hypotheses are examined: modification of early cortical representations ve ..."
Abstract
-
Cited by 12 (2 self)
- Add to MetaCart
The mechanisms of perceptual learning are analyzed theoretically, probed in an orientationdiscrimination experiment involving a novel nonstationary context manipulation, and instantiated in a detailed computational model. Two hypotheses are examined: modification of early cortical representations versus task-specific selective reweighting. Representation modification seems neither functionally necessary nor implied by the available psychophysical and physiological evidence. Computer simulations and mathematical analyses demonstrate the functional and empirical adequacy of selective reweighting as a perceptual learning mechanism. The stimulus images are processed by standard orientation- and frequency-tuned representational units, divisively normalized. Learning occurs only in the “read-out” connections to a decision unit; the stimulus representations never change. An incremental Hebbian rule tracks the task-dependent predictive value of each unit, thereby improving the signal-to-noise ratio of their weighted combination. Each abrupt change in the environmental statistics induces a switch cost in the learning curves as the system temporarily works with suboptimal weights.
Where do relations come from
, 1998
"... Relational knowledge is a hallmark of human cognition and the subject of a vast body of research. In this paper we argue that existing accounts of relations are inadequate because they have little to say abouthowrelations arise in the rst place and because they tend to be limited to particular sorts ..."
Abstract
-
Cited by 10 (5 self)
- Add to MetaCart
Relational knowledge is a hallmark of human cognition and the subject of a vast body of research. In this paper we argue that existing accounts of relations are inadequate because they have little to say abouthowrelations arise in the rst place and because they tend to be limited to particular sorts of relational tasks. We present a new approach to the learning and representation of relations, an approach that makes use of what we call micro-relation units (MRUs). Each MRU represents a relation between features of di erent objects rather than between objects themselves. We show howthis approach o ers an account of the grounding of relations, and we describe a neural-network implementation of the MRU framework and show howit enables a variety of relational tasks to be performed by the same system.
A knowledge-resonance (KRES) model of category learning
- Psychonomic Bulletin & Review
, 2003
"... In this article we present a connectionist model of category learning that takes into account the prior knowledge that people bring to many new learning situations. This model, which we call the Knowledge-Resonance Model or KRES, employs a recurrent network with bidirectional connections which are u ..."
Abstract
-
Cited by 10 (7 self)
- Add to MetaCart
In this article we present a connectionist model of category learning that takes into account the prior knowledge that people bring to many new learning situations. This model, which we call the Knowledge-Resonance Model or KRES, employs a recurrent network with bidirectional connections which are updated according to a contrastive-Hebbian learning rule. When prior knowledge is incorporated into a KRES network, the KRES activation dynamics and learning procedure accounts for a range of empirical results regarding the effects prior knowledge on category learning, including the accelerated learning that occurs in the presence of knowledge, the reinterpretation of features in light error correcting feedback, and the unlearning of prior knowledge which is inappropriate for a particular category.

