Results 1 - 10
of
20
Recurrent Neural Networks With Small Weights Implement Definite Memory Machines
- NEURAL COMPUTATION
, 2003
"... Recent experimental studies indicate that recurrent neural networks initialized with `small' weights are inherently biased towards definite memory machines (Tino, Cernansky, Benuskova, 2002a; Tino, Cernansky, Benuskova, 2002b). This paper establishes a theoretical counterpart: transition funct ..."
Abstract
-
Cited by 21 (5 self)
- Add to MetaCart
Recent experimental studies indicate that recurrent neural networks initialized with `small' weights are inherently biased towards definite memory machines (Tino, Cernansky, Benuskova, 2002a; Tino, Cernansky, Benuskova, 2002b). This paper establishes a theoretical counterpart: transition function of recurrent network with small weights and `squashing ' activation function is a contraction. We prove that recurrent networks with contractive transition function can be approximated arbitrarily well on input sequences of unbounded length by a definite mem-
Rule Extraction from Recurrent Neural Networks: a Taxonomy and Review
- Neural Computation
, 2005
"... this paper, the progress of this development is reviewed and analysed in detail. In order to structure the survey and to evaluate the techniques, a taxonomy, specifically designed for this purpose, has been developed. Moreover, important open research issues are identified, that, if addressed pr ..."
Abstract
-
Cited by 15 (3 self)
- Add to MetaCart
this paper, the progress of this development is reviewed and analysed in detail. In order to structure the survey and to evaluate the techniques, a taxonomy, specifically designed for this purpose, has been developed. Moreover, important open research issues are identified, that, if addressed properly, possibly can give the field a significant push forward
The Applicability of Recurrent Neural Networks for Biological Sequence Analysis
- IEEE/ACM Transactions on Computational Biology and Bioinformatics
, 2005
"... Selection of machine learning techniques requires a certain sensitivity to the requirements of the problem. In particular the problem can be made more tractable by deliberately using algorithms that are biased towards solutions of the requisite kind. ..."
Abstract
-
Cited by 9 (1 self)
- Add to MetaCart
Selection of machine learning techniques requires a certain sensitivity to the requirements of the problem. In particular the problem can be made more tractable by deliberately using algorithms that are biased towards solutions of the requisite kind.
Prediction of subcellular localization using sequence-biased recurrent networks
- Bioinformatics
, 2005
"... doi:10.1093/bioinformatics/bti372 ..."
Learn more by training less: systematicity in sentence processing by recurrent networks
- Connection Science
"... Connectionist models of sentence processing must learn to behave systematically by generalizing from a small training set. To what extent recurrent neural networks manage this generalization task is investigated. In contrast to Van der Velde et al. (Connection Sci., 16, pp. 21–46, 2004), it is found ..."
Abstract
-
Cited by 6 (2 self)
- Add to MetaCart
Connectionist models of sentence processing must learn to behave systematically by generalizing from a small training set. To what extent recurrent neural networks manage this generalization task is investigated. In contrast to Van der Velde et al. (Connection Sci., 16, pp. 21–46, 2004), it is found that simple recurrent networks do show so-called weak combinatorial systematicity, although their performance remains limited. It is argued that these limitations arise from overfitting in large networks. Generalization can be improved by increasing the size of the recurrent layer without training its connections, thereby combining a large short-term memory with a small long-term memory capacity. Performance can be improved further by increasing the number of word types in the training set.
Improved Access to Sequential Motifs: A note on the architectural bias of recurrent networks
, 2005
"... For many biological sequence problems the available data occupies only sparse regions of the problem space. To use machine learning e#ectively for the analysis of sparse data we must employ architectures with an appropriate bias. By experimentation we show that the bias of recurrent neural networ ..."
Abstract
-
Cited by 5 (4 self)
- Add to MetaCart
For many biological sequence problems the available data occupies only sparse regions of the problem space. To use machine learning e#ectively for the analysis of sparse data we must employ architectures with an appropriate bias. By experimentation we show that the bias of recurrent neural networks -- recently analysed by Tino, Cernansky and Benuskova [8], and Hammer and Tino [9, 3] -- o#ers superior access to motifs (sequential patterns) compared to the, in bioinformatics, standardly used feed forward neural networks.
Dynamics and topographic organization in recursive self-organizing map
- NEURAL COMPUTATION
, 2006
"... Recently, there has been an outburst of interest in extending topo-graphic maps of vectorial data to more general data structures, such as sequences or trees. However, at present, there is no general consensus as to how best to process sequences using topographic maps and this topic remains a very a ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
Recently, there has been an outburst of interest in extending topo-graphic maps of vectorial data to more general data structures, such as sequences or trees. However, at present, there is no general consensus as to how best to process sequences using topographic maps and this topic remains a very active focus of current neurocomputational research. The representational capabilities and internal representations of the models are not well understood. We rigorously analyze a generalization of the Self-Organizing Map (SOM) for processing sequential data, Recursive SOM (RecSOM) (Voegtlin, 2002), as a non-autonomous dynamical system consisting of a set of fixed input maps. We argue that contractive fixed input maps are likely to produce Markovian organizations of re-ceptive fields on the RecSOM map. We derive bounds on parameter β (weighting the importance of importing past information when process-ing sequences) under which contractiveness of the fixed input maps is guaranteed. Some generalizations of SOM contain a dynamic module responsible for processing temporal contexts as an integral part of the model. We show that Markovian topographic maps of sequential data can be produced using a simple fixed (non-adaptable) dynamic module externally feeding a standard topographic model designed to process static vectorial data of fixed dimensionality (e.g. SOM). However, by allowing trainable feedback connections one can obtain Markovian maps with superior memory depth and topography preservation. We elaborate upon the importance of non-Markovian organizations in topographic maps of 2sequential data.
Topographic Organization of Receptive Fields in Recursive Self-Organizing Map
- In Advances in Natural Computation (pp. 676-685). Lecture Notes in Computer Science
, 2005
"... Abstract. Recently, there has been an outburst of interest in extending topographic maps of vectorial data to more general data structures, such as sequences or trees. The representational capabilities and internal representations of the models are not well understood. We concentrate on a generaliza ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
Abstract. Recently, there has been an outburst of interest in extending topographic maps of vectorial data to more general data structures, such as sequences or trees. The representational capabilities and internal representations of the models are not well understood. We concentrate on a generalization of the Self-Organizing Map (SOM) for processing sequential data – the Recursive SOM (RecSOM [1]). We argue that contractive fixed-input dynamics of RecSOM is likely to lead to Markovian organizations of receptive fields on the map. We show that Markovian topographic maps of sequential data can be produced using a simple fixed (non-adaptable) dynamic module externally feeding a standard topographic model designed to process static vectorial data of fixed dimensionality (e.g. SOM). We elaborate upon the importance of non-Markovian organizations in topographic maps of sequential data. 1
Generalization and Systematicity in Echo State Networks
"... Echo state networks (ESNs) are recurrent neural networks that can be trained efficiently because the weights of recurrent connections remain fixed at random values. Investigations of these networks ’ ability to generalize in sentence-processing tasks have resulted in mixed outcomes. Here, we argue t ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
Echo state networks (ESNs) are recurrent neural networks that can be trained efficiently because the weights of recurrent connections remain fixed at random values. Investigations of these networks ’ ability to generalize in sentence-processing tasks have resulted in mixed outcomes. Here, we argue that ESNs do generalize but that they are not systematic, which we define as the ability to generally outperform Markov models on test sentences that violate the training sentences ’ grammar. Moreover, we show that systematicity in ESNs can easily be obtained by switching from arbitrary to informative representations of words, suggesting that the information provided by such representations facilitates connectionist systematicity.
Detecting and Sorting Targeting Peptides with Neural Networks and
"... This paper presents a composite multi-layer classifier system for predicting the subcellular localization of proteins based on their amino acid sequence. The work is an extension of our previous predictor PProwler v1.1 which is itself built upon the series of predictors SignalP and TargetP. In th ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
This paper presents a composite multi-layer classifier system for predicting the subcellular localization of proteins based on their amino acid sequence. The work is an extension of our previous predictor PProwler v1.1 which is itself built upon the series of predictors SignalP and TargetP. In this study we outline experiments conducted to improve the classifier design. The major improvement came from using Support Vector machines as a 'smart gate' sorting the outputs of several di#erent targeting peptide detection networks

