Results 1 -
3 of
3
Phoneme Probability Estimation with Dynamic Sparsely Connected Artificial Neural Networks
, 1997
"... This paper presents new methods for training large neural networks for phoneme probability estimation. An architecture combining time-delay windows and recurrent connections is used to capture the important dynamic information of the speech signal. Because the number of connections in a fully connec ..."
Abstract
-
Cited by 23 (1 self)
- Add to MetaCart
This paper presents new methods for training large neural networks for phoneme probability estimation. An architecture combining time-delay windows and recurrent connections is used to capture the important dynamic information of the speech signal. Because the number of connections in a fully connected recurrent network grows super-linear with the number of hidden units, schemes for sparse connection and connection pruning are explored. It is found that sparsely connected networks outperform their fully connected counterparts with an equal number of connections. The implementation of the combined architecture and training scheme is described in detail. The networks are evaluated in a hybrid HMM/ANN system for phoneme recognition on the TIMIT database, and for word recognition on the WAXHOLM database. The achieved phone error-rate, 27.8%, for the standard 39 phoneme set on the core test-set of the TIMIT database is in the range of the lowest reported. All training and simulation softwar...
Neural Fuzzy Techniques in Vehicle Acoustic Signal Classification
, 1997
"... Vehicle acoustic signals have long been considered as unwanted traffic noise. In this research acoustic signals generated by each vehicle will be used to detect its presence and classify its type. Circular arrays of microphones were designed and built to detect desired signals and suppress unwanted ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Vehicle acoustic signals have long been considered as unwanted traffic noise. In this research acoustic signals generated by each vehicle will be used to detect its presence and classify its type. Circular arrays of microphones were designed and built to detect desired signals and suppress unwanted ones. Circular arrays with multiple rings have an interesting and important property that is constant sidelobe levels. A modified genetic algorithm that can work directly with real numbers is used in the circular array design. It offers more effective ways to solve numerical problems than a standard genetic algorithm.
The Free Speech Journal, Issue 5(1997)
"... This paper presents new methods for training large neural networks for phoneme probability estimation. An architecture combining time-delay windows and recurrent connections is used to capture the important dynamic information of the speech signal. Because the number of connections in a fully con ..."
Abstract
- Add to MetaCart
This paper presents new methods for training large neural networks for phoneme probability estimation. An architecture combining time-delay windows and recurrent connections is used to capture the important dynamic information of the speech signal. Because the number of connections in a fully connected recurrent network grows super-linear with the number of hidden units, schemes for sparse connection and connection pruning are explored. It is found that sparsely connected networks outperform their fully connected counterparts with an equal number of connections. The implementation of the combined architecture and training scheme is described in detail. The networks are evaluated in a hybrid HMM/ANN system for phoneme recognition on the TIMIT database, and for word recognition on the WAXHOLM database. The achieved phone error-rate, 27.8%, for the standard 39 phoneme set on the core test-set of the TIMIT database is in the range of the lowest reported. All training and simulation software used is made freely available by the author, and detailed information about the software and the training process is given in an Appendix.

