Results 1 -
6 of
6
Exploiting Syntactic Structure for Natural Language Modeling
, 2000
"... The thesis presents an attempt at using the syntactic structure in natural language for improved language models for speech recognition. The structured language model merges techniques in automatic parsing and language modeling using an original probabilistic parameterization of a shift-reduce parse ..."
Abstract
-
Cited by 27 (0 self)
- Add to MetaCart
The thesis presents an attempt at using the syntactic structure in natural language for improved language models for speech recognition. The structured language model merges techniques in automatic parsing and language modeling using an original probabilistic parameterization of a shift-reduce parser. A maximum likelihood reestimation procedure belonging to the class of expectation-maximization algorithms is employed for training the model. Experiments on the Wall Street Journal, Switchboard and Broadcast News corpora show improvement in both perplexity and word error rate -- word lattice rescoring -- over the standard 3-gram language model. The significance of the thesis lies in presenting an original approach to language modeling that uses the hierarchical -- syntactic -- structure in natural language to improve on current 3-gram modeling techniques for large vocabulary speech recognition.
Recognition Performance of a Structured Language Model
, 1999
"... A new language model for speech recognition inspired by linguistic analysis is presented. The model develops hidden hierarchical structure incrementally and uses it to extract meaningful information from the word history --- thus enabling the use of extended distance dependencies --- in an attempt t ..."
Abstract
-
Cited by 19 (1 self)
- Add to MetaCart
A new language model for speech recognition inspired by linguistic analysis is presented. The model develops hidden hierarchical structure incrementally and uses it to extract meaningful information from the word history --- thus enabling the use of extended distance dependencies --- in an attempt to complement the locality of currently used trigram models. The structured language model, its probabilistic parameterization and performance in a two-pass speech recognizer are presented. Experiments on the SWITCHBOARD corpus show an improvement in both perplexity and word error rate over conventional trigram models. 1. INTRODUCTION The main goal of the present work is to develop and evaluate a language model that uses syntactic structure to model longdistance dependencies. The model we present is closely related to the one investigated in [1], however different in a few important aspects: ffl our model operates in a left-to-right manner, allowing the decoding of word lattices, as oppose...
Techniques for modelling Phonological Processes in Automatic Speech Recognition
, 2001
"... Declaration This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration, except where stated. It has not been submitted in whole or part for a degree at any other university. The length of this thesis including footnotes and appendices does ..."
Abstract
-
Cited by 6 (0 self)
- Add to MetaCart
Declaration This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration, except where stated. It has not been submitted in whole or part for a degree at any other university. The length of this thesis including footnotes and appendices does not exceed 29,500 words and includes no more than 40 figures. 1 Systems which automatically transcribe carefully dictated speech are now commercially available, but their performance degrades dramatically when the speaking style of users becomes more relaxed or conversational. This dissertation focuses on techniques that aim to improve the robustness of statistical speech transcription systems to conversational speaking styles. The dissertation shows first that the performance degradation occuring as speech becomes more conversational is severe and is partially attributable to differences in the acoustic realizations of sentences. Hypothesizing that the quantifiably wider range of
A Twostage Statistical Word Segmentation System for Chinese
- Proceedings of The 2nd SIGHAN Workshop on Chinese Language Processing
, 2003
"... In this paper we present a two-stage statistical word segmentation system for Chinese based on word bigram and wordformation models. This system was evaluated on Peking University corpora at the First International Chinese Word Segmentation Bakeoff. We also give results and discussions on this evalu ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
In this paper we present a two-stage statistical word segmentation system for Chinese based on word bigram and wordformation models. This system was evaluated on Peking University corpora at the First International Chinese Word Segmentation Bakeoff. We also give results and discussions on this evaluation. 1
Active Learning An Explicit Treatment of Unreliable Parameters
, 2008
"... Active learning reduces annotation costs for supervised learning by concentrating la-belling efforts on the most informative data. Most active learning methods assume that the model structure is fixed in advance and focus upon improving parameters within that structure. However, this is not appropri ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Active learning reduces annotation costs for supervised learning by concentrating la-belling efforts on the most informative data. Most active learning methods assume that the model structure is fixed in advance and focus upon improving parameters within that structure. However, this is not appropriate for natural language processing where the model structure and associated parameters are determined using labelled data. Ap-plying traditional active learning methods to natural language processing can fail to produce expected reductions in annotation cost. We show that one of the reasons for this problem is that active learning can only select examples which are already cov-ered by the model. In this thesis, we better tailor active learning to the need of natural language processing as follows. We formulate the Unreliable Parameter Principle: Active learning should explicitly and additionally address unreliably trained model parameters in order to optimally reduce classification error. In order to do so, we should target both missing events and infrequent events. We demonstrate the effectiveness of such an approach for a range of natural lan-
Using HMMs to Quantify Signals from DNA
"... It was recently shown that individual molecules of single-stranded DNA can be forced through a nanoscale pore by an electric field. We demonstrate signal processing methods to detect and measure sub-states in the DNA-induced current blockades during transport. The current flow is approximately piece ..."
Abstract
- Add to MetaCart
It was recently shown that individual molecules of single-stranded DNA can be forced through a nanoscale pore by an electric field. We demonstrate signal processing methods to detect and measure sub-states in the DNA-induced current blockades during transport. The current flow is approximately piecewise stationary during these transport events, and we used an ergodic HMM to make maximum likelihood estimates of the event sub-states. Interestingly, the signal amplitude distributions caused by polynucleotides with the same length and composition depends on the direction the polymers transit the pore. Our methods indicate automatic extraction of structural information from individual DNA molecules is possible.

