Results 1 -
3 of
3
Introducing Linguistic Constraints into Statistical Language Modeling
- In International Conference on Spoken Language Processing
, 1996
"... Building robust stochastic language models is a major issue in speech recognition systems. Conventional word-based n-gram models do not capture any linguistic constraints inherent in speech. In this paper the notion of function and contentwords #open#closed word classes# is used to provide linguisti ..."
Abstract
-
Cited by 8 (2 self)
- Add to MetaCart
Building robust stochastic language models is a major issue in speech recognition systems. Conventional word-based n-gram models do not capture any linguistic constraints inherent in speech. In this paper the notion of function and contentwords #open#closed word classes# is used to provide linguistic knowledge that can be incorporated into language models. Function words are articles, prepositions, personal pronouns # contentwords are nouns, verbs, adjectives and adverbs. Based on this class de#nition resulting in function and contentword markers, a new language model is de#ned. A combination of the word-based model with this new model will be introduced. The combined model shows modest improvements both in perplexity results and recognition performance. 1.
Exploiting headword dependency and predictive clustering for language modeling
- In EMNLP
, 2002
"... This paper presents several practical ways of incorporating linguistic structure into language models. A headword detector is first applied to detect the headword of each phrase in a sentence. A permuted headword trigram model (PHTM) is then generated from the annotated corpus. Finally, PHTM is exte ..."
Abstract
-
Cited by 7 (5 self)
- Add to MetaCart
This paper presents several practical ways of incorporating linguistic structure into language models. A headword detector is first applied to detect the headword of each phrase in a sentence. A permuted headword trigram model (PHTM) is then generated from the annotated corpus. Finally, PHTM is extended to a cluster PHTM (C-PHTM) by defining clusters for similar words in the corpus. We evaluated the proposed models on the realistic application of Japanese Kana-Kanji conversion. Experiments show that C-PHTM achieves 15 % error rate reduction over the word trigram model. This
Capturing long distance dependency for language modeling: an empirical study
- In Proceedings of the First International Joint Conference on Natural Language Processing (IJCNLP-04
, 2004
"... This paper presents an extensive empirical study on two language modeling techniques, linguistically-motivated word skipping and predictive clustering, both of which are used in capturing long distance word dependencies that are beyond the scope of a word trigram model. We compare the techniques to ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
This paper presents an extensive empirical study on two language modeling techniques, linguistically-motivated word skipping and predictive clustering, both of which are used in capturing long distance word dependencies that are beyond the scope of a word trigram model. We compare the techniques to others that were proposed previously for the same purpose. We evaluate the resulting models on the task of Japanese Kana-Kanji conversion. We show that the two techniques, while simple, outperform existing methods studied in this paper, and lead to language models that perform significantly better than a word trigram model. We also investigate how factors such as training corpus size and genre affect the performance of the models. 1

