Results 1 
4 of
4
A Maximum Entropy Approach to Adaptive Statistical Language Modeling
 Computer, Speech and Language
, 1996
"... An adaptive statistical languagemodel is described, which successfullyintegrates long distancelinguistic information with other knowledge sources. Most existing statistical language models exploit only the immediate history of a text. To extract information from further back in the document's h ..."
Abstract

Cited by 257 (12 self)
 Add to MetaCart
An adaptive statistical languagemodel is described, which successfullyintegrates long distancelinguistic information with other knowledge sources. Most existing statistical language models exploit only the immediate history of a text. To extract information from further back in the document's history, we propose and use trigger pairs as the basic information bearing elements. This allows the model to adapt its expectations to the topic of discourse. Next, statistical evidence from multiple sources must be combined. Traditionally, linear interpolation and its variants have been used, but these are shown here to be seriously deficient. Instead, we apply the principle of Maximum Entropy (ME). Each information source gives rise to a set of constraints, to be imposed on the combined estimate. The intersection of these constraints is the set of probability functions which are consistent with all the information sources. The function with the highest entropy within that set is the ME solution...
Adaptive language modeling using the maximum entropy principle.” Human Language Technology
 Proceedings of a Workshop Held at Plainsboro
, 1993
"... We describe our ongoing efforts at adaptive statistical language modeling. Central to our approach is the Maximum Entropy (ME) Principle, allowing us to combine evidence from multiple sources, such as longdistance triggers and conventional short.distance trigrams. Given consistent statistical evide ..."
Abstract

Cited by 38 (5 self)
 Add to MetaCart
We describe our ongoing efforts at adaptive statistical language modeling. Central to our approach is the Maximum Entropy (ME) Principle, allowing us to combine evidence from multiple sources, such as longdistance triggers and conventional short.distance trigrams. Given consistent statistical evidence, a unique ME solution is guaranteed to exist, and an iterative algorithm exists which is guaranteed to converge to it. Among the advantages of this approach are its simplicity, its generality, and its incremental nature. Among its disadvantages are its computational requirements. We describe a succession of ME models, culminating in our current Maximum Likelihood / Maximum Entropy (ML/ME) model. Preliminary results with the latter show a 27 % perplexity reduction as compared to a conventional trigram model. 1. STATE OF THE ART
Automated Speech Understanding: The Next Generation
"... Modern speech understanding systems merge interdisciplinary technologies from Signal Processing, Pattern Recognition, Natural Language, and Linguistics into a unified statistical framework. These systems, which have applications in a wide range of signal processing problems, represent a revolution i ..."
Abstract
 Add to MetaCart
Modern speech understanding systems merge interdisciplinary technologies from Signal Processing, Pattern Recognition, Natural Language, and Linguistics into a unified statistical framework. These systems, which have applications in a wide range of signal processing problems, represent a revolution in Digital Signal Processing (DSP). Once a field dominated by vectororiented processors and linear algebrabased mathematics, the current generation of DSPbased systems rely on sophisticated statistical models implemented using a complex software paradigm. Such systems are now capable of understanding continuous speech input for vocabularies of several thousand words in operational environments. The current generation of deployed systems, based on small vocabularies of isolated words, will soon be replaced by a new technology offering natural language access to vast information resources such as the Internet, and provide completely
EFFICIENT SEARCH ALGORITHMS FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
"... Automatic speakerindependent speech recognition has made significant progress from the days of isolated word recognition. Today state of the art systems are capable of performing largevocabulary continuous speech recognition (LVCSR) over complex domains such as news broadcasts and telephone conver ..."
Abstract
 Add to MetaCart
Automatic speakerindependent speech recognition has made significant progress from the days of isolated word recognition. Today state of the art systems are capable of performing largevocabulary continuous speech recognition (LVCSR) over complex domains such as news broadcasts and telephone conversations. A significant contribution to this advancement in technology is due to the development of search techniques that support efficient, suboptimal decoding over large search spaces and complex statistical models. Moreover, these decoding strategies are capable of dynamically integrating information from a number of diverse knowledge sources to determine the correct word hypothesis.