Results 1 - 10
of
10
A Maximum Entropy approach to Natural Language Processing
- COMPUTATIONAL LINGUISTICS
, 1996
"... The concept of maximum entropy can be traced back along multiple threads to Biblical times. Only recently, however, have computers become powerful enough to permit the widescale application of this concept to real world problems in statistical estimation and pattern recognition. In this paper we des ..."
Abstract
-
Cited by 847 (6 self)
- Add to MetaCart
The concept of maximum entropy can be traced back along multiple threads to Biblical times. Only recently, however, have computers become powerful enough to permit the widescale application of this concept to real world problems in statistical estimation and pattern recognition. In this paper we describe a method for statistical modeling based on maximum entropy. We present a maximum-likelihood approach for automatically constructing maximum entropy models and describe how to implement this approach efficiently, using as examples several problems in natural language processing.
A Gaussian Prior for Smoothing Maximum Entropy Models
, 1999
"... In certain contexts, maximum entropy (ME) modeling can be viewed as maximum likelihood training for exponential models, and like other maximum likelihood methods is prone to overfitting of training data. Several smoothing methods for maximum entropy models have been proposed to address this problem, ..."
Abstract
-
Cited by 181 (1 self)
- Add to MetaCart
In certain contexts, maximum entropy (ME) modeling can be viewed as maximum likelihood training for exponential models, and like other maximum likelihood methods is prone to overfitting of training data. Several smoothing methods for maximum entropy models have been proposed to address this problem, but previous results do not make it clear how these smoothing methods compare with smoothing methods for other types of related models. In this work, we survey previous work in maximum entropy smoothing and compare the performance of several of these algorithms with conventional techniques for smoothing n-gram language models. Because of the mature body of research in n-gram model smoothing and the close connection between maximum entropy and conventional n-gram models, this domain is well-suited to gauge the performance of maximum entropy smoothing methods. Over a large number of data sets, we find that an ME smoothing method proposed to us by Lafferty [1] performs as well as or better tha...
The Web as a Parallel Corpus
- Computational Linguistics
, 2003
"... Parallel corpora have become an essential resource for work in multilingual natural language processing. In this report, we describe our work using the STRAND system for mining parallel text on the World Wide Web, first reviewing the original algorithm and results and then presenting a set of signif ..."
Abstract
-
Cited by 101 (3 self)
- Add to MetaCart
Parallel corpora have become an essential resource for work in multilingual natural language processing. In this report, we describe our work using the STRAND system for mining parallel text on the World Wide Web, first reviewing the original algorithm and results and then presenting a set of significant enhancements. These enhancements include the use of supervised learning based on structural features of documents to improve classification performance, a new content-based measure of translational equivalence, and adaptation of the system to take advantage of the Internet Archive for mining parallel text from the Web on a large scale.
Automating knowledge acquisition for machine translation
- AI Mag
, 1997
"... How can we write a computer program to translate an English sentence into Japanese? Anyone who has taken a graduate-level course in Arti cial Intelligence knows the answer. First, compute the meaning of the English sentence. That is, convert it into logic or your favorite knowledge ..."
Abstract
-
Cited by 30 (3 self)
- Add to MetaCart
How can we write a computer program to translate an English sentence into Japanese? Anyone who has taken a graduate-level course in Arti cial Intelligence knows the answer. First, compute the meaning of the English sentence. That is, convert it into logic or your favorite knowledge
Cross-Language Text Retrieval Research in the USA. Presented at 3 rd ERCIM DELOS Workshop
- TREC-8 Experiments at Maryland: CLIR, QA and Routing. In Proceedings of the Eighth Text Retrieval Conference (TREC8
, 1997
"... The increasing availability of networked access to multilingual text collections has generated increased interest in the development ofe ective and e cient cross-language text retrieval technology. Examples of cross-language text retrieval applications are discussed and a classi cation of known appr ..."
Abstract
-
Cited by 20 (0 self)
- Add to MetaCart
The increasing availability of networked access to multilingual text collections has generated increased interest in the development ofe ective and e cient cross-language text retrieval technology. Examples of cross-language text retrieval applications are discussed and a classi cation of known approaches is introduced. This is used to structure a comprehensive discussion of published research and known commercial practice in the United States on the topic. The paper concludes by describing the structure of sponsored research on cross-language text retrieval in the United States and some brief observations of the potential for collaboration with European researchers on aspects of the problem which are of mutual interest. 1
Grammar Inference and Statistical Machine Translation
, 1998
"... NLP researchers face a dilemma: on one side, it is unarguably accepted that languages have internal structure rather than strings of words. On the other side, they find it very difficult and expensive to write grammars that have good coverage of language structures. Statistical machine translation ..."
Abstract
-
Cited by 13 (0 self)
- Add to MetaCart
NLP researchers face a dilemma: on one side, it is unarguably accepted that languages have internal structure rather than strings of words. On the other side, they find it very difficult and expensive to write grammars that have good coverage of language structures. Statistical machine translation tries to cope with this problem by ignoring language structures and using a statistical models to depict the translation process. Most of the translation models are word-based. While the approach has achieved surprisingly good performance comparable to the best commercial systems, many questions remain in the machine translation community. Can the statistical word-based translation still perform well on language pairs with radically different linguistic structures? How would it function with less training data or with spoken languages? The thesis work investigated these questions. In summary, word-based alignment model is a major cause of errors in German-English statistical spoken language...
A domain speci c lexicon acquisition tool for cross-language information retrieval
- In Proceedings of RIAO'97 Conference on ComputerAssisted Searching on the Internet
, 1997
"... With the recent enormous increase of information dissemination via the web as incentive there is a growing interest in supporting tools for cross-language retrieval. In this paper we describe a disclosure and retrieval approach that ful lls the needs of both information providers and users by o erin ..."
Abstract
-
Cited by 3 (3 self)
- Add to MetaCart
With the recent enormous increase of information dissemination via the web as incentive there is a growing interest in supporting tools for cross-language retrieval. In this paper we describe a disclosure and retrieval approach that ful lls the needs of both information providers and users by o ering fast and cheap access to a large amounts of documents from various language domains. Relevant information can be retrieved irrespective of the language used for the speci cation of a query. In order to realize this type of multilingual functionality theavailability of several translation tools is needed, both of a generic and a domain speci c nature. Domain speci c tools are often not available or only against large costs. In this paper we will therefore focus on a way to reduce these costs, namely the automatic derivation of multilingual resources from so-called parallel text corpora. The bene ts of this approach will be illustrated for an example system, i.e. the demonstrator developed within the project Twenty-One, which is tuned to information from the area of sustainable development.
Confidence Factor Assignment to Translation Templates
, 1998
"... that I have read this thesis and that in my opinion it is fully adequate, in scope ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
that I have read this thesis and that in my opinion it is fully adequate, in scope
Information as Statistical Translation
"... We propose a new probabilistic approach to information retrieval based upon the ideas and methods of statistical machine translation. The central ingredientinthis approach is a statistical model of how a user might distill or "translate" a given document into a query. To assess the relevance of a d ..."
Abstract
- Add to MetaCart
We propose a new probabilistic approach to information retrieval based upon the ideas and methods of statistical machine translation. The central ingredientinthis approach is a statistical model of how a user might distill or "translate" a given document into a query. To assess the relevance of a document to a user's query, we estimate the probability that the query would have been generated as a translation of the document, and factor in the user's general preferences in the form of a prior distribution over documents. We propose a simple, well motivated model of the document-to-query translation process, and describe an algorithm for learning the parameters of this model in an unsupervised manner from a collection of documents. As we show, one can view this approach as a generalization and justification of the "language modeling" strategy recently proposed by Ponte and Croft. In a series of experiments on TREC data, a simple translation-based retrieval system performs well in compari...

