Results 1 - 10
of
58
Unsupervised Language Acquisition: Theory and Practice
, 2001
"... In this thesis I present various algorithms for the unsupervised machine learning of aspects of natural languages using a variety of statistical models. The scientific object of the work is to examine the validity of the so-called Argument from the Poverty of the Stimulus advanced in favour of the p ..."
Abstract
-
Cited by 32 (0 self)
- Add to MetaCart
In this thesis I present various algorithms for the unsupervised machine learning of aspects of natural languages using a variety of statistical models. The scientific object of the work is to examine the validity of the so-called Argument from the Poverty of the Stimulus advanced in favour of the proposition that humans have language-specific innate knowledge. I start by examining an a priori argument based on Gold's theorem, that purports to prove that natural languages cannot be learned, and some formal issues related to the choice of statistical grammars rather than symbolic grammars. I present three novel algorithms for learning various parts of natural languages: first, an algorithm for the induction of syntactic categories from unlabelled text using distributional information, that can deal with ambiguous and rare words; secondly, a set of algorithms for learning morphological processes in a variety of languages, including languages such as Arabic with nonconcatenative morphology; thirdly an algorithm for the unsupervised induction of a context-free grammar from tagged text. I carefully examine the interaction between the various components, and show how these algorithms can form the basis for a empiricist model of language acquisition. I therefore conclude that the Argument from the Poverty of the Stimulus is unsupported by the evidence.
The Order of Prenominal Adjectives in Natural Language Generation
, 2000
"... The order of prenominal adjectival modifiers in English is governed by complex and difficult to describe constraints which straddle the boundary between competence and performance. ..."
Abstract
-
Cited by 19 (0 self)
- Add to MetaCart
The order of prenominal adjectival modifiers in English is governed by complex and difficult to describe constraints which straddle the boundary between competence and performance.
Robust, Applied Morphological Generation
- IN IN PROCEEDINGS OF THE FIRST INTERNATIONAL NATURAL LANGUAGE GENERATION CONFERENCE
, 2000
"... In practical natural language generation systems it is often advantageous to have a separate component that deals purely with morphological processing. We present such a component: a fast and robust morphological generator for English based on finite-state techniques that generates a word form given ..."
Abstract
-
Cited by 19 (1 self)
- Add to MetaCart
In practical natural language generation systems it is often advantageous to have a separate component that deals purely with morphological processing. We present such a component: a fast and robust morphological generator for English based on finite-state techniques that generates a word form given a specification of the lemma, part-of-speech, and the type of inflection required. We describe how this morphological generator is used in a prototype system for automatic simplification of English newspaper text, and discuss practical morphological and orthographic issues we have encountered in generation of unrestricted text within this application.
A Probabilistic Account of Logical Metonymy
, 2003
"... In this article we investigate logical metonymy, that is, constructions in which the argument of a word in syntax appears to be different from that argument in logical form (e.g., enjoy the book means enjoy reading the book, and easy problem means a problem that is easy to solve). The systematic var ..."
Abstract
-
Cited by 15 (1 self)
- Add to MetaCart
In this article we investigate logical metonymy, that is, constructions in which the argument of a word in syntax appears to be different from that argument in logical form (e.g., enjoy the book means enjoy reading the book, and easy problem means a problem that is easy to solve). The systematic variation in the interpretation of such constructions suggests a rich and complex theory of composition on the syntax/semantics interface. Linguistic accounts of logical metonymy typically fail to describe exhaustively all the possible interpretations, or they don't rank those interpretations in terms of their likelihood. In view of this, we acquire the meanings of metonymic verbs and adjectives from a large corpus and propose a probabilistic model that provides a ranking on the set of possible interpretations. We identify the interpretations automatically by exploiting the consistent correspondences between surface syntactic cues and meaning. We evaluate our results against paraphrase judgments elicited experimentally from humans and show that the model's ranking of meanings correlates reliably with human intuitions.
Exploiting Parallel Texts to Produce a Multilingual Sense Tagged Corpus for Word Sense Disambiguation
- In Proceedings of RANLP-05, Borovets
, 2005
"... We describe an approach to the automatic creation of a sense tagged corpus intended to train a word sense disambiguation (WSD) system for English-Portuguese machine translation. The approach uses parallel corpora, translation dictionaries and a set of straightforward heuristics. In an evaluati ..."
Abstract
-
Cited by 9 (6 self)
- Add to MetaCart
We describe an approach to the automatic creation of a sense tagged corpus intended to train a word sense disambiguation (WSD) system for English-Portuguese machine translation. The approach uses parallel corpora, translation dictionaries and a set of straightforward heuristics. In an evaluation with nine corpora containing 10 ambiguous verbs, the approach achieved an average precision of 94%, compared with 58% when a state of the art statistical alignment tool was used. The resulting corpus consists of 113,802 instances tagged with the senses (i.e., translations) of the 10 verbs. Besides the word-sense tags, this corpus provides other useful information, such as POS-tags, and can be readily used as input to supervised machine learning algorithms in order to build WSD models for machine translation.
Frequency of basic English grammatical structures: A corpus analysis
- JOURNAL OF MEMORY AND LANGUAGE
, 2007
"... Many recent models of language comprehension have stressed the role of distributional frequencies in determining the
relative accessibility or ease of processing associated with a particular lexical item or sentence structure. However, there
exist relatively few comprehensive analyses of structural ..."
Abstract
-
Cited by 9 (1 self)
- Add to MetaCart
Many recent models of language comprehension have stressed the role of distributional frequencies in determining the
relative accessibility or ease of processing associated with a particular lexical item or sentence structure. However, there
exist relatively few comprehensive analyses of structural frequencies, and little consideration has been given to the appro-
priateness of using any particular set of corpus frequencies in modeling human language. We provide a comprehensive set
of structural frequencies for a variety of written and spoken corpora, focusing on structures that have played a critical role
in debates on normal psycholinguistics, aphasia, and child language acquisition, and compare our results with those from
several recent papers to illustrate the implications and limitations of using corpus data in psycholinguistic research.
Interactive clustering of text collections according to a user-specified criterion
- In Proceedings of IJCAI
, 2007
"... Document clustering is traditionally tackled from the perspective of grouping documents that are topically similar. However, many other criteria for clustering documents can be considered: for example, documents ’ genre or the author’s mood. We propose an interactive scheme for clustering document c ..."
Abstract
-
Cited by 9 (1 self)
- Add to MetaCart
Document clustering is traditionally tackled from the perspective of grouping documents that are topically similar. However, many other criteria for clustering documents can be considered: for example, documents ’ genre or the author’s mood. We propose an interactive scheme for clustering document collections, based on any criterion of the user’s preference. The user holds an active position in the clustering process: first, she chooses the types of features suitable to the underlying task, leading to a task-specific document representation. She can then provide examples of features— if such examples are emerging, e.g., when clustering by the author’s sentiment, words like ‘perfect’, ‘mediocre’, ‘awful ’ are intuitively good features. The algorithm proceeds iteratively, and the user can fix errors made by the clustering system at the end of each iteration. Such an interactive clustering method demonstrates excellent results on clustering by sentiment, substantially outperforming an SVM trained on a large amount of labeled data. Even if features are not provided because they are not intuitively obvious to the user—e.g., what would be good features for clustering by genre using partof-speech trigrams?—our multi-modal clustering method performs significantly better than k-means and Latent Dirichlet Allocation (LDA). 1
A Feedback-Augmented Method for Detecting Errors in the Writing of Learners of English
"... This paper proposes a method for detecting errors in article usage and singular plural usage based on the mass count distinction. First, it learns decision lists from training data generated automatically to distinguish mass and count nouns. Then, in order to improve its performance, it is augmented ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
This paper proposes a method for detecting errors in article usage and singular plural usage based on the mass count distinction. First, it learns decision lists from training data generated automatically to distinguish mass and count nouns. Then, in order to improve its performance, it is augmented by feedback that is obtained from the writing of learners. Finally, it detects errors by applying rules to the mass count distinction. Experiments show that it achieves a recall of 0.71 and a precision of 0.72 and outperforms other methods used for comparison when augmented by feedback.
Unsupervised Argument Identification for Semantic Role Labeling
"... The task of Semantic Role Labeling (SRL) is often divided into two sub-tasks: verb argument identification, and argument classification. Current SRL algorithms show lower results on the identification sub-task. Moreover, most SRL algorithms are supervised, relying on large amounts of manually create ..."
Abstract
-
Cited by 8 (1 self)
- Add to MetaCart
The task of Semantic Role Labeling (SRL) is often divided into two sub-tasks: verb argument identification, and argument classification. Current SRL algorithms show lower results on the identification sub-task. Moreover, most SRL algorithms are supervised, relying on large amounts of manually created data. In this paper we present an unsupervised algorithm for identifying verb arguments, where the only type of annotation required is POS tagging. The algorithm makes use of a fully unsupervised syntactic parser, using its output in order to detect clauses and gather candidate argument collocation statistics. We evaluate our algorithm on PropBank10, achieving a precision of 56%, as opposed to 47 % of a strong baseline. We also obtain an 8 % increase in precision for a Spanish corpus. This is the first paper that tackles unsupervised verb argument identification without using manually encoded rules or extensive lexical or syntactic resources. 1
Identifying non-referential it: a machine learning approach incorporating linguistically motivated patterns
- In Proceedings of the ACL Workshop on Feature Selection for Machine Learning in NLP, Ann Arbor
, 2005
"... In this paper, we present a machine learning system for identifying non-referential it. Types of non-referential it are examined to determine relevant linguistic patterns. The patterns are incorporated as features in a machine learning system which performs a binary classification of it as referenti ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
In this paper, we present a machine learning system for identifying non-referential it. Types of non-referential it are examined to determine relevant linguistic patterns. The patterns are incorporated as features in a machine learning system which performs a binary classification of it as referential or non-referential in a POS-tagged corpus. The selection of relevant, generalized patterns leads to a significant improvement in performance. 1

