• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Combining Distributional and Morphological Information for Part of Speech Induction (2003)

by Alexander Clark
Add To MetaCart

Tools

Sorted by:
Results 11 - 20 of 21
Next 10 →

Controlling Complexity in Part-of-Speech Induction

by João V. Graça, Kuzman Ganchev, Luísa Coheur, Fernando Pereira, Ben Taskar
"... We consider the problem of fully unsupervised learning of grammatical (part-of-speech) categories from unlabeled text. The standard maximum-likelihood hidden Markov model for this task performs poorly, because of its weak inductive bias and large model capacity. We address this problem by refining t ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
We consider the problem of fully unsupervised learning of grammatical (part-of-speech) categories from unlabeled text. The standard maximum-likelihood hidden Markov model for this task performs poorly, because of its weak inductive bias and large model capacity. We address this problem by refining the model and modifying the learning objective to control its capacity via parametric and non-parametric constraints. Our approach enforces word-category association sparsity, adds morphological and orthographic features, and eliminates hard-to-estimate parameters for rare words. We develop an efficient learning algorithm that is not much more computationally intensive than standard training and provide an open-source implementation. Our experiments on five diverse languages (Bulgarian, Danish, English, Portuguese, Spanish) achieve significant improvements compared with previous methods for the same task. 1.

Online Entropy-based Model of Lexical Category Acquisition

by Grzegorz Chrupała, Afra Alishahi
"... Children learn a robust representation of lexical categories at a young age. We propose an incremental model of this process which efficiently groups words into lexical categories based on their local context using an information-theoretic criterion. We train our model on a corpus of childdirected s ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Children learn a robust representation of lexical categories at a young age. We propose an incremental model of this process which efficiently groups words into lexical categories based on their local context using an information-theoretic criterion. We train our model on a corpus of childdirected speech from CHILDES and show that the model learns a fine-grained set of intuitive word categories. Furthermore, we propose a novel evaluation approach by comparing the efficiency of our induced categories against other category sets (including traditional part of speech tags) in a variety of language tasks. We show the categories induced by our model typically

Bootstrapping Dependency Grammar Inducers from Incomplete Sentence Fragments via Austere Models

by Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky, Jeffrey Heinz, Colin De La Higuera, Tim Oates
"... Modern grammarinduction systems often employ curriculum learning strategies that begin by training on a subset of all available input that is considered simpler than the full data. Traditionally, filtering has been at granularities of whole input units, e.g., discarding entire sentences with too man ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Modern grammarinduction systems often employ curriculum learning strategies that begin by training on a subset of all available input that is considered simpler than the full data. Traditionally, filtering has been at granularities of whole input units, e.g., discarding entire sentences with too many words or punctuation marks. We propose instead viewing interpunctuation fragments as atoms, initially, thus making some simple phrases and clauses of complex sentences available to training sooner. Splitting input text at punctuation in this way improved our state-of-the-art grammar induction pipeline. We observe that resulting partial data, i.e., mostly incomplete sentence fragments, can be analyzed using reduced parsing models which, we show, can be easier to bootstrap than more nuanced grammars. Startingwithanew, baredependency-and-boundarymodel(DBM-0), ourgrammarinducer attained 61.2 % directed dependency accuracy on Section 23 (all sentences) of the Wall Street Journal corpus: more than 2 % higher than previous published results for this task.

Deriving an Ambiguous Word’s Part-of-Speech Distribution from Unannotated Text

by unknown authors
"... A distributional method for part-of-speech induction is presented which, in contrast to most previous work, determines the part-of-speech distribution of syntactically ambiguous words without explicitly tagging the underlying text corpus. This is achieved by assuming that the word pair consisting of ..."
Abstract - Add to MetaCart
A distributional method for part-of-speech induction is presented which, in contrast to most previous work, determines the part-of-speech distribution of syntactically ambiguous words without explicitly tagging the underlying text corpus. This is achieved by assuming that the word pair consisting of the left and right neighbor of a particular token is characteristic of the part of speech at this position, and by clustering the neighbor pairs on the basis of their middle words as observed in a large corpus. The results obtained in this way are evaluated by comparing them to the part-of-speech distributions as found in the manually tagged Brown corpus. 1

A Practical Solution to the Problem of Automatic Part-of-Speech Induction from Text

by unknown authors
"... The problem of part-of-speech induction from text involves two aspects: Firstly, a set of word classes is to be derived automatically. Secondly, each word of a vocabulary is to be assigned to one or several of these word classes. In this paper we present a method that solves both problems with good ..."
Abstract - Add to MetaCart
The problem of part-of-speech induction from text involves two aspects: Firstly, a set of word classes is to be derived automatically. Secondly, each word of a vocabulary is to be assigned to one or several of these word classes. In this paper we present a method that solves both problems with good accuracy. Our approach adopts a mixture of statistical methods that have been successfully applied in word sense induction. Its main advantage over previous attempts is that it reduces the syntactic space to only the most important dimensions, thereby almost eliminating the otherwise omnipresent problem of data sparseness. 1

Categorizing Local Contexts as a Step in Grammatical Category Induction

by Markus Dickinson, Charles Jochim
"... Building on the use of local contexts, or ..."
Abstract - Add to MetaCart
Building on the use of local contexts, or

Online Entropy-based Model of Lexical Category Acquisition

by Afra Alishahi
"... Humans incrementally learn lexical categories from exposure to language Children form robust lexical categories early on ..."
Abstract - Add to MetaCart
Humans incrementally learn lexical categories from exposure to language Children form robust lexical categories early on

Unsupervised Learning of . . .

by Burcu Can, Suresh Manandhar
"... This paper presents a method for unsupervised learning of morphology that exploits the syntactic categories of words. Previous research [4][12] on learning of morphology and syntax has shown that both kinds of knowledge affect each other making it possible to use one type of knowledge to help the ot ..."
Abstract - Add to MetaCart
This paper presents a method for unsupervised learning of morphology that exploits the syntactic categories of words. Previous research [4][12] on learning of morphology and syntax has shown that both kinds of knowledge affect each other making it possible to use one type of knowledge to help the other. In this work, we make use of syntactic information i.e. Part-of-Speech (PoS) tags of words to aid morphological analysis. We employ an existing unsupervised PoS tagging algorithm for inducing the PoS categories. A distributional clustering algorithm is developed for inducing morphological paradigms.

Computational Learning Theory and Language Acquisition

by Alexander Clark, et al. , 2010
"... ..."
Abstract - Add to MetaCart
Abstract not found

The Effect of Word-internal Properties on Syntactic Categorization: A Computational Modeling Approach

by Fatmeh Torabi Asr, Afsaneh Fazly, Zohreh Azimifar
"... We study the acquisition of abstract syntactic categories of words in children by using a computational model of categorization. Especially, we examine the effect of word-internal properties, such as morphological and phonological cues, on the identification of different categories, such as nouns, v ..."
Abstract - Add to MetaCart
We study the acquisition of abstract syntactic categories of words in children by using a computational model of categorization. Especially, we examine the effect of word-internal properties, such as morphological and phonological cues, on the identification of different categories, such as nouns, verbs, and determiners. To evaluate our model, we use it to determine the syntactic category of actual novel words selected from naturalistic child-directed utterances. We argue that such an evaluation is necessary for a better understanding of the effect of different cues (including word-internal properties and contextual
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University