Results 1 - 10
of
102
CoNLL-X shared task on multilingual dependency parsing
- In Proc. of CoNLL
, 2006
"... Each year the Conference on Computational Natural Language Learning (CoNLL) 1 features a shared task, in which participants train and test their systems on exactly the same data sets, in order to better compare systems. The tenth CoNLL (CoNLL-X) saw a shared task on Multilingual Dependency Parsing. ..."
Abstract
-
Cited by 161 (2 self)
- Add to MetaCart
Each year the Conference on Computational Natural Language Learning (CoNLL) 1 features a shared task, in which participants train and test their systems on exactly the same data sets, in order to better compare systems. The tenth CoNLL (CoNLL-X) saw a shared task on Multilingual Dependency Parsing. In this paper, we describe how treebanks for 13 languages were converted into the same dependency format and how parsing performance was measured. We also give an overview of the parsing approaches that participants took and the results that they achieved. Finally, we try to draw general conclusions about multi-lingual parsing: What makes a particular language, treebank or annotation scheme easier or harder to parse and which phenomena are challenging for any dependency parser? Acknowledgement Many thanks to Amit Dubey and Yuval Krymolowski, the other two organizers of the shared task, for discussions, converting treebanks, writing software and helping with the papers. 2
Memory-based dependency parsing
- In Proceedings of CoNLL
, 2004
"... In order to realize the full potential of dependency-based syntactic parsing, it is desirable to allow non-projective dependency structures. We show how a datadriven deterministic dependency parser, in itself restricted to projective structures, can be combined with graph transformation techniques t ..."
Abstract
-
Cited by 153 (32 self)
- Add to MetaCart
In order to realize the full potential of dependency-based syntactic parsing, it is desirable to allow non-projective dependency structures. We show how a datadriven deterministic dependency parser, in itself restricted to projective structures, can be combined with graph transformation techniques to produce non-projective structures. Experiments using data from the Prague Dependency Treebank show that the combined system can handle nonprojective constructions with a precision sufficient to yield a significant improvement in overall parsing accuracy. This leads to the best reported performance for robust non-projective parsing of Czech. 1
Automatic Acquisition of Domain Knowledge for Information Extraction
- In Proceedings of the 18th International Conference on Computational Linguistics
, 2000
"... In developing an Information Extraction (IE) system for a new class of events or relations, one of the major tasks is identifying the many ways in which these events or relations may be expressed in text. This has generally involved the manual analysis and, in some cases, the annotation of large qua ..."
Abstract
-
Cited by 38 (4 self)
- Add to MetaCart
In developing an Information Extraction (IE) system for a new class of events or relations, one of the major tasks is identifying the many ways in which these events or relations may be expressed in text. This has generally involved the manual analysis and, in some cases, the annotation of large quantities of text involving these events. This paper presents an alternative proach, based on an automatic discovery procedure, ExDIsCO, which identifies a set of relevant documents and a set of event patterns from un-annotated text, starting from a small set of "seed patterns." We evaluate ExDISCO by comparing the performance of discovered patterns against that of manually constructed systems on actual extraction tasks.
Sentiment analysis using support vector machines with diverse information sources
- In Proceedings of Conference on Empirical Methods in Natural Language Processing
, 2004
"... sources ..."
A Classification Approach to Word Prediction
, 2000
"... The eventual goal of a language model is to accurately predict the value of a missing word given its context. We present an approach to word prediction that is based on learning a representation for each word as a function of words and linguistics predicates in its context. This approach raises a fe ..."
Abstract
-
Cited by 33 (8 self)
- Add to MetaCart
The eventual goal of a language model is to accurately predict the value of a missing word given its context. We present an approach to word prediction that is based on learning a representation for each word as a function of words and linguistics predicates in its context. This approach raises a few new questions that we address. First, in order to learn good word representations it is necessary to use an expressive representation of the context. We present a way that uses external knowledge to generate expressive context representations, along with a learning method capable of handling the large number of features generated this way that can, potentially, contribute to each prediction. Second, since the number of words "competing" for each prediction is large, there is a need to "focus the attention" on a smaller subset of these. We exhibit the contribution of a "focus of attention" mechanism to the performance of the word predictor. Finally, we describe a large scale experimental study in which the approach presented is shown to yield significant improvements in word prediction tasks.
Tagging Grammatical Functions
- In Proceedings of EMNLP-97
, 1997
"... This paper addresses issues in automated treebank construction. We show how standard part-of-speech tagging techniques extend to the more general problem of structural annotation, especially for determining grammatical functions and syntactic categories. Annotation is viewed as an interactive proces ..."
Abstract
-
Cited by 24 (4 self)
- Add to MetaCart
This paper addresses issues in automated treebank construction. We show how standard part-of-speech tagging techniques extend to the more general problem of structural annotation, especially for determining grammatical functions and syntactic categories. Annotation is viewed as an interactive process where manual and automatic processing alternate. Efficiency and accuracy results are presented. We also discuss further automation steps.
Efficient content creation on the semantic web using metadata schemas with domain ontology services (System description)
- IN: PROCEEDINGS OF THE EUROPEAN SEMANTIC WEB CONFERENCE ESWC 2007
, 2007
"... Metadata creation is one of the major challenges in developing the Semantic Web. This paper discusses how to make provision of metadata easier and costeffective by an annotation editor combined with shared ontology services. We have developed an annotation system supporting distributed collaboration ..."
Abstract
-
Cited by 20 (11 self)
- Add to MetaCart
Metadata creation is one of the major challenges in developing the Semantic Web. This paper discusses how to make provision of metadata easier and costeffective by an annotation editor combined with shared ontology services. We have developed an annotation system supporting distributed collaboration in creating annotations, and hiding the complexity of the annotation schema and the domain ontologies from the annotators. Our system adapts flexibly to different metadata schemas, which makes it suitable for different applications. Support for using ontologies is based on ontology services, such as concept searching and browsing, concept URI fetching, semantic autocompletion and linguistic concept extraction. The system is being tested in various practical semantic portal projects.
A broad-coverage parser for German based on defeasible constraints
- In KONVENS 2004, Beiträge zur 7. Konferenz zur Verarbeitung natürlicher Sprache
, 2004
"... We present a parser for German that achieves a competitive accuracy on unrestricted input while maintaining a coverage of 100%. By writing well-formedness rules as declarative, defeasible constraints that integrate di#erent sources of linguistic knowledge, very high robustness is achieved agains ..."
Abstract
-
Cited by 15 (3 self)
- Add to MetaCart
We present a parser for German that achieves a competitive accuracy on unrestricted input while maintaining a coverage of 100%. By writing well-formedness rules as declarative, defeasible constraints that integrate di#erent sources of linguistic knowledge, very high robustness is achieved against all sorts of language error.
Notions of Correctness when Evaluating Protein Name Taggers
- In Proceedings of the 19th International Conference on Computational Linguistics
, 2002
"... This paper introduces four dierent notions of correctness to be used when measuring the performance of protein name taggers, each of which reects certain characteristics of the tagger under evaluation. The discussion regarding the different notions is centered around the evaluation of two protein na ..."
Abstract
-
Cited by 13 (0 self)
- Add to MetaCart
This paper introduces four dierent notions of correctness to be used when measuring the performance of protein name taggers, each of which reects certain characteristics of the tagger under evaluation. The discussion regarding the different notions is centered around the evaluation of two protein name taggers; Yapex, developed by the authors, and KeX developed by Fukuda et al. (1998). For the purpose of illustrating the dierence between the ways of evaluation, both taggers are applied to a test corpus of 101 MEDLINE abstracts in which all occurrences of protein names have been marked up by domain experts.

