Results 1 -
5 of
5
A Categorial Variation Database for English
- IN NAACL/HLT 2003, PROCEEDINGS OF THE HUMAN LANGUAGE TECHNOLOGY AND NORTH AMERICAN ASSOCIATION FOR COMPUTATIONAL LINGUISTICS CONFERENCE
, 2003
"... We describe our approach to the construction and evaluation of a large-scale database called "CatVar" which contains categorial variations of English lexemes. Due to the ..."
Abstract
-
Cited by 18 (6 self)
- Add to MetaCart
We describe our approach to the construction and evaluation of a large-scale database called "CatVar" which contains categorial variations of English lexemes. Due to the
Interlingual Annotation of Multilingual Text Corpora
- In Proceedings of the NAACL/HLT Workshop: New Frontiers in Corpus Annotation
, 2004
"... This paper describes a multi-site project to annotate six sizable bilingual parallel corpora for interlingual content. After presenting the background and objectives of the effort, we will go on to describe the data set that is being annotated, the interlingua representation language used, an ..."
Abstract
-
Cited by 9 (5 self)
- Add to MetaCart
This paper describes a multi-site project to annotate six sizable bilingual parallel corpora for interlingual content. After presenting the background and objectives of the effort, we will go on to describe the data set that is being annotated, the interlingua representation language used, an interface environment that supports the annotation task and the annotation process itself. We will then present a preliminary version of our evaluation methodology and conclude with a summary of the current status of the project along with a number of issues which have arisen.
Interlingua Development and Testing through Semantic Annotation of Multilingual Text Corpora
, 2004
"... This paper describes a multi-site project to annotate the interlingual content of six sizable bilingual parallel corpora. The project addresses several principal problems in parallel: specification of interlingua content and notation, development of reliable annotation methods, and evaluation ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
This paper describes a multi-site project to annotate the interlingual content of six sizable bilingual parallel corpora. The project addresses several principal problems in parallel: specification of interlingua content and notation, development of reliable annotation methods, and evaluation of annotated corpora. As a by-product, a growing corpus of annotated texts is being produced, which may eventually be useful for machine learning of semantics-based processing.
A Categorial Variation Database for English
- In NAACL/HLT 2003, Proceedings of the Human Language Technology and North American Association for Computational Linguistics Conference
, 2003
"... We describe our approach to the construction and evaluation of a large-scale database called "CatVar" which contains categorial variations of English lexemes. Due to the prevalence of cross-language categorial variation in multilingual applications, our categorial-variation resource may serve ..."
Abstract
- Add to MetaCart
We describe our approach to the construction and evaluation of a large-scale database called "CatVar" which contains categorial variations of English lexemes. Due to the prevalence of cross-language categorial variation in multilingual applications, our categorial-variation resource may serve as an integral part of a diverse range of natural language applications.
Semantic Annotation for Interlingual Representation of Multilingual Texts
"... This paper describes the annotation process being used in a multi-site project to create six sizable bilingual parallel corpora annotated with a consistent interlingua representation. After presenting the background and objectives of the effort, we describe the multilingual corpora and the three sta ..."
Abstract
- Add to MetaCart
This paper describes the annotation process being used in a multi-site project to create six sizable bilingual parallel corpora annotated with a consistent interlingua representation. After presenting the background and objectives of the effort, we describe the multilingual corpora and the three stages of interlingual representation being developed. We then focus on the annotation process itself, including an interface environment that supports the annotation task , and the methodology for evaluating the interlingua representation. Finally, we discuss some issues encountered during the annotation tasks. The resulting annotated multilingual corpora will be useful for a wide range of natural language processing research tasks, including machine translation, question answering, text summarization, and information extraction.

