• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 19
Next 10 →

Adding syntactic annotations to transcripts of parent–child dialogs

by Kenji Sagae, Brian Macwhinney, Alon Lavie - In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), 1815–18. Lisbon: European Language Resources Association , 2004
"... We describe an annotation scheme for syntactic information in the CHILDES database (MacWhinney, 2000), which contains several megabytes of transcribed dialogs between parents and children. The annotation scheme is based on grammatical relations (GRs) that are composed of bilexical dependencies (betw ..."
Abstract - Cited by 12 (6 self) - Add to MetaCart
We describe an annotation scheme for syntactic information in the CHILDES database (MacWhinney, 2000), which contains several megabytes of transcribed dialogs between parents and children. The annotation scheme is based on grammatical relations (GRs) that are composed of bilexical dependencies

Parsing of Grammatical Relations in Transcripts of Parent-Child Dialogs Thesis Summary

by Kenji Sagae, Jaime Carbonell, Lori Levin, John Carroll
"... Automatic analysis of syntax is one of the core problems in natural language processing. Despite significant advances in syntactic parsing of written text, the application of these techniques to spontaneous spoken language has received more limited attention. The recent explosive growth of online, a ..."
Abstract - Add to MetaCart
language in parent-child interactions. Specific emphasis is placed on the challenge of accurately annotating the English corpora in the CHILDES database with grammatical relations (such as subject, objects and adjuncts) that are of particular interest and utility to researchers in child language

A Formal Framework for Linguistic Annotation

by Steven Bird, Mark Liberman - Speech Communication , 2000
"... `Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added notations may include transcriptions of all sorts (from pho ..."
Abstract - Cited by 180 (25 self) - Add to MetaCart
`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added notations may include transcriptions of all sorts (from

A Multi-Strategy Approach for Parsing of Grammatical Relations in Transcripts of Parent-Child Dialogs

by Kenji Sagae, Jaime Carbonell, John Carroll , 2006
"... Automatic analysis of syntax is one of the core problems in natural language processing. Despite significant advances in syntactic parsing of written text, the application of these techniques to spontaneous spoken language has received more limited attention. The recent explosive growth of online, a ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Automatic analysis of syntax is one of the core problems in natural language processing. Despite significant advances in syntactic parsing of written text, the application of these techniques to spontaneous spoken language has received more limited attention. The recent explosive growth of online

Automatic measurement of syntactic development in child language

by Kenji Sagae, Alon Lavie, Brian Macwhinney, Kenji Sagae, Alon Lavie, Brian Macwhinney - Proceedings of the 43rd meeting of the Association for Computational Linguistics, Ann Arbor , 2005
"... To facilitate the use of syntactic infor-mation in the study of child language acquisition, a coding scheme for Gram-matical Relations (GRs) in transcripts of parent-child dialogs has been proposed by Sagae, MacWhinney and Lavie (2004). We discuss the use of current NLP tech-niques to produce the GR ..."
Abstract - Cited by 24 (10 self) - Add to MetaCart
To facilitate the use of syntactic infor-mation in the study of child language acquisition, a coding scheme for Gram-matical Relations (GRs) in transcripts of parent-child dialogs has been proposed by Sagae, MacWhinney and Lavie (2004). We discuss the use of current NLP tech-niques to produce

Automatic Measurement of Syntactic Development in Child Language

by Kenji Sagae And, Kenji Sagae, Alon Lavie - In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05 , 2005
"... To facilitate the use of syntactic information in the study of child language acquisition, a coding scheme for Grammatical Relations (GRs) in transcripts of parent-child dialogs has been proposed by Sagae, MacWhinney and Lavie (2004). ..."
Abstract - Add to MetaCart
To facilitate the use of syntactic information in the study of child language acquisition, a coding scheme for Grammatical Relations (GRs) in transcripts of parent-child dialogs has been proposed by Sagae, MacWhinney and Lavie (2004).

Syntactic annotation of spontaneous speech: application to call-center conversation data

by Thierry Bazillon, Melanie Delplano, Frederic Bechet, Alexis Nasr, Benoit Favre
"... This study describes the syntactic annotation process developped on the DECODA corpus. This corpus contains transcriptions of Human-Human conversations collected in a French public transport call-centre (RATP). The goal of the French ANR DECODA project is to propose new speech analytics methods targ ..."
Abstract - Cited by 3 (3 self) - Add to MetaCart
This study describes the syntactic annotation process developped on the DECODA corpus. This corpus contains transcriptions of Human-Human conversations collected in a French public transport call-centre (RATP). The goal of the French ANR DECODA project is to propose new speech analytics methods

Semantic annotations for conversational speech: from speech transcriptions to predicate argument structures

by Arianna Bisazza, Marco Dinarelli, Silvia Quarteroni, Sara Tonelli, Ro Moschitti - in Proceedings of IEEE-SLT’08 , 2008
"... In this paper, we describe the semantic content, which can be automatically generated, for the design of advanced dialog systems. Since the latter will be based on machine learning approaches, we created training data by annotating a corpus with the needed content. Given a sentence of our transcribe ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
transcribed corpus, domain concepts and other linguistic levels ranging from basic ones, i.e. part-of-speech tagging and constituent chunking level, to more advanced ones, i.e. syntactic and predicate argument structure (PAS) levels are annotated. In particular, the proposed PAS and taxonomy of dialog acts

Modeling Topics in User Dialog for Interactive Tablet Media

by Adrian Boteanu, Sonia Chernova
"... In this paper, we present a set of crowdsourcing and data processing techniques for annotating, segmenting and analyzing spoken dialog data to track topics of discussion between multiple users. Specifically, our system records the dialog between the parent and child as they interact with a reading ..."
Abstract - Add to MetaCart
In this paper, we present a set of crowdsourcing and data processing techniques for annotating, segmenting and analyzing spoken dialog data to track topics of discussion between multiple users. Specifically, our system records the dialog between the parent and child as they interact with a reading

Multi-Tier Annotations in the Verbmobil Corpus

by Karl Weilhammer, Uwe Reichel, Florian Schiel - In Proc. of the LREC 2002 , 2002
"... In very large and diverse scientific projects where as different groups as linguists and engineers with different intentions work on the same signal data or its orthographic transcript and annotate new valuable information, it will not be easy to build a homogeneous corpus. We will describe how this ..."
Abstract - Cited by 4 (1 self) - Add to MetaCart
In very large and diverse scientific projects where as different groups as linguists and engineers with different intentions work on the same signal data or its orthographic transcript and annotate new valuable information, it will not be easy to build a homogeneous corpus. We will describe how
Next 10 →
Results 1 - 10 of 19
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University