Chairperson, Departmental Committee on Graduate Students2Human-Machine Collaboration for Rapid Speech Transcription (2007)
BibTeX
@MISC{Roy07chairperson,departmental,
author = {Brandon C. Roy and Deb Roy and Prof Deb Roy and Brandon C. Roy},
title = {Chairperson, Departmental Committee on Graduate Students2Human-Machine Collaboration for Rapid Speech Transcription},
year = {2007}
}
OpenURL
Abstract
Inexpensive storage and sensor technologies are yielding a new generation of massive multimedia datasets. The exponential growth in storage and processing power makes it possible to collect more data than ever before, yet without appropriate content annotation for search and analysis such corpora are of little use. While advances in data mining and machine learning have helped to automate some types of analysis, the need for human annotation still exists and remains expensive. The Human Speechome Project is a heavily data-driven longitudinal study of language acquisition. More than 100,000 hours of audio and video recordings have been collected over a two year period to trace one child’s language development at home. A critical first step in analyzing this corpus is to obtain high quality transcripts of all speech heard and produced by the child. Unfortunately, automatic speech transcription has proven to be inadequate for these recordings, and manual transcription with existing tools is extremely labor intensive and therefore expensive. A new human-machine collaborative system for rapid speech transcription has been developed







