• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Challenges with Rapid Adaptation of Speech Translation Systems to New Language Pairs (2006)

by T Schultz, A Black
Venue:In the Proceedings of ICASSP
Add To MetaCart

Tools

Sorted by:
Results 1 - 3 of 3

SPICE: Web-based Tools for Rapid Language Adaptation

by Tanja Schultz, Alan W Black, Sameer Badaskar, Matthew Hornyak, John Kominek - in Speech Processing Systems", In the Proceedings of INTERSPEECH , 2007
"... In this paper we describe the design and implementation of a user interface for SPICE, a web-based toolkit for rapid prototyping of speech and language processing components. We report on the challenges and experiences gathered from testing these tools in an advanced graduate hands-on course, in whi ..."
Abstract - Cited by 7 (5 self) - Add to MetaCart
In this paper we describe the design and implementation of a user interface for SPICE, a web-based toolkit for rapid prototyping of speech and language processing components. We report on the challenges and experiences gathered from testing these tools in an advanced graduate hands-on course, in which we created speech recognition, speech synthesis, and smalldomain translation components for 10 different languages within only 6 weeks.

The CMU TransTac 2007 Eyes-free and Hands-free Two-way Speech-to-Speech Translation System

by Nguyen Bach, Matthias Eck, Paisarn Charoenpornsawat, Thilo Köhler, Sebastian Stüker, Thuylinh Nguyen, Roger Hsiao, Alex Waibel, Stephan Vogel, Tanja Schultz, Alan W Black
"... The paper describes our portable two-way speech-tospeech translation system using a completely eyesfree/hands-free user interface. This system translates between the language pair English and Iraqi Arabic as well as between English and Farsi, and was built within the framework of the DARPA TransTac ..."
Abstract - Cited by 5 (3 self) - Add to MetaCart
The paper describes our portable two-way speech-tospeech translation system using a completely eyesfree/hands-free user interface. This system translates between the language pair English and Iraqi Arabic as well as between English and Farsi, and was built within the framework of the DARPA TransTac program. The Farsi language support was developed within a 90-day period, testing our ability to rapidly support new languages. The paper gives an overview of the system’s components along with the individual component objective measures and a discussion of issues relevant for the overall usage of the system. We found that usability, flexibility, and robustness serve as severe constraints on system architecture and design. 1.

DATA SELECTION FOR SPEECH RECOGNITION

by Yi Wu, Rong Zhang, Er Rudnicky
"... This paper presents a strategy for efficiently selecting informative data from large corpora of transcribed speech. We propose to choose data uniformly according to the distribution of some target speech unit (phoneme, word, character, etc). In our experiment, in contrast to the common belief that “ ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
This paper presents a strategy for efficiently selecting informative data from large corpora of transcribed speech. We propose to choose data uniformly according to the distribution of some target speech unit (phoneme, word, character, etc). In our experiment, in contrast to the common belief that “there is no data like more data”, we found it possible to select a highly informative subset of data that produces recognition performance comparable to a system that makes use of a much larger amount of data. At the same time, our selection process is efficient and fast. Index Terms — data selection, maximum entropy, speech recognition, acoustic modeling 1.
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University