• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Corpus-based speech synthesis: methods and challenges. In: Arbeitspapiere des Instituts für Maschinelle Sprachverarbeitung (Univ (2000)

by B Möbius
Venue:Stuttgart), AIMS
Add To MetaCart

Tools

Sorted by:
Results 1 - 7 of 7

From MBROLA to NU-MBROLA

by Baris Bozkurt, Michel Bagein, Thierry Dutoit , 2001
"... We introduce the NU-MBROLA project as an extension of the MBROLA project for designing large NU-MBROLA databases exhibiting the same properties as MBROLA diphone databases, without the restriction of being composed of diphones. The accompanying NU-MBROLA synthesizer implements the MBROLA algorithm o ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
We introduce the NU-MBROLA project as an extension of the MBROLA project for designing large NU-MBROLA databases exhibiting the same properties as MBROLA diphone databases, without the restriction of being composed of diphones. The accompanying NU-MBROLA synthesizer implements the MBROLA algorithm on speech segments only defined by their starting and ending points in natural speech files. It is distributed with the same terms and conditions as in the MBROLA project. The terms and conditions for the creation of NU-MBROLA database are slightly different from those related to the creation of MBROLA databases, mainly in that no speech segmentation is required, and in that we leave it to providers to distribute their NU-MBROLA databases.

Corpus-based unit selection for natural-sounding speech synthesis

by Jon Rong-Wei Yi , 2003
"... Speech synthesis is an automatic encoding process carried out by machine through which symbols conveying linguistic information are converted into an acoustic waveform. In the past decade or so, a recent trend toward a non-parametric, corpus-based approach has focused on using real human speech as s ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
Speech synthesis is an automatic encoding process carried out by machine through which symbols conveying linguistic information are converted into an acoustic waveform. In the past decade or so, a recent trend toward a non-parametric, corpus-based approach has focused on using real human speech as source material for producing novel natural-sounding speech. This work proposes a communication-theoretic formulation in which unit selection is a noisy channel through which an input sequence of symbols passes and an output sequence, possibly corrupted due to the coverage limits of the corpus, emerges. The penalty of approximation is quantified by substitution and concatenation costs which grade what unit contexts are interchangeable and where concatenations are not perceivable. These costs are semi-automatically derived from data and are found to agree with acoustic-phonetic knowledge.

FonDat1: A speech synthesis corpus for Norwegian

by Ingunn Amdal, Torbjørn Svendsen - in Proc. LREC 2006 , 2006
"... This paper describes the Norwegian speech database FonDat1 designed for development and assessment of Norwegian unit selection speech synthesis. The quality of unit selection speech synthesis systems depends highly on the database used. The database should contain sufficient phonemic and prosodic co ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
This paper describes the Norwegian speech database FonDat1 designed for development and assessment of Norwegian unit selection speech synthesis. The quality of unit selection speech synthesis systems depends highly on the database used. The database should contain sufficient phonemic and prosodic coverage. High quality unit selection synthesis also requires that the database is annotated with accurate information about identity and position of the units. Traditionally this involves much manual work, either by hand labeling the entire database or by correcting automatic annotations. We are working on methods for a complete automation of the annotation process. To validate these methods a realistic unit selection synthesis database is needed. In addition to serve as a testbed for annotation tools and synthesis experiments, the process of producing the database using automatic methods is in itself an important result. FonDat1 contains studio recordings of approximately 2000 sentences read by two professional speakers, one male and one female. 10 % of the database is manually annotated. 1.

Data-driven formant synthesis

by David Öhlin, Rolf Carlson
"... A new method of speech synthesis, which combines earlier work on data-driven formant synthesis with improved data extraction and concatenation of recorded voiceless segments, has been developed and implemented as a TTS system. A listening test has been carried out, which has shown that this hybrid s ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
A new method of speech synthesis, which combines earlier work on data-driven formant synthesis with improved data extraction and concatenation of recorded voiceless segments, has been developed and implemented as a TTS system. A listening test has been carried out, which has shown that this hybrid synthesis significantly raises the perception of naturalness in the synthesized speech compared to rule only formant synthesis.

Intuitive Human-Machine-Interaction and Implementation on a Household Robot Companion 1,2

by Christopher Parlitz, Winfried Baum, Ulrich Reiser, Martin Hägele
"... Abstract. The increasing capabilities of experimental household robot platforms require more and more sophisticated methods of interaction. While there are many developments in all directions of Human-Machine-Interaction, the integration and combination of several modalities into one robot system re ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Abstract. The increasing capabilities of experimental household robot platforms require more and more sophisticated methods of interaction. While there are many developments in all directions of Human-Machine-Interaction, the integration and combination of several modalities into one robot system require some effort. To ease the development of applications supporting several types of interaction, Fraunhofer IPA has developed a framework named “Go”. Within this framework we have integrated different kinds of interaction methods into one robot platform “Care-O-bot 3”, a mobile service robot for accomplishing daily tasks. This framework and its interaction methods are presented here. 1

1 Introduction Design Issues of a Corpus-Based Speech Synthesizer

by András Nagy, Péter Pesti, Géza Németh, Tamás Bőhm
"... ..."
Abstract - Add to MetaCart
Abstract not found

- 1995 (XIII th ICPhS in Stockholm)

by Vi Chalmers, Edited Peter Br, Hartmut Traunmüller , 2004
"... ISBN 91-7265-901-7 printed version ISBN 91-7265-902-5 web version 2004-05-14 ..."
Abstract - Add to MetaCart
ISBN 91-7265-901-7 printed version ISBN 91-7265-902-5 web version 2004-05-14
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University