• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Toward better language models for spontaneous speech’, ICSLP-94 (1994)

by B SUHM, A WAIBEL
Add To MetaCart

Tools

Sorted by:
Results 11 - 20 of 21
Next 10 →

What makes a word: Learning base units in Japanese for speech recognition

by Laura Mayfield Tomokiyo - In Proceedings of the ACL Special Interest Group in Natural Language Learning (CoNLL , 1997
"... We describe an automatic process for learning word units in Japanese. Since the Japanese orthography has no spaces delimiting words, the first step in building a Japanese speech recognition system is to define the units that will be recognized. Our method applies a compound-finding algorithm, previo ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
We describe an automatic process for learning word units in Japanese. Since the Japanese orthography has no spaces delimiting words, the first step in building a Japanese speech recognition system is to define the units that will be recognized. Our method applies a compound-finding algorithm, previously used to find word sequences in English, to learning syllable sequences in Japanese. We report that we were able not only to extract meaningful units, eliminating the need for possibly inconsistent manual segmentation, but also to decrease perplexity using this automatic procedure, which relies on a statistical, not syntactic, measure of relevance. Our algorithm also uncovers the kinds of environments that help the recognizer predict phonological alternations, which are often hidden by morphologically-motivated tok- enization.

Learning a Syntagmatic and Paradigmatic Structure From Language Data With a Bi-Multigram Model

by Sabine Deligne, Yoshinori Sagisaka , 1998
"... In this paper, we present a stochastic language modeling tool which aims at retrieving variable-length phrases (multigrams), assuming bigram dependen- cies between them. The phrase retrieval can be intermixed with a phrase clustering procedure, so that the language data are iteratively structured at ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
In this paper, we present a stochastic language modeling tool which aims at retrieving variable-length phrases (multigrams), assuming bigram dependen- cies between them. The phrase retrieval can be intermixed with a phrase clustering procedure, so that the language data are iteratively structured at both a paradigmatic and a syntagmatic level in a fully integrated way. Perplexity results on ATR travel arrangement data with a bi-multigram model (assuming bigram correlations between the phrases) come very close to the trigram scores with a reduced number of entries in the language model. Also the ability of the class version of the model to merge semantically related phrases into a common class is illus- trated.

JANUS II - Advances in Spontaneous Speech Translation

by M. Woszczyna, M. Finke, T. Kemp, M. Maier, A. Lavie, L. Mayfield, I. Rogina, T. Sloboda, A. Waibel, P. Zahn, T. Zeppenfeld , 1995
"... JANUS II is a research system to design and test components of speech to speech translation systems as well as a research prototype for such a system. We will focus on two aspects of the system: 1) new features and recognition performance of the speech recognition component JANUS-SR and 2) the end-t ..."
Abstract - Add to MetaCart
JANUS II is a research system to design and test components of speech to speech translation systems as well as a research prototype for such a system. We will focus on two aspects of the system: 1) new features and recognition performance of the speech recognition component JANUS-SR and 2) the end-to-end performance of JANUS II, including a comparison of two machine translation strategies used for JANUS-MT (PHOENIX and GLR*). 1. INTRODUCTION Currently JANUS II components for English, German, Korean, Japanese, and Spanish speech input and translation are under development; though not all language pairs can always be kept at the same performance level, multilinguality is required to ensure generality in the recognition and translation approaches. A multitude of smaller and larger scale research projects contribute to the JANUS II system[1], including robust speech recognition[2], noise modeling[3], speaker and channel adaptation, strategies for porting recognition and translation to new...

Janus II - Advances in Spontaneous Speech Translation

by A. Waibel, M. Finke, D. Gates, M. Gavaldà, T. Kemp, A. Lavie, A. McNair, L. Mayfield, I. Rogina, K. Shima, T. Sloboda, M. Woszczyna, P. Zhan, T. Zeppenfeld , 1996
"... JANUS II is a research system to design and test components of speech to speech translation systems as well as a research prototype for such a system. We will focus on two aspects of the system: 1) new features and recognition performance of the speech recognition component JANUS-SR and 2) the end-t ..."
Abstract - Add to MetaCart
JANUS II is a research system to design and test components of speech to speech translation systems as well as a research prototype for such a system. We will focus on two aspects of the system: 1) new features and recognition performance of the speech recognition component JANUS-SR and 2) the end-to-end performance of JANUS II, including a comparison of two machine translation strategies used for JANUS-MT (PHOENIX and GLR*). 1. INTRODUCTION Currently JANUS II components for English, German, Korean, Japanese, and Spanish speech input and translation are under development; though not all language pairs can always be kept at the same performance level, multilinguality is required to ensure generality in the recognition and translation approaches. A multitude of smaller and larger scale research projects contribute to the JANUS II system[1], including robust speech recognition[2], noise modeling[3], speaker and channel adaptation, strategies for porting recognition and translation to new...

Tue Jul 25 20:03 1995 :: icassp96.janus.dvi

by Janus Ii Advances
"... JANUS II is a research system to design and test components of speech to speech translation systems as well as a research prototype for such a system. We will focus on two aspects of the system: 1) new features and recognition performance of the speech recognition component JANUS-SR and 2) the end-t ..."
Abstract - Add to MetaCart
JANUS II is a research system to design and test components of speech to speech translation systems as well as a research prototype for such a system. We will focus on two aspects of the system: 1) new features and recognition performance of the speech recognition component JANUS-SR and 2) the end-to-end performance of JANUS II, including a comparison of two machine translation strategies used for JANUS-MT (PHOENIX and GLR*). 1. INTRODUCTION Currently JANUS II components for English, German, Korean, Japanese, and Spanish speech input and translation are under development; though not all language pairs can always be kept at the same performance level, multilinguality is required to ensure generality in the recognition and translation approaches. A multitude of smaller and larger scale research projects contribute to the JANUS II system[1], including robust speech recognition[2], noise modeling[3], speaker and channel adaptation, strategies for porting recognition and translation to ne...

Evaluation of a Language Model using a Clustered Model Backoff

by John Miller And , 1996
"... In this paper, we describe and evaluate a language model using word classes automatically generated from a word clustering algorithm. Class based language models have been shown to be effective for rapid adaptation, training on small datasets, and reduced memory usage. In terms of model perplexity, ..."
Abstract - Add to MetaCart
In this paper, we describe and evaluate a language model using word classes automatically generated from a word clustering algorithm. Class based language models have been shown to be effective for rapid adaptation, training on small datasets, and reduced memory usage. In terms of model perplexity, prior work has shown diminished returns for class based language models constructed using very large training sets. This paper describes a method of using a class model as a backoff to a bigram model which produced significant benefits even when trained from a large text corpus. Tests results on the Whisper continuous speech recognition system show that for a given word error rate, the clustered bigram model uses 2/3 fewer parameters compared to a standard bigram model using unigram backoff.

CSE 256 (Spring 2004)

by Language Models For, Dustin Boswell
"... 1.4 The norm’s accepted ASTM Asm ANSI Cei Isa IEEE Nfpa Cee Oms En,afnor,bs Asme 1.2 technique condition Capacity 2 m3 /h for 8h non stop ???? To be confirmed. The system most be plugged to the national water network (Ade) at the pressure of 3 bars And temp following the ambient condition The caract ..."
Abstract - Add to MetaCart
1.4 The norm’s accepted ASTM Asm ANSI Cei Isa IEEE Nfpa Cee Oms En,afnor,bs Asme 1.2 technique condition Capacity 2 m3 /h for 8h non stop ???? To be confirmed. The system most be plugged to the national water network (Ade) at the pressure of 3 bars And temp following the ambient condition The caracteristique of the water to be treated is noted in the chapter 2.2.1 General demand Reference in this field. Experience in this type of installation Experience in the treatment of water reference in Algeria (all type of project) 3.1 The technique of water treatment Ref: drawing .1. Done by Didine Abdoune following 3.1 3.3 demands specified 1 traitement of water most following this point The treatment of water most be done following the data in the table of the caracteristique of the water table 1 Treatment of water containing floating particle Treatment of water wear the turbidity is high Correction of the smell the test and elimination of the chloral And the water most not contain um1 Treatment of bacterologique and disinfection and sterilization of the water For the sterilization we most use the UV method ///// most is in the norm of OMS OR THE EUROPEN UNION Demand of the equipments, most be \\\\\ Most be storage tank of the capacity of 15 M3 for the no treated water (venting valve) and man hole, over flow line, bady switched to indicate the level for the pump number 2 So the start and stop following the info given from the BDY SWICHE Storage tank for produced water in capacity of 4m3 x 2 in parallel (valve for by passe) sit glass safety valve and over flow, man hole. Bd swich for the control of start and stop of pump n 2 Pipeline Material most be STM a 120 SCH 80 OR EQUIVALENT The connection from the man most be EPOER PIPE MOST STAND THE FOLOWING SITUATION - dilation and the contraction thermique - vibration - effects of the temperature - support flexibility - over thickness 1,6 mm for the corrosion - the sludge and drain most be connected to the existing system - the treed most be type NPT ,drilled and treed conform to ANSI B 1.20.1 - The flanges most be ANSI class 150/PN20 - Manometer most be installed in the piping - No return valve drainer , venting , and sample - The pvc and the aplomb are excluded from the installation Civil engineering and still structure - All unites most be mounted in skids and chassis the chassis will be fix in foundation plate form of beton armed and welded inside structure the chassis most be from galvanized type and will be having low ding point - The offers most have also the offer of the installation housing still structure and...etc all proposition are well come the cover of the housing most be maid of TN40 - Construction most be erthequek proven Instrumentation and control - The installation most have the measurement and control of the following - The hardness - The ph - Pressure indicator - Pressure in the filters and alarms Power valuable - 380 v - phase 03 neutral - frequency HZ 50 Spare parts - list of spare parts for period of 2 years this list most have consumable part and changing part for the total unite - Most be list of spare parts of start up. - including the tousle and special tousles for the operation and maintenance of the installation - if there is chimiquell product to included in the prosses most provide the information of this one The noise restriction is 85 db The provider most guaranty 20 years of providing ( ) 20 years of spar parts The time of delivery is 6 month staring the day of the signature of the contract? Test the unite most be tested in eyes whiteness of 2 person from the sonatrach The submission most include - drawings of the implantation of the unite - foundation drawing and the still structure - process drawings - electrical drawings - specifications of the equipment List data scheet of the instrumentation Isometriques drawings Operating manuals Planning for the global phases

CSE 254 (Spring 2003)

by Growing Gram Trees, Dustin Boswell
"... We implement variable n-grams using a word-tree data structure, where nodes represent sequences of words ("contexts") and store how often that context appeared in a training corpus. We build the tree by growing it from the root up. Unlike other methods, there is no pruning step. Instead, we used ..."
Abstract - Add to MetaCart
We implement variable n-grams using a word-tree data structure, where nodes represent sequences of words ("contexts") and store how often that context appeared in a training corpus. We build the tree by growing it from the root up. Unlike other methods, there is no pruning step. Instead, we used the simple heuristic of maintaining a priority-queue of candidate leaves, sorted by how often those contexts occurred in the training text. The most popular leaves are then added to the tree, and this process repeats until a specified memory limit is reached. In this way, the tree was able to make branches for longer sentence fragments like "across the street from the" while saving the space from storing uncommon ones.

A New Lexicon Optimization Method For Lvcsr Based On Linguistic And Acoustic Characteristics Of Words

by Takahiro Shinozaki And, Takahiro Shinozaki, Sadaoki Furui , 2002
"... This paper proposes a new lexicon optimization method to improve recognition rate of large scale spontaneous speech recognition. Occurrence count and length of a word has strong correlation with difficulty of recognizing the word. First, we investigate the relation and make a word correctness proba ..."
Abstract - Add to MetaCart
This paper proposes a new lexicon optimization method to improve recognition rate of large scale spontaneous speech recognition. Occurrence count and length of a word has strong correlation with difficulty of recognizing the word. First, we investigate the relation and make a word correctness probability model. The proposed method optimizes the lexicon by making compound words or phrases step by step based on the word correctness probability model so as to improve the estimated recognition rate of the system. The optimization method is applied to a large scale Japanese spontaneous speech corpus. Experimental results show that the language model using the optimized lexicon improves the recognition rate.

Integrating history-length interpolation and classes in language modeling

by Hinrich Schütze
"... Building on earlier work that integrates different factors in language modeling, we view (i) backing off to a shorter history and (ii) class-based generalization as two complementary mechanisms of using a larger equivalence class for prediction when the default equivalence class is too small for rel ..."
Abstract - Add to MetaCart
Building on earlier work that integrates different factors in language modeling, we view (i) backing off to a shorter history and (ii) class-based generalization as two complementary mechanisms of using a larger equivalence class for prediction when the default equivalence class is too small for reliable estimation. This view entails that the classes in a language model should be learned from rare events only and should be preferably applied to rare events. We construct such a model and show that both training on rare events and preferable application to rare events improve perplexity when compared to a simple direct interpolation of class-based with standard language models. 1
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University