Results 1 -
6 of
6
A Framework and Toolkit for the Construction of Multimodal Learning Interfaces
, 1998
"... Multimodal human-computer interaction, in which the computer accepts input from multiple channels or modalities, is more flexible, natural, and powerful than unimodal interaction with input from a single modality. Many research studies ([Hauptmann89], [Nakagawa94], [Nishimoto94], [Oviatt97b], [Chu97 ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
Multimodal human-computer interaction, in which the computer accepts input from multiple channels or modalities, is more flexible, natural, and powerful than unimodal interaction with input from a single modality. Many research studies ([Hauptmann89], [Nakagawa94], [Nishimoto94], [Oviatt97b], [Chu97], to name a few) have reported that the combination of human communication means such as speech, gestures, handwriting, eye movement, etc. enjoys strong preference among users. Unfortunately, the development of multimodal applications is difficult and still suffers from a lack of generality, such that a lot of duplicated effort is wasted when implementing different applications sharing some common aspects. The research presented in this dissertation aims to provide a partial solution to the difficult problem of developing multimodal applications by creating a modular, distributed, and customizable infrastructure to facilitate the construction of such applications. This dissertation contribu...
Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package
- Journal of Statistical Software
, 2009
"... This introduction to the R package dtw is a (slightly) modified version of Giorgino (2009), published in the Journal of Statistical Software. Dynamic time warping is a popular technique for comparing time series, providing both a distance measure that is insensitive to local compression and stretche ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
This introduction to the R package dtw is a (slightly) modified version of Giorgino (2009), published in the Journal of Statistical Software. Dynamic time warping is a popular technique for comparing time series, providing both a distance measure that is insensitive to local compression and stretches and the warping which optimally deforms one of the two input series onto the other. A variety of algorithms and constraints have been discussed in the literature. The dtw package provides an unification of them; it allows R users to compute time series alignments mixing freely a variety of continuity constraints, restriction windows, endpoints, local distance definitions, and so on. The package also provides functions for visualizing alignments and constraints using several classic diagram types.
Tools and Basque language databases developed in the AhoLab Laboratory
- In Workshop Proc. LREC
, 2000
"... This paper gives an overview of the speech material used and generated in our laboratory (AhoLab), as well as of the software tools developed for its management. The databases were created in the context of the development of a text to speech converter for Basque, in the different fields of research ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
This paper gives an overview of the speech material used and generated in our laboratory (AhoLab), as well as of the software tools developed for its management. The databases were created in the context of the development of a text to speech converter for Basque, in the different fields of research. The software here described is freely available and is currently being used by some educational centre and individuals.
Nearest Neighbourhood Classifiers in Biometric Fusion Nearest Neighbourhood Classifiers in Biometric Fusion
"... This paper presents fusion decision technique comparisons based on nearestneighborhood (NN) classifiers family for a bimodal biometric verification system that ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
This paper presents fusion decision technique comparisons based on nearestneighborhood (NN) classifiers family for a bimodal biometric verification system that
Fréchet Distance based Approach for Searching Online Handwritten Documents
"... We propose a novel, language-neutral approach for searching online handwritten text using Fréchet distance. Online handwritten data, which is available as a time series (x,y,t), is treated as representing a parameterized curve in two-dimensions and the problem of searching online handwritten text is ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
We propose a novel, language-neutral approach for searching online handwritten text using Fréchet distance. Online handwritten data, which is available as a time series (x,y,t), is treated as representing a parameterized curve in two-dimensions and the problem of searching online handwritten text is posed as a problem of matching two curves in a two-dimensional Euclidean space. Fréchet distance is a natural measure for matching curves. The main contribution of this paper is the formulation of a variant of Fréchet distance that can be used for retrieving words even when only a prefix of the word is given as query. Extensive experiments on UNIPEN dataset1 consisting of over 16,000 words written by 7 users show that our method outperforms the state-of-the-art DTW method. Experiments were also conducted on a multilingual dataset, generated on a PDA, with encouraging results. Our approach can be used to implement useful, exciting features like auto-completion of handwriting in PDAs. 1.
[dsp EDUCATION] Research Developments and Directions in Speech Recognition and Understanding, Part 1
"... To advance research, it is important to identify promising future research directions, especially those that have not been adequately pursued or funded in the past. The working group producing this article was charged to elicit from the human language technology (HLT) community a set of well-conside ..."
Abstract
- Add to MetaCart
To advance research, it is important to identify promising future research directions, especially those that have not been adequately pursued or funded in the past. The working group producing this article was charged to elicit from the human language technology (HLT) community a set of well-considered directions or rich areas for future research that could lead to major paradigm shifts in the field of automatic speech recognition (ASR) and understanding. ASR has been an area of great interest and activity to the signal processing and HLT communities over the past several decades. As a first step, this group reviewed major developments in the field and the circumstances that led to their success and then focused on areas it deemed especially fertile for future research. Part 1 of this article will focus on historically significant developments in the ASR area, including several major research efforts that were guided by different funding agencies, and suggest general areas in which to focus research. Part 2 (to appear in the next issue) will explore in more detail several new avenues holding promise for substantial improvements in ASR performance. These entail cross-disciplinary research and specific approaches to address three-to-five-year grand challenges aimed at stimulating advanced research by dealing with realistic tasks of broad interest.

