Results 1 -
4 of
4
Designing Transition Networks for Multimodal VR-Interactions Using a Markup Language
- In: IEEE International Conference on Multimodal Interfaces (ICMI'02) Proceedings (2002
, 2002
"... This article presents one core component for enabling multimodal—speech and gesture—driven interaction in and for Virtual Environments. A so-called temporal Augmented Transition Network (tATN) is introduced. It allows to integrate and evaluate information from speech, gesture, and a given applicatio ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
This article presents one core component for enabling multimodal—speech and gesture—driven interaction in and for Virtual Environments. A so-called temporal Augmented Transition Network (tATN) is introduced. It allows to integrate and evaluate information from speech, gesture, and a given application context using a combined syntactic/semantic parse approach. This tATN represents the target structure for a multimodal integration markup language (MIML). MIML centers around the specification of multimodal interactions by letting an application designer declare temporal and semantic relations between given input utterance percepts and certain application states in a declarative and portable manner. A subsequent parse pass translates MIML into corresponding tATNs which are directly loaded and executed by a simulation engines scripting facility.
Match: Multimodal Access To City Help
- IN AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP, MADONNA DI CAMPIGLIO
, 2001
"... Interfaces to mobile information access devices need to allow users to interact using whichever mode or combination of modes are most appropriate, given their user preference, task at hand, and physical and social environment. This paper describes a multimodal application architecture which facilita ..."
Abstract
-
Cited by 5 (2 self)
- Add to MetaCart
Interfaces to mobile information access devices need to allow users to interact using whichever mode or combination of modes are most appropriate, given their user preference, task at hand, and physical and social environment. This paper describes a multimodal application architecture which facilitates rapid prototyping of flexible nextgeneration multimodal interfaces. Our sample application MATCH (Multimodal Access To City Help) provides a mobile multimodal speech-pen interface to restaurant and subway information for New York City. Finite-state multimodal language processing technology enables input in pen, speech, or integrated combinations of the two. The system also features multimodal generation capabilities providing speech output synchronized with dynamic graphical displays.
EVALUATING DIALOGUE STRATEGIES IN MULTIMODAL DIALOGUE SYSTEMS
"... Previous research suggests that multimodal dialogue systems providing both speech and pen input, and outputting a combination of spoken language and graphics, are more robust than unimodal systems based on speech or graphics alone (Andr´e, 2002; Oviatt, 1999). Such systems are complex to build and ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
Previous research suggests that multimodal dialogue systems providing both speech and pen input, and outputting a combination of spoken language and graphics, are more robust than unimodal systems based on speech or graphics alone (Andr´e, 2002; Oviatt, 1999). Such systems are complex to build and signifi cant research and evaluation effort must typically be expended to generate well-tuned modules for each system component. This chapter describes experiments utilising two complementary evaluation methods that can expedite the design process: (1) a Wizard-of-Oz data collection and evaluation using a novel Wizard tool we developed; and (2) an Overhearer evaluation experiment utilising logged interactions with the real system. We discuss the advantages and disadvantages of both methods and summarise how these two experiments have informed our research on dialogue management and response generation for the multimodal dialogue system MATCH.
APPLYING VISION TO INTELLIGENT HUMAN-COMPUTER INTERACTION
, 2005
"... As powerful and affordable computers and sensors become virtually om-nipresent, constructing highly intelligent and convenient computation systems has never been so promising. Vision holds great promise in building advanced human-computer interaction (HCI) systems. We investigate different technique ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
As powerful and affordable computers and sensors become virtually om-nipresent, constructing highly intelligent and convenient computation systems has never been so promising. Vision holds great promise in building advanced human-computer interaction (HCI) systems. We investigate different techniques to integrate passive vision into various interaction environments. First, we propose a novel approach to integrate visual tracking into a haptics systems. Traditional haptic environments require that the user must be attached to the haptic device at all times, even though force feedback is not always being rendered. We design and implement an augmented reality system called VisHap that uses visual tracking to seamlessly integrate force feedback with tactile feedback to generate a “complete ” haptic experience. The VisHap framework allows the user to interact with combinations of virtual and real objects naturally, thereby combining active and passive haptics. The flexibility and extensibility of our framework is promising in that it supports many interaction modes and allows further integration with other

