Results 1 - 10
of
30
H.P.: Gesture modeling and animation based on a probabilistic re-creation of speaker style
- ACM Transactions on Graphics
, 2008
"... Animated characters that move and gesticulate appropriately with spoken text are useful in a wide range of applications. Unfortunately, this class of movement is very difficult to generate, even more so when a unique, individual movement style is required. We present a system that, with a focus on a ..."
Abstract
-
Cited by 20 (5 self)
- Add to MetaCart
Animated characters that move and gesticulate appropriately with spoken text are useful in a wide range of applications. Unfortunately, this class of movement is very difficult to generate, even more so when a unique, individual movement style is required. We present a system that, with a focus on arm gestures, is capable of producing full-body gesture animation for given input text in the style of a particular performer. Our process starts with video of a person whose gesturing style we wish to animate. A tool-assisted annotation process is performed on the video, from which a statistical model of the person’s particular gesturing style is built. Using this model and input text tagged with theme, rheme and focus, our generation algorithm creates a gesture script. As opposed to isolated singleton gestures, our gesture script specifies a stream of continuous gestures coordinated with speech. This script is passed to an animation system, which enhances the gesture description with additional detail. It then generates either kinematic or physically simulated motion based on this description. The system is capable of generating gesture animations for novel text that are consistent with a given performer’s style, as was successfully validated in an empirical user study.
The HUMAINE Database: Addressing the Collection and Annotation of Naturalistic and
- Induced Emotional Data,” Affective Computing and Intelligent Interaction
"... Abstract. The HUMAINE project is concerned with developing interfaces that will register and respond to emotion, particularly pervasive emotion (forms of feeling, expression and action that colour most of human life). The HUMAINE Database provides naturalistic clips which record that kind of materia ..."
Abstract
-
Cited by 11 (0 self)
- Add to MetaCart
Abstract. The HUMAINE project is concerned with developing interfaces that will register and respond to emotion, particularly pervasive emotion (forms of feeling, expression and action that colour most of human life). The HUMAINE Database provides naturalistic clips which record that kind of material, in multiple modalities, and labelling techniques that are suited to describing it. 1
Real-Time Prosody-Driven Synthesis of Body Language
"... “Which is also... one of those very funny episodes... that are in... this movie.” Figure 1: Data-driven body language is synthesized from live speech input. Human communication involves not only speech, but also a wide variety of gestures and body motions. Interactions in virtual environments often ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
“Which is also... one of those very funny episodes... that are in... this movie.” Figure 1: Data-driven body language is synthesized from live speech input. Human communication involves not only speech, but also a wide variety of gestures and body motions. Interactions in virtual environments often lack this multi-modal aspect of communication. We present a method for automatically synthesizing body language animations directly from the participants ’ speech signals, without the need for additional input. Our system generates appropriate body language animations by selecting segments from motion capture data of real people in conversation. The synthesis can be performed progressively, with no advance knowledge of the utterance, making the system suitable for animating characters from live human speech. The selection is driven by a hidden Markov model and uses prosody-based features extracted from speech. The training phase is fully automatic and does not require hand-labeling of input data, and the synthesis phase is efficient enough to run in real time on live microphone input. User studies confirm that our method is able to produce realistic and compelling body language.
A Story about Gesticulation Expression
"... Abstract. Gesticulation is essential for the storytelling experience thus, virtual storytellers should be endowed with gesticulation expression. This work proposes a gesticulation expression model based on psycholinguistics. The model supports: (a) real-time gesticulation animation described as sequ ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
Abstract. Gesticulation is essential for the storytelling experience thus, virtual storytellers should be endowed with gesticulation expression. This work proposes a gesticulation expression model based on psycholinguistics. The model supports: (a) real-time gesticulation animation described as sequences of constraints on static (Portuguese Sign Language hand shapes, orientations and positions) and dynamic (motion profiles) features; (b) multimodal synchronization between gesticulation and speech; (c) automatic reproduction of annotated gesticulation according to GestuRA, a gesture transcription algorithm. To evaluate the model two studies, involving 147 subjects, were conducted. In both cases, the idea consisted of comparing the narration of the Portuguese traditional story “The White Rabbit ” by a human storyteller with a version by a virtual storyteller. Results indicate that synthetic gestures fared well when compared to real gestures however, subjects preferred the human storyteller. 1
Verbal or visual? How information is distributed across speech and gesture in spatial dialog
"... In spatial dialog like in direction giving humans make frequent use of speechaccompanying gestures. Some gestures convey largely the same information as speech while others complement speech. This paper reports a study on how speakers distribute meaning across speech and gesture, and depending on wh ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
In spatial dialog like in direction giving humans make frequent use of speechaccompanying gestures. Some gestures convey largely the same information as speech while others complement speech. This paper reports a study on how speakers distribute meaning across speech and gesture, and depending on what factors. Utterance meaning and the wider dialog context were tested by statistically analyzing a corpus of direction-giving dialogs. Problems of speech production (as indicated by discourse markers and disfluencies), the communicative goals, and the information status were found to be influential, while feedback signals by the addressee do not have any influence. 1
Multimodal Annotation of Conversational Data
"... We propose in this paper a broad-coverage approach for multimodal annotation of conversational data. Large annotation projects addressing the question of multimodal annotation bring together many different kinds of information from different domains, with different levels of granularity. We present ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
We propose in this paper a broad-coverage approach for multimodal annotation of conversational data. Large annotation projects addressing the question of multimodal annotation bring together many different kinds of information from different domains, with different levels of granularity. We present in this paper the first results of the OTIM project aiming at developing conventions and tools for multimodal annotation. 1
Multimodal Expression in Virtual Humans
"... Abstract. This work proposes a real-time virtual human multimodal expression model. Five modalities explore the affordances of the body: deterministic, non-deterministic, gesticulation, facial and vocal expression. Deterministic expression is keyframe body animation. Non-deterministic expression is ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Abstract. This work proposes a real-time virtual human multimodal expression model. Five modalities explore the affordances of the body: deterministic, non-deterministic, gesticulation, facial and vocal expression. Deterministic expression is keyframe body animation. Non-deterministic expression is robotics-based procedural body animation. Vocal expression is voice synthesis, through Festival, and parameterization, through SABLE. Facial expression is lip-synch and emotion expression through a parametric muscle-based face model. Inspired by psycholinguistics, gesticulation expression is unconventional, idiosyncratic and unconscious hand gestures animation described as sequences of Portuguese Sign Language hand shapes, position and orientation. Inspired by the arts, one modality goes beyond the body to explore the affordances of the environment and express emotions through camera, lights and music. To control multimodal expression, this work proposes a high-level integrated synchronized markup language – Expressive Markup Language. Finally, three studies, involving a total of 197 subjects, evaluated the model in storytelling contexts and produced promising results.
Providing Route Directions: Design of Robot’s Utterance, Gesture, and Timing
"... Providing route directions is a complicated interaction. Utterances are combined with gestures and pronounced with appropriate timing. This study proposes a model for a robot that generates route directions by integrating three important crucial elements: utterances, gestures, and timing. Two resear ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Providing route directions is a complicated interaction. Utterances are combined with gestures and pronounced with appropriate timing. This study proposes a model for a robot that generates route directions by integrating three important crucial elements: utterances, gestures, and timing. Two research questions must be answered in this modeling process. First, is it useful to let robot perform gesture even though the information conveyed by the gesture is given by utterance as well? Second, is it useful to implement the timing at which humans speaks? Many previous studies about the natural behavior of computers and robots have learned from human speakers, such as gestures and speech timing. However, our approach is different from such previous studies. We emphasized the listener's perspective. Gestures were designed based on the usefulness, although we were influenced by the basic structure of human gestures. Timing was not based on how humans speak, but modeled from how they listen. The experimental result demonstrated the effectiveness of our approach, not only for task efficiency but also for perceived naturalness.
Approachability: How People Interpret Automatic Door Movement as Gesture
"... Automatic doors exemplify the challenges of designing emotionally welcoming interactive systems. We attempt to broaden the automatic door’s repertoire of signals by examining how people respond to a variety of “door gestures” designed to offer different levels of approachability. In a pilot study, p ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
Automatic doors exemplify the challenges of designing emotionally welcoming interactive systems. We attempt to broaden the automatic door’s repertoire of signals by examining how people respond to a variety of “door gestures” designed to offer different levels of approachability. In a pilot study, participants (N=48) who walked past a physical gesturing door were asked to fill out a questionnaire about that experience. In our follow-up study, participants (N=51) viewed 12 video clips depicting a person walking toward and past an automatic door that moved with different speeds and trajectories. In both studies, our Likert-scale measures and open-ended responses indicate that participants viewing the door behavior prototypes show significant uniformity in the interpretation of the door’s behavior, and that they attribute these motions as gestures with human-like characteristics such as cognition and intent.

