Searching for authors named Barry Arons – sorted by Relevance.
-
Pitch-Based Emphasis Detection For Segmenting Speech Recordings
- This paper describes a technique to automatically locate emphasized segments of a speech recording based on pitch. These salient portions can be used in a variety of applications, but were originally designed to be used in an interactive system that enables high-speed skimming and browsing of speech
- Cited by 19 (0 self) – Add To MetaCart
-
Authoring and Transcription Tools for Speech-Based Hypermedia Systems
- Authoring is usually one of the most difficult parts in the design and implementation of hypertext and hypermedia systems. This problem is exacerbated if the data to be presented by the system is speech, rather than text or graphics, because of the slow and serial nature of speech. This paper provid
- Cited by 4 (1 self) – Add To MetaCart
-
The Design of Audio Servers and Toolkits for Supporting Speech in the User Interface
- An audio server is a software platform that oversees the sharing of audio resources in a distributed computing environment, and simplifies the task of integrating speech into the user interface. An audio toolkit layered on top of an audio server further simplifies the creation of voice interfaces, b
- Cited by 6 (2 self) – Add To MetaCart
-
Hyperspeech: Navigating in Speech-Only Hypermedia
- Most hypermedia systems emphasize the integration of graphics, images, video, and audio into a traditional hypertext framework. The hyperspeech system described in this paper, a speech-only hypermedia application, explores issues of navigation and system architecture in an audio environment without
- Cited by 50 (10 self) – Add To MetaCart
-
A Review of the Cocktail Party Effect
- The "cocktail party effect" -- the ability to focus one's listening attention on a single talker among a cacophony of conversations and background noise -- has been recognized for some time. This specialized listening ability may be because of characteristics of the human speech production system, t
- Cited by 66 (3 self) – Add To MetaCart
-
SpeechSkimmer: Interactively Skimming Recorded Speech
- Skimming or browsing audio recordings is much more difficult than visually scanning a document because of the temporal nature of audio. By exploiting properties of spontaneous speech it is possible to automatically select and present salient audio segments in a time-efficient manner. Techniques for
- Cited by 62 (2 self) – Add To MetaCart
-
Efficient listening with two ears: Dichotic time compression and spatialization
- To increase the amount of information we can collect in a given amount of time, it is possible to employ signal processing techniques to speed up the rate at which recorded sounds are presented to the ears. Besides simply speeding up the playback, it is possible to auditorily display the signals in
- Cited by 3 (0 self) – Add To MetaCart
-
SpeechSkimmer: A System for Interactively Skimming Recorded Speech
- Note that the text that appeared in printed journal contains very minor typographic and grammatical corrections that do not appear in this version. SpeechSkimmer:
- Cited by 78 (1 self) – Add To MetaCart
-
Techniques, Perception, and Applications of Time-Compressed Speech
- There are a variety of techniques for time-compressing speech that have been developed over the last four decades. This paper consists of a review of the literature on methods for time-compressing speech, including related perceptual studies of intelligibility and comprehension. 2 Motivation and App
- Cited by 18 (1 self) – Add To MetaCart
-
Tools for Building Asynchronous Servers to Support Speech and Audio Applications
- Distributed client/server models are becoming increasingly prevalent in multimedia systems and advanced user interface design. A multimedia application, for example, may play and record audio, use speechrecognition input, and usea window system for graphical I/O. The software architecture of such a
- Cited by 10 (2 self) – Add To MetaCart

