• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Retrieval of Spoken Documents: First Experiences (1997)

by Fabio Crestani, Mark Sanderson
Add To MetaCart

Tools

Sorted by:
Results 1 - 2 of 2

Mixing and Merging for Spoken Document Retrieval

by Mark Sanderson, Fabio Crestani - in Proceedings of SIGIR , 1998
"... . This paper describes a number of experiments that explored the issues surrounding the retrieval of spoken documents. Two such issues were examined. First, attempting to find the best use of speech recogniser output to produce the highest retrieval effectiveness. Second, investigating the potential ..."
Abstract - Cited by 12 (3 self) - Add to MetaCart
. This paper describes a number of experiments that explored the issues surrounding the retrieval of spoken documents. Two such issues were examined. First, attempting to find the best use of speech recogniser output to produce the highest retrieval effectiveness. Second, investigating the potential problems of retrieving from a so-called "mixed collection ", i.e. one that contains documents from both a speech recognition system (producing many errors) and from hand transcription (producing presumably near perfect documents). The result of the first part of the work found that merging the transcripts of multiple recognisers showed most promise. The investigation in the second part showed how the term weighting scheme used in a retrieval system was important in determining whether the system was affected detrimentally when retrieving from a mixed collection. 1 Introduction Over the past few years the field of Information Retrieval (IR) has directed increasing interest towards the retri...

Modular System Design for Multimedial Information Handling

by Botond Pakucs, Björn Gambäck, Preben Hansen
"... Often, information retrieval from various other media is analogous to text-based retrieval; however, accessing documents in e.g. audio or video formats causes some extra problems, in particular with respect to document segmentation, choice of indexing features, and robustness. We review these diffic ..."
Abstract - Add to MetaCart
Often, information retrieval from various other media is analogous to text-based retrieval; however, accessing documents in e.g. audio or video formats causes some extra problems, in particular with respect to document segmentation, choice of indexing features, and robustness. We review these difficulties, together with some previous attempts to overcome them, and then describe a very flexible, modular IR system which has been designed with a specific eye towards these issues.
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University