• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 140
Next 10 →

Query expansion using lexical-semantic relations

by Ellen M. Voorhees - In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval , 1994
"... Applications such as office automation, news filtering, help facilities in complex systems, and the like require the ability to retrieve documents from full-text databases where vocabulary problems can be particularly severe. Experiments performed on small collections with single-domain thesauri sug ..."
Abstract - Cited by 395 (1 self) - Add to MetaCart
suggest that expanding query vectors with words that are lexically related to the original query words can ameliorate some of the problems of mismatched vocabularies. This paper examines the utility of lexical query expansion in the large, diverse TREC collection. Concepts are represented by Word

The Effect of L1-L2 Lexicalization Mismatch on Incidental Acquisition of Receptive Vocabulary Knowledge

by Mohammad Ali Heidari-shahreza
"... The researcher was interested to explore the possible effects of L1-L2 lexicalization mismatch on the acquisition and retention of receptive vocabulary knowledge by 90 Persian-speaking EFL learners. Lexicalization mismatch was defined in the context of this study as the lack of a lexically equivalen ..."
Abstract - Add to MetaCart
The researcher was interested to explore the possible effects of L1-L2 lexicalization mismatch on the acquisition and retention of receptive vocabulary knowledge by 90 Persian-speaking EFL learners. Lexicalization mismatch was defined in the context of this study as the lack of a lexically

Translation events in cross-language information retrieval: lexical ambiguity, lexical holes, vocabulary mismatch, and correct translations

by Anne R. Diekema , 2003
"... Cross-Language Information Retrieval (CLIR) systems enable users to formulate queries in their native language to retrieve documents in foreign languages. Because queries and documents in CLIR do not necessarily share the same language, translation is needed before matching can take place. This tran ..."
Abstract - Cited by 5 (0 self) - Add to MetaCart
Cross-Language Information Retrieval (CLIR) systems enable users to formulate queries in their native language to retrieve documents in foreign languages. Because queries and documents in CLIR do not necessarily share the same language, translation is needed before matching can take place. This translation step tends to cause a reduction in the retrieval performance of CLIR as compared to monolingual information retrieval. The prevailing CLIR approach and the focus of this study is query translation. The translation of queries is inherently difficult due to the lack of a one-to-one mapping of a lexical item and its meaning, which creates lexical ambiguity. This, and other translation problems, result in translation errors which impact CLIR performance. To understand the events occurring in cross-language retrieval query translation and the relation of these events to retrieval performance, the study explored the following research questions: 1) What kinds of translation events affect cross-language retrieval? 2) In what way does the presence of certain translation events in query translation affect retrieval performance? The study followed a two-phase multi-method approach. In phase one, a taxonomy of translation events was created through content analysis of queries and their translations in combination with an examination of the literature. In the second and final phase, a subset of the test queries was coded using the taxonomy resulting from phase one. These queries were then used in information retrieval experimentation to assess the impact of the translation events on retrieval performance.

When a Mismatch Can Be Good: Large vocabulary speech recognition trained with idealized Tandem features

by Arlo Faria, Nelson Morgan
"... This paper explores Tandem feature extraction used in a large-vocabulary speech recognition system. In this frame-work a multi-layer perceptron estimates phone probabilities which are treated as acoustic observations in a traditional HMM-GMM system. To determine a lower error bound, we simulated an ..."
Abstract - Add to MetaCart
This paper explores Tandem feature extraction used in a large-vocabulary speech recognition system. In this frame-work a multi-layer perceptron estimates phone probabilities which are treated as acoustic observations in a traditional HMM-GMM system. To determine a lower error bound, we simulated

Visual Word Ambiguity

by J. C. van Gemert, C. J. Veenman, A. W. M. Smeulders, J. M. Geusebroek - ACCEPTED IN IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
"... This paper studies automatic image classification by modeling soft-assignment in the popular codebook model. The codebook model describes an image as a bag of discrete visual words selected from a vocabulary, where the frequency distributions of visual words in an image allow classification. One inh ..."
Abstract - Cited by 140 (11 self) - Add to MetaCart
This paper studies automatic image classification by modeling soft-assignment in the popular codebook model. The codebook model describes an image as a bag of discrete visual words selected from a vocabulary, where the frequency distributions of visual words in an image allow classification. One

Large Vocabulary Continuous Speech Recognition

by Terrence Martin, Kishan Thambiratnam, Sridha Sridharan
"... The task of porting Automatic Speech Recognition (ASR) technology to many languages is hindered by a lack of transcribed acoustic data, which in turn prevents the development of accurate acoustic models necessary for the recognition task. To overcome this problem, recent research has sought to explo ..."
Abstract - Add to MetaCart
to exploit the similarity of sounds across languages, and use this similarity to adapt models from one or more data rich languages for use in recognising data poor target languages. Pronunciation variation and cross language context mismatch combine to make the task more difficult then a monolingual ASR

Scoring missing terms in information retrieval tasks

by Egidio Terra - In ACM Thirteenth Conference on Information and Knowledge Management (CIKM-2004 , 2004
"... An usual approach to address mismatching vocabulary problem is to augment the original query using dictionaries and other lexical resources and/or by looking at pseudo-relevant documents. Either way, terms are added to form a new query that will be used to score all documents in a subsequent retriev ..."
Abstract - Cited by 15 (0 self) - Add to MetaCart
An usual approach to address mismatching vocabulary problem is to augment the original query using dictionaries and other lexical resources and/or by looking at pseudo-relevant documents. Either way, terms are added to form a new query that will be used to score all documents in a subsequent

problems in Information Retrieval Systems. Query

by unknown authors
"... Abstract—Vocabulary and Word mismatches are common ..."
Abstract - Add to MetaCart
Abstract—Vocabulary and Word mismatches are common

A.: Finding consensus among words: lattice-based word error minimization. Eurospeech

by Lidia Mangu , Eric Brill , Andreas Stolcke , 1999
"... ABSTRACT We describe a new algorithm for finding the hypothesis in a recognition lattice that is expected to minimize the word error rate (WER). Our approach thus overcomes the mismatch between the word-based performance metric and the standard MAP scoring paradigm that is sentence-based, and that ..."
Abstract - Cited by 120 (10 self) - Add to MetaCart
ABSTRACT We describe a new algorithm for finding the hypothesis in a recognition lattice that is expected to minimize the word error rate (WER). Our approach thus overcomes the mismatch between the word-based performance metric and the standard MAP scoring paradigm that is sentence

Joint uncertainty decoding for robust large vocabulary speech recognition

by H. Liao, M. J. F. Gales , 2006
"... Standard techniques to increase automatic speech recognition noise robustness typically assume recognition models are clean trained. This “clean ” training data may in fact not be clean at all, but may contain channel variations, varying noise conditions, as well as different speakers. Hence rather ..."
Abstract - Cited by 36 (28 self) - Add to MetaCart
than considering noise robustness techniques as compensating clean acoustic models for environmental noise, they may be thought of as reducing the acoustic mismatch between training and test conditions. This report examines the application of VTS model compensation or model-based Joint uncertainty
Next 10 →
Results 1 - 10 of 140
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University