Results 1 - 10
of
21
A Comparative Study of Query and Document Translation for Cross-language Information Retrieval
- IN PROCEEDINGS OF THE THIRD CONFERENCE OF THE ASSOCIATION FOR MACHINE TRANSLATION IN THE AMERICAS
, 1998
"... Cross-language retrieval systems use queries in one natural language to guide retrieval of documents that might be written in another. Acquisition and representation of translation knowledge plays a central role in this process. This paper explores the utility oftwo sources of translation knowledge ..."
Abstract
-
Cited by 62 (5 self)
- Add to MetaCart
(Show Context)
Cross-language retrieval systems use queries in one natural language to guide retrieval of documents that might be written in another. Acquisition and representation of translation knowledge plays a central role in this process. This paper explores the utility oftwo sources of translation knowledge for cross-language retrieval. We have implemented six query translation techniques that use bilingual term lists and one based on direct use of the translation output from an existing machine translation system � these are compared with a document translation technique that uses output from the same machine translation system. Average precision measures on a TREC collection suggest that arbitrarily selecting a single dictionary translation is typically no less e ective than using every translation in the dictionary, that query translation using a machine translation system can achieve somewhat better effectiveness than simpler techniques, and that document translation may result in further improvements in retrieval effectiveness under some conditions.
Alternative approaches for cross-language text retrieval
- In AAAI Symposium on cross-language text and speech retrieval. American Association for Artificial Intelligence
, 1997
"... The explosive growth of the Internet and other sources of networked information have made automatic mediation of access to networked information sources an increasingly important problem. Much of this information ..."
Abstract
-
Cited by 60 (5 self)
- Add to MetaCart
(Show Context)
The explosive growth of the Internet and other sources of networked information have made automatic mediation of access to networked information sources an increasingly important problem. Much of this information
Cross-Language Information Retrieval with the UMLS Metathesaurus
- In: Proc. of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
, 1998
"... david-eichmann(Puiowa.edu mruizQcs.uiowa.edu Abstract We investigate an automatic method for Cross Language Information Retrieval (CLIR) that uti-lizes the multilingual UMLS Metathesaurus to translate Spanish and French natural language queries into En-glish. Two experiments are presented using OHSU ..."
Abstract
-
Cited by 34 (0 self)
- Add to MetaCart
(Show Context)
david-eichmann(Puiowa.edu mruizQcs.uiowa.edu Abstract We investigate an automatic method for Cross Language Information Retrieval (CLIR) that uti-lizes the multilingual UMLS Metathesaurus to translate Spanish and French natural language queries into En-glish. Two experiments are presented using OHSUMED, a subset of MEDLINE. Both experiments examine re-trieval effectiveness of the translated queries. However, in the second experiment, the query translation procedure is augmented with digram based vocabulary normaliza-tion procedures. In this comparative study of retrieval effectiveness the measures used are: 11-point-average precision score (11-AvgP); average interpolated preci-sion at recall of 0.1; and noninterpolated (i.e., exact) precision after 10 retrieved documents. Our results in-dicate that for Spanish the UMLS Metathesaurus based CLIR method appears equivalent to multilingual dictio-nary based approaches investigated in the current litera-ture French yields less favorable results and our analysis suggests that linguistic differences may have caused the performance differences. 1
Query translation by text categorization
- COLING
, 2004
"... We report on the development of a cross language information retrieval system, which translates user queries by categorizing these queries into terms listed in a controlled vocabulary. Unlike usual automatic text categorization systems, which rely on dataintensive models induced from large training ..."
Abstract
-
Cited by 6 (6 self)
- Add to MetaCart
(Show Context)
We report on the development of a cross language information retrieval system, which translates user queries by categorizing these queries into terms listed in a controlled vocabulary. Unlike usual automatic text categorization systems, which rely on dataintensive models induced from large training data, our automatic text categorization tool applies data-independent classifiers: a vector-space engine and a pattern matcher are combined to improve ranking of Medical Subject Headings (MeSH). The categorizer also benefits from the availability of large thesauri, where variants of MeSH terms can be found. For evaluation, we use an English collection of MedLine records: OHSUMED. French OHSUMED queries-translated from the original English queries by domain experts- are mapped into French MeSH terms; then we use the MeSH controlled vocabulary as interlingua to translate French MeSH terms into English MeSH terms, which are finally used to query the OHSUMED document collection. The first part of the study focuses on the text to MeSH categorization task. We use a set of MedLine abstracts as input documents in order to tune the categorization system. The second part compares the performance of a machine translation-based cross language information retrieval (CLIR) system with the categorization-based system: the former results in a CLIR ratio close to 60%, while the latter achieves a ratio above 80%. A final experiment, which combines both approaches, achieves a result above 90%.
Survey: Multilingual Text Retrieval and Access. UNESCO Press. Retrieved from http://webplaza.pt.lu/craynaud/CLIR/SurveyMultilingualTextRetrievalandAccess.pdf Hall-Ellis
- Descriptive Impressions of Entry-Level Cataloger Positions as Reflected in American Libraries, AutoCAT, and the Colorado State Library Jobline, 2000-2003. Cataloging & Classification Quarterly
, 1999
"... ..."
SemEQUAL: Multilingual Semantic Matching in Relational Systems
- Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA
, 2005
"... kumaran,haritsa¡ Abstract. In an increasingly multilingual world, it is critical that information management tools organically support the simultaneous use of multiple natural languages. A pre-requisite for efficiently achieving this goal is that the underlying database engines must provide seamless ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
(Show Context)
kumaran,haritsa¡ Abstract. In an increasingly multilingual world, it is critical that information management tools organically support the simultaneous use of multiple natural languages. A pre-requisite for efficiently achieving this goal is that the underlying database engines must provide seamless matching of text data across languages. We propose here SemEQUAL, a new SQL functionality for semantic matching of multilingual attribute data. Our current implementation defines matches based on the standard WordNet linguistic ontologies. A performance evaluation of SemEQUAL, implemented using standard SQL:1999 features on a suite of commercial database systems indicates unacceptably slow response times. However, by tuning the schema and index choices to match typical linguistic features, we show that the performance can be improved to a level commensurate with online user interaction. 1
A Comparative Study of Knowledge-Based Approaches for Cross-Language Information Retrieval
, 1998
"... Cross-language retrieval systems seek to use queries in one natural language to guide the retrieval of documents that might be written in another. Acquisition and representation of translation knowledge plays a central role in this process. This paper explores the utility of two sources of manually ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
(Show Context)
Cross-language retrieval systems seek to use queries in one natural language to guide the retrieval of documents that might be written in another. Acquisition and representation of translation knowledge plays a central role in this process. This paper explores the utility of two sources of manually encoded translation knowledge, bilingual dictionaries and translation lexicons, for cross-language retrieval. We have implemented six query translation techniques that use bilingual dictionaries, one based on lexicalsemantic analysis, and one based on direct use of the translation output from an existing machine translation system; these are compared with a document translation technique that uses output from the same existing translation system. Average precision measures on portions of the TREC collection suggest that arbitrarily selecting a single translation from a bilingual dictionary is typically no less effective than using every translation in the dictionary, that query translation u...
Accès Multilingue Aux Systèmes D'information
, 2001
"... Introduction La socit globale de l'information a radicalement transform la faon d'acqurir la connaissance, de la dissminer et de l'changer, provoquant une rvolution dans le monde des bibliothques. Les utilisateurs de collections mises en rseau et distribues au plan international ont ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Introduction La socit globale de l'information a radicalement transform la faon d'acqurir la connaissance, de la dissminer et de l'changer, provoquant une rvolution dans le monde des bibliothques. Les utilisateurs de collections mises en rseau et distribues au plan international ont besoin de pouvoir trouver, retrouver et comprendre une information pertinente, quelles qu'en soient la langue et la forme de stockage. Beaucoup d'utilisateurs ont une connaissance partielle des langues trangres, mais leur capacit peut se rvler insuffisante pour formuler correctement les quations de recherche propres leur besoin d'information. Ces utilisateurs seront considrablement aids s'ils peuvent entrer leur requte dans leur langue maternelle, car ils sont capables d'examiner et d'extraire l'information de documents pertinents, mme s'ils ne sont pas traduits. Les utilisateurs monolingues, pour leur part, pourront utiliser les aides la traduction pour comprendre les rsultats de recherche dans un
Integration of language attributes in Metadata and Customization for Domain Specific Repositories
"... Abstract. The paper presents a system for customized metadata facility adapted to expose domain specific metadata along with language attributes. Quality metadata is crucial for efficient information retrieval. Use of standard and common vocabulary leads to normalization of differences that arise d ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract. The paper presents a system for customized metadata facility adapted to expose domain specific metadata along with language attributes. Quality metadata is crucial for efficient information retrieval. Use of standard and common vocabulary leads to normalization of differences that arise due geographic, cultural, domain specific environments even amongst digital libraries with same or similar collection scopes. The present work demonstrates implementation of language attributes (which is missing in the official version of DSpace) for metadata elements. As a case study we demonstrate the AGRIS (Agricultural Information System) application profile customized for metadata in DSpace repositories. It serves as a full text content management platform for the agricultural community and especially the languagewise disparate and geographically distributed AGRIS network through enriched domain metadata in standard format for the description, management and better metadata sharing.
Knowledge Framework for Intelligent Manufacturing Systems
, 2009
"... Abstract: Nowadays, competition is experienced not only among companies but among global supply chains and business networks. There is a demand for intelligent world-class solutions capable of reinforcing partnerships and collaborations with an improved cross-cultural understanding. However due to t ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract: Nowadays, competition is experienced not only among companies but among global supply chains and business networks. There is a demand for intelligent world-class solutions capable of reinforcing partnerships and collaborations with an improved cross-cultural understanding. However due to the proliferation of terminology, organizations from similar business environments have trouble cooperating, and are experiencing difficulties exchanging electronically vital information, such as product and manufacturing data, even when using international standards. To address similar interoperability problems, the Intelligent Manufacturing Systems program