• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Resolving translation ambiguity and target polysemy in crosslanguage information retrieval (1999)

by H H Chen, G W Bian, W C Lin
Venue:In ACL 99
Add To MetaCart

Tools

Sorted by:
Results 1 - 9 of 9

Japanese/English Cross-Language Information Retrieval: Exploration of Query . . .

by Atsushi Fujii, Tetsuya Ishikawa - COMPUTERS AND THE HUMANITIES , 2001
"... Cross-language information retrieval (CLIR), where queries and documents are in different languages, has of late become one of the major topics within the information retrieval community. This paper ..."
Abstract - Cited by 21 (8 self) - Add to MetaCart
Cross-language information retrieval (CLIR), where queries and documents are in different languages, has of late become one of the major topics within the information retrieval community. This paper

Morphological typology of languages for IR

by Ari Pirkola - Journal of Documentation , 2001
"... This paper presents a morphological classification of languages from the IR perspective. Linguistic typology research has shown that the morphological complexity of every language in the world can be described by two variables, index of synthesis and index of fusion. These variables provide a theore ..."
Abstract - Cited by 20 (2 self) - Add to MetaCart
This paper presents a morphological classification of languages from the IR perspective. Linguistic typology research has shown that the morphological complexity of every language in the world can be described by two variables, index of synthesis and index of fusion. These variables provide a theoretical basis for IR research handling morphological issues. A common theoretical framework is needed in particular because of the increasing significance of cross-language retrieval research and CLIR systems processing different languages. The paper elaborates the linguistic morphological typology for the purposes of IR research. It studies how the indexes of synthesis and fusion could be used as practical tools in mono- and cross-lingual IR research. The need for semantic and syntactic typologies is discussed. The paper also reviews studies made in different languages on the effects of morphology and stemming in IR. 1.

A Multilingual News Summarizer

by Hsin-Hsi Chen, Chuan-Jie Lin - IN PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL LINGUISTICS , 2000
"... Huge multilingual news articles are reported anct dissolninatod on the Internot. itow to oxlract the key inforlnation and save the reading time is a crucial issue. This paper proposes architluro of multilingual news sumlnarizer, including monolingual and multilingual clustering, similarity mea ..."
Abstract - Cited by 10 (1 self) - Add to MetaCart
Huge multilingual news articles are reported anct dissolninatod on the Internot. itow to oxlract the key inforlnation and save the reading time is a crucial issue. This paper proposes architluro of multilingual news sumlnarizer, including monolingual and multilingual clustering, similarity measure among moauingfid units, and proscntatiou of smnmarization results. Translation among news stories, idiosyncrasy among htnguages, implicit information, aud user prellronco are addressed.

Structured Translation for Cross-Language Information Retrieval

by Ruth Sperer, Douglas W. Oard - In ACM SIGIR , 2000
"... The paper introduces a query translation model that re ects the structure of the cross-language information retrieval task. The model is based on a structured bilingual dictionary in which the translations of each term are clustered into groups with distinct meanings. Query translation is modeled as ..."
Abstract - Cited by 9 (0 self) - Add to MetaCart
The paper introduces a query translation model that re ects the structure of the cross-language information retrieval task. The model is based on a structured bilingual dictionary in which the translations of each term are clustered into groups with distinct meanings. Query translation is modeled as a two-stage process, with the system rst determining the intended meaning of a query term and then selecting translations appropriate to that meaning that might appear in the document collection. An implementation of structured translation based on automatic dictionary clustering is described and evaluated by using Chinese queries to retrieve English documents. Structured translation achieved an average precision that was statistically indistinguishable from Pirkola's technique for very short queries, but Pirkola's technique outperformed structured translation on long queries. The paper concludes with some observations on future work to improve retrieval e ectiveness and on other potential uses of structured translation in interactive cross-language retrieval applications. 1.

Building a Chinese-English WordNet for Translingual Applications

by Hsin-hsi Chen, Chi-ching Lin, Wen-cheng Lin - ACM Transactions on Asian Languages Information Processing , 2002
"... A WordNet-like linguistic resource is useful, but difficult to construct. This article proposes a method to integrate five linguistic resources, including English/Chinese sense-tagged corpora, English/Chinese thesauruses, and a bilingual dictionary. Chinese words are mapped into WordNet. A Chinese W ..."
Abstract - Cited by 6 (0 self) - Add to MetaCart
A WordNet-like linguistic resource is useful, but difficult to construct. This article proposes a method to integrate five linguistic resources, including English/Chinese sense-tagged corpora, English/Chinese thesauruses, and a bilingual dictionary. Chinese words are mapped into WordNet. A Chinese WordNet and a Chinese-English WordNet are derived by following the structures of WordNet. Experiments with Chinese-English information retrieval are developed to evaluate the applicability of the Chinese-English WordNet. The best model achieves 0.1010 average precision, 69.23 % of monolingual information retrieval. It also gains a 10.02 % increase relative to a model that resolves translation ambiguity and target polysemy problems together.

Cross-Language Image Retrieval via Spoken Query

by Wen-cheng Lin, Ming-shun Lin, Hsin-hsi Chen - Proceedings of RIAO 2004: Coupling Approaches, Coupling Media and Coupling Languages for Information Retrieval , 2004
"... This paper studies cross-language cross-medium information retrieval. We introduce several approaches to unify the languages and media of queries and documents. We experiment on cross-language image retrieval via spoken query. Two approaches are proposed to recognize and translate spoken queries. We ..."
Abstract - Cited by 4 (3 self) - Add to MetaCart
This paper studies cross-language cross-medium information retrieval. We introduce several approaches to unify the languages and media of queries and documents. We experiment on cross-language image retrieval via spoken query. Two approaches are proposed to recognize and translate spoken queries. We also propose a similarity-based approach to identify and backward transliterate named entities in a spoken query. 1.

Spoken Cross-Language Access to Image Collection via Captions

by Hsin-hsi Chen - In Proceedings of Eurospeech 2003 , 2003
"... This paper presents a framework of using Chinese speech to access images via English captions. The formulation and the structure mapping rules of Chinese and English named entities are extracted from an NICT foreign location name corpus. For ..."
Abstract - Cited by 2 (2 self) - Add to MetaCart
This paper presents a framework of using Chinese speech to access images via English captions. The formulation and the structure mapping rules of Chinese and English named entities are extracted from an NICT foreign location name corpus. For

Merging Mechanisms in Multilingual Information Retrieval

by Wen-cheng Lin, Hsin-hsi Chen
"... National Taiwan University (NTU) Natural Language Processing Laboratory (NLPL) participated in MLIR task in CLEF 2002. We submitted five official multilingual runs. In this paper, we try to resolve the collection fusion problem. We experimented with several merging strategies that merge the results ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
National Taiwan University (NTU) Natural Language Processing Laboratory (NLPL) participated in MLIR task in CLEF 2002. We submitted five official multilingual runs. In this paper, we try to resolve the collection fusion problem. We experimented with several merging strategies that merge the results of several intermediate runs. 1.

Structured Translation for Cross-Language Information Retrieval

by Ruth Sperer Emotion, Ruth Sperer, Douglas W. Oard - In ACM SIGIR , 2000
"... The paper introduces a query translation model that reflects the structure of the cross-language information retrieval task. The model is based on a structured bilingual dictionary in which the translations of each term are clustered into groups with distinct meanings. Query translation is modeled a ..."
Abstract - Add to MetaCart
The paper introduces a query translation model that reflects the structure of the cross-language information retrieval task. The model is based on a structured bilingual dictionary in which the translations of each term are clustered into groups with distinct meanings. Query translation is modeled as a two-stage process, with the system first determining the intended meaning of a query term and then selecting translations appropriate to that meaning that might appear in the document collection. An implementation of structured translation based on automatic dictionary clustering is described and evaluated by using Chinese queries to retrieve English documents. Structured translation achieved an average precision that was statistically indistinguishable from Pirkola's technique for very short queries, but Pirkola's technique outperformed structured translation on long queries. The paper concludes with some observations on future work to improve retrieval effectiveness and on other potent...
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University