INTELLIGENCE CHINESE DOCUMENT SEMANTIC INDEXING SYSTEM
BibTeX
@MISC{Shi_intelligencechinese,
author = {Zhongzhi Shi and Qing He and Ziyan Jia and Jiayou Li},
title = {INTELLIGENCE CHINESE DOCUMENT SEMANTIC INDEXING SYSTEM},
year = {}
}
OpenURL
Abstract
With the rapid growth of the Internet, how to get information from this huge information space becomes an even more important problem. In this paper, An Intelligence Chinese Document Semantic Indexing System; ICDSIS, is proposed. Some new technologies are integrated in ICDSIS to obtain good performance. ICDSIS is composed of four key procedures. A parallel, distributed and configurable Spider is used for information gather; a multi-hierarchy document classification approach combining the information gain initially processes gathered web documents; a swarm intelligence based document clustering method is used for information organization; a concept-based retrieval interface is applied for user interactive retrieval. ICDSIS is an all-sided solution for information retrieval on the Internet.







