A Scalable Self-organizing Map Algorithm for Textual Classification: A Neural Network Approach to Thesaurus Generation (1998)
| Venue: | Communication Cognition and Artificial Intelligence, Spring |
| Citations: | 23 - 5 self |
BibTeX
@ARTICLE{Roussinov98ascalable,
author = {Dmitri G. Roussinov and Hsinchun Chen},
title = {A Scalable Self-organizing Map Algorithm for Textual Classification: A Neural Network Approach to Thesaurus Generation},
journal = {Communication Cognition and Artificial Intelligence, Spring},
year = {1998},
volume = {15},
pages = {81--112}
}
Years of Citing Articles
OpenURL
Abstract
: The rapid proliferation of textual and multimedia online databases, digital libraries, Internet servers, and intranet services has turned researchers' and practitioners' dream of creating an information-rich society into a nightmare of info-gluts. Many researchers believe that turning an info-glut into a useful digital library requires automated techniques for organizing and categorizing large-scale information. This paper presents research in which we sought to develop a scaleable textual classification and categorization system based on the Kohonen's self-organizing feature map (SOM) algorithm. In our paper, we show how self-organization can be used for automatic thesaurus generation. Our proposed data structure and algorithm took advantage of the sparsity of coordinates in the document input vectors and reduced the SOM computational complexity by several order of magnitude. The proposed Scaleable SOM (SSOM) algorithm makes large-scale textual categorization tasks a possibility. A...







