Results 1 -
2 of
2
Identifying the Subject of Documents in Digital Libraries Automatically Using
, 2002
"... Contemporary information databases contain millions of electronic documents. The immense number of documents makes it difficult to conduct efficient searches on the Internet. Several studies have found that associating documents with a subject or list of topics can make them easier to locate online ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Contemporary information databases contain millions of electronic documents. The immense number of documents makes it difficult to conduct efficient searches on the Internet. Several studies have found that associating documents with a subject or list of topics can make them easier to locate online [5] [6] [7]. Effective cataloging of information is performed manually, requiring extensive resources. Consequently, at present most information is not cataloged. This paper will present the findings of a study based on a software tool (TextAnalysis) that automatically identifies the subject of a document. We tested documents in two subject categories: geography and family studies. The present study follows an earlier one that examined the subject categories of industrial management and general management.
Using Text Analysis to Inform Clients of the Subject of a Document
, 2003
"... Contemporary informa tion databases contain many millions of electronic documents. Locating information on the Internet today is problematic, due to the enormous number of documents it contains. Several other studies have found that associating documents with a subject or list of topics can improve ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Contemporary informa tion databases contain many millions of electronic documents. Locating information on the Internet today is problematic, due to the enormous number of documents it contains. Several other studies have found that associating documents with a subject or list of topics can improve locatability of information on the Internet (Drori, 2000a 2000b 2000c). Effective cataloguing of information is performed manually, requiring extensive resources. Consequently, most information is currently not catalogued. This paper aims to present a software tool that automatically locates the subject of a document and to show the results of a test performed, using the software tool, TextAnalysis, specially developed for this purpose. The main purpose of this study is to inform clients of the subject of the corpus of texts it obtains from search engines as a search results list.

