A Probabilistic Learning Approach for Document Indexing (1991)
Cached
Download Links
- [ls6-www.informatik.uni-dortmund.de]
- [www.doc.ic.ac.uk]
- DBLP
Other Repositories/Bibliography
| Venue: | ACM TRANSACTIONS ON INFORMATION SYSTEMS |
| Citations: | 84 - 12 self |
BibTeX
@ARTICLE{Fuhr91aprobabilistic,
author = {Norbert Fuhr and Chris Buckley},
title = {A Probabilistic Learning Approach for Document Indexing},
journal = {ACM TRANSACTIONS ON INFORMATION SYSTEMS},
year = {1991},
volume = {9},
pages = {223--248}
}
Years of Citing Articles
OpenURL
Abstract
We describe a method for probabilistic document indexing using relevance feedback data that has been collected from a set of queries. Our approach is based on three new concepts: (1) Abstraction from specific terms and documents, which overcomes the restriction of limited relevance information for parameter estimation. (2) Flexibility of the representation, which allows the integration of new text analysis and knowledge-based methods in our approach as well as the consideration of document structures or different types of terms. (3) Probabilistic learning or classification methods for the estimation of the indexing weights making better use of the available relevance information. Our approach can be applied under restrictions that hold for real applications. We give experimental results for five test collections which show improvements over other indexing methods.







