The paper combines a comprehensive account of the probabilistic model of retrieval with new systematic experiments on TREC Programme material. It presents the model from its foundations through its logical development to cover more aspects of retrieval data and a wider range of system functions. Each step in the argument is matched by comparative retrieval tests, to provide a single coherent account of a major line of research. The experiments demonstrate, for a large test collection, that the probabilistic model is effective and robust, and that it responds appropriately, with major improvements in performance, to key features of retrieval situations.
|
2329
|
Introduction to modern information retrieval
– Salton
- 1983
|
|
1102
|
An Algorithm for Suffix Stripping
– Porter
- 1980
|
|
915
|
Term-weighting approaches in automatic text retrieval
– Salton, Buckley
- 1988
|
|
594
|
Relevance feedback in information retrieval
– Rocchio
- 1971
|
|
465
|
Improving retrieval performance by relevance feedback
– Salton, Buckley
- 1990
|
|
411
|
Relevance Weighting of Search Terms
– Robertson, Sparck-Jones
- 1976
|
|
215
|
Some simple effective approximations to 2-Poisson method for probabilistic weighted retrieval
– Robertson, Walker
- 1994
|
|
189
|
Inference networks for document retrieval
– Turtle, Croft
- 1990
|
|
184
|
A statistical interpretation of term specificity and its application in retrieval
– Jones, K
- 1972
|
|
172
|
Evaluation of an Inference Network-Based Retrieval Model
– Turtle, Croft
- 1991
|
|
153
|
The Probability Ranking Principle in IR
– Robertson
- 1977
|
|
149
|
A non-classical logic for information retrieval
– Rijsbergen
- 1986
|
|
127
|
On relevance, probabilistic indexing, and information retrieval
– Maron, Kuhns
- 1960
|
|
121
|
Using probabilistic models of document retrieval without relevance information. Readings in information retrieval
– Croft, Harper
- 1979
|
|
120
|
New retrieval approaches using SMART: TREC 4
– Buckley, Singhal, et al.
- 1996
|
|
94
|
On term selection for query expansion
– Robertson
- 1990
|
|
90
|
A theoretical basis for the use of co-occurrence data in information retrieval
– Rijsbergen
- 1977
|
|
75
|
Analysis of Binary Data
– Cox, Snell
- 1989
|
|
75
|
A probabilistic learning approach for document indexing
– Fuhr, Buckley
- 1991
|
|
72
|
The limitations of term cooccurrence data for query expansion in document retrieval systems
– Peat, Willett
- 1991
|
|
54
|
New experiments in relevance feedback
– Ide
- 1971
|
|
52
|
An analysis of statistical and syntactic phrases
– Mitra, Buckley, et al.
|
|
50
|
Full text retrieval based on probabilistic equations with coefficients fitted by logistic regression
– Cooper, Chen, et al.
- 1994
|
|
50
|
A probabilistic approach to automatic keyword indexing (part i & ii
– Harter
- 1975
|
|
46
|
On relevance weights with little relevance information
– Robertson, Walker
- 1997
|
|
45
|
Retrieving spoken documents by combining multiple index sources
– Jones, Foote, et al.
- 1996
|
|
40
|
Probabilistic models of indexing and searching
– Robertson, van-Rijsbergen, et al.
- 1981
|
|
37
|
An evaluation of feedback in document retrieval using co-occurrence data
– HARPER, RIJSBERGEN
- 1978
|
|
32
|
Probability of relevance: a unification of two competing models for information retrieval
– Robertson, Maron, et al.
- 1982
|
|
27
|
A network approach to probabilistic information retrieval
– Kwok
- 1995
|
|
25
|
The probability ranking principle
– Robertson
- 1997
|
|
24
|
A theory of indexing
– Salton
- 1975
|
|
21
|
The automatic indexing system AIR/PHYS — from research to application
– Biebricher, Fuhr, et al.
- 1988
|
|
20
|
Foundations of probabilistic and utility-theoretic indexing
– Cooper, Maron
- 1978
|
|
19
|
Experiments in relevance weighting of search terms
– Jones, K
- 1979
|
|
19
|
What is the role of NLP in Text Retrieval
– Jones
- 1999
|
|
18
|
Some inconsistencies and misidentified modeling assumptions in probabilistic information retrieval
– Cooper
- 1995
|
|
17
|
Search term relevance weighting given little relevance information
– Jones, K
- 1979
|
|
16
|
Overview of TREC-6 Very Large Collection Track
– Hawking, Thistlewaite
- 1997
|
|
14
|
Information retrieval (2nd ed
– RIJSBERGEN
- 1979
|
|
13
|
A test for the separation of relevant and non-relevant documents in experimental retrieval collections
– Rijsbergen, Jones, et al.
- 1973
|
|
9
|
Statistical Problems in the Application of Probabilistic Models to Information Retrieval
– Robertson, Bovey
- 1982
|
|
8
|
MEDLARS: Report on the evaluation of its operating efficiency
– Lancaster
- 1969
|
|
6
|
Probabilistic learning approaches for indexing and retrieval with the TREC-2 collection
– Fuhr, Pfeifer, et al.
- 1994
|
|
5
|
Summary performance comparisons: TREC-2, TREC-3, TREC-4, TREC-5, TREC-6
– Jones, K
- 1998
|
|
5
|
Automatic text structuring and summarisation
– Salton, Singhal, et al.
- 1997
|
|
5
|
Spanish and Chinese document retrieval in TREC-5
– Smeaton, Wilkinson
- 1997
|
|
3
|
Research on relevance weighting 1976–1979
– Jones, K, et al.
- 1980
|
|
3
|
A proposal for a task-based evaluation of text summarisation systems
– Hand
- 1997
|
|
2
|
A performance yardstick for test collections
– Jones, K
- 1975
|