MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Model-based Feedback in the Language Modeling Approach to Information Retrieval (2001) [68 citations — 5 self]

by Chengxiang Zhai ,  John Lafferty
In Proceedings of Tenth International Conference on Information and Knowledge Management
Add To MetaCart

Abstract:

The language modeling approach to retrieval has been shown to perform well empirically. One advantage of this new approach is its statistical foundations. However, feedback, as one important component in a retrieval system, has only been dealt with heuristically in this new retrieval approach: the original query is usually literally expanded by adding ditional terms to it. Such expansion-based feedback creates an inconsistent interpretation of the original and the expanded query. In this paper, we present a more principled approach to feedback in the language modeling approach. Specifically, we treat feedback as updating the query language model based on the extra evidence carried by the feedback documents. Such a model-based feedback strategy easily fits into an extension of the language modeling approach. We propose and evaluate two different approaches to updating a query language model based on feedback documents, one based on a generarive probabilistic model of feedback documents and one based on minimization of the KL-divergence over feedback documents. Experiment resuits show that both approaches are effective and outperform the Rocchio feedback approach.

Citations

4923 Elements of Information Theory – Cover, Thomas - 1991
4735 Maximum Likelihood from incomplete data via the EM algorithm – Dempster, Laird, et al. - 1977
594 Relevance feedback in information retrieval – Rocchio - 1971
472 A language modeling approach to information retrieval – Ponte, Croft - 1998
411 Relevance Weighting of Search Terms – Robertson, Sparck-Jones - 1976
280 A study of smoothing methods for language models applied to ad hoc information retrieval – Zhai, Lafferty
168 Information retrieval as statistical translation – Berger, Lafferty - 1999
164 Document language models, query models, and risk minimization for information retrieval – Lafferty, Zhai - 2001
153 Using Language Models for Information Retrieval – Hiemstra - 2001
148 Relevance-based language models – Lavrenko, Croft - 2001
132 A hidden Markov model information retrieval system – MILLER, LEEK, et al. - 1999
113 Okapi/Keenbow at TREC-8 – Robertson, Walker - 2000
106 A general language model for information retrieval – Song, Croft - 1999
104 Cluster-based language models for distributed retrieval – Xu, Croft - 1999
87 Twenty-one at TREC-7: Ad hoc and cross language track – Hiemstra, Kraaij
37 A maximum likelihood ratio information retrieval model – Ng - 1999
20 A probability distribution model for information retrieval – Wong, Yao - 1989