Empirical Development of an Exponential Probabilistic Model for Text Retrieval: Using Textual Analysis to Build a Better Model (2003)

by Jaime Teevan , David R. Karger
Venue:In Proceedings of the 26th Annual ACM Conference on Research and Development in Information Retrieval
Citations:11 - 0 self

Documents Related by Co-Citation

107 Tackling the Poor Assumptions of Naive Bayes Text Classifiers – Jason D. M. Rennie, Lawrence Shih, Jaime Teevan, David R. Karger - 2003
753 A comparison of event models for Naive Bayes text classification – Andrew McCallum, Kamal Nigam - 1998
877 A Language Modeling Approach to Information Retrieval – Jay M. Ponte, W. Bruce Croft - 1998
2350 Latent dirichlet allocation – David M. Blei, Andrew Y. Ng, Michael I. Jordan, John Lafferty - 2003
79 Distribution of content words and phrases in text and language modelling – S Katz - 1996
49 Modeling word burstiness using the Dirichlet distribution – Rasmus E. Madsen, David Kauchak, Charles Elkan - 2005
304 Document Language Models, Query Models, and Risk Minimization for Information Retrieval – John Lafferty, Chengxiang Zhai - 2001
352 Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval – S. E. Robertson, S. Walker - 1994
592 K.: Relevance weighting of search terms – S E Robertson, Sparck Jones
49 Probabilistic models of indexing and searching – S Robertson, C V Rijsbergen, M Porter - 1981
369 On Discriminative vs. Generative classifiers: A comparison of logistic regression and naive Bayes – Andrew Y. Ng, Michael I. Jordan - 2001
1686 Text Categorization with Support Vector Machines: Learning with Many Relevant Features – Thorsten Joachims - 1998
261 Using Maximum Entropy for Text Classification – Kamal Nigam, John Lafferty, Andrew Mccallum - 1999
696 A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval – Chengxiang Zhai, John Lafferty
346 Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval – David D. Lewis - 1998
6 Parametric Models of Linguistic Count Data – Martin Jansche - 2003
319 Human Behaviour and the Principle of Least Effort – G Zipf - 1949
91 A probabilistic approach to automatic keyword indexing. part I: On the distribution of specialty words words in a technical literature – S P HARTER - 1975
47 A GENERATIVE THEORY OF RELEVANCE – Victor Lavrenko - 2004