NewsWeeder: Learning to Filter Netnews (1995) [299 citations — 0 self]
http://www.fi.muni.cz/usr/popelinsky/newsweeder.ps
http://www.cse.fau.edu/~xqzhu/cap5615/reading/news
CiteULike | DBLP
CACHED:
Abstract:
A significant problem in many information filtering systems is the dependence on the user for the creation and maintenance of a user profile, which describes the user's interests. NewsWeeder is a netnews-filtering system that addresses this problem by letting the user rate his or her interest level for each article being read (1-5), and then learning a user profile based on these ratings. This paper describes how NewsWeeder accomplishes this task, and examines the alternative learning methods used. The results show that a learning algorithm based on the Minimum Description Length (MDL) principle was able to raise the percentage of interesting articles to be shown to users from 14% to 52% on average. Further, this performance significantly outperformed (by 21%) one of the most successful techniques in Information Retrieval (IR), termfrequency /inverse-document-frequency (tf-idf) weighting. 1
Citations
| 441 | Using collaborative filtering to weave an information tapestry – Goldberg, Nichols, et al. - 1992 |
| 97 | Modelling by the shortest data description. Automatica – Rissanen - 1978 |
| 89 | A Mathematical Theory of – Shannon - 1948 |
| 83 | An example-based mapping method for text categorization and retrieval – Yang, Chute - 1994 |
| 69 | A learning approach to personalized information filtering – Sheth - 1994 |
| 58 | Applied Bayesian and Classical Inference: The Case of the Federalist Papers – Mosteller, Wallace - 1984 |
| 55 | Index structures for selective dissemination of information – YAN, GARCIA-MOLINA - 1992 |
| 20 | et al., “GroupLens: An Open Architecture for Collaborative Filtering of Netnews – Resnick - 1994 |
| 4 | et al. Using Latent Semantic Analysis to Improve Access to Textual Information – Dumais - 1988 |
| 1 | et al. A Summary of the CLARIT – Evans - 1991 |

