Abstract:
Recommender systems leverage product and community information to target products to consumers. Researchers have developed collaborative recommenders, content-based recommenders, and a few hybrid systems. We propose a unified probabilistic framework for merging collaborative and content-based recommendations. We extend Hofmann's aspect model to incorporate three-way co-occurrence data among users, items, and item content. The relative influence of collaboration data versus content data is not imposed as an exogenous parameter, but rather emerges naturally from the given data sources. However, global probabilistic models coupled with standard EM learning algorithms tend to drastically overfit in the sparse-data situations typical of recommendation applications. We show that secondary content information can often be used to overcome sparsity. Experiments on data from the ResearchIndex library of Computer Science publications show that appropriate mixture models incorporating secondary data produce significantly better quality recommenders than $k$-nearest neighbors ($k$-NN). Global probabilistic models also allow more general inferences than local methods like $k$-NN.
Citations
|
2329
|
Introduction to modern information retrieval
– Salton
- 1983
|
|
663
|
GroupLens: An open architecture for collaborative filtering of netnews
– Resnick, Iacovou, et al.
- 1994
|
|
620
|
Social Information Filtering: Algorithms for Automating "Word of Mouth" SIGCHI '95
– Shardanand, Maes
- 1995
|
|
590
|
The X Window System
– R, Gettys
- 1986
|
|
547
|
Empirical analysis of predictive algorithms for collaborative filtering
– Breese, Heckerman, et al.
- 1998
|
|
441
|
Using collaborative filtering to weave an information tapestry
– Goldberg, Nichols, et al.
- 1992
|
|
225
|
An efficient boosting algorithm for combining preferences
– Freund, Iyer, et al.
- 1998
|
|
219
|
Probabilistic latent semantic analysis
– Hofmann
- 1999
|
|
211
|
Digital Libraries and Autonomous Citation Indexing
– Lawrence, Giles, et al.
- 1999
|
|
196
|
Learning to order things
– Cohen, Shapire, et al.
- 1999
|
|
183
|
Learning collaborative information filters
– Billsus, Pazzani
- 1998
|
|
159
|
Recommendation as Classification: Using Social and Content-Based
– Basu, Hirsh, et al.
- 1998
|
|
140
|
Combining Collaborative Filtering with Personal Agents for Better Recommendations
– Good, Schafer, et al.
- 1999
|
|
126
|
Application of dimensionality reduction in recommender system–a case study
– Sarwar, Karypis, et al.
- 2000
|
|
114
|
Recommender systems in e-commerce
– Schafer, Konstan, et al.
- 1999
|
|
107
|
The missing link - a probabilistic model of document content and hypertext connectivity
– Cohn, Hofmann
- 2001
|
|
106
|
Content-based book recommending using learning for text categorization
– Mooney, Roy
- 2000
|
|
104
|
Collaborative Filtering by Personality Diagnosis: A Hybrid Memory- and Model-Based Approach
– Pennock, Horvitz, et al.
- 2000
|
|
92
|
Combining Content-Based and Collaborative Filters in an Online Newspaper
– Claypool, Gokhale, et al.
- 1999
|
|
92
|
Latent Class Models for Collaborative Filtering
– Hofmann, Puzicha
- 1999
|
|
41
|
Social choice theory and recommender systems: Analysis of the axiomatic foundations of collaborative filtering
– Pennock, Horvitz, et al.
- 2000
|
|
33
|
Collaborative filtering using weighted majority prediction algorithms
– Nakamura, Abe
- 1998
|
|
20
|
Discovering relevant scientific literature on the Web
– Bollacker, Lawrence, et al.
- 2000
|
|
5
|
Dependency networks for collaborative filtering and data visualization
– Heckerman, Chickering, et al.
- 2000
|