There are many applications in which it is desirable to order rather than classify instances. Here we consider the problem of learning how to order, given feedback in the form of preference judgments, i.e., statements to the effect that one instance should be ranked ahead of another. We outline a two-stage approach in which one first learns by conventional means a preference function, of the form PREF(u; v), which indicates whether it is advisable to rank u before v. New instances are then ordered so as to maximize agreements with the learned preference function. We show that the problem of finding the ordering that agrees best with a preference function is NP-complete, even under very restrictive assumptions. Nevertheless, we describe a simple greedy algorithm that is guaranteed to find a good approximation. We then discuss an on-line learning algorithm, based on the "Hedge" algorithm, for finding a good linear combination of ranking "experts." We use the ordering algorith...
|
2329
|
Introduction to modern information retrieval
– Salton
- 1983
|
|
1205
|
Schapire, “Decision-theoretic generalization of on-line learning and application to boosting
– Freund, E
- 1997
|
|
499
|
Learning quickly when irrelevant attributes abound: A new linearthreshold algorithm
– Littlestone
- 1988
|
|
438
|
The weighted majority algorithm
– Littlestone, Warmuth
- 1994
|
|
404
|
Approximation Algorithms for NP-Hard Problems
– Hochbaum
- 1995
|
|
297
|
Fab: content-based, collaborative recommendation
– Balabanović, Shoham
- 1997
|
|
225
|
An efficient boosting algorithm for combining preferences
– Freund, Iyer, et al.
- 1998
|
|
159
|
Utility theory for Decision Making
– Fishburn
- 1970
|
|
111
|
Automatic combination of multiple ranked retrieval systems
– Bartell, Cottrell, et al.
- 1994
|
|
100
|
The Theory of Committees and Elections
– Black
- 1958
|
|
98
|
Decision Theory: An Introduction to The Mathematics of Rationality
– French
- 1986
|
|
96
|
Computers and intractability, A Guide to the Theory of NP-Completeness (Freeman
– Gary, Johnson
- 1979
|
|
89
|
The Collection Fusion Problem
– Voorhees, Gupta, et al.
- 1994
|
|
80
|
Divide-and-conquer approximation algorithms via spreading metrics
– Even, Naor, et al.
|
|
60
|
A machine learning architecture for optimizing Web search engine
– Boyan, Freitag, et al.
- 1996
|
|
60
|
Voting schemes for which it can be difficult to tell who won the election
– Tovey, Trick
- 1989
|
|
58
|
Expected Search Length: A Single Measure of Retrieval Effectiveness Based on the Weak Ordering Action of Retrieval Systems
– Cooper
- 1968
|
|
58
|
Dynamic reference sifting: A case study in the homepage domain
– Shakes, Langheinrich
- 1997
|
|
56
|
Approximating minimum feedback sets and multicuts in directed graphs. Algorithmica
– Even, Naor, et al.
- 1998
|
|
55
|
Packing directed circuits fractionally
– Seymour
- 1995
|
|
52
|
Two kinds of training information for evaluation function learning
– Utgoff
- 1991
|
|
51
|
Cut problems and their application to divide-and-conquer
– Shmoys
- 1997
|
|
34
|
Using the Future to \Sort Out" the Present: Rankprop and Multitask Learning for Medical Risk Prediction
– Caruana, Baluja, et al.
- 1996
|
|
34
|
Measuring retrieval effectiveness based on user preference of documents
– Yao
- 1995
|
|
28
|
Mathematics without numbers
– Kemeny
- 1959
|
|
26
|
Efficient information gathering on the Internet
– Etzioni, Hanks, et al.
- 1996
|
|
26
|
The Theory of Social Choice
– Fishburn
- 1973
|
|
25
|
Learning collection fusion strategies for information retrieval
– Towell, Voorhees, et al.
- 1995
|
|
22
|
Measurement theory with applications to decision making, utility and the social sciences
– Roberts
- 1979
|
|
19
|
Learning a Preference Predicate
– P, Saxena
- 1987
|
|
15
|
Comparing and combining the effectiveness of latent semantic indexing and the ordinary vector space model for information retrieval
– Lochbaum, Streeter
- 1989
|
|
14
|
Introduction (to the special section on recommender systems
– Resnik, Varian
- 1997
|
|
12
|
Decision Level Data Fusion for Routing of Documents in the TREC3 Context: A Best Case Analysis of Worst Case Results
– Kantor
- 1994
|
|
9
|
Cyclic ordering is np-complete
– Galil, Megido
- 1977
|
|
7
|
Two kinds of training information for evaluation function learning
– Utgo, Clouse
- 1991
|
|
6
|
Trick: Voting schemes for which it can be dicult to tell who won 1655. The same study, though, found that hundreds of ballots were thrown out in Palm Beach and Volusia counties that had marks no dierent from ballots deemed valid { Gore would have been ele
– Bartholdi, Tovey, et al.
- 1989
|
|
5
|
Learning a preference predicate
– Utgo
- 1987
|
|
4
|
A method for taking votes on more than two issues
– Dodgson
- 1876
|
|
2
|
Comparing and combining the eectiveness of latent semantic indexing and the ordinary vector space model for information retrieval. Information processing and management
– Lochbaum
- 1989
|
|
1
|
FAB: Content-based, collaborative recommendation
– Balabanovc
- 1997
|
|
1
|
Tight bounds for the acyclic subgraph problem
– Berger, Shor
- 1997
|
|
1
|
Learning to Order Things
– Caruana, Baluja
- 1996
|
|
1
|
Comparing and combiningthe effectiveness of latent semantic indexing and the ordinary vector space model for information retrieval. Information processing and management
– Streeter
- 1989
|