Combining Labeled and Unlabeled Data with Co-Training (1998) [640 citations — 18 self]
http://l2r.cs.uiuc.edu/~danr/Teaching/CS598-05/Pap
http://luthuli.cs.uiuc.edu/~daf/courses/Learning/P
http://axon.cs.byu.edu/~martinez/classes/678/Paper
http://www-connex.lip6.fr/~amini/./RelatedWorks/Bl
http://www-connex.lip6.fr/~amini/RelatedWorks/BlMi
http://www.cs.cmu.edu/afs/cs/Web/People/avrim/Pape
http://www-2.cs.cmu.edu/~avrim/Papers/cotrain.ps.g
http://www.cs.cmu.edu/afs/cs/usr/avrim/www/Papers/
http://www.ri.cmu.edu/pub_files/pub1/blum_a_1998_1
CiteULike
CACHED:
Abstract:
We consider the problem of using a large unlabeled sample to boost performance of a learning algorithm when only a small set of labeled examples is available. In particular, we consider a problem setting motivated by the task of learning to classify web pages, in which the description of each example can be partitioned into two distinct views. For example, the description of a web page can be partitioned into the words occurring on that page, and the words occurring in hyperlinks that point to that page. We assume that either view of the example would be sufficient for learning if we had enough labeled data, but our goal is to use both views together to allow inexpensive unlabeled data to augment a much smaller set of labeled examples. Specifically, the presence of two distinct views of each example suggests strategies in which two learning algorithms are trained separately on each view, and then each algorithm 's predictions on new unlabeled examples are used to enlarge the training s...

