MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

An Information-Theoretic Definition of Similarity (1998) [325 citations — 0 self]

by Dekang Lin
In Proceedings of the 15th International Conference on Machine Learning
Add To MetaCart

Abstract:

Similarity is an important and widely used concept. Previous definitions of similarity are tied to a particular application or a form of knowledge representation. We present an informationtheoretic definition of similarity that is applicable as long as there is a probabilistic model. We demonstrate how our definition can be used to measure the similarity in a number of different domains.

Citations

4701 Probabilistic Reasoning in Intelligent Systems – Pearl - 1988
1072 Introduction to WordNet: An On-line Lexical Database – Miller, Beckwith, et al. - 1990
787 Instance-based learning algorithms – Aha, Kibler, et al. - 1991
546 Features of Similarity – Tversky - 1977
407 Distributional clustering of english words – Pereira, Tishby, et al. - 1993
400 Toward MemoryBased Reasoning – Stanfill, Waltz - 1986
374 Using information content to evaluate semantic similarity in a taxonomy – Resnik - 1995
204 Representing and Reasoning with Probabilistic Knowledge: A Logical Approach to Probabilities – Bacchus - 1990
165 Noun classification from predicate-argument structure – Hindle - 1990
158 Elements of Information Theory. Wiley Series in Telecommunications – Cover, Thomas - 1991
149 Overview of the first text retrieval conference (TREC-1 – Harman - 1992
111 Contextual correlates of semantic similarity – Miller, Charles - 1991
104 Disambiguating Noun Groupings with Respect to WordNet Senses – Resnik - 1995
103 Verb Semantics and Lexical Selection – Wu, Palmer - 1994
58 Principle-based parsing without overgeneration – Lin - 1993
51 Training and scaling preference functions for disambiguation – Alshawi, Carter - 1994
43 Generalizing automatically generated selectional patterns – Grishman, Sterling - 1994
42 Y.J.: Information Retrieval Based on Conceptual Distance in ISA Hierarchies – Lee, Kim, et al. - 1993
26 Principar—an efficient, broadcoverage, principle-based parser – Lin - 1994
22 An evaluation of factors affecting document ranking by information retrieval systems – McGill, Koll, et al. - 1979
18 Experiments on linguistically based term associations – Ruge - 1991
2 Development and application ofa metric on semantic nets – Rada, Mili, et al. - 1989
2 Random House College Thesaurus. Random – Stein, Flexner - 1984