Abstract:
Similarity is an important and widely used concept. Previous definitions of similarity are tied to a particular application or a form of knowledge representation. We present an informationtheoretic definition of similarity that is applicable as long as there is a probabilistic model. We demonstrate how our definition can be used to measure the similarity in a number of different domains.
Citations
|
4701
|
Probabilistic Reasoning in Intelligent Systems
– Pearl
- 1988
|
|
1072
|
Introduction to WordNet: An On-line Lexical Database
– Miller, Beckwith, et al.
- 1990
|
|
787
|
Instance-based learning algorithms
– Aha, Kibler, et al.
- 1991
|
|
546
|
Features of Similarity
– Tversky
- 1977
|
|
407
|
Distributional clustering of english words
– Pereira, Tishby, et al.
- 1993
|
|
400
|
Toward MemoryBased Reasoning
– Stanfill, Waltz
- 1986
|
|
374
|
Using information content to evaluate semantic similarity in a taxonomy
– Resnik
- 1995
|
|
204
|
Representing and Reasoning with Probabilistic Knowledge: A Logical Approach to Probabilities
– Bacchus
- 1990
|
|
165
|
Noun classification from predicate-argument structure
– Hindle
- 1990
|
|
158
|
Elements of Information Theory. Wiley Series in Telecommunications
– Cover, Thomas
- 1991
|
|
149
|
Overview of the first text retrieval conference (TREC-1
– Harman
- 1992
|
|
111
|
Contextual correlates of semantic similarity
– Miller, Charles
- 1991
|
|
104
|
Disambiguating Noun Groupings with Respect to WordNet Senses
– Resnik
- 1995
|
|
103
|
Verb Semantics and Lexical Selection
– Wu, Palmer
- 1994
|
|
58
|
Principle-based parsing without overgeneration
– Lin
- 1993
|
|
51
|
Training and scaling preference functions for disambiguation
– Alshawi, Carter
- 1994
|
|
43
|
Generalizing automatically generated selectional patterns
– Grishman, Sterling
- 1994
|
|
42
|
Y.J.: Information Retrieval Based on Conceptual Distance in ISA Hierarchies
– Lee, Kim, et al.
- 1993
|
|
26
|
Principar—an efficient, broadcoverage, principle-based parser
– Lin
- 1994
|
|
22
|
An evaluation of factors affecting document ranking by information retrieval systems
– McGill, Koll, et al.
- 1979
|
|
18
|
Experiments on linguistically based term associations
– Ruge
- 1991
|
|
2
|
Development and application ofa metric on semantic nets
– Rada, Mili, et al.
- 1989
|
|
2
|
Random House College Thesaurus. Random
– Stein, Flexner
- 1984
|