Results 1 -
3 of
3
An exploration of entity models, collective classification and relation description
- In Proceedings of KDD Workshop on Link Analysis and Group Detection
, 2004
"... Traditional information retrieval typically represents data using a bag of words; data mining typically uses a highly structured database representation. This paper explores the middle ground using a representation which we term entity models, in which questions about structured data may be posed an ..."
Abstract
-
Cited by 13 (1 self)
- Add to MetaCart
Traditional information retrieval typically represents data using a bag of words; data mining typically uses a highly structured database representation. This paper explores the middle ground using a representation which we term entity models, in which questions about structured data may be posed and answered, but the complexities and task-specific restrictions of ontologies are avoided. An entity model is a language model or word distribution associated with an entity, such as a person, place or organization. Using these perentity language models, entities may be clustered, links may be detected or described with a short summary, entities may be collectively classified, and question answering may be performed. On a corpus of entities extracted from newswire and the Web, we group entities by profession with 90 % accuracy, improve accuracy further on the task of classifying politicians as liberal or conservative using collective classification and conditional random fields, and answer questions about “who a person is ” with mean reciprocal rank (MRR) of 0.52. 1.
Entity Models: Construction and Applications
"... We propose entity language models, a probabilistic representation of the language used to describe a named entity (person, organization, or location). The model is purely statistical and constructed from snippets of text surrounding mentions of an entity. We evaluate the effectiveness of entity mode ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
We propose entity language models, a probabilistic representation of the language used to describe a named entity (person, organization, or location). The model is purely statistical and constructed from snippets of text surrounding mentions of an entity. We evaluate the effectiveness of entity models in three tasks: fact-based question answering, classification into pre-defined groups, and description of the relationship between two entities. The results on all tasks are promising.
Probabilistic Language Modelling
, 2002
"... Language models assign probabilities to strings of symbols. Their interpretation is reviewed and applied to text classi cation. A language recogniser is constructed from Bayes' theorem and a simple bigram model. This provides ..."
Abstract
- Add to MetaCart
Language models assign probabilities to strings of symbols. Their interpretation is reviewed and applied to text classi cation. A language recogniser is constructed from Bayes' theorem and a simple bigram model. This provides

