Stylistic Experiments For Information Retrieval (2000)
| Citations: | 47 - 8 self |
BibTeX
@MISC{Karlgren00stylisticexperiments,
author = {Jussi Karlgren},
title = {Stylistic Experiments For Information Retrieval},
year = {2000}
}
Years of Citing Articles
OpenURL
Abstract
Information retrieval systems are built to handle texts as topical items: texts are tabulated by occurrence frequencies of content words in them, under the assumption that text topic is reasonably well modeled by content word occurrence. But texts have several interesting characteristics beyond topic. The experiments described in this text investigate stylistic variation. Roughly put, style is the difference between two ways of saying the same thing -- and systematic stylistic variation can be used to characterize the genre of documents. These experiments investigate if stylistic information is distinguishable using simple language engineering methods, and if in that case this type of information can be used to improve information retrieval systems.







