On-Line New Event Detection, Clustering, And Tracking (1999)
| Citations: | 16 - 0 self |
BibTeX
@TECHREPORT{Papka99on-linenew,
author = {Ron Papka},
title = {On-Line New Event Detection, Clustering, And Tracking},
institution = {},
year = {1999}
}
Years of Citing Articles
OpenURL
Abstract
In this work, we discuss and evaluate solutions to text classification problems associated with the events that are reported in on-line sources of news. We present solutions to three related classification problems: new event detection, event clustering, and event tracking. The primary focus of this thesis is new event detection, where the goal is to identify news stories that have not previously been reported, in a stream of broadcast news comprising radio, television, and newswire. We present an algorithm for new event detection, and analyze the effects of incorporating domain properties into the classification algorithm. We explore a solution that models the temporal relationship between news stories, and investigate the use of proper noun phrase







