Results 1 -
5 of
5
Novelty and redundancy detection in adaptive filtering
, 2002
"... This paper addresses the problem of extending an adaptive information filtering system to make decisions about the novelty and redundancy of relevant documents. It argues that relevance and redundance should each be modelled explicitly and separately. A set of five redundancy measures are proposed a ..."
Abstract
-
Cited by 82 (12 self)
- Add to MetaCart
This paper addresses the problem of extending an adaptive information filtering system to make decisions about the novelty and redundancy of relevant documents. It argues that relevance and redundance should each be modelled explicitly and separately. A set of five redundancy measures are proposed and evaluated in experiments with and without redundancy thresholds. The experimental results demonstrate that the cosine similarity metric and a redundancy measure based on a mixture of language models are both effective for identifying redundant documents.
Novelty Detection Based on Sentence Level Patterns
- CIKM
, 2005
"... The detection of new information in a document stream is an important component of many potential applications. In this paper, a new novelty detection approach based on the identification of sentence level patterns is proposed. Given a user’s information need, some patterns in sentences such as comb ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
The detection of new information in a document stream is an important component of many potential applications. In this paper, a new novelty detection approach based on the identification of sentence level patterns is proposed. Given a user’s information need, some patterns in sentences such as combinations of query words, named entities and phrases, may contain more important and relevant information than single words. Therefore, the proposed novelty detection approach focuses on the identification of previously unseen query-related patterns in sentences. Specifically, a query is preprocessed and represented with patterns that include both query words and required answer types. These patterns are used to retrieve sentences, which are then determined to be novel if it is likely that a new answer is present. An analysis of patterns in sentences was performed with data from the TREC 2002 novelty track and experiments on novelty detection were carried out on data from the TREC 2003 and 2004 novelty tracks. The experimental results show that the proposed pattern-based approach significantly outperforms all three baselines in terms of precision at top ranks.
Improving novelty detection for general topics using sentence level information patterns
- In Proceedings of
, 2006
"... The detection of new information in a document stream is an important component of many potential applications. In this work, a new novelty detection approach based on the identification of sentence level information patterns is proposed. First, the information- pattern concept for novelty detection ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
The detection of new information in a document stream is an important component of many potential applications. In this work, a new novelty detection approach based on the identification of sentence level information patterns is proposed. First, the information- pattern concept for novelty detection is presented with the emphasis on new information patterns for general topics (queries) that cannot be simply turned into specific questions whose answers are specific named entities (NEs). Then we elaborate a thorough analysis of sentence level information patterns on data from the TREC novelty tracks, including sentence lengths, named entities, sentence level opinion patterns. This analysis provides guidelines in applying those patterns in novelty detection particularly for the general topics. Finally, a unified pattern-based approach is presented to novelty detection for both general and specific topics. The new method for dealing with general topics will be the focus. Experimental results show that the proposed approach significantly improves the performance of novelty detection for general topics as well as the overall performance for all topics from the 2002-2004 TREC novelty tracks.
Summarisation and novelty in Mobile Information Access
"... The paper presents a user study which investigates the effects of incorporating novelty detection in automatic text summarisation. The motivation being the need to provide access to information that is tailored to small screen displays. Automatic text summarisation offers a means to deliver device-f ..."
Abstract
- Add to MetaCart
The paper presents a user study which investigates the effects of incorporating novelty detection in automatic text summarisation. The motivation being the need to provide access to information that is tailored to small screen displays. Automatic text summarisation offers a means to deliver device-friendly content. An effective summary could be one that includes only new information. However, a consequence of focusing exclusively on novel parts may result in a loss of context, which may have an impact on the ability to correctly interpret the meaning of a summary given the source document. In the user study we compares two strategies to produce summaries that incorporate novelty in different ways; an incremental summary and a constant length summary. The aim is to establish whether a summary that contains only novel sentences provides sufficient basis to determine relevance of a document, or do we need to include additional sentences to provide context. Findings from the study seem to suggest that there is minimal difference in performance for the tasks we set our users. Therefore, for the case of mobile information access a summary that contains only novel information would be more appropriate. 1.

